Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
Community Blog & Articles
New Article
community
guide
open source collab
partnerships
research
NLP
Audio
CV
RL
ethics
Diffusion
Game Development
RLHF
Leaderboard
Case Studies
LeRobot
Inference Providers
Community Articles
view all
ML Intern Takes Our Post-Training Internship Test
7 days ago
•
28
NEO-unify: Building Native Multimodal Unified Models End to End
Mar 5
•
146
Pallas for people who know JAX but not kernels yet
1 day ago
•
13
KV Caching Explained: Optimizing Transformer Inference Efficiency
Jan 30, 2025
•
313
Running AI agents to automate outreach at scale
3 days ago
•
10
Gemma 4 VLA Demo on Jetson Orin Nano Super
8 days ago
•
9
OpenRA-RL: An Open Platform for AI Agents in Real-Time Strategy Games
3 days ago
•
8
DenseOn with the LateOn: Open State-of-the-Art Single and Multi-Vector Models
9 days ago
•
36
Mastering Tensor Dimensions in Transformers
Jan 12, 2025
•
168
Adaptive Ultrasound Imaging with Physics-Informed NV-Raw2Insights-US AI
3 days ago
•
4
Multilingual Tool Calling in 70+ Languages, On Device
10 days ago
•
9
Hy3 preview: A Rebuilt Hunyuan, a 21B-Active MoE, and a New Reasoning Receipe
7 days ago
•
5
Code a simple RAG from scratch
Oct 29, 2024
•
327
Performant local mixture-of-experts CPU inference with GPU acceleration in llama.cpp
Jan 30
•
22
Building a Fast Multilingual OCR Model with Synthetic Data
13 days ago
•
31
Darwin-TTS: We Gave a TTS Model 3% of an LLM's Brain — It Started Showing Emotion
15 days ago
•
13
Building long-horizon SWE environments on Hugging Face: Frontier SWE × OpenEnv
4 days ago
•
3
BiomedBERT Small: Medical models at 22.7M parameters
2 days ago
•
3
The MCP Era Feels Like Déjà Vu
about 22 hours ago
•
3
The Large Language Model Course
Jan 16, 2025
•
228
research
nlp
Block Sparse Matrices for Smaller and Faster Language Models
September 10, 2020
research
nlp
The Reformer - Pushing the limits of language modeling
3
July 3, 2020
guide
nlp
Hot
How to generate text: using different decoding methods for language generation with Transformers
295
March 1, 2020
guide
nlp
How to train a new language model from scratch using Transformers and Tokenizers
61
February 14, 2020
Previous
1
...
54
55
56
Next
Community Articles
Sort: Trending
NEW
Articles from
Team
or
Enterprise
organizations will get promoted to the main section.
ML Intern Takes Our Post-Training Internship Test
7 days ago
•
28
NEO-unify: Building Native Multimodal Unified Models End to End
Mar 5
•
146
Pallas for people who know JAX but not kernels yet
1 day ago
•
13
KV Caching Explained: Optimizing Transformer Inference Efficiency
Jan 30, 2025
•
313
Running AI agents to automate outreach at scale
3 days ago
•
10
Gemma 4 VLA Demo on Jetson Orin Nano Super
8 days ago
•
9
OpenRA-RL: An Open Platform for AI Agents in Real-Time Strategy Games
3 days ago
•
8
DenseOn with the LateOn: Open State-of-the-Art Single and Multi-Vector Models
9 days ago
•
36
Mastering Tensor Dimensions in Transformers
Jan 12, 2025
•
168
Adaptive Ultrasound Imaging with Physics-Informed NV-Raw2Insights-US AI
3 days ago
•
4
Multilingual Tool Calling in 70+ Languages, On Device
10 days ago
•
9
Hy3 preview: A Rebuilt Hunyuan, a 21B-Active MoE, and a New Reasoning Receipe
7 days ago
•
5
Code a simple RAG from scratch
Oct 29, 2024
•
327
Performant local mixture-of-experts CPU inference with GPU acceleration in llama.cpp
Jan 30
•
22
Building a Fast Multilingual OCR Model with Synthetic Data
13 days ago
•
31
Darwin-TTS: We Gave a TTS Model 3% of an LLM's Brain — It Started Showing Emotion
15 days ago
•
13
Building long-horizon SWE environments on Hugging Face: Frontier SWE × OpenEnv
4 days ago
•
3
BiomedBERT Small: Medical models at 22.7M parameters
2 days ago
•
3
The MCP Era Feels Like Déjà Vu
about 22 hours ago
•
3
The Large Language Model Course
Jan 16, 2025
•
228
View all articles