Community Blog & Articles

Community Articles

Nemotron 3 Nano \- A new Standard for Efficient, Open, and Intelligent Agentic Models

Qwen-Image-i2L: Training Strategies for Image-to-LoRA Generation

EuroLLM-22B

Why You Should Care About Partial Differential Equations (PDEs)

MiniGuard-v0.1: Prem's Guardrail Model Redefining the Pareto Frontier

Spinning Up a CPU-Only Micro-LLM with LoRA for Literary Style

Gotchas in Tokenizer Behavior Every Developer Should Know

How We Use Claude Code Skills to Run 1,000+ ML Experiments a Day

LLM based TTS models

Make and publish your Reachy Mini App

Phare LLM benchmark V2: Reasoning models don't guarantee better security

Announcing LiteCoder-Terminal: Lightweight Terminal Agents with <1k Synthesized Trajectories

KV Caching Explained: Optimizing Transformer Inference Efficiency

Uncensor any LLM with abliteration

What is the Hugging Face Community Building?

Muon vs MuonClip vs Muon+AdamW for Fine-Tuning

I Built a RAG System That Listens to Live BBC News and Answers Questions About "What Happened 10 Minutes Ago"

cua-bench: A Framework for Benchmarking, Training Data, and RL Environments for Computer-Use Agents

Open Collaboration in Action: Inside the Open Safeguard Hackathon

about 14 hours ago

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

open-source-collab

Hugging Face on PyTorch / XLA TPUs

February 9, 2021

Faster TensorFlow models in Hugging Face Transformers

January 26, 2021

Fit More and Train Faster With ZeRO via DeepSpeed and FairScale

January 19, 2021

How we sped up transformer inference 100x for 🤗 API customers

January 18, 2021

Leveraging Pre-trained Language Model Checkpoints for Encoder-Decoder Models

November 9, 2020

open-source-collabnlp

Porting fairseq wmt19 translation system to transformers

November 3, 2020

open-source-collabnlp

Hyperparameter Search with Transformers and Ray Tune

November 2, 2020

Transformer-based Encoder-Decoder Models

October 10, 2020

Block Sparse Matrices for Smaller and Faster Language Models

September 10, 2020

The Reformer - Pushing the limits of language modeling

How to generate text: using different decoding methods for language generation with Transformers

How to train a new language model from scratch using Transformers and Tokenizers

February 14, 2020

Community Articles

NEW Articles from Team or Enterprise organizations will get promoted to the main section.

Nemotron 3 Nano \- A new Standard for Efficient, Open, and Intelligent Agentic Models

Qwen-Image-i2L: Training Strategies for Image-to-LoRA Generation

EuroLLM-22B

Why You Should Care About Partial Differential Equations (PDEs)

MiniGuard-v0.1: Prem's Guardrail Model Redefining the Pareto Frontier

Spinning Up a CPU-Only Micro-LLM with LoRA for Literary Style

Gotchas in Tokenizer Behavior Every Developer Should Know

How We Use Claude Code Skills to Run 1,000+ ML Experiments a Day

LLM based TTS models

Make and publish your Reachy Mini App

Phare LLM benchmark V2: Reasoning models don't guarantee better security

Announcing LiteCoder-Terminal: Lightweight Terminal Agents with <1k Synthesized Trajectories

KV Caching Explained: Optimizing Transformer Inference Efficiency

Uncensor any LLM with abliteration

What is the Hugging Face Community Building?

Muon vs MuonClip vs Muon+AdamW for Fine-Tuning

I Built a RAG System That Listens to Live BBC News and Answers Questions About "What Happened 10 Minutes Ago"

cua-bench: A Framework for Benchmarking, Training Data, and RL Environments for Computer-Use Agents

Open Collaboration in Action: Inside the Open Safeguard Hackathon

about 14 hours ago

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

View all articles