Community Blog & Articles

Community Articles

olmo-eval: An evaluation workbench for the model development loop

Profiling in PyTorch (Part 2): From nn.Linear to a Fused MLP

Introducing North Mini Code: Cohere’s First Model For Developers

How an Agent Built a 3D Paris Gallery by Chaining Two Hugging Face Spaces

Migrating Your GitHub CI to Hugging Face Jobs

The Open Source Community is backing OpenEnv for Agentic RL

Nemotron 3.5 Content Safety: Customizable Multimodal Safety for Global Enterprise AI

Designing the hf CLI as an agent-optimized way to work with the Hub

Direct Preference Optimization Beyond Chatbots

Adding MCP Tools to Reachy Mini

Holo3.1: Fast & Local Computer Use Agents

Introducing Mellum2: A 12B Mixture-of-Experts Model by JetBrains

Beyond LLMs: Why Scalable Enterprise AI Adoption Depends on Agent Logic

Profiling in PyTorch (Part 1): A Beginner's Guide to torch.profiler

NEW Articles from Team or Enterprise organizations will get promoted to the main section.

Community Blog & Articles

Can Voice Agents Handle Bilingual Customers? Benchmarking Frontier ASR on Code-Switched Speech

Arcee Becomes the First Major American AI Lab to Replace AWS S3 with Hugging Face Private Storage, in a Multi-Million Dollar Commercial Partnership

MTEB Leaderboard: From a slow demo to feature-rich leaderboard

How to Fine-Tune Nemotron 3.5 ASR for Your Language, Domain, or Accent

Eyes, ears, and a voice: building Reachy Mini's media stack

Her · हेर — a detective for your Claude Code sessions

Welcome NVIDIA Cosmos 3: The First Open Omni-model for Physical AI Reasoning and Action

36 Prompts, One Infinite City

Lolaby — AI-powered lullabies

Fine-tune FLUX.2 [klein] with a LoRA under 60 minutes

Introducing Serge: GitHub-Native AI Code Review

LeRobot Humanoid: An Open, Low-Cost, 3D-Printed Humanoid for Robot Learning

EVA-Bench Data 2.0: 3 Domains, 121 Tools, 213 Scenarios

Task-Seeded Synthetic Q&A Generation for Nemotron Pretraining

Code a simple RAG from scratch

MiniMax-01 is Now Open-Source: Scaling Lightning Attention for the AI Agent Era

KV Caching Explained: Optimizing Transformer Inference Efficiency

Build Small Hackathon With Cohere Models

Thousand Token Wood: shipping a multi-agent economy on a 3B model

Run Claude Code, OpenCode & Frontier Coding Models on Your Own AI Infrastructure with DEH

olmo-eval: An evaluation workbench for the model development loop

Profiling in PyTorch (Part 2): From nn.Linear to a Fused MLP

Introducing North Mini Code: Cohere’s First Model For Developers

How an Agent Built a 3D Paris Gallery by Chaining Two Hugging Face Spaces

Migrating Your GitHub CI to Hugging Face Jobs

The Open Source Community is backing OpenEnv for Agentic RL

Nemotron 3.5 Content Safety: Customizable Multimodal Safety for Global Enterprise AI

Designing the hf CLI as an agent-optimized way to work with the Hub

Direct Preference Optimization Beyond Chatbots

Adding MCP Tools to Reachy Mini

Holo3.1: Fast & Local Computer Use Agents

Introducing Mellum2: A 12B Mixture-of-Experts Model by JetBrains

Beyond LLMs: Why Scalable Enterprise AI Adoption Depends on Agent Logic

Profiling in PyTorch (Part 1): A Beginner's Guide to torch.profiler

Can Voice Agents Handle Bilingual Customers? Benchmarking Frontier ASR on Code-Switched Speech

Arcee Becomes the First Major American AI Lab to Replace AWS S3 with Hugging Face Private Storage, in a Multi-Million Dollar Commercial Partnership

MTEB Leaderboard: From a slow demo to feature-rich leaderboard

How to Fine-Tune Nemotron 3.5 ASR for Your Language, Domain, or Accent

Eyes, ears, and a voice: building Reachy Mini's media stack

Her · हेर — a detective for your Claude Code sessions

Welcome NVIDIA Cosmos 3: The First Open Omni-model for Physical AI Reasoning and Action

36 Prompts, One Infinite City

Lolaby — AI-powered lullabies

Fine-tune FLUX.2 [klein] with a LoRA under 60 minutes

Introducing Serge: GitHub-Native AI Code Review

LeRobot Humanoid: An Open, Low-Cost, 3D-Printed Humanoid for Robot Learning

EVA-Bench Data 2.0: 3 Domains, 121 Tools, 213 Scenarios

Task-Seeded Synthetic Q&A Generation for Nemotron Pretraining

Code a simple RAG from scratch

MiniMax-01 is Now Open-Source: Scaling Lightning Attention for the AI Agent Era

KV Caching Explained: Optimizing Transformer Inference Efficiency

Build Small Hackathon With Cohere Models

Thousand Token Wood: shipping a multi-agent economy on a 3B model

Run Claude Code, OpenCode & Frontier Coding Models on Your Own AI Infrastructure with DEH