Community Blog & Articles

Community Articles

The Optimal Architecture for Small Language Models

Deriving the PPO Loss from First Principles

Continuity as a First-Class System Property in Artificial Intelligence

KV Caching Explained: Optimizing Transformer Inference Efficiency

Uncensor any LLM with abliteration

LLM based Audio models

Skill is All You Need: Lessons from Building Marketing Agents at Noumena

Deriving the DPO Loss from First Principles

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

What makes good reasoning data

Nemotron 3 Nano \- A new Standard for Efficient, Open, and Intelligent Agentic Models

Red Teaming with RL: Exploiting Tinker API for Harmful RL on 235B Model

about 12 hours ago

Mastering Tensor Dimensions in Transformers

Small Language Models (SLM): A Comprehensive Overview

Qwen-Image-i2L: Training Strategies for Image-to-LoRA Generation

Code a simple RAG from scratch

Why Did MiniMax M2 End Up as a Full Attention Model?

Why You Should Care About Partial Differential Equations (PDEs)

Encoding the World's Medical Knowledge into 970K

Topic 23: What is LLM Inference, it's challenges and solutions for it

AprielGuard: A Guardrail for Safety and Adversarial Robustness in Modern LLM Systems

December 23, 2025

tokenizerstransformersopen-source

Tokenization in Transformers v5: Simpler, Clearer, and More Modular

+2

December 18, 2025

The Open Evaluation Standard: Benchmarking NVIDIA Nemotron 3 Nano with NeMo Evaluator

December 17, 2025

CUGA on Hugging Face: Democratizing Configurable AI Agents

December 15, 2025

New in llama.cpp: Model Management

December 11, 2025

llmfine-tuningopen-source

Codex is Open Sourcing AI models

December 11, 2025

swifthubopen-source

Introducing swift-huggingface: The Complete Swift Client for Hugging Face

December 5, 2025

llmreasoningagents

DeepMath: A lightweight math reasoning Agent with smolagents

December 4, 2025

llmfine-tuningopen-source

We Got Claude to Fine-Tune an Open Source LLM

December 4, 2025

transformersv5community

Transformers v5: Simple model definitions powering the AI ecosystem

December 1, 2025

diffusersfluxquantization

Diffusers welcomes FLUX-2

+4

November 25, 2025

transformerspytorchoptimization

Continuous batching from first principles

November 25, 2025

Building Deep Research: How we Achieved State of the Art

November 24, 2025

OVHcloud on Hugging Face Inference Providers 🔥

November 24, 2025

Community Articles

NEW Articles from Team or Enterprise organizations will get promoted to the main section.

The Optimal Architecture for Small Language Models

Deriving the PPO Loss from First Principles

Continuity as a First-Class System Property in Artificial Intelligence

KV Caching Explained: Optimizing Transformer Inference Efficiency

Uncensor any LLM with abliteration

LLM based Audio models

Skill is All You Need: Lessons from Building Marketing Agents at Noumena

Deriving the DPO Loss from First Principles

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

What makes good reasoning data

Nemotron 3 Nano \- A new Standard for Efficient, Open, and Intelligent Agentic Models

Red Teaming with RL: Exploiting Tinker API for Harmful RL on 235B Model

about 12 hours ago

Mastering Tensor Dimensions in Transformers

Small Language Models (SLM): A Comprehensive Overview

Qwen-Image-i2L: Training Strategies for Image-to-LoRA Generation

Code a simple RAG from scratch

Why Did MiniMax M2 End Up as a Full Attention Model?

Why You Should Care About Partial Differential Equations (PDEs)

Encoding the World's Medical Knowledge into 970K

Topic 23: What is LLM Inference, it's challenges and solutions for it

View all articles