-

All you need to know about Chain of Causation reasoning and the current state of…
9 min read -

Why your final LLM layer is OOMing and how to fix it with a custom…
18 min read -

All you need to know about a fast, readable and PyTorch-ready softmax kernel!
10 min read -

Tiled GEMM, GPU memory, coalescing, and much more!
13 min read -

Temporal-Difference Learning and the Importance of Exploration: An Illustrated Guide
Artificial IntelligenceComparing model-free and model-based RL methods on a dynamic grid world
18 min read -

The basics of GPU programming, optimisation, and your first Triton kernel
9 min read -

Everything you need to assemble the DQN Megazord in JAX.
20 min read -

All the tricks and details you wish you knew about PPO
9 min read -

Solving the CartPole environment with DQN in under a second
11 min read -

Understanding the fundamental building blocks of Transformers.
14 min read