Hi, Alex 👌
alexdremov.me
📚 MIPT alumnus — graduated with honors in PSAMI Informatics and Computational Technologies.
🇨🇭 EPFL student — current Master's in Data Science student
- Compute-Optimal Quantization-Aware Training — Aleksandr Dremov, David Grangier, Angelos Katharopoulos, Awni Hannun
ICLR 2026 - Training dynamics of the cooldown stage in warmup-stable-decay learning rate scheduler — Aleksandr Dremov, Alexander Hägele, Atli Kosson, Martin Jaggi
TMLR, J2C Certification (ICLR 2026)
- 🔥 Understanding Flash Attention: Writing the Algorithm from Scratch in Triton
- 🌮 Speed Up PyTorch With Custom Kernels. But It Gets Progressively Darker
- 🔥 Simple Ways to Speed Up Your PyTorch Model Training
- 🚀 Swift Actors — Common Problems and Tips
- ❤️ I Contributed to PyTorch. Here's What I Learned
| Aleksandr Dremov | @aldrmv | alex@alexdremov.me |





