Debtirtha Saha
I reimplement the architectures I want to understand, then write down how it actually went.
latest posts
| Jun 07, 2026 | stateful-agent: an agent that actually remembers |
|---|---|
| Jun 07, 2026 | A VLA from scratch: 29% tokens, 0% grasps, and a GRPO that wouldn't budge |
| May 23, 2026 | A haiku VLM: SFT did the work, KTO collapsed at λ=1.0 |
| May 22, 2026 | A 7B math fine-tune on 8× H100: SFT +6.4, DPO +0.6 |
| May 17, 2026 | Eight A100s, $61, and 124M parameters |
| Apr 25, 2026 | BPE from scratch, and why your LLM can't count L's |