Debtirtha Saha

I reimplement the architectures I want to understand, then write down how it actually went.

latest posts

Jun 07, 2026	stateful-agent: an agent that actually remembers
Jun 07, 2026	A VLA from scratch: 29% tokens, 0% grasps, and a GRPO that wouldn't budge
May 23, 2026	A haiku VLM: SFT did the work, KTO collapsed at λ=1.0
May 22, 2026	A 7B math fine-tune on 8× H100: SFT +6.4, DPO +0.6
May 17, 2026	Eight A100s, $61, and 124M parameters
Apr 25, 2026	BPE from scratch, and why your LLM can't count L's