2025  12

June  3

Why I think Tesla’s Robotaxi will do laps on Waymo

June 24, 2025 · 1 min · Computer Vision & Personal Opinions

Deploying a production grade ML model: a full walkthrough

June 14, 2025 · 1 min · ML Systems

Beyond Accuracy: Why Most ML Models Fail in Production (and what real-world ML looks like)

June 6, 2025 · 1 min · ML Systems

May  1

Designing AlphaTicTacToe (and crawling towards AlphaTicTacToeZero)

May 20, 2025 · 1 min · ML Theory

April  1

Mamba (and it’s peers) aren’t replacing Attention anytime soon

April 3, 2025 · 1 min · ML Theory + Code

March  2

Vision Models are truly underrated

March 10, 2025 · 1 min · ML Theory + Code

Building Transformers and Multi-head attention from scratch

March 3, 2025 · 1 min · ML Theory + Code

February  2

How did we get to transformers? The rise of the Attention Mechanims

February 28, 2025 · 1 min · ML Theory + Code

Crypto’s asymmetric upside: HODL Ethereum for the next 20 years

February 14, 2025 · 1 min · Crypto Theory + Code

January  3

Why VAEs Changed the Way I Think About Modeling

January 27, 2025 · 1 min · ML Theory

Word2Vec from scratch: Intuition to Implementation

January 10, 2025 · 11 min · ML Theory + Code

What are Xavier and He initializations? Why do they (almost always) help our eval?

January 4, 2025 · 1 min · ML Theory + Code

2024  1

December  1

My goals for this blog

December 29, 2024 · 1 min · Personal thoughts