Re: Implementation Series

Welcome to my open notebook.

I've found that the best way to truly understand a concept is to build it from scratch. This series documents my journey as I re-implement influential AI research papers and models, line by line.

The goal isn't to create production-ready software, but to build a strong intuition for how these architectures actually work. I'm keeping my code minimal and pedagogical, focusing on clarity over optimization.

Entries

Re: Implementation [02]: GPT + Mixture of Experts (MoE)

Feb 1, 2026

Re: Implementation [01]: Decoder-Only GPT - Building a Character-Level Language Model

Jan 15, 2026

Upcoming

I'm currently exploring Vision Transformers (ViT) and Diffusion Models. More notes will be added here as I finish them.

If you find these notes helpful or spot any errors in my implementation, feel free to reach out. We learn better together.