LLM

Building AI infrastructure — from data pipelines to GPU kernels. I write about data engineering, LLMOps, GPU optimization, and the systems behind reinforcement learning.

AI • Mar 2, 2026 • 14 min read

The Five Pillars of Large Language Models

Every LLM is built on five foundational pillars: Basics, Systems, Scaling Laws, Data, and Alignment. This post maps out what they are and why mastering them is the path to building real AI systems.

Read Full Article →

AI • Feb 20, 2026 • 6 min read

Test-Time Training: When Your Model Learns During Inference

Test-time training lets models update their own weights during inference. Learn how TTT layers work, their GPU implications, and why this changes AI infrastructure.

Read Full Article →

AI • Feb 12, 2026 • 4 min read

LLM Observability: Tracing Your AI Applications

Learn how to add observability to LLM applications using OpenTelemetry and data engineering practices.

Read Full Article →

AI • Feb 3, 2026 • 3 min read

Building a Production RAG Pipeline with Apache Airflow

Learn how to orchestrate a Retrieval-Augmented Generation pipeline using Apache Airflow and data engineering best practices.

Read Full Article →

AI • Jan 29, 2026 • 4 min read

Prompt Version Control: Treating Prompts Like Production Code

Learn how to apply data engineering principles to manage LLM prompts with versioning, labels, and rollback capabilities.

Read Full Article →

AI • Jan 22, 2026 • 13 min read

RAG is Just ETL with Extra Steps: A Data Engineering Perspective

RAG isn't magic - it's Extract, Transform, Load with vectors. I break down how your existing pipeline skills map directly to building production AI systems.

Read Full Article →

LAUNCHING MARCH 2026

Build an AI Agent in 5 Days

10 minutes a day. 5 days. One working AI agent. Join the waitlist — launching first week of March.

See the Full Curriculum