AI Systems Essays

Topic Guide · 11 essays

AI Systems

Inference, agents, routing, evaluation, and model tooling.

These essays are about the machinery underneath modern AI products: inference constraints, orchestration layers, model tooling, and the developer workflows that emerge around them.

If you care more about how systems actually behave than about benchmark theater, start here. The through line is that once models are useful, the hard problems move into routing, evaluation, interfaces, and operations.

The App Layer Is Dead. Long Live the App Layer.

May 25, 2026

The model can do the task. The company has to improve the outcome.

Software Is Eating the World (But Actually This Time)

April 26, 2026

Coding was just the first workload, and almost everyone is underestimating how much inference demand will grow.

Specialized Intelligence

March 31, 2026

A short note on distilling Qwen2.5-Coder-7B for numeric vulnerability triage, where a narrow specialist slightly outperformed GPT-5.2 on a real benchmark.

Background Agent Optimizations

March 8, 2026

A technical deep dive into inference optimizations for background coding agents, from semantic caching to speculative execution and smarter scheduling.

The Clearinghouse for Code

March 17, 2026

AI code generation is flooding repos with changes. The next bottleneck in software is clearing, sequencing, testing, and deploying them safely.

Teaching an LLM to Read a Profiler

October 26, 2024

AI models, reasoning, tool use, and real world data.

A Bitter Lesson

August 14, 2025

Building basketball shot-classification models taught me a classic ML lesson: sometimes human operations beat a more complicated AI system.

Vector DBs

May 25, 2024

Exploring the pivotal role of vector databases in managing multimodal data, including video and voice content.

Prototyping ML Models

May 10, 2024

Explore how Attentive leveraged LLMs to rapidly prototype a product recommendations model, significantly reducing development time from months to just a week.

Title card for "LLMs for ML" about Explore the integration of LLMs in enhancing traditional ML workflows at Attentive, focusing on data cleaning and organization.

LLMs in ML

May 14, 2024

Explore the integration of LLMs in enhancing traditional ML workflows at Attentive, focusing on data cleaning and organization.

Next Top Model

May 12, 2024

Discussing an innovative ML Recommendation Engine, this blog post explores a self-serve portal idea for companies to match their data with suitable ML…

Siddharth Ramakrishnan

Writing