Siddharth Ramakrishnan

Writing

Topic Guide · 8 essays

AI Systems

Inference, agents, routing, evaluation, and model tooling.

These essays are about the machinery underneath modern AI products: inference constraints, orchestration layers, model tooling, and the developer workflows that emerge around them.

If you care more about how systems actually behave than about benchmark theater, start here. The through line is that once models are useful, the hard problems move into routing, evaluation, interfaces, and operations.

Title card for "Background Agent Optimizations" about How caching, routing, and scheduling change when coding agents run off the critical path.

Background Agent Optimizations

March 8, 2026

A technical deep dive into inference optimizations for background coding agents, from semantic caching to speculative execution and smarter scheduling.

Title card for "The Clearinghouse for Code" about Why code review is only the first bottleneck in an agent-scale software delivery stack.

The Clearinghouse for Code

March 17, 2026

AI code generation is flooding repos with changes. The next bottleneck in software is clearing, sequencing, testing, and deploying them safely.

Title card for "A Bitter Lesson" about A computer vision side quest on hooper.gg ended with a cheaper, simpler answer: pay labelers.

A Bitter Lesson

August 14, 2025

Building basketball shot-classification models taught me a classic ML lesson: sometimes human operations beat a more complicated AI system.

Title card for "Vector DBs 🤝 Multimodality" about Exploring the pivotal role of vector databases in managing multimodal data, including video and voice content.

Vector DBs

May 25, 2024

Exploring the pivotal role of vector databases in managing multimodal data, including video and voice content.

Title card for "Prototyping ML Models" about Explore how Attentive leveraged LLMs to rapidly prototype a product recommendations model, significantly reducing development time from months to just a week.

Prototyping ML Models

May 10, 2024

Explore how Attentive leveraged LLMs to rapidly prototype a product recommendations model, significantly reducing development time from months to just a week.

Title card for "LLMs for ML" about Explore the integration of LLMs in enhancing traditional ML workflows at Attentive, focusing on data cleaning and organization.

LLMs in ML

May 14, 2024

Explore the integration of LLMs in enhancing traditional ML workflows at Attentive, focusing on data cleaning and organization.

Title card for "Next Top Model" about Discussing an innovative ML Recommendation Engine, this blog post explores a self-serve portal idea for companies to match their data with suitable ML….

Next Top Model

May 12, 2024

Discussing an innovative ML Recommendation Engine, this blog post explores a self-serve portal idea for companies to match their data with suitable ML…