Writing

Blog

15 posts

AI HardwareCloud Strategy

AI Silicon Strategy 2026: What the Hardware Wars Mean for Your Cloud Spend

OpenAI, Google, Anthropic, and China's Four Dragons are betting billions on different silicon strategies — here's what it means for which cloud you should trust with your AI workloads.

5 min
LLMToken Optimization

Stop Wasting 80% of Your LLM Tokens: The Caveman + Graphify Framework

A three-step framework — compressed prompting, knowledge graph indexing, and project-specific shorthand — that cuts token usage by 80-90% without sacrificing technical accuracy.

6 min
Code ReviewVS Code

AI-Powered Code Review at Scale: Building a Multi-Provider VS Code Extension

How I built a bring-your-own-AI code review VS Code extension that found 12 critical security issues in my own codebase — including SQL injection risks I'd missed for months.

5 min
LLMAI Architecture

ModelRouter: Intelligent Multi-LLM Routing to Stop Burning Claude Pro Tokens

How I built a local proxy that classifies prompts and routes them to the right model — Gemma for trivial tasks, Gemini for features, Codex for debugging, Claude only for hard architecture problems.

5 min
LLMFine-tuning

The Context Gap: Why Frontier LLMs Fail at Local Business Problems

Frontier LLMs are trained on Western-centric internet data and fail at regional supply chains, local business etiquette, and jurisdiction-specific law — here's how localized fine-tuning fixes this on commodity hardware.

5 min
MCPInfrastructure

MCP as Infrastructure Control Plane: Building a Real-Time Monitoring Dashboard in 2 Hours

How I used a lightweight MCP shell server to build a real-time infrastructure monitoring dashboard — and why MCP is more than just an AI agent protocol.

5 min
AI ArchitectureRAG

Beyond RAG: Building AI Systems That Actually Know Your Business

Why vanilla RAG isn't enough, and how the four-layer architecture — memory, identity, tools, and feedback loops — builds AI that genuinely represents how your organization thinks.

6 min
AIIndic Languages

Building a Multilingual Meeting Notes System for Indian Business: OdiaMeet

How I built a system that transcribes meetings in 10 Indian languages including code-mixed conversations, and generates clean English minutes of meeting at near-zero cost.

5 min
WhatsAppConversational AI

WhatsApp as Enterprise Interface: Conversational AI for Business Automation

Why WhatsApp beats native apps for customer-facing business automation, and the architecture pattern that makes it work across bookings, support, onboarding, and more.

6 min
MCPAI Agents

Building Production AI Agents with MCP and Anthropic Claude

How I built a production-grade AI agent using Model Context Protocol (MCP) connected directly to a client's production database — delivering real-time natural language data intelligence.

3 min
SolrNoSQL

Apache Solr as a NoSQL Database: When Search Beats Storage

After running 150+ Solr nodes in production at TransUnion CIBIL under RBI regulation, here's when Solr outperforms MongoDB and when it doesn't.

5 min
AI AgentsKnowledge Graphs

Graphify: Temporal Knowledge Graphs for Stateful AI Agents

How I built Graphify — a tool that turns any folder of code, docs, or notes into a navigable knowledge graph — and why temporal memory is the missing piece in most AI agent architectures.

4 min
Apache SolrHigh Availability

Zero-Downtime Apache Solr in Production: VIP + Keepalived Architecture

How we achieved zero-downtime patching and version upgrades across a 150+ node Solr cluster serving a national credit reporting platform under RBI regulatory constraints.

4 min