10+ posts published
Notes on AI engineering, backend systems, and building in production.
Building a RAG Pipeline from Scratch — pgvector, Embeddings, and Retrieval That Actually Works
How I built a production retrieval system using pgvector, chunking strategies, and embedding models without reaching for a framework.
gRPC vs REST for Microservices — When to Use Each and Why I Chose gRPC
A practical comparison from building real microservice systems — performance, DX, and the tradeoffs nobody talks about.
How I Fixed the Binance -2022 Bug in a Live Trading Bot
The December 2025 algo order migration broke my live trading system. Here's what happened, why it happened, and how I fixed it under pressure.
Docker Compose for Microservices — Base/Override Pattern, Traefik, and Per-Service DB Isolation
How I structure Docker Compose for real microservice projects — secrets, network isolation, and Traefik routing without the mess.
pgvector vs Qdrant — Choosing a Vector Store for Production
A comparison of two vector storage approaches — pgvector for tight Postgres integration versus Qdrant for dedicated vector performance.
NestJS Microservices with RabbitMQ — Patterns That Scale
Production patterns for building NestJS microservices connected via RabbitMQ — from message routing to error handling.
Building a Backtesting Engine in TypeScript
How I built a performant backtesting engine in TypeScript — from data loading and order simulation to performance metrics.
Go vs Node.js for Backend Microservices — A Real Comparison
A practical comparison based on building production systems in both — where each shines and where each hurts.
LLM-as-a-Judge — How to Evaluate AI Outputs Without Human Labels
Using LLMs to evaluate LLM outputs — prompt design, cross-model judging, and when to trust the judge.
gRPC in Go — Connect-ES, buf, and the Polyrepo Setup
My full setup for gRPC services in Go — the toolchain, proto management, and polyrepo code generation.