The Blog
Writing about what works.
Engineering deep dives on Go, distributed systems, and AI tooling — tested in production, written for people who build.
Quantization, Explained: Why Big Models Run on Small Hardware
A 7B model is 14 GB in full precision. A 70B is 140 GB. Quantization is the trick that brings those numbers down to something your machine can hold — and the tradeoff is smaller than most people expect.
DeerFlow 2.0 Isn't a Framework — It's a Harness
ByteDance open-sourced DeerFlow 2.0 and it hit #1 on GitHub Trending in hours. The number isn't the story. The shift from framework to batteries-included harness is.
Agent Skills Are Becoming the Vendor's Job
1,497 agent skills now sit in one repo, and the best ones come straight from Stripe, Cloudflare, Figma, and Anthropic. Here's why vendor-written skills change how you set up an agent—and five worth installing first.
The Real AI Shift: Agents Are Moving Inside the Tools
Palmier Pro hit 3,500 GitHub stars in 48 hours—not because it reinvented editing, but because it ships an MCP server. Here's why that matters.
Distributed Tracing in Go with OpenTelemetry: A Production-Ready Implementation Guide
An in-depth exploration of implementing distributed tracing in Go microservices using OpenTelemetry, covering propagation context, sampling strategies, and performance optimization.
Beyond JSON: Achieving Sub-millisecond Latency with Go, NATS, and Protobuf
Why traditional HTTP/JSON architectures fail at scale, and how a binary-first, event-driven stack delivers real-time performance.