The Blog

Writing about what works.

Engineering deep dives on Go, distributed systems, and AI tooling — tested in production, written for people who build.

Latest Jun 29, 2026 5 min read

Quantization, Explained: Why Big Models Run on Small Hardware

A 7B model is 14 GB in full precision. A 70B is 140 GB. Quantization is the trick that brings those numbers down to something your machine can hold — and the tradeoff is smaller than most people expect.

aillmlocal-llmquantizationtooling

Read article

Jun 28, 2026 4 min

DeerFlow 2.0 Isn't a Framework — It's a Harness

ByteDance open-sourced DeerFlow 2.0 and it hit #1 on GitHub Trending in hours. The number isn't the story. The shift from framework to batteries-included harness is.

aiagentsopen-source

Jun 28, 2026 4 min

Agent Skills Are Becoming the Vendor's Job

1,497 agent skills now sit in one repo, and the best ones come straight from Stripe, Cloudflare, Figma, and Anthropic. Here's why vendor-written skills change how you set up an agent—and five worth installing first.

aiagentsskills

Jun 27, 2026 7 min

The Real AI Shift: Agents Are Moving Inside the Tools

Palmier Pro hit 3,500 GitHub stars in 48 hours—not because it reinvented editing, but because it ships an MCP server. Here's why that matters.

aiagentsmcp

Jan 12, 2026 10 min

Distributed Tracing in Go with OpenTelemetry: A Production-Ready Implementation Guide

An in-depth exploration of implementing distributed tracing in Go microservices using OpenTelemetry, covering propagation context, sampling strategies, and performance optimization.

golangdistributed-systemsobservability

Jan 4, 2026 8 min

Beyond JSON: Achieving Sub-millisecond Latency with Go, NATS, and Protobuf

Why traditional HTTP/JSON architectures fail at scale, and how a binary-first, event-driven stack delivers real-time performance.

GoNATSMicroservices