Metehan Ariman.

The Blog

Writing about what works.

Engineering deep dives on Go, distributed systems, and AI tooling — tested in production, written for people who build.

Latest 5 min read

Quantization, Explained: Why Big Models Run on Small Hardware

A 7B model is 14 GB in full precision. A 70B is 140 GB. Quantization is the trick that brings those numbers down to something your machine can hold — and the tradeoff is smaller than most people expect.

aillmlocal-llmquantizationtooling
Read article
4 min

DeerFlow 2.0 Isn't a Framework — It's a Harness

ByteDance open-sourced DeerFlow 2.0 and it hit #1 on GitHub Trending in hours. The number isn't the story. The shift from framework to batteries-included harness is.

aiagentsopen-source
4 min

Agent Skills Are Becoming the Vendor's Job

1,497 agent skills now sit in one repo, and the best ones come straight from Stripe, Cloudflare, Figma, and Anthropic. Here's why vendor-written skills change how you set up an agent—and five worth installing first.

aiagentsskills
7 min

The Real AI Shift: Agents Are Moving Inside the Tools

Palmier Pro hit 3,500 GitHub stars in 48 hours—not because it reinvented editing, but because it ships an MCP server. Here's why that matters.

aiagentsmcp
10 min

Distributed Tracing in Go with OpenTelemetry: A Production-Ready Implementation Guide

An in-depth exploration of implementing distributed tracing in Go microservices using OpenTelemetry, covering propagation context, sampling strategies, and performance optimization.

golangdistributed-systemsobservability
8 min

Beyond JSON: Achieving Sub-millisecond Latency with Go, NATS, and Protobuf

Why traditional HTTP/JSON architectures fail at scale, and how a binary-first, event-driven stack delivers real-time performance.

GoNATSMicroservices