Field Notes from Production · SRE Chronicles
Field Notes from Production
SRE
Chronicles
Real incidents. Real debugging. Real lessons. Each chapter is a story from the trenches - the kind of production problems that don't show up in documentation, only in 2am alerts and packet captures.
Filter by topic
Chapter 01
The 5-Second Time Bomb
Every Node.js server ships with a silent misconfiguration. Most never notice it - until nginx does.
Node.jsnginxKubernetesTCP· 8 min readApr 2025
Chapter 02
CF-RAY: -
curl from the pod worked. The app from the same pod did not. Cloudflare returned a 400 for a request it pretended never existed.
CloudflareJavaSOAPTLS· 9 min readMay 2026
Chapter 03
The Ghost Connections
A Java service was taking 21 seconds to call an API that responded in 179ms. The wire logs exonerated the connection pool, the firewall, and the network.
Apache HC5CamelKafkaJava· 10 min readMay 2026