All Posts

1575 articles

OpenTelemetry for AI Systems — Tracing LLM Calls, Token Usage, and Agent Loops

Trace LLM inference with OpenTelemetry semantic conventions. Monitor token counts, latency, agent loops, and RAG pipeline steps with structured observability.

March 15, 2026Read →

observability8 min read

OpenTelemetry Full Setup — Vendor-Neutral Observability for Node.js

Deploy OpenTelemetry with auto-instrumentation, custom spans, metrics, and the Collector pipeline. Export to Jaeger, Tempo, or Datadog.

March 15, 2026Read →

observability9 min read

OpenTelemetry in Node.js — Distributed Tracing From Zero to Production

Complete OpenTelemetry setup for Node.js, auto-instrumentation, custom spans, trace propagation, OTLP export to Tempo/Jaeger, sampling strategies, and production alerting.

March 15, 2026Read →

outbox-pattern8 min read

The Transactional Outbox Pattern — Guaranteed Message Delivery Without 2PC

Eliminate dual-write problems with the outbox pattern. Learn polling publishers, CDC with Debezium, and building reliable event-driven systems.

March 15, 2026Read →

backend7 min read

The Overconfident Junior Breaking Prod — Guardrails That Protect Without Demoralizing

A junior engineer with access to production and insufficient guardrails runs a database migration directly on prod. Or force-pushes to main. Or deletes an S3 bucket thinking it was the staging one. The fix isn''t surveillance — it''s systems that make the catastrophic mistake require extra steps.

March 15, 2026Read →

Page 246 of 315