OpenTelemetry meets OpenAI: manual instrumentation

The posts explores three approaches to manual OpenTelemetry instrumentation for OpenAI calls in Langchain and LlamaIndex.

May 4, 2023 · 6 min

OpenTelemetry meets OpenAI

Using automatic instrumentation to quickly assess which APIs are called by popular Python LLM demo apps.

April 23, 2023 · 5 min

Emergency Procedures in SRE

Emergency procedures aim at stabilizing the system in a degraded state. When used properly, they result in faster incident response and …

January 24, 2023 · 5 min

Most common design issues found during Production Readiness and Post-Incident Reviews

Operating software in production offers great insights into software quality. Learning from production incidents is key to improving…

May 24, 2020 · 9 min