pi-autoresearch optimization loops

The autoresearch technique promoted by Andrej Karpathy is easily accessible in the pi coding agent.

March 21, 2026 · 12 min

Cupcake: policy enforcement for AI coding agents

Cupcake enables policy enforcement for AI coding agents like Claude Code, Cursor and OpenCode

March 14, 2026 · 6 min

The Hunter, Skeptic, Arbiter trio as an example of debate-style prompting

This post explores Hunter, Skeptic, Arbiter prompting for finding vulnerabilities in code as an example of debate-style prompting.

March 8, 2026 · 13 min

MCP Sampling

This post explores MCP sampling and illustrates how important guardrails are to protect clients against malicious servers.

February 19, 2026 · 5 min

MCP servers: what happens behind the scenes?

The codebases of MCP servers tend to be early-stage API wrappers, far away in quality from existing SDKs. Non-determinism of LLMs adds…

April 6, 2025 · 12 min

OpenTelemetry meets OpenAI: manual instrumentation

The posts explores three approaches to manual OpenTelemetry instrumentation for OpenAI calls in Langchain and LlamaIndex.

May 4, 2023 · 6 min

OpenTelemetry meets OpenAI

Using automatic instrumentation to quickly assess which APIs are called by popular Python LLM demo apps.

April 23, 2023 · 5 min

Emergency Procedures in SRE

Emergency procedures aim at stabilizing the system in a degraded state. When used properly, they result in faster incident response and …

January 24, 2023 · 5 min

Building and shipping in iterations

This posts showcases strategies for shipping software in iterations, inspired by TDD, Coding Katas, and Testing in Production.

November 2, 2022 · 11 min

Accepting and supporting changes

Explores how individuals can react to changes and focus their efforts to support the change to foster their own development.

July 10, 2022 · 6 min