Agent Almanac

Forthcoming · 2026

Recurring publication of capability, performance, and cost benchmarks of agents on real-world task suites. Frontier APIs and small open-weights in the same harness.

Awesome-LLM-Prod

Active

Curated list of production-grade LLM tooling, infrastructure, and operational patterns. Continuous PRs.