by
🔓 Open Source Rust 🌍 Global free

About

TensorZero is an open-source LLMOps platform that unifies LLM Gateway, Observability, Evaluation, Optimization, and Experimentation. It offers a high-performance, unified API (<1ms p99 latency) to access all major LLM providers, supporting multimodal inference, caching, and prompt templating. The platform enables robust operations, data storage, metric monitoring, automated prompt/model optimization, and A/B testing via its UI and programmatic interfaces, accelerating efficient LLM application development and deployment.

Features

  • Unified LLM Gateway: High-performance, low-latency, multi-provider support with advanced inference features
  • Comprehensive LLM Observability: Data storage, metric monitoring, OpenTelemetry/Prometheus export
  • Intelligent LLM Evaluation: Heuristic and LLM judge-based, supporting inference/workflow evaluations
  • Automated Optimization: Leveraging production feedback to optimize prompts, models, and inference strategies (e.g., GEPA, SFT)
  • Robust Experimentation: Built-in A/B testing, routing, fallbacks, and retries for confident releases

Supported Platforms

web