tensorzero/tensorzero

每日信息看板 · 2026-02-21

返回当天 Daily Index

开源项目

AI 总结

TensorZero 在 GitHub 展示了一个面向工业级 LLM 应用的开源全栈，统一多模型接入并提供观测、优化、评测与实验能力，重要性在于可显著降低多供应商集成和持续迭代成本。

提供统一 LLM Gateway，一次接入即可调用多家主流模型与 OpenAI 兼容接口，支持流式、工具调用、结构化输出、多模态等能力。
强调生产可用性与性能：基于 Rust，宣称在 10k+ QPS 下网关 p99 额外延迟低于 1ms，并内置路由、重试、回退、限流和鉴权。
内置可观测性体系，可将推理与反馈写入自有数据库，并支持 UI/编程方式分析、回放历史推理及构建数据集。
覆盖优化与评测闭环：支持提示词/模型/推理策略优化（含 GEPA、MIPROv2、RLHF 等）以及启发式和 LLM Judge 评测。
提供实验能力（如自适应 A/B 测试）与多语言接入方式（Python SDK、OpenAI SDK、HTTP API），并预告自动化优化产品 Autopilot。

#GitHub #repo #开源项目

原链接

内容摘录

<p><picture><img src="https://github.com/user-attachments/assets/47d67430-386d-4675-82ad-d4734d3262d9" alt="TensorZero Logo" width="128" height="128"></picture></p>
TensorZero

<p><picture><img src="https://www.tensorzero.com/github-trending-badge.svg" alt="#1 Repository Of The Day"></picture></p>

**TensorZero is an open-source stack for _industrial-grade LLM applications_:**
**Gateway:** access every LLM provider through a unified API, built for performance (<1ms p99 latency)
**Observability:** store inferences and feedback in your database, available programmatically or in the UI
**Optimization:** collect metrics and human feedback to optimize prompts, models, and inference strategies
**Evaluation:** benchmark individual inferences or end-to-end workflows using heuristics, LLM judges, etc.
**Experimentation:** ship with confidence with built-in A/B testing, routing, fallbacks, retries, etc.

Take what you need, adopt incrementally, and complement with other tools.

<video src="https://github.com/user-attachments/assets/04a8466e-27d8-4189-b305-e7cecb6881ee"></video>

---

<p align="center">
 <b><a href="https://www.tensorzero.com/" target="_blank">Website</a></b>
 ·
 <b><a href="https://www.tensorzero.com/docs" target="_blank">Docs</a></b>
 ·
 <b><a href="https://www.x.com/tensorzero" target="_blank">Twitter</a></b>
 ·
 <b><a href="https://www.tensorzero.com/slack" target="_blank">Slack</a></b>
 ·
 <b><a href="https://www.tensorzero.com/discord" target="_blank">Discord</a></b>
 <br>
 <br>
 <b><a href="https://www.tensorzero.com/docs/quickstart" target="_blank">Quick Start (5min)</a></b>
 ·
 <b><a href="https://www.tensorzero.com/docs/gateway/deployment" target="_blank">Deployment Guide</a></b>
 ·
 <b><a href="https://www.tensorzero.com/docs/gateway/api-reference" target="_blank">API Reference</a></b>
 ·
 <b><a href="https://www.tensorzero.com/docs/gateway/deployment" target="_blank">Configuration Reference</a></b>
</p>

---
[!NOTE]
### **Coming Soon: TensorZero Autopilot**
TensorZero Autopilot is an **automated AI engineer** (powered by the TensorZero Stack) that analyzes LLM observability data, optimizes prompts and models, sets up evals, and runs A/B tests.
**Learn more** **Join the waitlist**
Features
🌐 LLM Gateway
**Integrate with TensorZero once and access every major LLM provider.**
[x] **Call any LLM** (API or self-hosted) through a single unified API
[x] Infer with **streaming**, **tool use**, **structured outputs (JSON)**, **batch**, **embeddings**, **multimodal (images, files)**, **caching**, etc.
[x] **Create prompt templates and schemas** to enforce a consistent, typed interface between your application and the LLMs
[x] Satisfy extreme throughput and latency needs, thanks to 🦀 Rust: **<1ms p99 latency overhead at 10k+ QPS**
[x] Use any programming language: **integrate via our Python SDK, any OpenAI SDK, or our HTTP API**
[x] **Ensure high availability** with routing, retries, fallbacks, load balancing, granular timeouts, etc.
[x] **Enforce custom rate limits** with granular scopes (e.g. user-defined tags) to keep usage under control
[x] **Set up auth for TensorZero** to allow clients to access models without sharing provider API keys
[ ] Soon: spend tracking and budgeting

<br>

**Supported Model Providers:**
**Anthropic**,
**AWS Bedrock**,
**AWS SageMaker**,
**Azure**,
**DeepSeek**,
**Fireworks**,
**GCP Vertex AI Anthropic**,
**GCP Vertex AI Gemini**,
**Google AI Studio (Gemini API)**,
**Groq**,
**Hyperbolic**,
**Mistral**,
**OpenAI**,
**OpenRouter**,
**SGLang**,
**TGI**,
**Together AI**,
**vLLM**, and
**xAI (Grok)**.
Need something else? TensorZero also supports **any OpenAI-compatible API (e.g. Ollama)**.

<br>

<details open>
<summary><b>Usage: Python &mdash; TensorZero SDK</b></summary>

You can access any provider using the TensorZero Python SDK.
pip install tensorzero
Optional: Set up the TensorZero configuration.
Run inference:

See **Quick Start** for more information.

</details>

<details>
<summary><b>Usage: Python &mdash; OpenAI SDK</b></summary>

You can access any provider using the OpenAI Python SDK with TensorZero.
pip install tensorzero
Optional: Set up the TensorZero configuration.
Run inference:

See **Quick Start** for more information.

</details>

<details>
<summary><b>Usage: JavaScript / TypeScript (Node) &mdash; OpenAI SDK</b></summary>

You can access any provider using the OpenAI Node SDK with TensorZero.
Deploy tensorzero/gateway using Docker.
 **Detailed instructions →**
Set up the TensorZero configuration.
Run inference:

See **Quick Start** for more information.

</details>

<details>
<summary><b>Usage: Other Languages & Platforms &mdash; HTTP API</b></summary>

TensorZero supports virtually any programming language or platform via its HTTP API.
Deploy tensorzero/gateway using Docker.
 **Detailed instructions →**
Optional: Set up the TensorZero configuration.
Run inference:

See **Quick Start** for more information.

</details>
🔍 LLM Observability
**Zoom in to debug individual API calls, or zoom out to monitor metrics across models and prompts over time &mdash; all using the open-source TensorZero UI.**
[x] Store inferences and **feedback (metrics, human edits, etc.)** in your own database
[x] Dive into individual inferences or high-level aggregate patterns using the TensorZero UI or programmatically
[x] **Build datasets** for optimization, evaluation, and other workflows
[x] Replay historical inferences with new prompts, models, inference strategies, etc.
[x] **Export OpenTelemetry traces (OTLP)** and **export Prometheus metrics** to your favorite application observability tools
[ ] Soon: AI-assisted debugging and root cause analysis; AI-assisted data labeling

<table>
<tr></tr> <!-- flip highlight order -->
<tr>
<td width="50%" align="center" valign="middle"><b>Observability » UI</b></td>
<td width="50%" align="center" valign="middle"><b>Observability » Programmatic</b></td>
</tr>
<tr>
<td width="50%" align="center" valign="middle"><video s…