AI Proxy
Next-generation AI gateway with OpenAI-compatible protocol
[](https://github.com/labring/aiproxy/releases)
[](https://github.com/labring/aiproxy/blob/main/LICENSE)
[](https://github.com/labring/aiproxy/blob/main/core/go.mod)
[](https://github.com/labring/aiproxy/actions)
[English](./README.md) | [įŽäŊ䏿](./README.zh.md)
---
## đ Overview
AI Proxy is a powerful, production-ready AI gateway that provides intelligent request routing, comprehensive monitoring, and seamless multi-tenant management. Built with OpenAI-compatible and Anthropic protocols, it serves as the perfect middleware for AI applications requiring reliability, scalability, and advanced features.
## ⨠Key Features
### đ **Intelligent Request Management**
- **Smart Retry Logic**: Intelligent retry strategies with automatic error recovery
- **Priority-based Channel Selection**: Route requests based on channel priority and error rates
- **Load Balancing**: Efficiently distribute traffic across multiple AI providers
- **Protocol Conversion**: Seamless Claude to OpenAI API protocol conversion
### đ **Comprehensive Monitoring & Analytics**
- **Real-time Alerts**: Proactive notifications for balance warnings, error rates, and anomalies
- **Detailed Logging**: Complete request/response tracking with audit trails
- **Advanced Analytics**: Request volume, error statistics, RPM/TPM metrics, and cost analysis
- **Channel Performance**: Error rate analysis and performance monitoring
### đĸ **Multi-tenant Architecture**
- **Organization Isolation**: Complete separation between different organizations
- **Flexible Access Control**: Token-based authentication with subnet restrictions
- **Resource Quotas**: RPM/TPM limits and usage quotas per group
- **Custom Pricing**: Per-group model pricing and billing configuration
### đ¤ **MCP (Model Context Protocol) Support**
- **Public MCP Servers**: Ready-to-use MCP integrations
- **Organization MCP Servers**: Private MCP servers for organizations
- **Embedded MCP**: Built-in MCP servers with configuration templates
- **OpenAPI to MCP**: Automatic conversion of OpenAPI specs to MCP tools
### đ **Plugin System**
- **Cache Plugin**: High-performance caching for identical requests with Redis/memory storage
- **Web Search Plugin**: Real-time web search capabilities with support for Google, Bing, and Arxiv
- **Think Split Plugin**: Support for reasoning models with content splitting, automatically handling `