AI/new-api: AI模型聚合管理中转分发系统，一个应用管理您的所有AI模型，支持将多种大模型转为统一格式调用，支持OpenAI、Claude、Gemini等格式，可供个人或者企业内部管理与分发渠道使用。 @ 0f1c4c4ebeb3fffeeaa2d60da3778c6bdff7a6f4

AI模型聚合管理中转分发系统，一个应用管理您的所有AI模型，支持将多种大模型转为统一格式调用，支持OpenAI、Claude、Gemini等格式，可供个人或者企业内部管理与分发渠道使用。

45 Vetvy

[email protected] 0f1c4c4ebe fix: Add pagination support to user search functionality		10 mesiacov pred
.github	13d1b8203c chore: update CI	10 mesiacov pred
bin	d84b0b0f5d chore: add model parameter to the time_test script (#245)	2 rokov pred
common	f451268830 feat: Update Claude relay temperature setting	10 mesiacov pred
constant	bf80d71ddf feat: Add Gemini version settings configuration support (close #568)	10 mesiacov pred
controller	1bcf7a3c39 chore: Update Azure OpenAI API version and embedding model detection	10 mesiacov pred
docs	115a181db3 feat: Add thinking-to-content conversion for stream responses	10 mesiacov pred
dto	13ab0f8e4f fix: gemini&claude tool call format #795 #766	10 mesiacov pred
middleware	069f2672c1 refactor: Enhance user context and quota management	10 mesiacov pred
model	bf80d71ddf feat: Add Gemini version settings configuration support (close #568)	10 mesiacov pred
relay	5f0b3f6d6f fix: Improve AWS Claude adaptor request conversion error handling #796	10 mesiacov pred
router	83a37e4653 feat: Add model request rate limiting functionality	10 mesiacov pred
service	13ab0f8e4f fix: gemini&claude tool call format #795 #766	10 mesiacov pred
setting	19a318c943 init openrouter adaptor	10 mesiacov pred
web	0f1c4c4ebe fix: Add pagination support to user search functionality	10 mesiacov pred
.dockerignore	006bc37231 refactor: access_token auth	11 mesiacov pred
.env.example	bf80d71ddf feat: Add Gemini version settings configuration support (close #568)	10 mesiacov pred
.gitignore	5f082d72bb update dockerignore	1 rok pred
BT.md	0dd1953cd6 Update BT.md	1 rok pred
Dockerfile	81591f20e0 refactor: Optimize Dockerfile for Go build process	10 mesiacov pred
LICENSE	fcb8506679 Update LICENSE	1 rok pred
Midjourney.md	bec18ed82d Update README.md	1 rok pred
README.en.md	e9ba392af8 feat: Add model rate limit settings in system configuration	10 mesiacov pred
README.md	bf80d71ddf feat: Add Gemini version settings configuration support (close #568)	10 mesiacov pred
Rerank.md	c44a32efe0 chore: update rerank.md	10 mesiacov pred
Suno.md	bec18ed82d Update README.md	1 rok pred
VERSION	7e80e2da3a fix: add a blank VERSION file (#135)	2 rokov pred
docker-compose.yml	3da1344897 feat: Add user notification settings with quota warning and multiple notification methods	10 mesiacov pred
go.mod	3a2e22443f chore: replace sqlite lib with prue go lib	10 mesiacov pred
go.sum	3a2e22443f chore: replace sqlite lib with prue go lib	10 mesiacov pred
main.go	5937d850d9 refactor: Replace manual goroutine creation with gopool.Go	10 mesiacov pred
makefile	6e54f01435 update makefile	1 rok pred
one-api.service	c6717307d0 chore: update one-api.service	2 rokov pred

![new-api](/web/public/logo.png) # New API 🍥 Next Generation LLM Gateway and AI Asset Management System

📝 Project Description

[!NOTE]
This is an open-source project developed based on One API

[!IMPORTANT]

Users must comply with OpenAI's Terms of Use and relevant laws and regulations. Not to be used for illegal purposes.

This project is for personal learning only. Stability is not guaranteed, and no technical support is provided.

✨ Key Features

🎨 New UI interface (some interfaces pending update)
🌍 Multi-language support (work in progress)
🎨 Added Midjourney-Proxy(Plus) interface support, Integration Guide
💰 Online recharge support, configurable in system settings:
- EasyPay
🔍 Query usage quota by key:
- Works with neko-api-key-tool
📑 Configurable items per page in pagination
🔄 Compatible with original One API database (one-api.db)
💵 Support per-request model pricing, configurable in System Settings - Operation Settings
⚖️ Support channel weighted random selection
📈 Data dashboard (console)
🔒 Configurable model access per token
🤖 Telegram authorization login support:
1. System Settings - Configure Login Registration - Allow Telegram Login
2. Send /setdomain command to @Botfather
3. Select your bot, then enter http(s)://your-website/login
4. Telegram Bot name is the bot username without @
🎵 Added Suno API interface support, Integration Guide
🔄 Support for Rerank models, compatible with Cohere and Jina, can integrate with Dify, Integration Guide
⚡ OpenAI Realtime API - Support for OpenAI's Realtime API, including Azure channels
🧠 Support for setting reasoning effort through model name suffix:
- Add suffix -high to set high reasoning effort (e.g., o3-mini-high)
- Add suffix -medium to set medium reasoning effort
- Add suffix -low to set low reasoning effort
🔄 Thinking to content option thinking_to_content in Channel->Edit->Channel Extra Settings, default is false, when true, the reasoning_content of the thinking content will be converted to <think> tags and concatenated to the content returned.
🔄 Model rate limit, support setting total request limit and successful request limit in System Settings->Rate Limit Settings

Model Support

This version additionally supports:

Third-party model gps (gpt-4-gizmo-*)
Midjourney-Proxy(Plus) interface, Integration Guide
Custom channels with full API URL support
Suno API interface, Integration Guide
Rerank models, supporting Cohere and Jina, Integration Guide
Dify

You can add custom models gpt-4-gizmo-* in channels. These are third-party models and cannot be called with official OpenAI keys.

Additional Configurations Beyond One API

GENERATE_DEFAULT_TOKEN: Generate initial token for new users, default false
STREAMING_TIMEOUT: Set streaming response timeout, default 60 seconds
DIFY_DEBUG: Output workflow and node info to client for Dify channel, default true
FORCE_STREAM_OPTION: Override client stream_options parameter, default true
GET_MEDIA_TOKEN: Calculate image tokens, default true
GET_MEDIA_TOKEN_NOT_STREAM: Calculate image tokens in non-stream mode, default true
UPDATE_TASK: Update async tasks (Midjourney, Suno), default true
GEMINI_MODEL_MAP: Specify Gemini model versions (v1/v1beta), format: "model:version", comma-separated
COHERE_SAFETY_SETTING: Cohere model safety settings, options: NONE, CONTEXTUAL, STRICT, default NONE
GEMINI_VISION_MAX_IMAGE_NUM: Gemini model maximum image number, default 16, set to -1 to disable
MAX_FILE_DOWNLOAD_MB: Maximum file download size in MB, default 20
CRYPTO_SECRET: Encryption key for encrypting database content
AZURE_DEFAULT_API_VERSION: Azure channel default API version, if not specified in channel settings, use this version, default 2024-12-01-preview
NOTIFICATION_LIMIT_DURATION_MINUTE: Duration of notification limit in minutes, default 10
NOTIFY_LIMIT_COUNT: Maximum number of user notifications in the specified duration, default 2

Deployment

[!TIP] Latest Docker image: calciumion/new-api:latest
Default account: root, password: 123456

Multi-Server Deployment

Must set SESSION_SECRET environment variable, otherwise login state will not be consistent across multiple servers.
If using a public Redis, must set CRYPTO_SECRET environment variable, otherwise Redis content will not be able to be obtained in multi-server deployment.

Requirements

Local database (default): SQLite (Docker deployment must mount /data directory)
Remote database: MySQL >= 5.7.8, PgSQL >= 9.6

Deployment with BT Panel

Install BT Panel (version 9.2.0 or above) from BT Panel Official Website, choose the stable version script to download and install.
After installation, log in to BT Panel and click Docker in the menu bar. First-time access will prompt to install Docker service. Click Install Now and follow the prompts to complete installation.
After installation, find New-API in the app store, click install, configure basic options to complete installation.
Pictorial Guide

Docker Deployment

Using Docker Compose (Recommended)

# Clone project
git clone https://github.com/Calcium-Ion/new-api.git
cd new-api
# Edit docker-compose.yml as needed
# nano docker-compose.yml
# vim docker-compose.yml
# Start
docker-compose up -d

Update Version

docker-compose pull
docker-compose up -d

Direct Docker Image Usage

# SQLite deployment:
docker run --name new-api -d --restart always -p 3000:3000 -e TZ=Asia/Shanghai -v /home/ubuntu/data/new-api:/data calciumion/new-api:latest

# MySQL deployment (add -e SQL_DSN="root:123456@tcp(localhost:3306)/oneapi"), modify database connection parameters as needed
# Example:
docker run --name new-api -d --restart always -p 3000:3000 -e SQL_DSN="root:123456@tcp(localhost:3306)/oneapi" -e TZ=Asia/Shanghai -v /home/ubuntu/data/new-api:/data calciumion/new-api:latest

Update Version

# Pull the latest image
docker pull calciumion/new-api:latest
# Stop and remove the old container
docker stop new-api
docker rm new-api
# Run the new container with the same parameters as before
docker run --name new-api -d --restart always -p 3000:3000 -e TZ=Asia/Shanghai -v /home/ubuntu/data/new-api:/data calciumion/new-api:latest

Alternatively, you can use Watchtower for automatic updates (not recommended, may cause database incompatibility):

docker run --rm -v /var/run/docker.sock:/var/run/docker.sock containrrr/watchtower -cR

Channel Retry

Channel retry is implemented, configurable in Settings->Operation Settings->General Settings. Cache recommended.
First retry uses same priority, second retry uses next priority, and so on.

Cache Configuration

REDIS_CONN_STRING: Use Redis as cache
- Example: REDIS_CONN_STRING=redis://default:redispw@localhost:49153
MEMORY_CACHE_ENABLED: Enable memory cache, default false
- Example: MEMORY_CACHE_ENABLED=true

Why Some Errors Don't Retry

Error codes 400, 504, 524 won't retry

To Enable Retry for 400

In Channel->Edit, set Status Code Override to:

{
  "400": "500"
}

Integration Guides

Related Projects

One API: Original project
Midjourney-Proxy: Midjourney interface support
chatnio: Next-gen AI B/C solution
neko-api-key-tool: Query usage quota by key

README.en.md

📝 Project Description

✨ Key Features

Model Support

Additional Configurations Beyond One API

Deployment

Multi-Server Deployment

Requirements

Deployment with BT Panel

Docker Deployment

Using Docker Compose (Recommended)

Update Version

Direct Docker Image Usage

Update Version

Channel Retry

Cache Configuration

Why Some Errors Don't Retry

To Enable Retry for 400

Integration Guides

Related Projects

🌟 Star History