AI/new-api: AI模型聚合管理中转分发系统，一个应用管理您的所有AI模型，支持将多种大模型转为统一格式调用，支持OpenAI、Claude、Gemini等格式，可供个人或者企业内部管理与分发渠道使用。 @ revert-26-main

AI模型聚合管理中转分发系统，一个应用管理您的所有AI模型，支持将多种大模型转为统一格式调用，支持OpenAI、Claude、Gemini等格式，可供个人或者企业内部管理与分发渠道使用。

Seefs 00ba64d837 Revert "Fork Sync: Update from parent repository"		2 meses atrás
.github	63f9d11da9 Merge pull request #20 from QuantumNous/main	2 meses atrás
bin	d84b0b0f5d chore: add model parameter to the time_test script (#245)	2 anos atrás
common	9f4a2d64a3 feat: add sora video submit task	2 meses atrás
constant	9f4a2d64a3 feat: add sora video submit task	2 meses atrás
controller	00ba64d837 Revert "Fork Sync: Update from parent repository"	2 meses atrás
docs	51d71a6e1a ✨ feat: add Spanish feature request template to GitHub issue tracker for improved feature proposal submissions	2 meses atrás
dto	e8966c7374 feat: pplx channel	2 meses atrás
electron	629a534798 chore(deps-dev): bump electron from 28.3.3 to 35.7.5 in /electron	2 meses atrás
logger	39a868faea 💱 feat(settings): introduce site-wide quota display type (USD/CNY/TOKENS/CUSTOM)	2 meses atrás
middleware	2479da4986 feat: add sora video fetch task	2 meses atrás
model	6897a9ffd8 fix: channel remark ignore issue	2 meses atrás
relay	00ba64d837 Revert "Fork Sync: Update from parent repository"	2 meses atrás
router	5a7f498629 Merge pull request #1997 from feitianbubu/pr/add-sora-fetch-task	2 meses atrás
service	76ab8a480a Merge pull request #1401 from feitianbubu/pr/add-qwen-channel-auto-disabled	2 meses atrás
setting	fe9b305232 fix: legal setting	2 meses atrás
types	74f93d41f3 feat: update Gemini API response handling to include block reason and improve error reporting	2 meses atrás
web	268757a670 Merge pull request #25 from QuantumNous/main	2 meses atrás
.dockerignore	fe9b305232 fix: legal setting	2 meses atrás
.env.example	c6cf1b98f8 feat(option): enhance UpdateOption to handle various value types and improve validation	3 meses atrás
.gitignore	fe9b305232 fix: legal setting	2 meses atrás
Dockerfile	5b5f10fe93 🔄 update: add bun.lock file copy to Dockerfile for dependency management	5 meses atrás
LICENSE	4d8189f21b ⚖️ docs(LICENSE): update license information from Apache 2.0 to New API Licensing	5 meses atrás
README.en.md	98261ec9fa chore: update README files	2 meses atrás
README.fr.md	98261ec9fa chore: update README files	2 meses atrás
README.ja.md	98261ec9fa chore: update README files	2 meses atrás
README.md	98261ec9fa chore: update README files	2 meses atrás
VERSION	7e80e2da3a fix: add a blank VERSION file (#135)	2 anos atrás
docker-compose.yml	b0b275b236 chore(docker): add comment for compatibility with older Docker versions	2 meses atrás
go.mod	60dc910a27 fix: update jwt package import to v5 across multiple files	2 meses atrás
go.sum	c4e0fc1837 chore: go version & sonic dep	2 meses atrás
main.go	8e10af82b1 fix(main): conditionally log missing .env file message based on debug mode	2 meses atrás
makefile	27bbd951f0 feat: use bun when develop locally	6 meses atrás
one-api.service	c6717307d0 chore: update one-api.service	2 anos atrás

中文 | English | Français | 日本語

[!NOTE] MT (Machine Translation): This document is machine translated. For the most accurate information, please refer to the Chinese version.

![new-api](/web/public/logo.png) # New API 🍥 Next-Generation Large Model Gateway and AI Asset Management System

📝 Project Description

[!NOTE]
This is an open-source project developed based on One API

[!IMPORTANT]

This project is for personal learning purposes only, with no guarantee of stability or technical support.

Users must comply with OpenAI's Terms of Use and applicable laws and regulations, and must not use it for illegal purposes.

According to the 《Interim Measures for the Management of Generative Artificial Intelligence Services》, please do not provide any unregistered generative AI services to the public in China.

🤝 Trusted Partners

No particular order

<img

src="./docs/images/cherry-studio.png" alt="Cherry Studio" height="120"

/> <img

src="./docs/images/pku.png" alt="Peking University" height="120"

/> <img

src="./docs/images/ucloud.png" alt="UCloud" height="120"

/> <img

src="./docs/images/aliyun.png" alt="Alibaba Cloud" height="120"

/> <img

src="./docs/images/io-net.png" alt="IO.NET" height="120"

📚 Documentation

For detailed documentation, please visit our official Wiki: https://docs.newapi.pro/

You can also access the AI-generated DeepWiki:

✨ Key Features

New API offers a wide range of features, please refer to Features Introduction for details:

🎨 Brand new UI interface
🌍 Multi-language support
💰 Online recharge functionality, currently supports EPay and Stripe
🔍 Support for querying usage quotas with keys (works with neko-api-key-tool)
🔄 Compatible with the original One API database
💵 Support for pay-per-use model pricing
⚖️ Support for weighted random channel selection
📈 Data dashboard (console)
🔒 Token grouping and model restrictions
🤖 Support for more authorization login methods (LinuxDO, Telegram, OIDC)
🔄 Support for Rerank models (Cohere and Jina), API Documentation
⚡ Support for OpenAI Realtime API (including Azure channels), API Documentation
⚡ Support for OpenAI Responses format, API Documentation
⚡ Support for Claude Messages format, API Documentation
⚡ Support for Google Gemini format, API Documentation
🧠 Support for setting reasoning effort through model name suffixes:
1. OpenAI o-series models
  - Add -high suffix for high reasoning effort (e.g.: o3-mini-high)
  - Add -medium suffix for medium reasoning effort (e.g.: o3-mini-medium)
  - Add -low suffix for low reasoning effort (e.g.: o3-mini-low)
2. Claude thinking models
  - Add -thinking suffix to enable thinking mode (e.g.: claude-3-7-sonnet-20250219-thinking)
🔄 Thinking-to-content functionality
🔄 Model rate limiting for users
🔄 Request format conversion functionality, supporting the following three format conversions:
1. OpenAI Chat Completions => Claude Messages
2. Claude Messages => OpenAI Chat Completions (can be used for Claude Code to call third-party models)
3. OpenAI Chat Completions => Gemini Chat
💰 Cache billing support, which allows billing at a set ratio when cache is hit:
1. Set the Prompt Cache Ratio option in System Settings-Operation Settings
2. Set Prompt Cache Ratio in the channel, range 0-1, e.g., setting to 0.5 means billing at 50% when cache is hit
3. Supported channels:
  - OpenAI
  - Azure
  - DeepSeek
  - Claude

Model Support

This version supports multiple models, please refer to API Documentation-Relay Interface for details:

Third-party models gpts (gpt-4-gizmo-*)
Third-party channel Midjourney-Proxy(Plus) interface, API Documentation
Third-party channel Suno API interface, API Documentation
Custom channels, supporting full call address input
Rerank models (Cohere and Jina), API Documentation
Claude Messages format, API Documentation
Google Gemini format, API Documentation
Dify, currently only supports chatflow
For more interfaces, please refer to API Documentation

Environment Variable Configuration

For detailed configuration instructions, please refer to Installation Guide-Environment Variables Configuration:

GENERATE_DEFAULT_TOKEN: Whether to generate initial tokens for newly registered users, default is false
STREAMING_TIMEOUT: Streaming response timeout, default is 300 seconds
DIFY_DEBUG: Whether to output workflow and node information for Dify channels, default is true
GET_MEDIA_TOKEN: Whether to count image tokens, default is true
GET_MEDIA_TOKEN_NOT_STREAM: Whether to count image tokens in non-streaming cases, default is true
UPDATE_TASK: Whether to update asynchronous tasks (Midjourney, Suno), default is true
GEMINI_VISION_MAX_IMAGE_NUM: Maximum number of images for Gemini models, default is 16
MAX_FILE_DOWNLOAD_MB: Maximum file download size in MB, default is 20
CRYPTO_SECRET: Encryption key used for encrypting Redis database content
AZURE_DEFAULT_API_VERSION: Azure channel default API version, default is 2025-04-01-preview
NOTIFICATION_LIMIT_DURATION_MINUTE: Notification limit duration, default is 10 minutes
NOTIFY_LIMIT_COUNT: Maximum number of user notifications within the specified duration, default is 2
ERROR_LOG_ENABLED=true: Whether to record and display error logs, default is false

Deployment

For detailed deployment guides, please refer to Installation Guide-Deployment Methods:

[!TIP] Latest Docker image: calciumion/new-api:latest

Multi-machine Deployment Considerations

Environment variable SESSION_SECRET must be set, otherwise login status will be inconsistent across multiple machines
If sharing Redis, CRYPTO_SECRET must be set, otherwise Redis content cannot be accessed across multiple machines

Deployment Requirements

Local database (default): SQLite (Docker deployment must mount the /data directory)
Remote database: MySQL version >= 5.7.8, PgSQL version >= 9.6

Deployment Methods

Using BaoTa Panel Docker Feature

Install BaoTa Panel (version 9.2.0 or above), find New-API in the application store and install it. Tutorial with images

Using Docker Compose (Recommended)

# Download the project
git clone https://github.com/Calcium-Ion/new-api.git
cd new-api
# Edit docker-compose.yml as needed
# Start
docker-compose up -d

Using Docker Image Directly

# Using SQLite
docker run --name new-api -d --restart always -p 3000:3000 -e TZ=Asia/Shanghai -v /home/ubuntu/data/new-api:/data calciumion/new-api:latest

# Using MySQL
docker run --name new-api -d --restart always -p 3000:3000 -e SQL_DSN="root:123456@tcp(localhost:3306)/oneapi" -e TZ=Asia/Shanghai -v /home/ubuntu/data/new-api:/data calciumion/new-api:latest

Channel Retry and Cache

Channel retry functionality has been implemented, you can set the number of retries in Settings->Operation Settings->General Settings->Failure Retry Count, recommended to enable caching functionality.

Cache Configuration Method

REDIS_CONN_STRING: Set Redis as cache
MEMORY_CACHE_ENABLED: Enable memory cache (no need to set manually if Redis is set)

API Documentation

For detailed API documentation, please refer to API Documentation:

Related Projects

One API: Original project
Midjourney-Proxy: Midjourney interface support
neko-api-key-tool: Query usage quota with key

Other projects based on New API:

new-api-horizon: High-performance optimized version of New API

Help and Support

If you have any questions, please refer to Help and Support:

README.en.md

📝 Project Description

🤝 Trusted Partners

📚 Documentation

✨ Key Features

Model Support

Environment Variable Configuration

Deployment

Multi-machine Deployment Considerations

Deployment Requirements

Deployment Methods

Using BaoTa Panel Docker Feature

Using Docker Compose (Recommended)

Using Docker Image Directly

Channel Retry and Cache

Cache Configuration Method

API Documentation

Related Projects

Help and Support

🌟 Star History