Seefs
b1842b908e
fix: /v1/responses/compact default billing
2026-01-28 14:22:34 +08:00
Seefs
cc1da72d10
feat: openai response /v1/response/compact ( #2644 )
...
* feat: openai response /v1/response/compact
* feat: /v1/response/compact bill
* feat: /v1/response/compact
* feat: /v1/responses/compact -> codex channel
* feat: /v1/responses/compact -> codex channel
* feat: /v1/responses/compact -> codex channel
* feat: codex channel default models
* feat: compact model price
* feat: /v1/responses/comapct test
2026-01-26 20:20:16 +08:00
CaIon
d06915c30d
feat(ratio): add functions to check for audio ratios and clean up unused code
2025-12-31 21:29:10 +08:00
CaIon
b1bb64ae11
feat(model): add audio ratios for new TTS models and adjust default values
2025-12-31 21:22:33 +08:00
Seefs
559da6362a
fix: revert model ratio
2025-12-25 15:37:54 +08:00
CaIon
689c43143b
feat(model_ratio): add default ratios for gpt-4o-mini-tts
2025-12-13 19:14:27 +08:00
CaIon
e36e2e1b69
feat(audio): enhance audio request handling with token type detection and streaming support
2025-12-13 17:24:23 +08:00
FlowerRealm
c3c119a9b4
feat: add claude-haiku-4-5-20251001 model support
...
- Add model to Claude ModelList
- Add model ratio (0.5, $1/1M input tokens)
- Add completion ratio support (5x, $5/1M output tokens)
- Add cache read ratio (0.1, $0.10/1M tokens)
- Add cache write ratio (1.25, $1.25/1M tokens)
Model specs:
- Context window: 200K tokens
- Max output: 64K tokens
- Release date: October 1, 2025
2025-12-05 18:54:20 +08:00
Seefs
b6a02d8303
feat: gemini 3 thinking level gemini-3-pro-preview-high
2025-12-01 16:40:46 +08:00
Papersnake
5931d333cb
feat: add claude-opus-4-5-20251101 ratio
2025-11-25 10:49:34 +08:00
CaIon
84745d5ca4
feat: Add ContextKeyLocalCountTokens and update ResponseText2Usage to use context in multiple channels
2025-11-21 18:17:01 +08:00
Xyfacai
06cd774c10
fix: 未设置价格模型不会被拉取,除非设置自用模式
2025-11-13 18:44:18 +08:00
Sh1n3zZ
d0c45a01fa
feat: replicate channel flux model
2025-11-08 01:24:45 +08:00
Seefs
e1c7a4f41f
format: package name -> github.com/QuantumNous/new-api ( #2017 )
2025-10-11 15:30:09 +08:00
Xyfacai
a54baf4998
feat: sora 增加参数校验与计费
2025-10-10 23:56:36 +08:00
creamlike1024
70263e96ab
feat: 支持 gemini-robotics-er-1.5-preview
2025-10-01 17:33:54 +08:00
Seefs
31544405f4
Merge pull request #1924 from prnake/claude-4-5
...
feat: support claude-sonnet-4-5-20250929
2025-09-30 09:26:34 +08:00
papersnake
d7db30a23e
feat: support claude-sonnet-4-5-20250929
2025-09-30 09:14:12 +08:00
dd
8c5b6654cb
Merge branch 'QuantumNous:main' into main
2025-09-29 19:15:43 +08:00
RedwindA
ca0eaa7697
解锁deepseek补全倍率;允许deepseek渠道获取模型
2025-09-29 18:32:44 +08:00
dd
4c2979bb67
Merge branch 'QuantumNous:main' into main
2025-09-29 14:13:50 +08:00
CaIon
b5d3e87ea2
Merge branch 'alpha'
2025-09-19 14:20:15 +08:00
creamlike1024
9f1ab16aa5
feat: 支持 gemini-embedding-001
2025-09-19 00:24:01 +08:00
DD
cab562276d
Merge branch 'main' of github.com:danding5/new-api
...
# Conflicts:
# relay/relay_adaptor.go
2025-09-15 14:31:06 +08:00
CaIon
6451158680
Revert "feat: gemini-2.5-flash-image-preview 文本和图片输出计费"
...
This reverts commit e732c58426 .
2025-09-13 12:53:28 +08:00
DD
a12ed5709e
merge
2025-09-10 19:11:58 +08:00
DD
42d29756a0
Merge branches 'main' and 'main' of github.com:danding5/new-api
...
# Conflicts:
# common/api_type.go
# constant/api_type.go
# constant/channel.go
# relay/relay_adaptor.go
# web/src/constants/channel.constants.js
2025-09-10 18:33:42 +08:00
DD
511489db09
add submodel.ai
2025-09-08 16:21:21 +08:00
creamlike1024
d15718a87e
feat: improve ratio update
2025-08-30 23:53:46 +08:00
creamlike1024
da5aace109
feat: 图像倍率,音频倍率和音频补全倍率配置
2025-08-30 23:28:09 +08:00
creamlike1024
e732c58426
feat: gemini-2.5-flash-image-preview 文本和图片输出计费
2025-08-27 21:30:52 +08:00
CaIon
64a6168092
fix(model_ratio): update return value logic for gemini-2.5-flash-lite
2025-08-26 23:01:00 +08:00
CaIon
c9f5b1de1a
fix: improve model ratio handling for reserved models in getHardcodedCompletionModelRatio
2025-08-25 11:59:55 +08:00
CaIon
bef59929db
fix: update model ratio handling for GPT versions
2025-08-25 11:52:45 +08:00
wzxjohn
0edc707657
feat(ratio): add ratio for OpenAI models
2025-08-15 17:12:39 +08:00
CaIon
e2037ad756
refactor: Introduce pre-consume quota and unify relay handlers
...
This commit introduces a major architectural refactoring to improve quota management, centralize logging, and streamline the relay handling logic.
Key changes:
- **Pre-consume Quota:** Implements a new mechanism to check and reserve user quota *before* making the request to the upstream provider. This ensures more accurate quota deduction and prevents users from exceeding their limits due to concurrent requests.
- **Unified Relay Handlers:** Refactors the relay logic to use generic handlers (e.g., `ChatHandler`, `ImageHandler`) instead of provider-specific implementations. This significantly reduces code duplication and simplifies adding new channels.
- **Centralized Logger:** A new dedicated `logger` package is introduced, and all system logging calls are migrated to use it, moving this responsibility out of the `common` package.
- **Code Reorganization:** DTOs are generalized (e.g., `dalle.go` -> `openai_image.go`) and utility code is moved to more appropriate packages (e.g., `common/http.go` -> `service/http.go`) for better code structure.
2025-08-14 20:05:06 +08:00
CaIon
b843bb8286
feat: add support for gpt-5 models and adjust temperature settings
...
- Updated the model list to include various gpt-5 variants.
- Enhanced the ConvertOpenAIRequest function to handle gpt-5 model temperature settings based on specific model prefixes.
- Adjusted default cache and model ratios for new gpt-5 models.
2025-08-08 10:43:07 +08:00
Calcium-Ion
77975529fe
Merge pull request #1525 from HynoR/chore/gpt5
...
feat: sync gpt-5 model ratio and support new reasoning effort
2025-08-08 10:24:13 +08:00
HynoR
4912b1e632
feat: sync gpt-5 model ratio and support new reasoning effort
2025-08-08 09:11:28 +08:00
RedwindA
0257918571
feat: add default model ratio for gemini-2.5-flash-lite-preview-thinking model
2025-08-07 21:39:11 +08:00
RedwindA
1d4e746c4f
feat: update FormatMatchingModelName to handle gemini-2.5-flash-lite model prefix
2025-08-07 21:37:08 +08:00
Xyfacai
0c0caad827
refactor: 调整模型匹配
2025-08-06 20:09:22 +08:00
neotf
24aa29598a
feat: add support for claude-opus-4-1 model and update ratios
2025-08-06 00:58:46 +08:00
Xyfacai
7aa2972c3f
fix(price): 未设置价格,错误返回模型价格匹配的名字
2025-07-20 10:12:36 +08:00
Calcium-Ion
5ec421d8e6
Merge pull request #1321 from iszcz/main
...
支持Midjourney视频任务和图片编辑
2025-07-05 15:28:33 +08:00
iszcz
660180ea1b
支持Midjourney视频任务和图片编辑
2025-06-30 22:31:12 +08:00
CaIon
bfb6fbbac9
🐛 fix: update hardcoded completion model ratio for gemini-2.5-flash-lite
2025-06-27 22:36:23 +08:00
CaIon
3f53e4f53e
🔧 fix(model_ratio): adjust return values for gemini-2.5-pro and gemini-2.5-flash models
2025-06-24 18:08:42 +08:00
Calcium-Ion
4ebf9e35e1
Merge pull request #1248 from RedwindA/update-gemini-ratio
...
feat(model-ratio): add default ratios for new Gemini models and refine flash model handling
2025-06-21 04:51:41 +08:00
RedwindA
c192d07a04
fix gizmo completion ratio
2025-06-19 20:16:04 +08:00