sub2api

Author	SHA1	Message	Date
erio	1af06aed96	feat: shuffle accounts within same sort group to prevent thundering herd Add post-sort shuffle for accounts with identical (priority, loadRate, lastUsedAt) to break deterministic ordering when concurrent requests read the same scheduler snapshot. Applies to both Antigravity and OpenAI scheduling paths, plus the sortAccountsByPriorityAndLastUsed helper. Keeps upstream CallCount/ModelLoadInfo scheduling intact; shuffle is additive and only randomises within equivalent-rank groups.	2026-02-09 07:33:17 +08:00
erio	edb0937024	fix: restore non-failover error passthrough from `7b156489`	2026-02-07 14:24:55 +08:00
erio	5e98445b22	feat(antigravity): comprehensive enhancements - model mapping, rate limiting, scheduling & ops Key changes: - Upgrade model mapping: Opus 4.5 → Opus 4.6-thinking with precise matching - Unified rate limiting: scope-level → model-level with Redis snapshot sync - Load-balanced scheduling by call count with smart retry mechanism - Force cache billing support - Model identity injection in prompts with leak prevention - Thinking mode auto-handling (max_tokens/budget_tokens fix) - Frontend: whitelist mode toggle, model mapping validation, status indicators - Gemini session fallback with Redis Trie O(L) matching - Ops: enhanced concurrency monitoring, account availability, retry logic - Migration scripts: 049-051 for model mapping unification	2026-02-07 12:31:10 +08:00
shaw	7b1564898b	fix: make error passthrough effective for non-failover upstream errors	2026-02-07 10:25:56 +08:00
shaw	39e05a2dad	feat: 新增全局错误透传规则功能支持管理员配置上游错误如何返回给客户端： - 新增 ErrorPassthroughRule 数据模型和 Ent Schema - 实现规则的 CRUD API（/admin/error-passthrough-rules） - 支持按错误码、关键词匹配，支持 any/all 匹配模式 - 支持按平台过滤（anthropic/openai/gemini/antigravity） - 支持透传或自定义响应状态码和错误消息 - 实现两级缓存（Redis + 本地内存）和多实例同步 - 集成到 gateway_handler 的错误处理流程 - 新增前端管理界面组件 - 新增单元测试覆盖核心匹配逻辑优化： - 移除 refreshLocalCache 中的冗余排序（数据库已排序） - 后端 Validate() 增加匹配条件非空校验	2026-02-05 21:52:54 +08:00
Payne Fu	fecfaae8dc	fix: remove unsupported safety_identifier and previous_response_id fields from upstream requests Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2026-02-04 15:56:01 +08:00
liuxiongfeng	9a48b2e942	fix(openai): 统一 OAuth instructions 处理逻辑，修复 Codex CLI 400 错误 - 修改 applyCodexOAuthTransform 函数签名，增加 isCodexCLI 参数 - 移除 && !isCodexCLI 条件，对所有 OAuth 请求统一处理 - 新增 applyInstructions/applyCodexCLIInstructions/applyOpenCodeInstructions 辅助函数 - 新增 isInstructionsEmpty 函数检查 instructions 字段是否为空 - 添加 Codex CLI 和非 Codex CLI 场景的测试用例逻辑说明： - Codex CLI + 有 instructions: 保持不变 - Codex CLI + 无 instructions: 补充 opencode 指令 - 非 Codex CLI: 使用 opencode 指令覆盖	2026-02-03 21:22:33 +08:00
bayma888	6146be1474	feat(api-key): add independent quota and expiration support This feature allows API Keys to have their own quota limits and expiration times, independent of the user's balance. Backend: - Add quota, quota_used, expires_at fields to api_key schema - Implement IsExpired() and IsQuotaExhausted() checks in middleware - Add ResetQuota and ClearExpiration API endpoints - Integrate quota billing in gateway handlers (OpenAI, Anthropic, Gemini) - Include quota/expiration fields in auth cache for performance - Expiration check returns 403, quota exhausted returns 429 Frontend: - Add quota and expiration inputs to key create/edit dialog - Add quick-select buttons for expiration (+7, +30, +90 days) - Add reset quota confirmation dialog - Add expires_at column to keys list - Add i18n translations for new features (en/zh) Migration: - Add 045_add_api_key_quota.sql for new columns	2026-02-03 19:49:31 +08:00
Wesley Liddick	bb3df5785a	Merge pull request #322 from xlx0852/main fix: 混合渠道警告确认框和过滤 prompt_cache_retention 参数	2026-02-03 15:13:10 +08:00
Wesley Liddick	df4c0adf0b	Merge pull request #463 from DuckyProject/feat/usage-records-codex-reasoning-effort feat(usage): add reasoning effort column	2026-02-03 14:57:45 +08:00
ducky	53ee6383db	feat(usage): add reasoning effort column	2026-02-03 14:36:29 +08:00
cyhhao	a161fcc89b	Merge branch 'main' of github.com:Wei-Shaw/sub2api	2026-01-26 10:44:38 +08:00
shaw	74e05b83ea	fix(ratelimit): 修复 OpenAI 账号限流倒计时计算错误 - 解析 x-codex-* 响应头获取正确的重置时间 - 7d 限制用尽时使用 codex_7d_reset_after_seconds - 提取 Normalize() 方法统一窗口规范化逻辑	2026-01-25 13:32:08 +08:00
cyhhao	65e69738cc	Merge branch 'main' of github.com:Wei-Shaw/sub2api	2026-01-20 22:46:23 +08:00
cyhhao	c8e2f614fa	Merge branch 'main' of github.com:Wei-Shaw/sub2api	2026-01-20 13:53:32 +08:00
yangjianbo	f6ed3d1456	Merge branch 'test' into dev	2026-01-20 11:59:13 +08:00
yangjianbo	91f01309da	fix(调度): 完善粘性会话清理与账号调度刷新 - Update/BulkUpdate 按不可调度字段触发缓存刷新 - GatewayCache 支持多前缀会话键清理 - 模型路由与混合调度优化粘性会话处理 - 补充调度与缓存相关测试覆盖	2026-01-20 11:40:55 +08:00
cyhhao	26298c4a5f	fix(openai): emit OpenAI-compatible SSE error events	2026-01-19 13:53:39 +08:00
cyhhao	6901b64fce	merge: sync upstream changes	2026-01-17 18:30:16 +08:00
nick8802754751	4e75d8fda9	fix: 添加混合渠道警告确认框和过滤 prompt_cache_retention 参数 - 前端: EditAccountModal 和 CreateAccountModal 添加 409 mixed_channel_warning 处理 - 前端: 弹出确认框让用户确认混合渠道风险 - 后端: 过滤 OpenAI 请求中的 prompt_cache_retention 参数（上游不支持） - 添加中英文翻译 Co-authored-by: factory-droid[bot] <138933559+factory-droid[bot]@users.noreply.github.com>	2026-01-17 16:06:44 +08:00
IanShaw027	ae21db77ec	fix(openai): 使用 prompt_cache_key 兜底粘性会话 opencode 请求不带 session_id/conversation_id，导致粘性会话失效。现在按 header 优先、prompt_cache_key 兜底生成 session hash，并补充单测验证优先级。	2026-01-17 02:31:16 +08:00
IanShaw027	539b41f421	feat(openai): 添加Codex工具调用自动修正功能实现了完整的Codex工具调用拦截和自动修正系统，解决OpenCode使用Codex模型时的工具调用兼容性问题。核心功能: 1. 工具名称自动映射 - apply_patch/applyPatch → edit - update_plan/updatePlan → todowrite - read_plan/readPlan → todoread - search_files/searchFiles → grep - list_files/listFiles → glob - read_file/readFile → read - write_file/writeFile → write - execute_bash/executeBash/exec_bash/execBash → bash 2. 工具参数自动修正 - bash: 自动移除不支持的 workdir/work_dir 参数 - edit: 自动将 path 参数重命名为 file_path - 支持 JSON 字符串和对象两种参数格式 3. 流式响应集成 - 在 SSE 数据流中实时修正工具调用 - 支持多种 JSON 结构（tool_calls, function_call, delta, choices等） - 不影响响应性能和用户体验 4. 统计和监控 - 记录每次工具修正的详细信息 - 提供修正统计数据查询 - 便于问题排查和性能优化实现文件: - `openai_tool_corrector.go`: 工具修正核心逻辑（250行） - `openai_tool_corrector_test.go`: 完整的单元测试（380+行） - `openai_gateway_service.go`: 流式响应集成 - `openai_gateway_service_tool_correction_test.go`: 集成测试测试覆盖: - 工具名称映射测试（18个映射规则） - 参数修正测试（bash workdir、edit path等） - SSE数据修正测试（多种JSON结构） - 统计功能测试 - 所有测试通过 ✅ 解决的问题: 修复了 OpenCode 使用 sub2api 中转 Codex 时，因工具名称和参数不兼容导致的工具调用失败问题。 Codex 模型有时会忽略指令文件中的工具映射说明，导致调用不存在的工具（如 apply_patch）。现在通过流式响应拦截，自动将错误的工具调用修正为 OpenCode 兼容的格式。参考文档: - OpenCode 工具规范: https://opencode.ai/docs/ - Codex Bridge 指令: backend/internal/service/prompts/codex_opencode_bridge.txt	2026-01-15 23:52:50 +08:00
cyhhao	c11f14f3a0	fix(gateway): drain upstream after client disconnect	2026-01-15 21:51:14 +08:00
cyhhao	98b65e67f2	fix(gateway): avoid injecting invalid SSE on client cancel	2026-01-15 21:42:13 +08:00
yangjianbo	f862ddc9ff	style: 修复 gofmt 格式化问题 Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2026-01-15 19:42:18 +08:00
yangjianbo	1820389a05	feat(网关): 引入 OpenAI/Claude OAuth token 缓存新增 OpenAI/Claude TokenProvider 与缓存键生成扩展 OAuth 缓存失效覆盖更多平台统一 OAuth 缓存前缀与依赖注入	2026-01-15 18:27:06 +08:00
yangjianbo	90bce60b85	feat: merge dev	2026-01-15 15:14:44 +08:00
ianshaw	25b00abca1	fix(网关): 修复账号选择中的调度器快照延迟问题 ## 问题描述调度器快照更新存在0.5-1秒的延迟（Outbox轮询间隔），导致在账号被限流或过载后的短时间窗口内，可能仍会被选中，造成请求失败。 ## 根本原因账号选择逻辑依赖调度器快照（listSchedulableAccounts），但快照更新有延迟： - Outbox轮询: 每1秒检查一次变更事件 - 全量重建: 每300秒重建一次 - 时间窗口: 账号状态变更后0.5-1秒内，快照可能未更新 ## 解决方案在账号选择循环中添加IsSchedulable()实时检查，作为第二道防线： 1. 第一道防线: 调度器快照过滤（可能有延迟） 2. 第二道防线: IsSchedulable()实时检查（本次修复） IsSchedulable()会检查： - RateLimitResetAt: 限流重置时间 - OverloadUntil: 过载持续时间 - TempUnschedulableUntil: 临时不可调度时间 - Status: 账号状态 - Schedulable: 可调度标志 ## 修改范围 ### OpenAI Gateway Service - SelectAccountForModelWithExclusions: 添加IsSchedulable()检查 - SelectAccountWithLoadAwareness: 添加IsSchedulable()检查 ### Gateway Service (Claude/Gemini/Antigravity) - 负载感知选择候选账号筛选: 添加IsSchedulable()检查 - selectAccountForModelWithPlatform: 添加IsSchedulable()检查 - selectAccountWithMixedScheduling: 添加IsSchedulable()检查 ### 测试用例 - OpenAI: 添加2个测试用例验证限流账号过滤 - Gateway: 添加2个测试用例验证限流和过载账号过滤 ### 其他修复 - ops_repo_preagg.go: 修复platform为NULL时的聚合问题 ## 测试结果所有单元测试通过 ✅	2026-01-13 22:49:26 -08:00
yangjianbo	eea6c2d02c	fix(网关): 补齐Codex指令回退与输入过滤	2026-01-13 17:02:31 +08:00
ianshaw	3d6e01a58f	fix(openai): 增强 OpenCode 兼容性和模型规范化 ## 主要改动 1. 模型规范化扩展到所有账号 - 将 Codex 模型规范化（如 gpt-5-nano → gpt-5.1）应用到所有 OpenAI 账号类型 - 不再仅限于 OAuth 非 CLI 请求 - 解决 Codex CLI 使用 ChatGPT 账号时的模型兼容性问题 2. reasoning.effort 参数规范化 - 自动将 `minimal` 转换为 `none` - 解决 gpt-5.1 模型不支持 `minimal` 值的问题 3. Session/Conversation ID fallback 机制 - 从请求体多个字段提取 session_id/conversation_id - 优先级：prompt_cache_key → session_id → conversation_id → previous_response_id - 支持 Codex CLI 的会话保持 4. Tool Call ID fallback - 当 call_id 为空时使用 id 字段作为 fallback - 确保 tool call 输出能正确匹配 - 保留 item_reference 类型的 items 5. Header 优化 - 添加 conversation_id 到允许的 headers - 移除删除 session headers 的逻辑 ## 相关 Issue - 参考 OpenCode issue #3118 关于 item_reference 的讨论	2026-01-12 20:18:53 -08:00
ianshaw	d85288a6c0	Revert "fix(gateway): 修复 base_url 包含 /chat/completions 时路径拼接错误" This reverts commit `7fdc25df3c`.	2026-01-12 13:29:04 -08:00
ianshaw	3402acb606	feat(gateway): 对所有请求（包括 Codex CLI）应用模型映射 - 移除 Codex CLI 的模型映射跳过逻辑 - 添加详细的模型映射日志，包含账号名称和请求类型 - 确保所有 OpenAI 请求都能正确应用账号配置的模型映射	2026-01-12 13:23:05 -08:00
ianshaw	7fdc25df3c	fix(gateway): 修复 base_url 包含 /chat/completions 时路径拼接错误问题： - 当账号的 base_url 配置为 https://example.com/v1/chat/completions 时 - 代码直接追加 /responses，导致路径变成 /v1/chat/completions/responses - 上游返回 404 错误修复： - 在追加 /responses 前，先移除 base_url 中的 /chat/completions 后缀 - 确保最终路径为 https://example.com/v1/responses 影响范围： - OpenAI API Key 账号的测试接口 - OpenAI API Key 账号的实际网关请求 Related-to: #231	2026-01-12 11:39:45 -08:00
ianshaw	fe6a3f4267	fix(gateway): 完善 max_output_tokens 参数处理逻辑根据不同平台和账号类型处理 max_output_tokens 参数： - OpenAI OAuth (Responses API): 保留 max_output_tokens（支持） - OpenAI API Key: 删除 max_output_tokens（不支持） - Anthropic (Claude): 转换 max_output_tokens 为 max_tokens - Gemini: 删除 max_output_tokens（由 Gemini 专用转换处理） - 其他平台: 删除（安全起见）同时处理 max_completion_tokens 参数，仅在 OpenAI OAuth 时保留。修复客户端（如 OpenCode）发送不支持参数导致上游返回 400 错误的问题。 Related-to: #231	2026-01-12 11:08:28 -08:00
yangjianbo	2db34139f0	Merge branch 'main' of https://github.com/mt21625457/aicodex2api	2026-01-12 14:50:53 +08:00
yangjianbo	3141aa5144	feat(scheduler): 引入调度快照缓存与 outbox 回放 - 调度热路径优先读 Redis 快照，保留分组排序语义 - outbox 回放 + 全量重建纠偏，失败重试不推进水位 - 自动 Atlas 基线对齐并同步调度配置示例	2026-01-12 14:19:06 +08:00
ianshaw	2a0758bdfe	feat(gateway): 添加流超时处理机制 - 添加 StreamTimeoutSettings 配置结构体和系统设置 - 实现 TimeoutCounterCache Redis 计数器用于累计超时次数 - 在 RateLimitService 添加 HandleStreamTimeout 方法 - 在 gateway_service、openai_gateway_service、antigravity_gateway_service 中调用超时处理 - 添加后端 API 端点 GET/PUT /admin/settings/stream-timeout - 添加前端配置界面到系统设置页面 - 支持配置：启用开关、超时阈值、处理方式、暂停时长、触发阈值、阈值窗口默认配置： - 启用：true - 超时阈值：60秒 - 处理方式：临时不可调度 - 暂停时长：5分钟 - 触发阈值：3次 - 阈值窗口：10分钟	2026-01-11 21:54:52 -08:00
IanShaw027	4cb7b26f03	fix: 移除未使用的os包导入	2026-01-11 23:18:00 +08:00
IanShaw027	3dfb62e996	merge: 合并main分支最新改动解决冲突： - backend/internal/config/config.go: 合并Ops和Dashboard配置 - backend/internal/server/api_contract_test.go: 合并handler初始化 - backend/internal/service/openai_gateway_service.go: 保留Ops错误追踪逻辑 - backend/internal/service/wire.go: 合并Ops和APIKeyAuth provider 主要合并内容： - Dashboard缓存和预聚合功能 - API Key认证缓存优化 - Codex转换支持 - 使用日志分区表	2026-01-11 23:15:01 +08:00
IanShaw027	7ebca553ef	feat(ops): 实现上游错误事件记录与查询功能新增功能: - 新建ops_upstream_error_events表存储上游服务错误详情 - 支持记录上游429/529/5xx错误的详细上下文信息 - 提供按时间范围查询上游错误事件的API 后端改动: 1. 模型层（ops_models.go, ops_port.go）: - 新增UpstreamErrorEvent结构体 - 扩展Repository接口支持上游错误事件CRUD 2. 仓储层（ops_repo.go）: - 实现InsertUpstreamErrorEvent写入上游错误 - 实现GetUpstreamErrorEvents按时间范围查询 3. 服务层（ops_service.go, ops_upstream_context.go）: - ops_service: 新增GetUpstreamErrorEvents查询方法 - ops_upstream_context: 封装上游错误上下文构建逻辑 4. Handler层（ops_error_logger.go）: - 新增GetUpstreamErrorsHandler处理上游错误查询请求 5. Gateway层集成: - antigravity_gateway_service.go: 429/529错误时记录上游事件 - gateway_service.go: OpenAI 429/5xx错误时记录 - gemini_messages_compat_service.go: Gemini 429/5xx错误时记录 - openai_gateway_service.go: OpenAI 429/5xx错误时记录 - ratelimit_service.go: 429限流错误时记录数据记录字段: - request_id: 关联ops_logs主记录 - platform/model: 上游服务标识 - status_code/error_message: 错误详情 - request_headers/response_body: 调试信息（可选） - created_at: 错误发生时间	2026-01-11 15:30:27 +08:00
IanShaw027	89a725a433	feat(ops): 添加QPS脉搏线图并优化指标布局 - 添加实时QPS/TPS历史数据追踪（最近60个数据点） - 在平均QPS/TPS上方添加SVG脉搏线图（sparkline） - 将延迟和TTFT卡片的指标布局从2列改为3列 - 恢复Max指标显示（P95/P90/P50/Avg/Max）	2026-01-11 11:49:34 +08:00
cyhhao	1a641392d9	Merge up/main	2026-01-10 21:57:57 +08:00
cyhhao	36b817d008	Align OAuth transform with OpenCode instructions	2026-01-10 20:53:16 +08:00
kzw200015	24d19a5f78	fix: 从codex请求参数中移除max_output_tokens (#231 ) 某些客户端比如 opencode 会在请求中附加 max_output_tokens，这会导致上游返回400错误	2026-01-10 19:37:04 +08:00
cyhhao	eb06006d6c	Make Codex CLI passthrough	2026-01-10 03:12:56 +08:00
Edric.Li	0a4641c24e	feat(api-key): 添加 IP 白名单/黑名单限制功能 (#221 ) * feat(api-key): add IP whitelist/blacklist restriction and usage log IP tracking - Add IP restriction feature for API keys (whitelist/blacklist with CIDR support) - Add IP address logging to usage logs (admin-only visibility) - Remove billing_type column from usage logs UI (redundant) - Use generic "Access denied" error message for security Backend: - New ip package with IP/CIDR validation and matching utilities - Database migrations for ip_whitelist, ip_blacklist (api_keys) and ip_address (usage_logs) - Middleware IP restriction check after API key validation - Input validation for IP/CIDR patterns on create/update Frontend: - API key form with enable toggle for IP restriction - Shield icon indicator in table for keys with IP restriction - Removed billing_type filter and column from usage views * fix: update API contract tests for ip_whitelist/ip_blacklist fields Add ip_whitelist and ip_blacklist fields to expected JSON responses in API contract tests to match the new API key schema.	2026-01-09 21:59:32 +08:00
cyhhao	7a06c4873e	Fix Codex OAuth tool mapping	2026-01-09 18:35:58 +08:00
Call White	f6a9a0a45a	Merge pull request #1 from cyhhao/feat/ai-sdk-compatibility feat(openai): add AI SDK content format compatibility for OAuth accounts	2026-01-09 00:47:44 +08:00
cyhhao	5b8d4fb047	feat(openai): add AI SDK content format compatibility for OAuth accounts - Add normalizeInputForCodexAPI function to convert AI SDK multi-part content format to simplified format expected by ChatGPT Codex API - AI SDK sends: {"content": [{"type": "input_text", "text": "..."}]} - Codex API expects: {"content": "..."} - Only applies to OAuth accounts (ChatGPT internal API) - API Key accounts remain unchanged (OpenAI Platform API supports both)	2026-01-09 00:34:49 +08:00
Edric Li	a42105881f	feat(groups): add Claude Code client restriction and session isolation - Add claude_code_only field to restrict groups to Claude Code clients only - Add fallback_group_id for non-Claude Code requests to use alternate group - Implement ClaudeCodeValidator for User-Agent detection - Add group-level session binding isolation (groupID in Redis key) - Prevent cross-group sticky session pollution - Update frontend with Claude Code restriction controls	2026-01-08 23:07:00 +08:00

1 2

85 Commits