sub2api

Author	SHA1	Message	Date
erio	62e80c602d	revert: completely remove all Sora functionality	2026-04-05 17:11:01 +08:00
erio	e59fa8637a	fix: resolve cherry-pick compilation and test issues - Add int64(0) param to SelectAccountWithLoadAwareness callers (signature change from channel scheduling refactor) - Add UsageMapHook type and struct field to StreamingProcessor - Revert Claude Max cache billing code to upstream/main (not part of channel feature) - Revert credits overages logic to upstream/main (non-channel change) - Remove Instructions field reference (non-channel OpenAI feature) - Restore sora_client_handler_test.go from upstream + add channel service nil params	2026-04-04 12:38:50 +08:00
erio	ce41afb756	refactor: move channel model restriction from handler to scheduling phase Move the model pricing restriction check from 8 handler entry points to the account scheduling phase (SelectAccountForModelWithExclusions / SelectAccountWithLoadAwareness), aligning restriction with billing: - requested: check original request model against pricing list - channel_mapped: check channel-mapped model against pricing list - upstream: per-account check using account-mapped model Handler layer now only resolves channel mapping (no restriction). Scheduling layer performs pre-check for requested/channel_mapped, and per-account filtering for upstream billing source.	2026-04-04 11:24:48 +08:00
erio	2555951be4	feat(channel): 渠道管理全链路集成 — 模型映射、定价、限制、用量统计 - 渠道模型映射：支持精确匹配和通配符映射，按平台隔离 - 渠道模型定价：支持 token/按次/图片三种计费模式，区间分层定价 - 模型限制：渠道可限制仅允许定价列表中的模型 - 计费模型来源：支持 requested/upstream 两种计费模型选择 - 用量统计：usage_logs 新增 channel_id/model_mapping_chain/billing_tier/billing_mode 字段 - Dashboard 支持 model_source 维度（requested/upstream/mapping）查看模型统计 - 全部 gateway handler 统一接入 ResolveChannelMappingAndRestrict - 修复测试：同步 SoraGenerationRepository 接口、SQL INSERT 参数、scan 字段	2026-04-04 11:13:58 +08:00
erio	eb385457b2	fix(channel): 全平台渠道映射覆盖 + 公共函数抽取 + 死代码清理 - 4个缺失handler入口添加渠道映射+限制检查(ChatCompletions/Responses/Gemini) - 模型限制错误信息优化，区分"模型不可用"和"无账号" - OpenAI RecordUsage RequestedModel 改用 OriginalModel - ResolveChannelMappingAndRestrict/ReplaceModelInBody 抽取到 ChannelService 消除跨service重复 - validateNoDuplicateModels 按 platform:model 去重 - 删除 Channel.ResolveMappedModel 死代码和 CalculateCostWithChannel Deprecated方法 - 移除冗余nil检查，抽取 validatePricingBillingMode 公共校验	2026-04-04 11:13:56 +08:00
erio	4ea8b4cb4f	refactor(channel): 抽取渠道映射公共函数 + OpenAI映射到body + 空响应修复 + 清理日志 - 抽取 ResolveChannelMappingAndRestrict 统一入口（5处→1个方法） - 抽取 BuildModelMappingChain 到 ChannelMappingResult 方法（5处→1行调用） - OpenAI 三入口 Forward 前应用渠道映射到请求体 - OpenAI Responses/Messages 限制检查添加错误响应 - 清理前端 3 处 console.log 调试日志	2026-04-04 11:13:56 +08:00
erio	91bdcf8994	fix(channel): 模型限制用映射后模型检查 + 平台开关保留配置不删除 - OpenAI 网关三处 IsModelRestricted 改用 channelMapping.MappedModel - 前端平台勾选改为 enabled 开关，取消勾选不清空配置数据 - formToAPI/校验只处理 enabled 的平台	2026-04-04 11:13:56 +08:00
erio	632035aabd	feat(billing): 网关计费迁移到 CalculateCostUnified + 模型限制错误统一 - GatewayService/OpenAIGatewayService 注入 ModelPricingResolver - RecordUsage 从旧路径迁移到 CalculateCostUnified（支持 per_request/image 模式） - 无渠道时自动回退旧路径，保持原有行为 - 长上下文双倍计费仅在无渠道定价时生效 - CostBreakdown 新增 BillingMode 字段，使用日志记录实际计费模式 - 模型限制错误改为与"无可用账号"相同的 503 响应	2026-04-04 11:12:21 +08:00
erio	ebac0dc628	feat(channel): 缓存扁平化 + 网关映射集成 + 计费模式统一 + 模型限制 - 缓存重构为 O(1) 哈希结构 (pricingByGroupModel, mappingByGroupModel) - 渠道模型映射接入网关流程 (Forward 前应用, a→b→c 映射链) - 新增 billing_model_source 配置 (请求模型/最终模型计费) - usage_logs 新增 channel_id, model_mapping_chain, billing_tier 字段 - 每种计费模式统一支持默认价格 + 区间定价 - 渠道模型限制开关 (restrict_models) - 分组按平台分类展示 + 彩色图标 - 必填字段红色星号 + 模型映射 UI - 去除模型通配符支持	2026-04-04 11:09:01 +08:00
Dave King	7c6dc9dda8	fix: add account and proxy details to gateway.forward_failed log The forward_failed error log only included account_id, making it difficult to identify which account and proxy caused the failure without querying the database. Add account_name, account_platform, and proxy details (id, name, host, port) to the log fields. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-25 12:19:17 +00:00
Ethan0x0000	db9021f9c1	feat(ops): propagate endpoint/request-type context in handlers; add UpstreamURL to upstream error events	2026-03-21 23:47:39 +08:00
shaw	01d8286bd9	feat: add max_claude_code_version setting and disable auto-upgrade env var Add maximum Claude Code version limit to complement the existing minimum version check. Refactor the version cache from single-value to unified bounds struct (min+max) with a single atomic.Value and singleflight group. - Backend: new constant, struct field, cache refactor, validation (semver format + cross-validation max >= min), gateway enforcement, audit diff - Frontend: settings UI input, TypeScript types, zh/en i18n - Add CLAUDE_CODE_DISABLE_NONESSENTIAL_TRAFFIC=1 to all Claude Code tutorials on /keys page (unix/cmd/powershell/vscode settings.json)	2026-03-20 09:10:01 +08:00
haruka	1fd1a58a7a	fix: record original upstream status code when failover exhausted (#1128 ) When all failover accounts are exhausted, handleFailoverExhausted maps the upstream status code (e.g. 403) to a client-facing code (e.g. 502) but did not write the original code to the gin context. This caused ops error logs to show the mapped code instead of the real upstream code. Call SetOpsUpstreamError before mapUpstreamError in all failover- exhausted paths so that ops_error_logger captures the true upstream status code and message. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-19 11:15:02 +08:00
Ethan0x0000	7bd1972f94	refactor: migrate all handlers to shared endpoint normalization middleware - Apply InboundEndpointMiddleware to all gateway route groups - Replace normalizedOpenAIInboundEndpoint/normalizedOpenAIUpstreamEndpoint and normalizedGatewayInboundEndpoint/normalizedGatewayUpstreamEndpoint with GetInboundEndpoint/GetUpstreamEndpoint - Remove 4 old constants and 4 old normalization functions (-70 lines) - Migrate existing endpoint normalization test to new API Ultraworked with [Sisyphus](https://github.com/code-yeongyu/oh-my-opencode) Co-authored-by: Sisyphus <clio-agent@sisyphuslabs.ai>	2026-03-15 22:13:42 +08:00
Wesley Liddick	8321e4a647	Merge pull request #1023 from YanzheL/fix/claude-output-effort-logging fix: extract and log Claude output_config.effort in usage records	2026-03-15 16:45:37 +08:00
Elysia	359e56751b	增加测试	2026-03-15 16:21:49 +08:00
YanzheL	1bff2292a6	fix: extract and log Claude output_config.effort in usage records Claude's output_config.effort parameter (low/medium/high/max) was not being extracted from requests or logged in the reasoning_effort column of usage logs. Only the OpenAI path populated this field. Changes: - Extract output_config.effort in ParseGatewayRequest - Add ReasoningEffort field to ForwardResult - Populate reasoning_effort in both RecordUsage and RecordUsageWithLongContext - Guard against overwriting service-set effort values in handler - Update stale comments that described reasoning_effort as OpenAI-only - Add unit tests for extraction, normalization, and persistence	2026-03-15 12:55:37 +08:00
Elysia	0e23732631	fix(gateway): 防止流式 failover 拼接腐化导致客户端收到双 message_start 当上游在 SSE 流中途返回 event:error 时，handleStreamingResponse 已将部分 SSE 事件写入客户端，但原先的 failover 逻辑仍会切换到下一个账号并写入完整流，导致客户端收到两个 message_start 进而产生 400 错误。修复方案：在每次 Forward 调用前记录 c.Writer.Size()，若 Forward 返回 UpstreamFailoverError 后 writer 字节数增加，说明 SSE 内容已不可撤销地发送给客户端，此时直接调用 handleFailoverExhausted 发送 SSE error 事件终止流，而非继续 failover。 Ping-only 场景不受影响：slot 等待期的 ping 字节在 Forward 前后相等，正常 failover 流程照常进行。 Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-14 22:49:23 +08:00
ius	611fd884bd	feat: decouple billing correctness from usage log batching	2026-03-12 16:53:18 +08:00
shaw	00a0a12138	feat: Anthropic平台可配置 anthropic-beta 策略	2026-03-10 11:20:10 +08:00
shaw	c7fcb7a84b	feat: apikey限额支持查询重置时间	2026-03-09 10:22:24 +08:00
shaw	7a353028e7	fix: 修复keys速率限制未自动重置额度的bug	2026-03-07 10:13:51 +08:00
shaw	838dad8759	feat: 重构 /v1/usage 端点，支持 quota_limited 和 unrestricted 双模式 - quota_limited 模式：返回 Key 级别的总额度、速率限制窗口用量和过期时间 - unrestricted 模式：返回订阅限额或钱包余额信息（向后兼容） - 新增 model_stats 字段，支持 start_date/end_date 参数查询按模型用量统计 - 提取 buildUsageData/parseUsageDateRange 等辅助方法，减少主函数复杂度 - 新增 APIKeyService.GetRateLimitData 和 UsageService.GetAPIKeyModelStats	2026-03-03 20:59:12 +08:00
shaw	a80ec5d8bb	feat: apikey支持5h/1d/7d速率控制	2026-03-03 15:01:10 +08:00
QTom	a9285b8a94	feat(gateway): 双模式用户消息队列 — 串行队列 + 软性限速新增 UMQ (User Message Queue) 双模式支持: - serialize: 账号级分布式串行锁 + RPM 自适应延迟（严格限流） - throttle: 仅 RPM 自适应前置延迟，不阻塞并发（软性限速）后端: - config: 新增 Mode 字段，保留 Enabled 向后兼容 - service: 新增 UserMessageQueueService（Lua 锁/延迟算法/清理 worker） - repository: 新增 UserMsgQueueCache（Redis Lua acquire/release/force-release） - handler: 新增 UserMsgQueueHelper（SSE ping + 等待循环 + throttle） - gateway: 按 mode 分支集成 serialize/throttle 逻辑 - lint: 修复 gofmt rewrite rules、errcheck 类型断言、staticcheck QF1012 前端: - 三态选择器 UI（关闭/软性限速/串行队列）替代 toggle 开关 - BulkEdit 支持 null 语义（不修改） - i18n 中英文文案通过 6 轮专家评审（42 次 review）、golangci-lint、单元测试、集成测试。	2026-03-03 01:05:11 +08:00
QTom	4280aca82c	feat(gateway): 添加 Claude Code 客户端最低版本检查功能 - 通过 User-Agent 识别 Claude Code 客户端并提取版本号 - 在网关层验证客户端版本是否满足管理员配置的最低要求 - 在管理后台提供版本要求配置选项（英文/中文双语） - 实现原子缓存 + singleflight 防止并发问题和 thundering herd - 使用 context.WithoutCancel 隔离 DB 查询，避免客户端断连影响缓存 - 双 TTL 策略：60s 正常、5s 错误恢复，保证性能与可用性 - 仅检查 Claude Code 客户端，其他客户端不受影响 - 添加完整单元测试覆盖版本提取、比对、上下文操作	2026-03-01 15:45:44 +08:00
QTom	e63c83955a	fix: address deep code review issues for RPM limiting - Move IncrementRPM after Forward success to prevent phantom RPM consumption during account switch retries - Add base_rpm input sanitization (clamp to 0-10000) in Create/Update - Add WindowCost scheduling checks to legacy path sticky sessions (4 check sites + 4 prefetch sites), fixing pre-existing gap - Clean up rpm_strategy/rpm_sticky_buffer when disabling RPM in BulkEditModal (JSONB merge cannot delete keys, use empty values) - Add json.Number test cases to TestGetBaseRPM/TestGetRPMStickyBuffer - Document TOCTOU race as accepted soft-limit design trade-off	2026-02-28 20:38:06 +08:00
QTom	607237571f	fix: address code review issues for RPM limiting feature - Use TxPipeline (MULTI/EXEC) instead of Pipeline for atomic INCR+EXPIRE - Filter negative values in GetBaseRPM(), update test expectation - Add RPM batch query (GetRPMBatch) to account List API - Add warn logs for RPM increment failures in gateway handler - Reset enableRpmLimit on BulkEditAccountModal close - Use union type 'tiered' \| 'sticky_exempt' for rpmStrategy refs - Add design decision comments for rdb.Time() RTT trade-off	2026-02-28 20:37:37 +08:00
QTom	f648b8e026	feat: increment RPM counter before request forwarding	2026-02-28 20:37:10 +08:00
yangjianbo	bb664d9bbf	feat(sync): full code sync from release	2026-02-28 15:01:20 +08:00
erio	d8d4b0c0c7	fix: enable Gemini model_mapping UI and extend warmup to Antigravity - Remove Gemini platform exclusion from model restriction UI in Create/Edit account modals (Gemini now supports model_mapping) - Remove outdated Gemini model passthrough info cards - Add model_mapping field to GeminiCredentials type - Extend warmup request interception toggle to Antigravity platform - Remove redundant try/catch in API key account creation - Remove noisy gateway.request_completed debug log - Reorganize Gemini model mapping sections in constants.go	2026-02-24 21:30:32 +08:00
erio	09166a52f8	refactor: extract failover error handling into FailoverState - Extract duplicated failover logic from gateway_handler.go (3 places) and gemini_v1beta_handler.go into shared failover_loop.go - Introduce FailoverState with HandleFailoverError and HandleSelectionExhausted - Move helper functions (needForceCacheBilling, sleepWithContext) into failover_loop.go - Add comprehensive unit tests (32+ test cases) - Delete redundant gateway_handler_single_account_retry_test.go	2026-02-24 18:08:04 +08:00
yangjianbo	2ee6c26676	fix(gateway): 修复粘性会话预取分组错配并优化并发等待热路径	2026-02-22 16:43:33 +08:00
yangjianbo	a89477ddf5	perf(gateway): 优化热点路径并补齐高覆盖测试	2026-02-22 13:31:30 +08:00
yangjianbo	33db7a0fb6	feat(gateway): 引入使用量记录有界 worker 池与自动扩缩容 - 新增 UsageRecordWorkerPool，支持有界队列、溢出降级策略与自动扩缩容 - 将 Gateway/OpenAI/Sora/Gemini 使用量记录改为提交到统一任务池执行 - 增加 usage_record 配置默认值与校验规则，并补充配置与任务提交相关测试 - 注入并托管 worker 池生命周期，服务退出时统一 StopAndWait Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-22 12:56:57 +08:00
yangjianbo	d04b47b3ca	feat(backend): 提交后端审计修复与配套测试改动	2026-02-14 11:23:10 +08:00
yangjianbo	abf5de69fb	Merge branch 'main' into test	2026-02-12 23:43:47 +08:00
yangjianbo	584cfc3db2	chore(logging): 完成后端日志审计与结构化迁移 - 将高密度服务与处理器日志迁移到新日志系统（LegacyPrintf/结构化日志） - 增加 stdlog bridge 与兼容测试，保留旧日志捕获能力 - 将 OpenAI 断流告警改为结构化 Warn 并改造对应测试为 sink 捕获 - 补齐后端相关文件 logger 引用并通过全量 go test	2026-02-12 19:01:09 +08:00
yangjianbo	fff1d54858	feat(log): 落地统一日志底座与系统日志运维能力	2026-02-12 16:27:29 +08:00
程序猿MT	8da5fac69e	Merge branch 'Wei-Shaw:main' into main	2026-02-11 18:39:52 +08:00
Edric Li	2d4236f76e	fix: 修复错误透传规则 skip_monitoring 未生效的问题 - ops_error_logger: status < 400 分支增加 OpsSkipPassthroughKey 检查 - ops_upstream_context: 新增 checkSkipMonitoringForUpstreamEvent，中间重试/故障转移事件也能触发跳过标记 - gateway_handler/openai_gateway_handler/gemini_v1beta_handler: handleFailoverExhausted 匹配规则后设置 OpsSkipPassthroughKey - antigravity_gateway_service: writeMappedClaudeError 增加 applyErrorPassthroughRule 调用	2026-02-10 20:56:01 +08:00
yangjianbo	3b0910f664	Merge branch 'main' into test-sora	2026-02-10 18:01:17 +08:00
程序猿MT	1dd3158c7e	Merge branch 'Wei-Shaw:main' into main	2026-02-10 13:55:51 +08:00
Edric Li	7d0a30fa8f	merge: sync upstream main (antigravity single-account 503 retry) 合并上游新增的 Antigravity 单账号 503 退避重试机制，解决与本地 MODEL_CAPACITY_EXHAUSTED 逻辑的冲突，两者共存。	2026-02-10 12:00:21 +08:00
Edric Li	d6c2921f2b	feat: same-account retry before failover for transient errors For retryable transient errors (Google 400 "invalid project resource name" and empty stream responses), retry on the same account up to 2 times (with 500ms delay) before switching to another account. - Add RetryableOnSameAccount field to UpstreamFailoverError - Add same-account retry loop in both Gemini and Claude/OpenAI handler paths - Move temp-unschedule from service layer to handler layer (only after all same-account retries exhausted) - Reduce temp-unschedule cooldown from 30 minutes to 1 minute	2026-02-10 00:53:54 +08:00
yangjianbo	d367d1cde6	Merge branch 'main' into test-sora	2026-02-09 20:40:09 +08:00
yangjianbo	16131c3d3f	Merge branch 'main' of https://github.com/mt21625457/aicodex2api	2026-02-09 20:26:03 +08:00
Rose Ding	021abfca18	fix: 单账号分组首次 503 不设模型限流标记，避免后续请求雪崩单账号 antigravity 分组收到 503 (MODEL_CAPACITY_EXHAUSTED) 时，原逻辑会设置 ~29s 模型限流标记。由于只有一个账号无法切换，后续所有新请求在预检查时命中限流 → 几毫秒内直接返回 503，导致约 30 秒的雪崩窗口。修复：在 Handler 入口处检查分组是否只有单个 antigravity 账号，如果是则提前设置 SingleAccountRetry context 标记，让 Service 层首次 503 就走原地重试逻辑（不设限流标记），避免污染后续请求。	2026-02-09 17:25:36 +08:00
Rose Ding	f6cfab9901	feat: 添加 Antigravity 单账号 503 退避重试机制当分组内只有一个可用账号且上游返回 503 (MODEL_CAPACITY_EXHAUSTED) 时，不再设置模型限流+切换账号（因为切换回来还是同一个账号），而是在 Service 层原地等待+重试，避免双重等待问题。主要变更： - Handler 层：检测单账号 503 场景，清除排除列表并设置 SingleAccountRetry 标记 - Service 层：新增 handleSingleAccountRetryInPlace 原地重试逻辑 - Service 层：预检查跳过单账号模式下的限流检查 - 新增 ctxkey.SingleAccountRetry 上下文标记	2026-02-09 14:26:01 +08:00
erio	72b08f9cc5	fix: ensure sticky session failover triggers cache billing exemption	2026-02-09 06:57:07 +08:00

1 2 3

135 Commits