sub2api

Author	SHA1	Message	Date
erio	10699eeb34	refactor: extract ReadUpstreamResponseBody to deduplicate upstream response read + too-large error handling Consolidates 9 call sites of resolveUpstreamResponseReadLimit + readUpstreamResponseBodyLimited + ErrUpstreamResponseBodyTooLarge error handling into a single ReadUpstreamResponseBody function with TooLargeWriter callback for API-format-specific error responses (Anthropic, OpenAI, countTokens).	2026-04-16 01:53:22 +08:00
ius	265687b56d	fix: 优化调度快照缓存以避免 Redis 大 MGET	2026-04-08 10:39:15 -07:00
Wesley Liddick	276f499c82	Merge pull request #1418 from YanzheL/fix/1161-gemini-google-search-grounding fix(gemini): preserve google search grounding tools	2026-04-08 14:19:57 +08:00
erio	d72ac92694	feat: image output token billing, channel-mapped billing source, credits balance precheck - Parse candidatesTokensDetails from Gemini API to separate image/text output tokens - Add image_output_tokens and image_output_cost to usage_log (migration 089) - Support per-image-token pricing via output_cost_per_image_token from model pricing data - Channel pricing ImageOutputPrice override works in token billing mode - Auto-fill image_output_price in channel pricing form from model defaults - Add "channel_mapped" billing model source as new default (migration 088) - Bills by model name after channel mapping, before account mapping - Fix channel cache error TTL sign error (115s → 5s) - Fix Update channel only invalidating new groups, not removed groups - Fix frontend model_mapping clearing sending undefined instead of {} - Credits balance precheck via shared AccountUsageService cache before injection - Skip credits injection for accounts with insufficient balance - Don't mark credits exhausted for "exhausted your capacity on this model" 429s	2026-04-04 11:15:59 +08:00
YanzheL	dd5978f222	fix(gemini): normalize ai studio google search tools	2026-04-01 00:45:56 +08:00
YanzheL	0ebe0ce585	fix(gemini): preserve google search in Claude compat tools	2026-04-01 00:33:39 +08:00
Ethan0x0000	2c667a159c	fix(provider): retain upstream model for gemini compat and ws Ultraworked with [Sisyphus](https://github.com/code-yeongyu/oh-my-openagent) Co-authored-by: Sisyphus <clio-agent@sisyphuslabs.ai>	2026-03-21 01:24:59 +08:00
IanShaw027	90b3838173	fix: 移除 Gemini 不支持的 patternProperties 字段 #795	2026-03-15 17:46:58 +08:00
QTom	530a16291c	fix(gateway): 分组隔离 — 禁止未分组账号被跨组调度当 API Key 无分组时，调度仅从未分组账号池中选取。修复 isAccountInGroup 在 groupID==nil 时的逻辑，同时补全 scheduler_snapshot_service 和 gemini_compat_service 中的 SimpleMode 保护，确保分组隔离在所有调度路径生效。新增 ListSchedulableUngroupedByPlatform/s 方法，使用 Ent 的 Not(HasAccountGroups()) 谓词实现未分组账号隔离。新增 17 个单元和端到端隔离测试，覆盖所有分支和边界条件。	2026-03-03 13:20:58 +08:00
yangjianbo	bb664d9bbf	feat(sync): full code sync from release	2026-02-28 15:01:20 +08:00
yangjianbo	d04b47b3ca	feat(backend): 提交后端审计修复与配套测试改动	2026-02-14 11:23:10 +08:00
yangjianbo	abf5de69fb	Merge branch 'main' into test	2026-02-12 23:43:47 +08:00
yangjianbo	584cfc3db2	chore(logging): 完成后端日志审计与结构化迁移 - 将高密度服务与处理器日志迁移到新日志系统（LegacyPrintf/结构化日志） - 增加 stdlog bridge 与兼容测试，保留旧日志捕获能力 - 将 OpenAI 断流告警改为结构化 Warn 并改造对应测试为 sink 捕获 - 补齐后端相关文件 logger 引用并通过全量 go test	2026-02-12 19:01:09 +08:00
sususu98	d21d70a5cf	fix: include Gemini thoughtsTokenCount in output token billing Gemini 2.5 Pro/Flash thinking models return thoughtsTokenCount separately from candidatesTokenCount in usageMetadata, but this field was not parsed or included in billing calculations, causing thinking tokens to be unbilled. - Add ThoughtsTokenCount field to GeminiUsageMetadata struct - Include thoughtsTokenCount in OutputTokens across all 3 Gemini usage parsing paths (non-streaming, streaming, compat layer) - Add tests covering thinking token scenarios Closes #554	2026-02-11 15:41:54 +08:00
yangjianbo	3b0910f664	Merge branch 'main' into test-sora	2026-02-10 18:01:17 +08:00
yangjianbo	58912d4ac5	perf(backend): 使用 gjson/sjson 优化热路径 JSON 处理将 API 网关热路径中的 json.Unmarshal+json.Marshal 替换为 gjson 零拷贝查询和 sjson 精准写入： - unwrapV1InternalResponse 性能提升 22x（4009ns→182ns），内存分配减少 28.5x - unwrapGeminiResponse、extractGeminiUsage、estimateGeminiCountTokens、ParseGeminiRateLimitResetTime 改为接收 []byte 使用 gjson 提取 - ParseGatewayRequest 的 model/stream/metadata/thinking/max_tokens 改用 gjson 类型安全提取 - Handler 层（sora/openai）改用 gjson 提取字段、sjson 注入/修改字段，移除 map[string]any 中间变量 - Sora Client 响应解析改用 gjson ForEach 遍历，减少内存分配 - 新增约 100 个单元测试用例，所有改动函数覆盖率 >85% Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-10 08:59:30 +08:00
Edric Li	d6c2921f2b	feat: same-account retry before failover for transient errors For retryable transient errors (Google 400 "invalid project resource name" and empty stream responses), retry on the same account up to 2 times (with 500ms delay) before switching to another account. - Add RetryableOnSameAccount field to UpstreamFailoverError - Add same-account retry loop in both Gemini and Claude/OpenAI handler paths - Move temp-unschedule from service layer to handler layer (only after all same-account retries exhausted) - Reduce temp-unschedule cooldown from 30 minutes to 1 minute	2026-02-10 00:53:54 +08:00
Edric Li	89905ec43d	feat: failover and temp-unschedule on Google "Invalid project resource name" 400 Google 后端间歇性返回 400 "Invalid project resource name" 错误，此前该错误直接透传给客户端且不触发账号切换，导致请求失败。 - 在 Antigravity 和 Gemini 两个平台的所有转发路径中，精确匹配该错误消息后触发 failover 自动换号重试 - 命中后将账号临时封禁 1 小时，避免反复调度到同一故障账号 - 提取共享函数 isGoogleProjectConfigError / tempUnscheduleGoogleConfigError 消除跨 Service 的代码重复	2026-02-09 22:48:32 +08:00
erio	a70d37a676	fix: Gemini error policy check should precede retry logic	2026-02-09 19:55:17 +08:00
erio	6892e84ad2	fix: skip rate limiting when custom error codes don't match upstream status Add ShouldHandleErrorCode guard at the entry of handleGeminiUpstreamError and AntigravityGatewayService.handleUpstreamError so that accounts with custom error codes (e.g. [599]) are not rate-limited when the upstream returns a non-matching status (e.g. 429).	2026-02-09 19:55:05 +08:00
erio	73f455745c	feat: ErrorPolicySkipped returns 500 instead of upstream status code When custom error codes are enabled and the upstream error code is NOT in the configured list, return HTTP 500 to the client instead of transparently forwarding the original status code. Also adds integration test TestCustomErrorCode599 verifying that 429, 500, 503, 401, 403 all return 500 without triggering SetRateLimited or SetError.	2026-02-09 19:54:54 +08:00
erio	a67d9337b8	feat: integrate CheckErrorPolicy into Gemini error handling paths	2026-02-09 06:55:45 +08:00
erio	fb58560d15	refactor(upstream): replace upstream account type with apikey, auto-append /antigravity Upstream accounts now use the standard APIKey type instead of a dedicated upstream type. GetBaseURL() and new GetGeminiBaseURL() automatically append /antigravity for Antigravity platform APIKey accounts, eliminating the need for separate upstream forwarding methods. - Remove ForwardUpstream, ForwardUpstreamGemini, testUpstreamConnection - Remove upstream branch guards in Forward/ForwardGemini/TestConnection - Add migration 052 to convert existing upstream accounts to apikey - Update frontend CreateAccountModal to create apikey type - Add unit tests for GetBaseURL and GetGeminiBaseURL	2026-02-08 13:06:25 +08:00
erio	edb0937024	fix: restore non-failover error passthrough from `7b156489`	2026-02-07 14:24:55 +08:00
erio	5e98445b22	feat(antigravity): comprehensive enhancements - model mapping, rate limiting, scheduling & ops Key changes: - Upgrade model mapping: Opus 4.5 → Opus 4.6-thinking with precise matching - Unified rate limiting: scope-level → model-level with Redis snapshot sync - Load-balanced scheduling by call count with smart retry mechanism - Force cache billing support - Model identity injection in prompts with leak prevention - Thinking mode auto-handling (max_tokens/budget_tokens fix) - Frontend: whitelist mode toggle, model mapping validation, status indicators - Gemini session fallback with Redis Trie O(L) matching - Ops: enhanced concurrency monitoring, account availability, retry logic - Migration scripts: 049-051 for model mapping unification	2026-02-07 12:31:10 +08:00
shaw	7b1564898b	fix: make error passthrough effective for non-failover upstream errors	2026-02-07 10:25:56 +08:00
shaw	39e05a2dad	feat: 新增全局错误透传规则功能支持管理员配置上游错误如何返回给客户端： - 新增 ErrorPassthroughRule 数据模型和 Ent Schema - 实现规则的 CRUD API（/admin/error-passthrough-rules） - 支持按错误码、关键词匹配，支持 any/all 匹配模式 - 支持按平台过滤（anthropic/openai/gemini/antigravity） - 支持透传或自定义响应状态码和错误消息 - 实现两级缓存（Redis + 本地内存）和多实例同步 - 集成到 gateway_handler 的错误处理流程 - 新增前端管理界面组件 - 新增单元测试覆盖核心匹配逻辑优化： - 移除 refreshLocalCache 中的冗余排序（数据库已排序） - 后端 Validate() 增加匹配条件非空校验	2026-02-05 21:52:54 +08:00
song	2220fd18ca	merge upstream main	2026-02-03 15:36:17 +08:00
ianshaw	03e94f9f53	fix(gemini): 为 Gemini 工具调用添加 thoughtSignature 避免 INVALID_ARGUMENT 错误	2026-02-03 06:01:29 +08:00
song	0170d19fa7	merge upstream main	2026-02-02 22:13:50 +08:00
Wesley Liddick	cc7e67b01a	Merge pull request #445 from touwaeriol/fix/gemini-cache-token-billing fix(billing): 修复 Gemini 接口缓存 token 统计	2026-02-02 15:22:46 +08:00
liuxiongfeng	4bfeeecb05	fix(billing): 修复 Gemini 接口缓存 token 统计 extractGeminiUsage 函数未提取 cachedContentTokenCount，导致计费时缓存读取 token 始终为 0。修复： - 提取 usageMetadata.cachedContentTokenCount - 设置 CacheReadInputTokens 字段 - InputTokens 减去缓存 token（与 response_transformer 逻辑一致）	2026-02-02 14:01:17 +08:00
song	7ade9baa15	fix(gateway): 过滤 Gemini 请求中 parts 为空的消息 Gemini API 不接受 contents 数组中 parts 为空的消息，会返回 400 INVALID_ARGUMENT 错误。添加 filterEmptyPartsFromGeminiRequest 函数在转发前过滤这类消息。影响范围：ForwardGemini (antigravity) 和 ForwardNative (gemini)	2026-01-29 21:09:33 +08:00
song	7cea6b6fc9	feat(gemini): 为 Gemini 原生平台添加图片计费支持对齐 Antigravity 平台的图片计费逻辑： - 添加 extractImageSize() 方法提取图片尺寸 - Forward() 和 ForwardNative() 返回 ImageCount/ImageSize - 支持分组自定义图片价格和倍率	2026-01-27 00:33:48 +08:00
song	0059a232a6	feat(gemini): 为 Gemini 原生平台添加图片计费支持对齐 Antigravity 平台的图片计费逻辑： - 添加 extractImageSize() 方法提取图片尺寸 - Forward() 和 ForwardNative() 返回 ImageCount/ImageSize - 支持分组自定义图片价格和倍率	2026-01-26 20:51:40 +08:00
lynoot	909b8a8f9c	fix(gateway): aggregate all text chunks in non-streaming Gemini responses Previously, collectGeminiSSE() only returned the last chunk received from the upstream streaming response when converting to non-streaming. This caused incomplete responses where only the final text fragment was returned to clients. For example, a request asking to "count from 1 to 10" would only return "\n" (the last chunk) instead of "1\n2\n3\n...\n10\n". This was especially problematic for JSON structured output where the opening brace "{" from the first chunk was lost, resulting in invalid JSON like: colors": ["red", "blue"]} The fix: - Collect all text parts from each SSE chunk into a slice - Merge all collected text parts into the final response - Reuse the same pattern as handleGeminiStreamToNonStreaming in antigravity_gateway_service.go Fixes: non-streaming responses returning incomplete text Fixes: structured output (JSON schema) returning invalid JSON	2026-01-23 13:54:09 +00:00
yangjianbo	91f01309da	fix(调度): 完善粘性会话清理与账号调度刷新 - Update/BulkUpdate 按不可调度字段触发缓存刷新 - GatewayCache 支持多前缀会话键清理 - 模型路由与混合调度优化粘性会话处理 - 补充调度与缓存相关测试覆盖	2026-01-20 11:40:55 +08:00
IanShaw027	63711067e6	refactor(ops): 完善gateway服务ops集成	2026-01-14 14:30:00 +08:00
IanShaw027	060699c3b8	refactor(ops): 更新gateway服务集成ops功能	2026-01-14 12:40:49 +08:00
yangjianbo	3141aa5144	feat(scheduler): 引入调度快照缓存与 outbox 回放 - 调度热路径优先读 Redis 快照，保留分组排序语义 - outbox 回放 + 全量重建纠偏，失败重试不推进水位 - 自动 Atlas 基线对齐并同步调度配置示例	2026-01-12 14:19:06 +08:00
IanShaw027	7ebca553ef	feat(ops): 实现上游错误事件记录与查询功能新增功能: - 新建ops_upstream_error_events表存储上游服务错误详情 - 支持记录上游429/529/5xx错误的详细上下文信息 - 提供按时间范围查询上游错误事件的API 后端改动: 1. 模型层（ops_models.go, ops_port.go）: - 新增UpstreamErrorEvent结构体 - 扩展Repository接口支持上游错误事件CRUD 2. 仓储层（ops_repo.go）: - 实现InsertUpstreamErrorEvent写入上游错误 - 实现GetUpstreamErrorEvents按时间范围查询 3. 服务层（ops_service.go, ops_upstream_context.go）: - ops_service: 新增GetUpstreamErrorEvents查询方法 - ops_upstream_context: 封装上游错误上下文构建逻辑 4. Handler层（ops_error_logger.go）: - 新增GetUpstreamErrorsHandler处理上游错误查询请求 5. Gateway层集成: - antigravity_gateway_service.go: 429/529错误时记录上游事件 - gateway_service.go: OpenAI 429/5xx错误时记录 - gemini_messages_compat_service.go: Gemini 429/5xx错误时记录 - openai_gateway_service.go: OpenAI 429/5xx错误时记录 - ratelimit_service.go: 429限流错误时记录数据记录字段: - request_id: 关联ops_logs主记录 - platform/model: 上游服务标识 - status_code/error_message: 错误详情 - request_headers/response_body: 调试信息（可选） - created_at: 错误发生时间	2026-01-11 15:30:27 +08:00
yangjianbo	297f08c683	Merge branch 'test' into dev	2026-01-10 09:39:02 +08:00
yangjianbo	2597fe78ba	fix(分组): 防止降级环并校验上下文分组 - 增加降级链路环检测并拦截配置 - 仅复用合法分组上下文并必要时回退查询 - 标注 GetByIDLite 轻量语义并补充测试	2026-01-10 07:56:50 +08:00
yangjianbo	675543240e	perf(网关): 复用分组上下文减少热路径查询新增 GetByIDLite 并在网关与 Gemini 选择流程复用上下文 group，避免 COUNT 触发更新 API key 中间件注入 group 上下文，减少重复查库补充 gateway/gemini 中间件与仓库层回归测试测试: make test	2026-01-09 23:01:42 +08:00
Song Siyu	7d1fe818be	feat: antigravity 配额域限流 + SSE 上限 (#222 ) * fix: 添加 gemini-3-flash 前缀映射支持 gemini-3-flash-preview * feat(antigravity): 增强请求参数和注入 Antigravity 身份 system prompt * feat: antigravity 配额域限流 * chore: 调整 SSE 单行上限到 25MB * chore: 提升 SSE 单行上限到 40MB	2026-01-09 22:00:14 +08:00
Edric Li	a42105881f	feat(groups): add Claude Code client restriction and session isolation - Add claude_code_only field to restrict groups to Claude Code clients only - Add fallback_group_id for non-Claude Code requests to use alternate group - Implement ClaudeCodeValidator for User-Agent detection - Add group-level session binding isolation (groupID in Redis key) - Prevent cross-group sticky session pollution - Update frontend with Claude Code restriction controls	2026-01-08 23:07:00 +08:00
yangjianbo	fb313356f7	Merge branch 'main' into test-dev	2026-01-05 14:43:08 +08:00
yangjianbo	048ed061c2	fix(安全): 关闭白名单时保留最小校验与默认白名单实现 allow_insecure_http 并在关闭校验时执行最小格式验证 - 关闭 allowlist 时要求 URL 可解析且 scheme 合规 - 响应头过滤关闭时使用默认白名单策略 - 更新相关文档、示例与测试覆盖	2026-01-05 14:41:08 +08:00
yangjianbo	794a9f969b	feat(安全): 添加安全开关并完善测试流程实现安全开关默认关闭与响应头透传逻辑 - URL 校验与响应头过滤支持开关并覆盖流式路径 - 非流式 Content-Type 透传/默认值按配置生效 - 接入 go test、golangci-lint 与前端 lint/typecheck - 补充相关测试与配置/文档说明	2026-01-05 13:54:43 +08:00
IanShaw027	aa6f253374	merge: 合并 upstream/main 并解决冲突解决了以下文件的冲突： - backend/internal/handler/admin/setting_handler.go - 采用 upstream 的字段对齐风格和 Configured 字段名 - 添加 EnableIdentityPatch 和 IdentityPatchPrompt 字段 - backend/internal/handler/gateway_handler.go - 采用 upstream 的 billingErrorDetails 错误处理方式 - frontend/src/api/admin/settings.ts - 采用 upstream 的 _configured 字段名 - 添加 enable_identity_patch 和 identity_patch_prompt 字段 - frontend/src/views/admin/SettingsView.vue - 合并 turnstile_secret_key_configured 字段 - 保留 enable_identity_patch 和 identity_patch_prompt 字段	2026-01-04 23:17:15 +08:00

1 2

79 Commits