eb75ff232fRefactor: Optimize the request rate limiting for ModelRequestRateLimitCount. Reason: The original steps 1 and 3 in the redisRateLimitHandler method were not atomic, leading to poor precision under high concurrent requests. For example, with a rate limit set to 60, sending 200 concurrent requests would result in none being blocked, whereas theoretically around 140 should be intercepted. Solution: I chose not to merge steps 1 and 3 into a single Lua script because a single atomic operation involving read, write, and delete operations could suffer from performance issues under high concurrency. Instead, I implemented a token bucket algorithm to optimize this, reducing the atomic operation to just read and write steps while significantly decreasing the memory footprint.
霍雨佳
2025-04-16 10:33:43 +08:00
c4c1099ae5Merge pull request #944 from lamcodes/main
Calcium-Ion
2025-04-10 00:09:54 +08:00
c869455456fix: Update model ratios for gemini-2.5-pro
CaIon
2025-04-10 00:09:11 +08:00
f89d8a0fe5refactor: Remove duplicate model settings initialization in main function
CaIon
2025-04-10 00:07:34 +08:00
3d6d19903brefactor: Update localization keys for API address in English translations and adjust related UI labels
CaIon
2025-04-09 22:22:19 +08:00
c5f1a0c712Add support for grok-2-image. Currently, grok-2-image doesn't support the size, quality, or style parameters. Set 'size'='empty' to use grok-2-image
datehoer
2025-04-09 15:05:00 +08:00
19935ee8acfeat: Enhance ConvertClaudeRequest method to set request model and handle vertex-specific request conversion
1808837298@qq.com
2025-03-17 17:13:33 +08:00
6fef5aaf22feat: Update RerankerInfo structure and modify GenRelayInfoRerank function to accept RerankRequest
1808837298@qq.com
2025-03-17 16:44:53 +08:00
b5aa3c129bMerge pull request #872 from neotf/main
Calcium-Ion
2025-03-17 16:18:11 +08:00
8c7c39550crefactor: Update ClaudeResponse error handling to use pointer for ClaudeError and improve nil checks in response processing
1808837298@qq.com
2025-03-16 23:14:45 +08:00
d1c62a583dfeat: support xinference rerank to jina format
1808837298@qq.com
2025-03-16 21:06:29 +08:00
53b3599827refactor: Enhance Claude response handling
1808837298@qq.com
2025-03-16 19:11:58 +08:00
b3b1c803fcfeat: Introduce JSON decoding utility functions and update error handling in Claude and OpenAI response structures
1808837298@qq.com
2025-03-16 18:34:39 +08:00
ee302c063crefactor: Enhance error handling in AWS and Claude response processing by updating function signatures and improving error propagation
1808837298@qq.com
2025-03-16 16:47:16 +08:00
5a67bdf1b0Merge pull request #851 from HynoR/main
Calcium-Ion
2025-03-16 16:31:48 +08:00
2c81a5f0ccrefactor: Streamline AWS and Claude response handling by consolidating logic and improving error management
1808837298@qq.com
2025-03-16 15:57:01 +08:00
b84b6affe9Merge pull request #874 from HynoR/feat/gemini2
Calcium-Ion
2025-03-15 19:44:37 +08:00
c183c1231crefactor: Replace direct access to ImageUrl with GetImageMedia method across multiple relay channels
1808837298@qq.com
2025-03-15 19:43:37 +08:00
54e738941dfeat: Add warning modal for base URL input and display warning banner for specific channel type in EditChannel component
1808837298@qq.com
2025-03-15 19:38:05 +08:00