Files
new-api/common
霍雨佳 eb75ff232f Refactor: Optimize the request rate limiting for ModelRequestRateLimitCount.
Reason: The original steps 1 and 3 in the redisRateLimitHandler method were not atomic, leading to poor precision under high concurrent requests. For example, with a rate limit set to 60, sending 200 concurrent requests would result in none being blocked, whereas theoretically around 140 should be intercepted.
Solution: I chose not to merge steps 1 and 3 into a single Lua script because a single atomic operation involving read, write, and delete operations could suffer from performance issues under high concurrency. Instead, I implemented a token bucket algorithm to optimize this, reducing the atomic operation to just read and write steps while significantly decreasing the memory footprint.
2025-04-16 10:33:43 +08:00
..
2025-03-12 21:31:46 +08:00
2024-12-07 00:37:11 +08:00
2023-04-22 20:39:27 +08:00
2024-07-07 02:24:51 +08:00
2024-04-04 16:35:44 +08:00
2023-11-21 18:11:07 +08:00
2023-04-22 20:39:27 +08:00
2025-02-25 19:39:29 +08:00
2024-12-24 20:46:02 +08:00
2024-06-27 19:30:17 +08:00
2023-04-22 20:39:27 +08:00
2023-04-22 20:39:27 +08:00