Go to file

edxeth 6151888df5 fix: stabilize thinking streams, multimodal parsing, and token accounting (#20 )

* fix: stabilize multimodal image compatibility across OpenCode flows

Advertise vision-capable metadata in /v1/models and make model matching deterministic so OpenCode does not downgrade image support or route 4.6 models incorrectly. Expand request translation to accept OpenCode/OpenAI attachment shapes, sanitize [Image N] placeholders safely, keep image-only follow-up turns non-empty, and improve token accounting so base64 image bytes no longer inflate prompt token usage and trigger premature compaction.

* fix: deduplicate thinking streams and trim injected prompt noise

* fix: align /v1/messages thinking blocks and message_start usage

* fix: reduce repetitive thinking across tool turns

Select a single reasoning stream source, prevent chunk replay, and preserve structured tool-loop context so the model keeps continuity instead of re-planning each turn.

* fix: unify token counting on existing API endpoints

Compute usage deterministically on /v1/messages and /v1/chat/completions even when upstream omits tokenUsage.

- remove roo-only token path and keep behavior on existing endpoints
- add proxy/token_estimator.go with shared Claude/OpenAI estimators (input/system/messages/tools + output/thinking/tool calls)
- wire stream/non-stream handlers to use estimator-derived input/output usage
- update /v1/messages/count_tokens to reuse the same estimator
- keep robust upstream usage parsing/normalization in proxy/kiro.go while dropping parser-level estimate fallback

Why: direct upstream tests show metering/context events frequently arrive without tokenUsage in this environment; this made usage zero or inconsistent. Local deterministic accounting keeps reported usage stable and explicit.

2026-02-23 20:33:53 +08:00

.github/workflows

ci: add latest tag for default branch

2026-02-04 01:26:56 +08:00

auth

feat: add admin logout, 72h session expiry, /v1/stats endpoint, and UI fixes

2026-02-08 19:23:00 +08:00

config

chore: bump version to 1.0.2

2026-02-10 12:36:23 +08:00

pool

feat: Kiro API Proxy - OpenAI/Anthropic compatible API service

2026-02-04 00:37:05 +08:00

proxy

fix: stabilize thinking streams, multimodal parsing, and token accounting (#20 )

2026-02-23 20:33:53 +08:00

web

feat: Extract and export client credentials securely (#17 )

2026-02-19 20:07:27 +08:00

docker-compose.yml

docs: update and fix README errors

2026-02-04 02:15:43 +08:00

Dockerfile

feat: Kiro API Proxy - OpenAI/Anthropic compatible API service

2026-02-04 00:37:05 +08:00

go.mod

feat: Kiro API Proxy - OpenAI/Anthropic compatible API service

2026-02-04 00:37:05 +08:00

go.sum

feat: Kiro API Proxy - OpenAI/Anthropic compatible API service

2026-02-04 00:37:05 +08:00

main.go

feat: add admin logout, 72h session expiry, /v1/stats endpoint, and UI fixes

2026-02-08 19:23:00 +08:00

README_CN.md

feat: add admin logout, 72h session expiry, /v1/stats endpoint, and UI fixes

2026-02-08 19:23:00 +08:00

README.md

feat: add admin logout, 72h session expiry, /v1/stats endpoint, and UI fixes

2026-02-08 19:23:00 +08:00

version.json

chore: bump version to 1.0.2

2026-02-10 12:36:23 +08:00

README.md

Kiro-Go

Convert Kiro accounts to OpenAI / Anthropic compatible API service.

English | 中文

Features

🔄 Anthropic Claude API - Full support for /v1/messages endpoint
🤖 OpenAI Chat API - Compatible with /v1/chat/completions
⚖️ Multi-Account Pool - Round-robin load balancing
🔐 Auto Token Refresh - Seamless token management
📡 Streaming - Real-time SSE responses
🎛️ Web Admin Panel - Easy account management
🔑 Multiple Auth Methods - AWS Builder ID, IAM Identity Center (Enterprise SSO), SSO Token, Local Cache, Credentials
📊 Usage Tracking - Monitor requests, tokens, and credits
📦 Account Export/Import - Compatible with Kiro Account Manager format
🔄 Dynamic Model List - Auto-synced from Kiro API with caching
🔔 Version Update Check - Automatic new version notification
🌐 i18n - Chinese / English admin panel

Quick Start

Docker Compose (Recommended)

git clone https://github.com/Quorinex/Kiro-Go.git
cd Kiro-Go

# Create data directory for persistence
mkdir -p data

docker-compose up -d

Docker Run

# Create data directory
mkdir -p /path/to/data

docker run -d \
  --name kiro-go \
  -p 8080:8080 \
  -e ADMIN_PASSWORD=your_secure_password \
  -v /path/to/data:/app/data \
  --restart unless-stopped \
  ghcr.io/quorinex/kiro-go:latest

📁 The /app/data volume stores config.json with accounts and settings. Mount it for data persistence.

Build from Source

git clone https://github.com/Quorinex/Kiro-Go.git
cd Kiro-Go
go build -o kiro-go .
./kiro-go

Configuration

Config file is auto-created at data/config.json on first run:

{
  "password": "changeme",
  "port": 8080,
  "host": "127.0.0.1",
  "requireApiKey": false,
  "apiKey": "",
  "accounts": []
}

⚠️ Change the default password before production use!

Environment Variables

Variable	Description	Default
`CONFIG_PATH`	Config file path	`data/config.json`
`ADMIN_PASSWORD`	Admin panel password (overrides config)	-

Usage

1. Access Admin Panel

Open http://localhost:8080/admin and login with your password.

2. Add Accounts

Multiple methods available:

Method	Description
AWS Builder ID	Login with AWS Builder ID (personal accounts)
IAM Identity Center (Enterprise SSO)	Login with IAM Identity Center (enterprise accounts)
SSO Token	Import `x-amz-sso_authn` token from browser
Kiro Local Cache	Import from local Kiro IDE cache files
Credentials JSON	Import JSON from Kiro Account Manager

Credentials Format

{
  "refreshToken": "eyJ...",
  "accessToken": "eyJ...",
  "clientId": "xxx",
  "clientSecret": "xxx"
}

3. Call API

Claude API

curl http://localhost:8080/v1/messages \
  -H "Content-Type: application/json" \
  -H "anthropic-version: 2023-06-01" \
  -d '{
    "model": "claude-sonnet-4-20250514",
    "max_tokens": 1024,
    "messages": [{"role": "user", "content": "Hello!"}]
  }'

OpenAI API

curl http://localhost:8080/v1/chat/completions \
  -H "Content-Type: application/json" \
  -H "Authorization: Bearer any" \
  -d '{
    "model": "gpt-4o",
    "messages": [{"role": "user", "content": "Hello!"}]
  }'

Model Mapping

Request Model	Actual Model
`claude-sonnet-4-20250514`	claude-sonnet-4-20250514
`claude-sonnet-4.5`	claude-sonnet-4.5
`claude-haiku-4.5`	claude-haiku-4.5
`claude-opus-4.5`	claude-opus-4.5
`claude-opus-4.6`	claude-opus-4.6
`gpt-4o`, `gpt-4`	claude-sonnet-4-20250514
`gpt-3.5-turbo`	claude-sonnet-4-20250514

Thinking Mode

Enable extended thinking by adding a suffix to the model name (default: -thinking).

Usage

# OpenAI API with thinking
curl http://localhost:8080/v1/chat/completions \
  -H "Content-Type: application/json" \
  -d '{
    "model": "claude-sonnet-4.5-thinking",
    "messages": [{"role": "user", "content": "Solve this step by step: 15 * 23"}],
    "stream": true
  }'

# Claude API with thinking
curl http://localhost:8080/v1/messages \
  -H "Content-Type: application/json" \
  -H "anthropic-version: 2023-06-01" \
  -d '{
    "model": "claude-sonnet-4.5-thinking",
    "max_tokens": 4096,
    "messages": [{"role": "user", "content": "Analyze this problem"}]
  }'

Configuration

Configure thinking mode in the Admin Panel under Settings > Thinking Mode Settings:

Setting	Description	Options
Trigger Suffix	Model name suffix to enable thinking	Default: `-thinking` (customizable, e.g., `-think`, `-reason`)
OpenAI Output Format	How thinking content is returned in OpenAI API	`reasoning_content` (DeepSeek compatible), `<thinking>` tag, `<think>` tag
Claude Output Format	How thinking content is returned in Claude API	`<thinking>` tag (default), `<think>` tag, plain text

Output Formats

OpenAI API (/v1/chat/completions):

reasoning_content - Thinking in separate reasoning_content field (DeepSeek compatible)
thinking - Thinking wrapped in <thinking>...</thinking> tags in content
think - Thinking wrapped in <think>...</think> tags in content

Claude API (/v1/messages):

thinking - Thinking wrapped in <thinking>...</thinking> tags (default)
think - Thinking wrapped in <think>...</think> tags
reasoning_content - Plain text output

API Endpoints

Endpoint	Description
`GET /health`	Health check
`GET /v1/models`	List models
`GET /v1/stats`	Statistics
`POST /v1/messages`	Claude Messages API
`POST /v1/messages/count_tokens`	Token counting
`POST /v1/chat/completions`	OpenAI Chat API
`GET /admin`	Admin panel

Project Structure

Kiro-Go/
├── main.go              # Entry point
├── version.json         # Version info for update check
├── config/              # Configuration management
├── pool/                # Account pool & load balancing
├── proxy/               # API handlers & Kiro client
│   ├── handler.go       # HTTP routing & admin API
│   ├── kiro.go          # Kiro API client
│   ├── kiro_api.go      # Kiro REST API (usage, models)
│   └── translator.go    # Request/response conversion
├── auth/                # Authentication
│   ├── builderid.go     # AWS Builder ID login
│   ├── iam_sso.go       # IAM SSO login
│   ├── oidc.go          # OIDC token refresh
│   └── sso_token.go     # SSO token import
├── web/                 # Admin panel frontend
├── Dockerfile
└── docker-compose.yml

Disclaimer

This project is provided for educational and research purposes only.

This software is not affiliated with, endorsed by, or associated with Amazon, AWS, or Kiro in any way
Users are solely responsible for ensuring their use complies with all applicable terms of service and laws
The authors assume no liability for any misuse or violations arising from the use of this software
Use at your own risk

By using this software, you acknowledge that you have read and understood this disclaimer.

License

MIT