Models

Model	Multiplier	Context Window	Original Price	Discounted Price	Discount
	in Subscription		Pay-as-you-go	Pay-as-you-go
			(per 1M tokens)	(per 1M tokens)
claude-haiku-4-5-20251001	1x	200K tokens	$1.00 /$ 5.00	$0.20 /$ 1.00	-80%
claude-sonnet-4-5 (no thinking)	1x	200K tokens	$3.00 /$ 15.00	$0.60 /$ 3.00	-80%
claude-sonnet-4-5-20250929	1x	200K tokens	$3.00 /$ 15.00	$0.60 /$ 3.00	-80%
claude-opus-4-5-20251101	2x	200K tokens	$5.00 /$ 25.00	$1.00 /$ 5.00	-80%
gemini-3-pro	0.8x	1M tokens	$2.00 /$ 12.00	$0.40 /$ 2.40	-80%
gemini-3-flash	0.5x	1M tokens	$0.50 /$ 3.00	$0.10 /$ 0.60	-80%
gpt-5-mini	0x (free-using)	128K tokens	-	-	-
glm-4.6	0x (free-using)	200K tokens	$0.55 /$ 2.00	$0.11 /$ 0.40	-80%
glm-4.7	0x (free-using)	200K tokens	$0.60 /$ 2.20	$0.12 /$ 0.44	-80%

Claude Haiku 4.5

Model ID: claude-haiku-4-5-20251001
Multiplier: 1x (base model)
Context Window: 200K tokens
Description: Balanced model between performance and cost, suitable for most daily coding tasks.

Claude Sonnet 4.5 (no thinking)

Model ID: claude-sonnet-4-5
Multiplier: 1x (base model)
Context Window: 200K tokens
Description: Balanced model between performance and cost, suitable for most daily coding tasks.

Claude Sonnet 4.5 Thinking

Model ID: claude-sonnet-4-5-20250929
Multiplier: 1x
Context Window: 200K tokens
Description: Sonnet version with extended thinking capability, providing more accurate results for complex problems.

Claude Opus 4.5 Thinking

Model ID: claude-opus-4-5-20251101
Multiplier: 2x
Context Window: 200K tokens
Description: Opus version with extended thinking, the most powerful for complex multi-step reasoning tasks.

Gemini 3 Pro

Model ID: gemini-3-pro
Multiplier: 0.8x
Context Window: 1M tokens
Description: Google’s flagship model with extremely large 1M tokens context window, suitable for processing large codebases.

Gemini 3 Flash

Model ID: gemini-3-flash
Multiplier: 0.5x
Context Window: 1M tokens
Description: Fast and cost-efficient version of Gemini 3, still maintains 1M tokens context window.

GPT-5 Mini

Model ID: gpt-5-mini
Multiplier: 0x (Free-Using with unlimited usage when subscribing to Coding Plan at VibeCodeCheap)
Context Window: 128K tokens
Description: Latest model from OpenAI, good balance between performance and cost.

GLM 4.6

Model ID: glm-4.6
Multiplier: 0x (Free-Using with unlimited usage when subscribing to Coding Plan at VibeCodeCheap)
Context Window: 200K tokens
Description: GLM generation 4.6 model with good coding capabilities, completely free.

GLM 4.7

Model ID: glm-4.7
Multiplier: 0x (Free-Using with unlimited usage when subscribing to Coding Plan at VibeCodeCheap)
Context Window: 200K tokens
Description: Latest GLM model with significantly improved performance, completely free.

Cost Saving

Choose claude-haiku-4-5 (0.5x), gemini-3-flash (0.5x), gpt-5-mini (0.6x) or free models glm-4.6/4.7.

Choose claude-opus-4-5-20251101 (2x) or claude-opus-4-5 (1.8x) for complex tasks.

Choose claude-sonnet-4-5 (1x) or gemini-3-pro (0.8x) for daily tasks.

Choose gemini-3-pro or gemini-3-flash (1M tokens) when working with large codebases.