Skip to main content

Models

The table below lists all available AI models on Coding Plan, including multiplier rates and context windows.
Multiplier is the pricing coefficient compared to the base model. For example: A model with 1.8x multiplier will consume 1.8 times more tokens than a 1x model.

Models Table

ModelMultiplierContext WindowOriginal PriceDiscounted PriceDiscount
in SubscriptionPay-as-you-goPay-as-you-go
(per 1M tokens)(per 1M tokens)
claude-haiku-4-5-202510011x200K tokens1.00/1.00 / 5.000.20/0.20 / 1.00-80%
claude-sonnet-4-5 (no thinking)1x200K tokens3.00/3.00 / 15.000.60/0.60 / 3.00-80%
claude-sonnet-4-5-202509291x200K tokens3.00/3.00 / 15.000.60/0.60 / 3.00-80%
claude-opus-4-5-202511012x200K tokens5.00/5.00 / 25.001.00/1.00 / 5.00-80%
gemini-3-pro0.8x1M tokens2.00/2.00 / 12.000.40/0.40 / 2.40-80%
gemini-3-flash0.5x1M tokens0.50/0.50 / 3.000.10/0.10 / 0.60-80%
gpt-5-mini0x (free-using)128K tokens---
glm-4.60x (free-using)200K tokens0.55/0.55 / 2.000.11/0.11 / 0.40-80%
glm-4.70x (free-using)200K tokens0.60/0.60 / 2.200.12/0.12 / 0.44-80%
Price format: Input / Output
gpt-5-mini, glm-4.6 and glm-4.7 are free-using models with unlimited usage when subscribing to Coding Plan at VibeCodeCheap.

Model Details

  • Model ID: claude-haiku-4-5-20251001
  • Multiplier: 1x (base model)
  • Context Window: 200K tokens
  • Description: Balanced model between performance and cost, suitable for most daily coding tasks.
  • Model ID: claude-sonnet-4-5
  • Multiplier: 1x (base model)
  • Context Window: 200K tokens
  • Description: Balanced model between performance and cost, suitable for most daily coding tasks.
  • Model ID: claude-sonnet-4-5-20250929
  • Multiplier: 1x
  • Context Window: 200K tokens
  • Description: Sonnet version with extended thinking capability, providing more accurate results for complex problems.
  • Model ID: claude-opus-4-5-20251101
  • Multiplier: 2x
  • Context Window: 200K tokens
  • Description: Opus version with extended thinking, the most powerful for complex multi-step reasoning tasks.
  • Model ID: gemini-3-pro
  • Multiplier: 0.8x
  • Context Window: 1M tokens
  • Description: Google’s flagship model with extremely large 1M tokens context window, suitable for processing large codebases.
  • Model ID: gemini-3-flash
  • Multiplier: 0.5x
  • Context Window: 1M tokens
  • Description: Fast and cost-efficient version of Gemini 3, still maintains 1M tokens context window.
  • Model ID: gpt-5-mini
  • Multiplier: 0x (Free-Using with unlimited usage when subscribing to Coding Plan at VibeCodeCheap)
  • Context Window: 128K tokens
  • Description: Latest model from OpenAI, good balance between performance and cost.
  • Model ID: glm-4.6
  • Multiplier: 0x (Free-Using with unlimited usage when subscribing to Coding Plan at VibeCodeCheap)
  • Context Window: 200K tokens
  • Description: GLM generation 4.6 model with good coding capabilities, completely free.
  • Model ID: glm-4.7
  • Multiplier: 0x (Free-Using with unlimited usage when subscribing to Coding Plan at VibeCodeCheap)
  • Context Window: 200K tokens
  • Description: Latest GLM model with significantly improved performance, completely free.

Choosing the Right Model

Cost Saving

Choose claude-haiku-4-5 (0.5x), gemini-3-flash (0.5x), gpt-5-mini (0.6x) or free models glm-4.6/4.7.

High Performance

Choose claude-opus-4-5-20251101 (2x) or claude-opus-4-5 (1.8x) for complex tasks.

Balanced

Choose claude-sonnet-4-5 (1x) or gemini-3-pro (0.8x) for daily tasks.

Large Context

Choose gemini-3-pro or gemini-3-flash (1M tokens) when working with large codebases.