Devforth
Cost-performance analysis

Which Model Is Best for Coding?

A cost-performance leaderboard of the models coding agents run on - benchmark scores, real token-mix pricing, latency, and decoding speed.

Models used by coding agents

Sort by:
  • OpenAI
    GPT-5.5 OpenAI
    #1
    DF Score 89.3
    Price / 1M out $216
    Latency 118.46s
    Context window 1.05M
  • Google
    Gemini 3.5 Flash Google
    #2
    DF Score 85.0
    Price / 1M out $62.0
    Latency 20.59s
    Context window 1M
  • Anthropic
    Claude Fable 5 (with fallback) Anthropic
    #3
    DF Score 80.3
    Price / 1M out $343
    Latency 239.01s
    Context window 1M
  • OpenAI
    GPT-5.4 OpenAI
    #4
    DF Score 79.2
    Price / 1M out $85.5
    Latency 113.80s
    Context window 1M
  • Google
    Gemini 3.1 Pro (Preview) Google
    #5
    DF Score 77.4
    Price / 1M out $73.6
    Latency 25.74s
    Context window 1M
  • Anthropic
    Claude Opus 4.7 Anthropic
    #6
    DF Score 74.5
    Price / 1M out $140
    Latency
    Context window 1M
  • Anthropic
    Claude Sonnet 5 Anthropic
    #7
    DF Score 71.3
    Price / 1M out $115
    Latency 144.37s
    Context window 1M
  • Anthropic
    Claude Opus 4.8 Anthropic
    #8
    DF Score 69.6
    Price / 1M out $136
    Latency 30.05s
    Context window 1M
  • OpenAI
    GPT-5.4 mini OpenAI
    #9
    DF Score 63.0
    Price / 1M out $16.0
    Latency 11.87s
    Context window 400k
  • OpenAI
    GPT-5.3 Codex OpenAI
    #10
    DF Score 62.4
    Price / 1M out $70.5
    Latency 79.89s
    Context window 400k
  • Google
    Gemini 3 Flash (Preview) Google
    #11
    DF Score 57.2
    Price / 1M out $17.5
    Latency 7.60s
    Context window 1M
  • OpenAI
    GPT-5.4 nano OpenAI
    #12
    DF Score 55.8
    Price / 1M out $7.71
    Latency 4.74s
    Context window 400k
  • Anthropic
    Claude Sonnet 4.6 Anthropic
    #13
    DF Score 53.4
    Price / 1M out $98.1
    Latency 1.16s
    Context window 1M
  • Kimi-K2.7-Code Moonshot AI
    #14
    DF Score 50.0
    Price / 1M out $41.8
    Latency 2.92s
    Context window 256k
  • Anthropic
    Claude Opus 4.6 Anthropic
    #15
    DF Score 45.7
    Price / 1M out $140
    Latency
    Context window 1M
  • Anthropic
    Claude Opus 4.5 Anthropic
    #16
    DF Score 37.5
    Price / 1M out $140
    Latency 15.04s
    Context window 200k
  • Anthropic
    Claude Sonnet 4.5 Anthropic
    #17
    DF Score 34.7
    Price / 1M out $83.8
    Latency 1.36s
    Context window 1M
  • Anthropic
    Claude Haiku 4.5 Anthropic
    #18
    DF Score 30.5
    Price / 1M out $31.8
    Latency 0.81s
    Context window 200k
  • OpenAI
    GPT-5 mini OpenAI
    #19
    DF Score 26.9
    Price / 1M out $10.1
    Latency 85.30s
    Context window 400k
  • OpenAI
    GPT-OSS 120B OpenAI
    #20
    DF Score 16.2
    Price / 1M out $31.5
    Latency 0.94s
    Context window 131k
  • Google
    Gemini 2.5 Pro Google
    #21
    DF Score 12.5
    Price / 1M out $48.5
    Latency 22.54s
    Context window 1M
  • OpenAI
    GPT-5.3 Codex Spark OpenAI
    #22
    DF Score
    Price / 1M out
    Latency
    Context window
  • Anthropic
    Claude Opus 4.8 Fast Anthropic
    #23
    DF Score
    Price / 1M out $280
    Latency
    Context window 1M
  • MAI-Code-1-Flash Microsoft
    #24
    DF Score
    Price / 1M out $22.0
    Latency
    Context window
  • Raptor mini GitHub
    #25
    DF Score
    Price / 1M out $7.85
    Latency
    Context window 400k

Help us to detect updates in plans values

The prices and token mixes above are driven by real coding usage. Share yours via letmecode to allow us discover updates in plans values.

$ npx letmecode@latest

FAQ