Árbol: d53cf20292

148-restore-docs-links

420-show-message-when-using-cline-or-roo-rule-files-failing-to-improve-import

4525_chore_reorganize_kilocode_with

4810_featcli_add_append-system-prompt

4826_cli_queue_messages_json-io_plus

5032_fix_cli_dispose_randomuuid_debug_ux-ne

5318_kilocode_cli_write_to_file_twice

LigiaZ-patch-1

LigiaZ-patch-2

add-agent-management-tab

add-auto-model-support

add-back-update-contributors-script

add-gpt-53-codex

add-opus-46

add-sessions-to-cli

add-smol-command

add-support-for-multi-root-workspaces-822

add-symlink-support

add-type-export-script

add-type-export-script-attempt-2

add-walkthrough

agent-branch-picker

agent-manager-worktree-support

aider-watch

am-cloud-agents

am-permissions

animated-tab-switching

at_kilo_dig_out

auto-triage

autocomplete-abort-stream

autocomplete-follow-ups

autocomplete-follow-ups-2

bdo/daily-broken-list-fix

bdo/remove-roo-cline

beatlevic/autocomplete-lru-cache-in-mem

beatlevic/fim-context-improvements

beatlevic/ghost-streaming-parser-test-cleanup

beatlevic/inline-ghost-completion

beatlevic/inline-ghost-empty-search

bmc/fix-project-id-for-non-dot-git

bmc/remove-dead-code

brave_bhaskara

briant/deploydocs

briant/updateNPM

cancel-revert-enhancement

catriel/ghost-prevent-race-conditions

catriel/speedup-runners

catrielmuller/cli-request

catrielmuller/fix-jetbrains-terminal-integration

catrielmuller/fix-msg-deadlocks

catrielmuller/jetbrains-remove-mpc-button

catrielmuller/migrate-jetbrains-sdk-253

catrielmuller/multi-kilo-providers-fix

change-display-name

changeset-release/main

cherry-pick/pr-10813

chore/remove-openai-handler-todo-comment

chore/remove-redundant-todo-comment

christiaan/cli

claude/execute-docs-migration-w6dAk

claude/plan-docs-migration-vM9JT

cli-bundle

cli-bundle-HOLD

cli-fix-command-suggestion-default

cli-integration-tests

cli-jsonio-yolo-fix

cli-queue-messages

cline-max-requests

cline-max-requests-roo

code-index-progress-bar

codicon-mode-icons-roo

comment-out-all-telemetry

correctly-handle-kitty-input

david-test

debug-settings

docs-project-name

docs/add-breadcrumbs

docs/add-claude-code-credentials-notice

docs/add-sitemap-xml

docs/cli-custom-modes-location

docs/cli-development-quick-start

docs/cli-development-quickstart

docs/cli-env-setup

docs/enable-search-insights

docs/missing-items

docs/slackbot-model-configuration

docs/update-404-links

docs/update-file-locations-kilo-cli

docs/update-managed-indexing

docs/update-managed-indexing-heading

docs/verify-algolia

eamon/MergeStrategyReview

eamon/SupportStaging

enable-autocomplete-jetbrains

exec-cmd-in-background

extension-release-notes

feat/add-llms-txt-support

feat/add-review-mode

feat/cli-auto-purge

feat/cli-diff-syntax-highlighting

feat/cli-ephemeral-mode-argument

feat/cli-no-git-restore

feat/enhanced-cache-matching-poc

feat/multi-directory-skills-support

feat/session-name-in-history

feat/trigger-homebrew-tap-update

feature/agent-manager-image-paste

feature/cli-version-check-caching

feature/ghost-interface-alignment

feature/ghost-request-deduplication

feature/jetbrains-vscode-command-check

feature/modes-folder

feature/skills-md-notification

fix-build

fix-cli-duplicate-output

fix-new-terminal-every-cmd

fix-playwright-test-dl-failure

fix-playwright-test-notifications

fix-terminal-btn

fix-tests

fix-workspace-tracker-too-many-files

fix/agent-behaviour-search-translation

fix/agent-manager-image-support

fix/auto-scroll-auto-approve

fix/auto-scroll-auto-approve-section

fix/autocomplete-model-refresh-on-login

fix/bracket-autocompletion-duplication

fix/cli-agent-manager-no-config-auth

fix/cli-teams-autocomplete

fix/cli-terminal-scroll-flicker

fix/command-execution-multiline-display

fix/disable-unconditional-cli-notifications

fix/ghost-auto-bracket-detection

fix/mode-create-role-definition-clearing

fix/moonshot-kimi-k2-temperature

fix/multiline-command-auto-approve

fix/standardize-config-schema-url

fix/word-boundary-search-backtracking

fix/word-boundary-search-fusejs

fix/word-boundary-search-matchsorter

florian/fix/ctrl-chars-in-json-out

florian/fix/thinking-block-errors

ghost-memory-improvements

ghost-strategies-redux

ghost-strategies-redux-2

git-commit-gen-map-reduce-prompt

git-restore-no-head-fallback

goofy_roentgen

hassoncs/ghost-testing-framework

hassoncs/jetbrains

hiding-welcome-message

highlight-limited-context-remaining

ignore-vscode-submodule

improve-ask-response-chat-ux

improve_how_file_updates-mw

improve_the_ui_in-mh

jetbrains-build-optimizations

jl-acp-prototype

jl-add-global-ignore

jl-change-all-icons

jl-chatgpt-cherry-pick

jl-fix-model-selection

jl-kilo-pass-profile-page

jl-roo-paths

jl-support-mcp-reload

jl-tweak-install-instructions

jobrietbergen-patch-1

jovial_franklin

kevinvandijk/enable-auto-complete-for-new-installs

kevinvandijk/fix-clear-index-on-error

lambertjosh-patch-1

lambertjosh-patch-2

lambertjosh-patch-3

main

mark/add-autoapprove-for-commands

mark/add-debounce

mark/add-web-tools

mark/autocomplete-profile-config

mark/autocomplete-settings

mark/autocomplete-single-line-truncation

mark/autocomplete-transplant-docs

mark/chat-autocomplete-use-shared-filter

mark/dead-continue-code

mark/debounce

mark/disable-streaming-parsing

mark/duplicatecommands2

mark/duplicatecommands3

mark/enable-autocomplete-for-new-installs

mark/fix-duplicate-upsert-api-configuration

mark/fix-mcp-restart-loop

mark/fix-profile-state-sharing-bug

mark/fix-settings-editing-profile

mark/fixbugs

mark/ghost-autocomplete-telemetry

mark/ghost-generator-reuse

mark/ghost-inline-partial-request-reuse

mark/ghost-inline-provider-test-simplify

mark/ghost-inline-streaming-reuse

mark/ghost-statusbar-click-to-show

mark/git-commit-autocomplete

mark/holefiller

mark/kilocode-backend-envvar

mark/less-noise

mark/mercury-coder-web-benchmarks

mark/merge-model-logic

mark/more-completions

mark/multiple-strategies

mark/o11y-error-classification

mark/onboarding-flow-update

mark/opt-in-free-models

mark/other-java-setup

mark/parse-search-replace-regex-tests

mark/pr-5234-review

mark/reapply-multiline-auto-approve-attempt1

mark/reload-autocomplete-on-get-started

mark/rename-ghost-to-autocomplete

mark/replace-approvals

mark/reproduce-786

mark/roo-v3.22.0

mark/session-metrics-o11y

mark/sqlite

mark/telemetry-id-based-tracking

mark/telemetry-phase-1a

mark/update-approvals-opus

mark/xml-testing

mcowger/virtualProvider

mcowger/virtualProvider-HOLD

mcp-panel-of-experts

memory-bank

memory-telemetry-service

model-per-mode

mw/agent-manager-cloud-mode

nested-agents

nested-agents-md

planning-doc-tool

playwright-network-cache

port/kilo-pr-867-to-roo

pr-3704-part3-context-fim-formatting

pr-4786

pr-4810

pr-4868

pr-5087

pr-5644-fix-dark-mode-icons

pr/single-commit-a95ff49

provider-selection-fix

quick-fix-openrouter-ui

refactor-tool-experiment

refactor-tool-squashed

refactor/extract-kilocode-webview-handlers

refactor/unify-holefiller-fim-strategies

refactor/unify-suggestion-adjustment

release-notes-ui-only

remove-request-message-and-sum-costs

resize-repaint

resource-and-log

restore-docs-link

revert-4222-catrielmuller/jetbrains-fix-webview-assets

revert-5427-bdo/fix-for-redirects

screenshot-flakes-script

session/agent_0c51ed5d-2c2e-4c6d-b6f9-62eeb9aaaee1

session/agent_0ee7e893-be7e-4c42-a5c9-efd44a06a437

session/agent_0f0287d4-6b73-4241-bbcb-9490b88294d5

session/agent_1003c3bf-9980-4e5b-98d3-d70e88fd32c3

session/agent_13b53740-6cd6-49b9-9501-04e3257d88b4

session/agent_18cd6e87-57a1-414e-bf74-d85c755e3d10

session/agent_2064ee1f-02c2-4320-9212-e741c96f0519

session/agent_28f8c0ba-2c31-4b63-b92c-20cbc377dd90

session/agent_33b6974d-5b92-4010-ae64-90f943a81df2

session/agent_5aed847c-c43f-4c5e-a20d-e4a68922dd29

session/agent_6eba240f-16a5-49bb-9866-dba88624acf9

session/agent_72a6a8f9-1432-45e8-991b-a589adb1f782

session/agent_787611af-421c-4f03-9983-6a2c166c368d

session/agent_82d2630e-34f5-4446-8a5c-826a79cb6d9b

session/agent_8c6d1bce-ad4a-43eb-ab53-6ac044269a70

session/agent_9a2fe77d-8f85-4842-97f8-c3f8fc5bed5d

session/agent_9e198744-1771-4f1e-a09a-ec2943f3b00f

session/agent_9ed7570f-22e9-40b5-b345-5f8d0001b541

session/agent_a6151947-5fad-4147-aafa-f50fd7b6af44

session/agent_b406bb85-0e1a-4d3a-96db-6cffcbcf8db6

session/agent_ba8d8282-0ad9-46b6-853b-cbe84ade43fa

session/agent_df19d6e7-7f13-45f5-9f3d-1753d3645579

session/agent_e392d966-f4a6-4a96-8850-e1e2b7fbd4c4

session/agent_e49e15d4-f925-4776-9057-19d834a0c8bd

session/agent_ebc17d95-b208-454d-a8cb-594797a35fdb

session/agent_ebf86ab6-6e53-4660-bd8c-4fae93f0bd2b

session/agent_ecd29846-3de4-41c8-91fa-1f72d8a84b31

skills-marketplace

skills-path-fix

spec_onboarding

ss-chunker-jotai

storybook-fixups

sync-chinese-docs

sync-chinese-docs-20260124

tab-management-tool

task-history-memory-improvements

test-change

text-to-speech-experiments

the_context_indicator_seems-pg-rk-dp

trusting_goldwasser

update-contributors-10

update-contributors-11

update-contributors-12

update-contributors-13

update-contributors-14

update-contributors-15

update-contributors-16

update-contributors-17

update-contributors-18

update-contributors-19

update-contributors-2

update-contributors-20

update-contributors-21

update-contributors-22

update-contributors-23

update-contributors-24

update-contributors-25

update-contributors-26

update-contributors-27

update-contributors-28

update-contributors-29

update-contributors-3

update-contributors-30

update-contributors-31

update-contributors-32

update-contributors-33

update-contributors-34

update-contributors-35

update-contributors-36

update-contributors-37

update-contributors-38

update-contributors-39

update-contributors-4

update-contributors-40

update-contributors-41

update-contributors-42

update-contributors-43

update-contributors-44

update-contributors-45

update-contributors-46

update-contributors-47

update-contributors-48

update-contributors-49

update-contributors-5

update-contributors-50

update-contributors-51

update-contributors-52

update-contributors-53

update-contributors-54

update-contributors-55

update-contributors-56

update-contributors-57

update-contributors-6

update-contributors-7

update-contributors-8

update-contributors-9

update-discord-link-4956

update-opus-model-4.6

vite-verbose-flag

sidebar_label: "Model Selection Guide"

Kilo Code Model Selection Guide

Last updated: September 3, 2025.

The AI model landscape evolves rapidly, so this guide focuses on what's delivering excellent results with Kilo Code right now. We update this regularly as new models emerge and performance shifts.

Kilo Code Top Performers

Model	Context Window	SWE-Bench Verified	Human Eval	LiveCodeBench	Input Price*	Output Price*	Best For
GPT-5	400K tokens	74.9%	96.3%	68.2%	$1.25	$10	Latest capabilities, multi-modal coding
Claude Sonnet 4	1M tokens	72.7%	94.8%	65.9%	$3-6	$15-22.50	Enterprise code generation, complex systems
Grok Code Fast 1	256K tokens	70.8%	92.1%	63.4%	$0.20	$1.50	Rapid development, cost-performance balance
Qwen3 Coder	256K tokens	68.4%	91.7%	61.8%	$0.20	$0.80	Pure coding tasks, rapid prototyping
Gemini 2.5 Pro	1M+ tokens	67.2%	89.9%	59.3%	TBD	TBD	Massive codebases, architectural planning

*Per million tokens

Budget-Conscious Options

Model	Context Window	SWE-Bench Verified	Human Eval	LiveCodeBench	Input Price*	Output Price*	Notes
DeepSeek V3	128K tokens	64.1%	87.3%	56.7%	$0.14	$0.28	Exceptional value for daily coding
DeepSeek R1	128K tokens	62.8%	85.9%	54.2%	$0.55	$2.19	Advanced reasoning at budget prices
Qwen3 32B	128K tokens	60.3%	83.4%	52.1%	Varies	Varies	Open source flexibility
Z AI GLM 4.5	128K tokens	58.7%	81.2%	49.8%	TBD	TBD	MIT license, hybrid reasoning system

*Per million tokens

Comprehensive Evaluation Framework

Latency Performance

Response times significantly impact development flow and productivity:

Ultra-Fast (< 2s): Grok Code Fast 1, Qwen3 Coder
Fast (2-4s): DeepSeek V3, GPT-5
Moderate (4-8s): Claude Sonnet 4, DeepSeek R1
Slower (8-15s): Gemini 2.5 Pro, Z AI GLM 4.5

Impact on Development: Ultra-fast models enable real-time coding assistance and immediate feedback loops. Models with 8+ second latency can disrupt flow state but may be acceptable for complex architectural decisions.

Throughput Analysis

Token generation rates affect large codebase processing:

High Throughput (150+ tokens/s): GPT-5, Grok Code Fast 1
Medium Throughput (100-150 tokens/s): Claude Sonnet 4, Qwen3 Coder
Standard Throughput (50-100 tokens/s): DeepSeek models, Gemini 2.5 Pro
Variable Throughput: Open source models depend on infrastructure

Scaling Factors: High throughput models excel when generating extensive documentation, refactoring large files, or batch processing multiple components.

Reliability & Availability

Enterprise considerations for production environments:

Enterprise Grade (99.9%+ uptime): Claude Sonnet 4, GPT-5, Gemini 2.5 Pro
Production Ready (99%+ uptime): Qwen3 Coder, Grok Code Fast 1
Developing Reliability: DeepSeek models, Z AI GLM 4.5
Self-Hosted: Qwen3 32B (reliability depends on your infrastructure)

Success Rates: Enterprise models maintain consistent output quality and handle edge cases more gracefully, while budget options may require additional validation steps.

Context Window Strategy

Optimizing for different project scales:

Size	Word Count	Typical Use Case	Recommended Models	Strategy
32K tokens	~24,000 words	Individual components, scripts	DeepSeek V3, Qwen3 Coder	Focus on single-file optimization
128K tokens	~96,000 words	Standard applications, most projects	All budget models, Grok Code Fast 1	Multi-file context, moderate complexity
256K tokens	~192,000 words	Large applications, multiple services	Qwen3 Coder, Grok Code Fast 1	Full feature context, service integration
400K+ tokens	~300,000+ words	Enterprise systems, full stack apps	GPT-5, Claude Sonnet 4, Gemini 2.5 Pro	Architectural overview, system-wide refactoring

Performance Degradation: Model effectiveness typically drops significantly beyond 400-500K tokens, regardless of advertised limits. Plan context usage accordingly.

Community Choice

The AI model landscape changes quicky to stay up to date 👉 check Kilo Code Community Favorites on OpenRouter

model-selection-guide.md 5.5 KB Histórico Raw