Branch: main

0xToshii-patch-1

3.27.0_hotfix

ENG-1141

HeavenOSK/clineIgnore

Nighttrek/liteLLM-model-picker

Nighttrek/streamableHttp-support

Nighttrek/telemetry-settings-2-config

Nighttrek/telemetry-settings-3-interface

Nighttrek/telemetry-settings-4-otel-impl

Nighttrek/telemetry-settings-5-integration

Nighttrek/telemetry-settings-refactor

account-sync

accountview-default-balance-to-dollars

add-accurate-title

add-alibaba-qwq-model

add-cdn-image-links-cline-docs

add-cline-keybinding

add-delete-rule-button

add-dependencies-tasks-extension-start

add-edit-cline-rule-button

add-edit-tool-new-format

add-eslint-rule-no-postmessage

add-extension-version-to-headers-cline-provider

add-fe-banner

add-feature-flags-extension-side

add-function-to-read-from-global-clinerules

add-garoth-codeowner

add-grep-tool-new-format

add-import-path-aliasing

add-intro-suggested-tasks

add-kimi-trending-model

add-ls-file-tool-new-format

add-mcp-remote-config-tests

add-new-rule-row

add-open-disk-conversation-button

add-options-types-for-each-provider

add-posthog-feature-flags

add-protos-files

add-protos-to-tasks-json

add-provider-config-docs

add-rest-of-docs

add-sambanova-provider

add-server-row-is-expandable

add-servers-modal

add-settings-button-to-mcp-modal

add-sonnet-reasoning

add-tailwind-css-extension-rec

add-terminal-connection-timeout

add-toggles-to-clinerules

adjust-sonnet-3.7-max-tokens

aliases-extension-side

allow-feedback-with-max-autoapprove-requests

always-allow-textarea-typing

anthropic-claude-4

apiconfiguration/update-webview

apioptions-refactor-bedrock

apioptions-refactor-cline

apioptions-refactor-lmstudio

apioptions-refactor-ollama

apioptions-refactor-vscodelm

apioptions-remove-unused-createdropdown-component

apply-new-edit-tool-to-diff

ara-model-picker-fixes

arafatkatze/add-intro-suggested-tasks

arafatkatze/adding-cline-walkthrough

arafatkatze/adding-gitbash-terminal

arafatkatze/adding-uuid

arafatkatze/adding-voice

arafatkatze/adding-voice-commit-copy

arafatkatze/adding-voice-commit-copy-merge

arafatkatze/adding-voice-on-main

arafatkatze/adding-voice-rebase

arafatkatze/anthropic-diffs

arafatkatze/anthropic-diffs-xml

arafatkatze/autoapprovefix

arafatkatze/autoimport-fix

arafatkatze/automagic-plan-act-switch-solve

arafatkatze/background-terminal-revamp

arafatkatze/background-tracker-code

arafatkatze/cerebras-upgrade

arafatkatze/changing-annoncement-note

arafatkatze/changing-provider-ordering

arafatkatze/chat-view-refactor

arafatkatze/chat-view-state-machine

arafatkatze/checkpointmove

arafatkatze/cli-multiroot

arafatkatze/commandRunner

arafatkatze/empty-changeset-bump-1

arafatkatze/emptycommit

arafatkatze/enable-quickwins

arafatkatze/final-prompt-fix-anthropic-diff

arafatkatze/fix-auto-approve-popups

arafatkatze/fix-cline-autofocus

arafatkatze/fix-context-window-gpt-5

arafatkatze/fix-contributor-flow

arafatkatze/fix-copy-code-manually

arafatkatze/fix-e2e-tests

arafatkatze/fix-enchancedterminal

arafatkatze/fix-enchancedterminal-stuck

arafatkatze/fix-flaky-clineprovider-toggle

arafatkatze/fix-flaky-toggle

arafatkatze/fix-gemini-slider

arafatkatze/fix-mcp-schema

arafatkatze/fix-model

arafatkatze/fix-openai-compatible

arafatkatze/fix-openrouter-stream-break

arafatkatze/fix-pricing-and-ui-grok4

arafatkatze/fix-read-of-workspaceroot

arafatkatze/fix-terminal-limits

arafatkatze/fix-terminal-line-limits

arafatkatze/fix-title-windows

arafatkatze/fix-toggle-act-plan

arafatkatze/fix-token-refresh

arafatkatze/fix-token-refresh-logic

arafatkatze/fix-tool-accept-bug

arafatkatze/fix-types

arafatkatze/fix-vscode-lm-tokens

arafatkatze/fix-vscode-settings

arafatkatze/fix-xai-pricing

arafatkatze/fixing-terminal-logic

arafatkatze/focus-on-update

arafatkatze/focus-stealing-stop

arafatkatze/gemini-2.5

arafatkatze/gemini-cli

arafatkatze/gemini-cli-provider

arafatkatze/gemini-context-window-fix

arafatkatze/gemini-retry-ui

arafatkatze/gemini-retry-ui-fake

arafatkatze/gemini-telemetry

arafatkatze/gemini-thinking

arafatkatze/gemini-thinking-ux

arafatkatze/gemini-token-limits

arafatkatze/glm-4.7

arafatkatze/gpt-oss

arafatkatze/groq-provider

arafatkatze/improve-task-timeline

arafatkatze/improve-timeline

arafatkatze/intro-tasks

arafatkatze/language-settings-fix

arafatkatze/local-debug

arafatkatze/lower-bedrock-max-tokens

arafatkatze/mcp-hub-typefix

arafatkatze/memory-optimize

arafatkatze/model-switch-race-condition

arafatkatze/multiroot-docs

arafatkatze/multiroot-gitroots

arafatkatze/multiroot-telemetry

arafatkatze/namefix

arafatkatze/notification-Mcp

arafatkatze/o3-pricing-update

arafatkatze/o3-pro-family

arafatkatze/openrouter-context-fix

arafatkatze/openrouter-stream-fix

arafatkatze/optimize-org-switch

arafatkatze/opus-4.1

arafatkatze/opus-4.1-fix

arafatkatze/opus-4.1-fix-openrouter

arafatkatze/payments-history-fix

arafatkatze/ping-pong-test

arafatkatze/qwen-provider

arafatkatze/qwen3-cerebras

arafatkatze/refactor-chat-view-base

arafatkatze/remove-mcp-settings-banner

arafatkatze/remove-redundant-logging

arafatkatze/remove-redundant-notificationMCP

arafatkatze/request-queuing

arafatkatze/revert-gemini-cli

arafatkatze/save-settings-condition

arafatkatze/set-gpt-5.2

arafatkatze/settings-fix

arafatkatze/show-gemini-flashpromptcache

arafatkatze/state-update-perf-improvement

arafatkatze/stop-multi-render

arafatkatze/stream-mcp

arafatkatze/streaming-gc-fix

arafatkatze/switch-to-ulid

arafatkatze/telemetry-buttons

arafatkatze/terminal-followup-fixes

arafatkatze/terminal-problems

arafatkatze/terminal-problems-guide

arafatkatze/terminal-shell-telemetry

arafatkatze/tool-copy

arafatkatze/tool-executor

arafatkatze/tool-executor-copy

arafatkatze/tool-executor-refactoring

arafatkatze/ui-reduce-model-info

arafatkatze/update-release-workflow

arafatkatze/updating-mintlify

arafatkatze/vertex-debugger-test

arafatkatze/voice-mode-adding

arafatkatze/voice-mode-final

arafatkatze/voice-mode-squashed

arafatkatze/working-local-setup

arm-linux-rollup

auto-version-based-announcements

beatrix/finish-implementing-registerurihandler

bee/action-btn-keys

bee/agent-poc

bee/arafatkatze/updating-mintlify

bee/arafatkatze/voice-mode

bee/assistant-msg-store

bee/atif

bee/auth-nonce

bee/auth-sync

bee/auth-user

bee/authnonce

bee/auto-condense-draggable

bee/auto-condense-threshold-flag

bee/bg-edit-wip

bee/biome

bee/biome-apply-2

bee/biome-noRestrictedImports

bee/biome-rules-2

bee/buy_credits_url

bee/capture-native-tool-call

bee/chatstream-ui-1

bee/clean-up-dom

bee/cline-abort-signal

bee/cline-error

bee/cline-secrets-storage

bee/clineerror

bee/commit-class

bee/compact-prompt-v2

bee/contextmanager-test

bee/credit-fix

bee/credit-state

bee/diff-editor-focus

bee/disable-autocapture

bee/enableExceptionAutocapture

bee/error-cline-provider

bee/example-set-globalstate

bee/external-rules-refactor

bee/firebase-auth

bee/fix-callback-uri-tests

bee/fix-e2e-teardown

bee/fix-highlight

bee/fix-launch-config

bee/fix-launch-config-e2e

bee/fix-plan-response

bee/fix-workspace-context

bee/fixShowMessage

bee/flaky

bee/focuse-diff

bee/fresh-install-mode

bee/gpt-5-codex-responses

bee/interactive-playwright

bee/jb-storybook

bee/langfuse-tracer

bee/launch-env

bee/mock-server

bee/navbar-fix

bee/new-ui-components

bee/ollama-fix

bee/org-balance

bee/package-lock-regen

bee/posthog-test-env

bee/prompt-validator

bee/qwen-default-api-region

bee/refactor-states

bee/replace-open-with-opn

bee/request_id

bee/reset-error-on-retry

bee/response-api-openai

bee/revert-b3aee38

bee/secret-listener

bee/shared-server

bee/showInputBox

bee/simplify-extension-state-context

bee/simplify-provider-id

bee/sjf-j17o

bee/sjf-jl21-comp2

bee/storybook

bee/storybook-workflow

bee/streaming-chunk

bee/tailwind-v4

bee/tailwind-v4-upgrade

bee/tailwindcss-upgrade

bee/task-history-view

bee/temp-in-memory-cache

bee/tempature-fix

bee/textDocuments

bee/tokens-telemetry

bee/tsconfig-update

bee/tsconfig-updates

bee/ui-banner-component

bee/vercel-models

bee/webview-tailwind-v4

bee/welcome-section

bee/workos

bug/2113

bug/add_vertex_gemini_model_back

bug/list-files

bug/tests_types

build-package-per-commit

build-protos-rosetta-warning

cache-e2e-dependencies

canvnrno/focus_chain_prompt_adjustment

canvnro/cli_dev_mode

canvnro/gpt5_1_context_window

canvrno/act_mode_respond

canvrno/mistral_family

canvrno/version_bump

celestial-vault/grok3-reasoning

celestial_vault/eng-108

celestial_vault/open_mermaid_graphs_in_editor

centralize-navigation-message-handling

changeset-release/main

check-webview-types-in-github-workflow

checkpoint-refactor-first-pass

chore-bug-summary-WHyKp

chore/test_coverage

claude-4-rest-of-updates

claude/project-overview-ebYMq

claude/test-session-hook-UX0N5

claude/test-session-hook-xNE6r

cleanup-getstatetoposttowebview

cli-all-json-out

cli-dev-command

cli/docker-dev-setup

cline-openrouter-model-4-prep

cline-provider-tokens-depleted-warning

cline-rules-toggle-modal

conditionally-initialize-posthog-webview

configurable-mcp-timeouts

contextManager-tests

copilot/fix-950836ed-c8f5-42df-bf87-c67ddd2013df

create-separate-sonnet-thinking-model

create-task-state-manager-class

create-tool-executor-class

dangerbutton-to-tailwind

dante/eng-130

dcbartlett-patch-1

dcbartlett/ENG-312_add_sentry_to_extension

dcbartlett/add_to_template

dcbartlett/changeset-check

dcbartlett/move_grok_models_around

dcbartlett/refactor_ErrorService_to_obey_telemetry_setting

dcbartlett/update_test_commands

dcbartlett/workflow_optimization

dependabot/npm_and_yarn/eslint-rules/npm_and_yarn-6d0192441a

dependabot/npm_and_yarn/express-5.2.1

dependabot/npm_and_yarn/webview-ui/mdast-util-to-hast-13.2.1

dependabot/npm_and_yarn/webview-ui/npm_and_yarn-d56b2ef021

detach-authservice-from-controller-class

dev-script-create-tasks

devin/1746217952-fix-plan-act-mode-toggle

diable-auto-track

disable-auto-scroll-on-scroll-up

disable-diff-auto-scroll-break

discussion-mode

docs/mcp-env-vars

docs/upgrade-docs

dpc/wip-hooks

ellipsis/jrun_mwSDsFtCFhhOdHyl_3XjD

ellipsis/jrun_zCxvIROy434W4ljE_01CJ

ephemeral-plan-act-model-settings-to-globalstorage

eve_fix-8073

eve_fix-CLINE-231

eve_hooks-ENG-1376

eve_hooks_PreCompact

eve_hooks_TaskStart

eve_hooks_all_hands_demo

eve_hooks_beyond_all_hands_demo

eve_hooks_fix-7334

eve_hooks_fix-PreToolUse-render-order

eve_hooks_fix_multi_root_workspace

eve_hooks_fix_race_conditions_BACKUP_COPY

eve_hooks_fix_race_conditions_MERGE_ATTEMPT

eve_hooks_hide_files_from_workspace_settings

eve_hooks_phase3_planning

eve_hooks_testing_infra

eve_hooks_testing_step1

eve_hooks_ui_cancel

eve_hooks_ui_planning

eve_troubleshoot-ci-test-failure

execute-quick-win

experiment-speed-up-apiconfiguration-remote

experimental-vscode-impls

extract-isClaude4ModelFamily-helper

factor-out-addToClineMessages

factor-out-getClineRules-function

factor-out-overwriteApiConversationHistory

factor-out-overwriteClineMessages

factor-out-saveApiConversationHistory

factor-out-saveClineMessages

feat-vscode-unified-storage

feat/alias_smol

feat/change_reset_state_colors

feat/codespell_warn

feat/no_track_package_lock

feat/skip-diff-animation

feature-aihubmix-clean

feature/DevEx_ENV

feature/ENG-210

feature/ENG-238

feature/add-confirmation-button

feature/reset_states_update

feature/robust-task-persistence-v1

feature/test-backend-model-list

filter-models-cline-provider

fix-McpToolRow-auto-approve-toggle

fix-add-taskId-to-buildApiHandler

fix-anthropic-input-overload-catch

fix-auth-login-logout

fix-file-mention-not-found

fix-hide-welcome-view-when-signing-up

fix-jumpy-textinput-saving

fix-list-files-return-hidden-directory-file-contents

fix-long-claude

fix-new-rule-button-click

fix-remote-configured-providers

fix-retry-animation

fix-stale-state-groq-model-picker

fix-state-out-of-sync-mcp-auto-approve-all

fix-telemetry-source-link

fix-version-check-proto-script

fix-webfetch-url-link

fix/aikido-security-update-packages-10078919-xAdh

fix/aikido-security-update-packages-10140122-w7bQ

fix/aikido-security-update-packages-10253862-pdeu

fix/aikido-security-update-packages-10263548-ag6J

fix/aikido-security-update-packages-10903508-mH33

fix/aikido-security-update-packages-11303974-rSax

fix/aikido-security-update-packages-11303976-dFeJ

fix/aikido-security-update-packages-11451422-sxg6

fix/aikido-security-update-packages-11987602-ghtQ

fix/aikido-security-update-packages-12319121-gpwV

fix/aikido-security-update-packages-12450023-h4CG

fix/aikido-security-update-packages-12570536-5Ah8

fix/aikido-security-update-packages-12620868-fYh2

fix/aikido-security-update-packages-7781748-eYQH

fix/aikido-security-update-packages-7781749-qgHR

fix/aikido-security-update-packages-7874645-oGBN

fix/aikido-security-update-packages-7874646-sYsV

fix/aikido-security-update-packages-8179819-7q1h

fix/aikido-security-update-packages-8260739-62Ad

fix/aikido-security-update-packages-8487369-fd53

fix/aikido-security-update-packages-8575768-oC5k

fix/aikido-security-update-packages-8810247-wDdb

fix/aikido-security-update-packages-8941102-i4ME

fix/aikido-security-update-packages-8993782-hESM

fix/aikido-security-update-packages-9452888-1mM9

fix/aikido-security-update-packages-9496302-t1Mo

fix/aikido-security-update-packages-9760940-emo8

fix/aikido-security-update-packages-9817843-vEBx

fix/aikido-security-update-packages-9868929-25Tq

fix/aikido-security-update-packages-9877269-9HWW

fix/changeset

fix/chat-placeholder-text

fix/chrome-remote-debugging-user-data-dir

fix/lint

fix/prevent-tooltip-hovering

fix/taskHistory-migration-across-dev-and-prod

fix/webview-ui-focus-stealing

fix_account_buttons

fix_grok_caching

flaky-tests/get-by-text-api-request

frostbourne/autoapp_menu_fixes

frostbourne/block_eval

frostbourne/eng-126

frostbourne/eng-130

frostbourne/eng-169

frostbourne/eng-251

frostbourne/eng-265

frostbourne/eng-353

frostbourne/eng-417

frostbourne/eng-503

frostbourne/make-tags-scrollable

frostbourne/zu-standup

ft/context-adjustment-1

ft/context-adjustment-2

ft/context-adjustment-3

ft/context-adjustment-4

ft/context-adjustment-5

ft/context-combined-1

ft/gemini-cache

ft/gemini-cache-fix

ft/move-context-logic-out

ft/new-task-prompt

ft/overwrite-first-asst-msg

ft/sl-comma

ft/sl-command-condense-new

ft/sl-command-new-rule

ft/sl-command-rule

ft/sl-commands

ft/sl-menu

ft/split-task-resumption-text

ft/sug-model

ft/trunc-notice

gitignore-protobuf-compiles

grok3-reasoning-effort

grpc-client-codegen-prettier

handle-openrouter-context-window-broken

hardcode_workos_auth

hide-expandable-server-row

hooks-vid

host-bridge-showTextDocument

hostbridge-diff-diagnostics

hostbridge-migrate-showSaveDialog

hostbridge-migrate-textdocument-save

hostbridge-migrate-withProgress

hostbridge-telemetry

hostbridge-vscode-openfile

hotfix

hugelung/errorboundaries

hugelung/gpt-4.5

hugelung/proto_browserSettings

hugelung/proto_getDetectedChromePath

hugelung/proto_testBrowserConnection

hugelung/protobus

hugelung/protobus_discoverBrowser

hugelung/remote_browser_info_panel

hugelung/remote_browser_settings_scroll

hugelung/rich_mcp_response

igor/hostbridge_dynamic_port

igor/local_mcp

igor/staging-int

igor/terminal_add_last_command

igor/thinking_budget_update

igor/vscode_plugins

improve-bug-report-template

improve-openrouter-error-message

improve/mcp-streamable-http-error-handling

jb-test

jose/chatstream-ui

jose/checkpoint-restore-ctas

jose/issue-5153

jose/ui-refresh

jose/ui-refresh-01

jose/ui-refresh-01b

josec/refactor-tests-foldering-nd-coverage

josec/remove-integration-tests-shadow

juan/devrel-13-add-documentation-on-workflows-with-mcps

kevinneung-documentation-for-new-coders

kevinneung-documentation-get-to-know-the-models

kevinneung-documentation-installing-cline

kevinneung-mcp-marketplace-doc-update-2

kevinneung-mcp-marketplace-docs-update

kevinneung-vscode-focusBorders-enhancement

kevinneung-walkthrough-handler-fix

keytar-secret-storage

kvyb/cline-standalone-secrets-storage

lazily-initialize-provider-sdks

locking-stuff

logger/write-to-logs-file-vscode

main

main-backup-may7

make-clinerules-folder

markdown-mcp-responses

mcp-config

mcp-edit-review

mcp-enterprise-rules

mcp-image

mcp-sse-test

mcpresponsedisplay-refactor-useeffect-logic

mermaid-system-prompt

migrate-accountButtonClicked-protobus

migrate-accountLogout-Clicked-protobus

migrate-apiConfiguration-protobus

migrate-authCallback-protobus

migrate-authStateChanged-protobus

migrate-chatbuttonclicked-protobus

migrate-clineMessages-to-MessageStateHandler

migrate-custom-model-configs-section-new-docs

migrate-deleteMcpServer-protobus

migrate-didBecomeVisible-protobus

migrate-didShowAnnouncement-protobus

migrate-downloadMcp-postMessageToWebview-to-proto

migrate-fetchLatestServersFromHub-protobus

migrate-fetchUserCreditsData-protobus

migrate-focusChatInput-protobus

migrate-getting-started-new-docs

migrate-historybuttonclicked-protobus

migrate-mcpMarketplaceCatalog-protobus

migrate-mcpServers-protobus

migrate-mcpbuttonclicked-protobus

migrate-openExtensionSettings-protobus

migrate-openMcpSettings-protobus

migrate-openRouterModels-protobus

migrate-openSettings-protobus

migrate-partialMessage-protobus

migrate-plan-act-to-workspace-state

migrate-prompting-folder-new-docs

migrate-refreshClineRules-protobus

migrate-relinquishControl-protobus

migrate-requestyModels-protobus

migrate-restartMcpServer-protobus

migrate-running-models-locally-section-new-docs

migrate-setActiveQuote-protobus

migrate-setttingsButtonClicked-protobus

migrate-showAccountViewClicked-protobus

migrate-silentlyRefreshMcpMarketplace-protobus

migrate-successbutton-to-tailwind

migrate-theme-protobus

migrate-toggleClineRule-protobus

migrate-toggleCursorRule-protobus

migrate-toggleToolAutoApprove-protobus

migrate-toggleWindsurfRule-protobus

migrate-tools-section-new-docs

migrate-updateTerminalConnectionTimeout-protobus

migrate-webviewDidLaunch-protobus

migrate-workspaceUpdated-protobus

mintlify/docs-issue-7726-30327

mintlify/remove-star-actions-docs-47961

model-4-prep-bedrock

model-4-prep-vertex

model-4-ui-prep

more-mermaids

move-apiconfiguration-to-cache-layer

move-context-files-to-context-folder

move-ensureTaskDirectoryExists-from-Cline-file

move-mode-to-controller

move-plan-act-to-workspace-state

multi-cline

nighttre/otel-jitsu-refactor

nighttrek/account-support-telemetry

nighttrek/aws-nova-models

nighttrek/bedrock-auth-provider-caching

nighttrek/bedrock-credential-manager

nighttrek/change-best-model

nighttrek/cline-account-support

nighttrek/clinerules

nighttrek/eng-464

nighttrek/filter-opted-out-people

nighttrek/hooks-cli-support

nighttrek/identify-users-for-support

nighttrek/increase-posthog-cost

nighttrek/liteLLM-model-picker

nighttrek/nightly-workflow-build

nighttrek/protobuf-rule

nighttrek/telemetry-bug-fixes

nighttrek/telemetry-event-handler-seperation

nighttrek/telemetry-settings-module

nighttrek/telemetry-settings-refactor

nighttrek/telemetryhost

no-protoc-wat

ocasta181/ENG-158

ocasta181/ENG-209

ocasta181/ENG-226

ocasta181/ENG-282

ocasta181/ENG-340

ocasta181/clineignore

ocasta181/fix-mistralai-deps

ocasta181/hash-tool-config

ocasta181/refactor

openai-compatible-extra-headers

opus-4-max-tokens

organizations/main

organize-migrations-new-file

original-voice

pashpashpash-patch-1

pashpashpash-patch-2

pashpashpash/accounts

pashpashpash/act-mode-hotkey

pashpashpash/activation-events

pashpashpash/auto-condense-feature

pashpashpash/auto-evals

pashpashpash/auto-summarize-compact

pashpashpash/balance-display

pashpashpash/bashTool

pashpashpash/buy-credits-url

pashpashpash/claude-4-cozy

pashpashpash/claude-code-docs

pashpashpash/claude4-prompt-improvements

pashpashpash/cleanup-extension

pashpashpash/cli-host-version-implementation

pashpashpash/cli-markdown-streaming

pashpashpash/cli-record-functionality

pashpashpash/cli-registry-sync

pashpashpash/cline-core-registry-and-cleanup

pashpashpash/cline-task-server

pashpashpash/closing-telemetry-banner-fix

pashpashpash/context-in-context

pashpashpash/conversation-telemetry

pashpashpash/copy-buttons

pashpashpash/credits-redirect

pashpashpash/default-and-recommended-models

pashpashpash/delete-mermaid-prompt

pashpashpash/diff-edit-closing-revert

pashpashpash/diff-edit-eval-provider-options

pashpashpash/diff-evals

pashpashpash/diff-evals-default-algo

pashpashpash/diff-evals-new-algorithm

pashpashpash/diffeditfix

pashpashpash/docs-features

pashpashpash/docs-update

pashpashpash/documentation-workflow

pashpashpash/ending-free-grok3-promo

pashpashpash/evals-commands

pashpashpash/evals-dashboard-fix

pashpashpash/evals-env

pashpashpash/evals-format

pashpashpash/evals-replay

pashpashpash/exact-antml

pashpashpash/exact-antml2

pashpashpash/experimental-claude4

pashpashpash/feedback

pashpashpash/formatted-read-tool

pashpashpash/framework

pashpashpash/full-automation

pashpashpash/gemini-2dot5-using-claude4-prompt

pashpashpash/gemini-prompt-caching

pashpashpash/global-at-mentions

pashpashpash/gpt-5-release

pashpashpash/greyscreen-fix

pashpashpash/grok-3-promo-over

pashpashpash/grok-4

pashpashpash/grok-name-param

pashpashpash/grok3-beta-name

pashpashpash/hugging-face-openai-models

pashpashpash/huggingface

pashpashpash/increase-max-tasks-test-mode

pashpashpash/instance-locking-debug

pashpashpash/is-test-build-flag

pashpashpash/line-break

pashpashpash/mcp-download-fix

pashpashpash/mcp-downloads

pashpashpash/more-docs

pashpashpash/more-tools

pashpashpash/moree-docs

pashpashpash/new-horizons

pashpashpash/new-task-test

pashpashpash/new-task-tool

pashpashpash/newest-sort

pashpashpash/no-terminal-reuse

pashpashpash/not-showing-request-id-insufficient-balance-error

pashpashpash/openrouter-and-cline-provider-accurate-pricing

pashpashpash/parsing-v2-fix

pashpashpash/recommend-grok4

pashpashpash/recommended-models

pashpashpash/release-workflow

pashpashpash/remote-git-context

pashpashpash/remove-free-gemini

pashpashpash/removing-sparkle-command

pashpashpash/requesty-fix

pashpashpash/retrying-anthropic

pashpashpash/revert-diff-edit-v1

pashpashpash/rip-latex

pashpashpash/ripgrep-overload-fix

pashpashpash/ripping-out-test-build-flag

pashpashpash/rules-links

pashpashpash/settings-cancel

pashpashpash/settings-tabs

pashpashpash/settings-ui-fixes

pashpashpash/showOpenDialogue-migration

pashpashpash/summarization-context-management

pashpashpash/supporting-legacy-search-replace-blocks

pashpashpash/task-expanded

pashpashpash/task-id-cline-provider

pashpashpash/tasks

pashpashpash/telemetry-models

pashpashpash/terminal-output-fix

pashpashpash/terminal-race-condition

pashpashpash/terminal-reuse-option

pashpashpash/terminal-revert

pashpashpash/terminal-timeout

pashpashpash/test-cli

pashpashpash/test-message-catching

pashpashpash/test-server-approval-settings

pashpashpash/test-server-waiting-completion

pashpashpash/top-model-claude-4

pashpashpash/tracking-model-diff-failure

pashpashpash/trying-out-bubbletea

pashpashpash/ui-bugs

pashpashpash/uninstall-cli-script

pashpashpash/updating-clinerules

pashpashpash/vertex-prompt-caching

pashpashpash/virtuoso-timeline

pashpashpash/write-to-file-fail-telemetry

pashpashpash/z-index-fix

pass-id-to-webview-on-creation

pass-linter-errors-read-file

patch-1

persist-distinct-id

pf-336-auth-state

pf-355-allow-admins-and-owners-to-override-remote-config

pr-6066

pr-review-cline-workflow

pr/7975

prompting/shorten-task-completion-message

protobus-addRemoteServer

protobus-host-bridge

protobus-showMcpView

protobus-state-memleak

protobus-sub-addToInput

protobus-toggleMcpServer

protobus-updateMcpTimeout

protos-script-fix

pull-out-file-io-functions

refactor-apiModelId-field

refactor-auto-approve-to-modal

refactor-check-auth-after-initialize-cacheservice

refactor-remote-config

refactor-saveClineMessagesAndUpdateHistory-separate-file

refresh-mcp-market

release/v0.1.0

remote-config/setters-guards

remove-chatsettings-object

remove-claude-tool-use-backwards-compatability

remove-custom-instructions

remove-enable-checkpoints-task-class-variable

remove-enable-checkpoints-task-class-variable-2

remove-free-models-cline-provider

remove-getConfiguration-telemetry-setup

remove-legacy-tool-result-and-tool-use-code

remove-legacy-tool-use-code-gemini

remove-markdown

remove-mcp-args-system-prompt

remove-model-context-tracker-class

remove-proto-gen-files

remove-remaining-postMessage-calls

remove-unused-extensionstatecontext-setters

remove-unused-newTask-message

remove-vscode-fs-writeFile-host-bridge

remove-windows-integration-tests

renee/enterprise-banner

reorganize-system-prompt-creation

reorganize-task-state-and-refactor-out-small-functions

reset-recommended-model-to-3.7

revert-1513-i18n-version-number

revert-1765-feat/skip-diff-animation

revert-1778-checkpoints-v2.0

revert-1895-feature/dev_exp

revert-1964-ocasta181/ENG-209

revert-1980-better-streaming-support

revert-2031-create-separate-sonnet-thinking-model

revert-2361-factor-out-overwriteApiConversationHistory

revert-2415-context-manager-tests

revert-2448-pashpashpash/conversation-observability

revert-2847-improve_prompt_cache_claude

revert-2927-OpenAI-o3-and-4o-mini

revert-3443-arafatkatze/fix-copy-code-manually

revert-3598-changeset-release/main

revert-4101-spruce-up-historyPreview

revert-4159-pass-linter-errors-read-file

revert-4404-simplify-chatview-isStreaming-boolean

revert-4664-main

revert-4681-update-gemini-models-072025

revert-4954-nighttrek/increase-posthog-cost

revert-5549-igor/fix_welcome_view_state

revert-5770-pashpashpash/publish-action-fix

revert-6143-arafatkatze/workspace-adapter

revert-7909-revert-7884-saoudrizwan/update-capabilities-command-output-limiting

revert-8055-url-mcp-check

revert/privacy_policy

review_cta

robin/8167-newline-patch

robin/jupyter-prompt-context

saoudrizwan/add-gpt-4-1

saoudrizwan/add-gpt5codex

saoudrizwan/add-local

saoudrizwan/auth-fix-2

saoudrizwan/bug-fixes

saoudrizwan/chat-field-clearing

saoudrizwan/checkpoints-fix

saoudrizwan/decoupling

saoudrizwan/file-size-checker

saoudrizwan/fix-auth

saoudrizwan/fix-browser-thinking

saoudrizwan/fix-chat-restore

saoudrizwan/fix-checkpoints-ui

saoudrizwan/fix-colors

saoudrizwan/fix-credits-402

saoudrizwan/fix-diff-edit

saoudrizwan/fix-diff-editing

saoudrizwan/fix-diff-out-of-order

saoudrizwan/fix-file-edits

saoudrizwan/fix-firebase

saoudrizwan/fix-header

saoudrizwan/fix-huggingface

saoudrizwan/fix-huggingface-descriptions

saoudrizwan/fix-issue-template

saoudrizwan/fix-issue-templates

saoudrizwan/fix-kimi

saoudrizwan/fix-menu

saoudrizwan/fix-metadata-pr

saoudrizwan/fix-moonshot-model

saoudrizwan/fix-opus-caching

saoudrizwan/fix-package-version

saoudrizwan/fix-plan-mode-response-yolo

saoudrizwan/fix-plan-mode-yolo

saoudrizwan/fix-pr-template

saoudrizwan/fix-progress-update-state

saoudrizwan/fix-resignin-flow

saoudrizwan/fix-restore-state

saoudrizwan/fix-sonnet-4-window

saoudrizwan/fix-terminal

saoudrizwan/fix-tests

saoudrizwan/fix-tests-2

saoudrizwan/fix-thinking-cancel

saoudrizwan/fix-todo-prompt

saoudrizwan/gemini-model-fix

saoudrizwan/grok-model-id

saoudrizwan/grok-prompt-caching

saoudrizwan/issues-fix

saoudrizwan/kimi-fixes

saoudrizwan/kimi-support

saoudrizwan/load-mcp-docs

saoudrizwan/moonshot-china-option

saoudrizwan/moonshot-provider

saoudrizwan/options-buttons

saoudrizwan/options-fix

saoudrizwan/refactor-cline

saoudrizwan/remove-options

saoudrizwan/remove-vercel-stream-transform

saoudrizwan/revert-context-management

saoudrizwan/show-resignin-button

saoudrizwan/temp-disable-test

saoudrizwan/terminal-fixes

saoudrizwan/todos

saoudrizwan/unblock-ui

saoudrizwan/update-banner

saoudrizwan/update-package

saoudrizwan/version-bump

saoudrizwan/xai-free

saoudrizwan/zai-updates

sec/audit_fixes

separate-plan-act-model-settings

settingsview-apiconfig-save-on-change

settingsview-refactor-about

settingsview-refactor-apiconfiguration

settingsview-refactor-browser-feature-terminal

settingsview-refactor-debug

settingsview-refactor-generalsettings

setup-mintlify-docs

simple-home-header

simplified-fileopen-hostbridge

simplify-chatview-isStreaming-boolean

singleton-statemanager/use-in-isMultiRootEnabled

sjf-1

sjf-a15-m2

sjf-a27-diffview

sjf-a28-lsdkfj

sjf-a8-asdf

sjf-a8-asdf2

sjf-a8-nm

sjf-a8-nm2

sjf-a9-sfdsd

sjf-d19-mswk

sjf-g2

sjf-g3

sjf-g4

sjf-g5

sjf-h2

sjf-h3

sjf-l2

sjf-lint

sjf-lint-grpc

sjf-lintA

sjf-p

sjf-patch-1

sjf-patch-2

sjf-s11-wv

sjf-s14-kb

sjf-s15-b

sjf-s15-bn

sjf-s16-ddd

sjf-s24-xwhg

sjf-s8-v

sjf-v

solve-stale-ui-state-openrouter-model-picker

spruce-up-historyPreview

sse-schema-and-types

sso-docs

state-revert

stream-claude-thinking-vertex

stream-mcp

support-temperature-openai-compatible

taskHistory-migration-improvements

thinking-budget-slider-rendering

to/account-page-refresh

to/add-act-mode-constraints

to/add-await

to/add-devstral-medium

to/add-diff-apply-check

to/api-provider-refactor-2

to/chat-box

to/claude4-read-file-note

to/cli-update-15

to/cline-trending-model

to/compact-prompt-update-actual

to/condense-proto-new

to/context-management-switch

to/convert-rule-file

to/delete-ui-proto

to/diff-evals

to/env-run-button

to/eval-cache

to/external-rules-ui

to/file-selection-proto

to/fix-grok-doesnt-support-webp

to/fix-invalid-tab-not-found

to/fix-sap

to/fix-thinking-budget

to/gemini-ga-1

to/gemini-or-cache-2

to/gemini-thinking-budget-slider

to/generation-cache-get

to/github-feature-template

to/global-workflows

to/grok4-advanced-list

to/inject-focus-chain-in-summary-text

to/lite-llm

to/mcp-env

to/migrate-rules

to/more-chat-text

to/more-checkpoints

to/more-file-type-uploads

to/more-file-types

to/nebius-2

to/new-diff-apply-to-eval

to/ollama-deployment-auth

to/or-cline-gemini-caching

to/or-openai-cache

to/or-openai-cache-ui

to/provider-ui-refactor-3

to/read-images

to/refactor-browser-ui

to/refactor-providers

to/refactor-providers-10

to/refactor-providers-12

to/refactor-providers-4

to/refactor-providers-5

to/refactor-providers-6

to/refactor-providers-7

to/refactor-providers-8

to/refactor-providers-9

to/refactor-segment-streamer

to/remove-gemini-explicit-cache

to/remove-log

to/remove-unused-grpc

to/remove-vite-suggestion

to/replay-eval-2

to/replay-evals

to/report-bug

to/report-bug-button

to/settings-fix

to/small-prompt

to/smol-formatting

to/smol-remove

to/sse-rpc-headers

to/stop-double-token-count

to/systematic-gemini-cache-logic

to/terminal-setting

to/test-env-injection

to/update-runs-options

to/update-suggested-gemini-ga

to/web-fetch

to/web-tools-menu

to/wrong-gemini-text

track-total-task-size

trevhud/ENG-100

trevhud/ENG-100-with-env

trevhud/ENG-1817

trevhud/ENG-279

trevhud/GIT-2342

trevhud/GIT-453

trevhud/auto-approve

trevhud/auto-approve-menu

trevhud/debugging

trevhud/distinct-id

trevhud/eng-592-add-title-tags-to-buttons-in-the-bottom-left-corner

trevhud/exception

trevhud/exceptions

trevhud/extended-thinking

trevhud/identify

trevhud/init

trevhud/launchjson

trevhud/local-logging

trevhud/posthog-key

trevhud/remove_linear

trevhud/replace-missing-function

trevhud/sonnet3.7

trevhud/tasks-by-workspace

trevhud/telem

trevhud/telem-defaults

trevhud/telemetry

trevhud/telemetry-optimization

trevhud/vite-auto

trevor/eng-253-system-for-users-to-report-errors-to-github

trevor/eng-513-terminal-output-doesnt-stream-to-cline-after-first-line

trevor/eng-515-new-onboarding-screen

trevor/eng-528-create-linear-ticket-every-time-a-community-pr-is-made

try-to-fix-tests

unblock-apioptions-spec-errors

update-anthropic-sdk

update-consecutiveAutoApprovedRequestsCount-on-max-requests-change

update-extension-import-paths

update-gitignore-protos

update-package-script

update-pr-template-guide

update-provider-sdks

update-webview-imports

use-the-vscode-api-for-urls

v3.18.8_Change_Log

vscode-impls-terminal

wf/disable_codespell

wip/release_scripts

wiring-up-remote-server-creation

workflow/changeset_converter_org

workflow/changeset_dispatch

workflow/deployer_team_update

workflow/publish_update

workflows/lefthook

you-know-what-im-talking-about

aikido-autofix[bot] 2f8a4525a4 chore(deps): bump streamlit from 1.28.0 to 1.43.2 in evals		5 days ago
..
dashboard	2f8a4525a4 chore(deps): bump streamlit from 1.28.0 to 1.43.2 in evals	5 days ago
database	430074d0c2 diff evals (#4154)	6 months ago
diff-apply	52571ccee8 base (#4600)	5 months ago
parsing	1ffa4085a6 Remove experimental Sonnet 4 code (#5527)	4 months ago
prompts	87b3e79b90 Instruct AI to prefer non-interactive commands (#7762)	3 weeks ago
ClineWrapper.ts	1ffa4085a6 Remove experimental Sonnet 4 code (#5527)	4 months ago
README.md	004b313d20 Update diff edit evals README.md (#4920)	5 months ago
TestRunner.ts	6255ac0a51 added provider flag to diff edit cli (#5334)	4 months ago
database.md	50b43c0559 Diff Evals - Replay feature (#4407)	6 months ago
helpers.ts	36e3f4cdd3 add log + run options (#4630)	5 months ago
openRouterModelsHelper.ts	430074d0c2 diff evals (#4154)	6 months ago
run_and_open_dashboard.sh	430074d0c2 diff evals (#4154)	6 months ago
types.ts	0b1a237290 Unify tool name vars (#6012)	3 months ago

A Note on Cline's Diff Evaluation Setup

Hey there, this note explains what we're doing with Cline's diff evaluation (evals) system. It's all about checking how well various AI models (which users connect to Cline via their own API keys), prompts, and diffing tools can handle file changes.

What We're Trying to Figure Out

The main idea here is to figure out which AI models (configured by users) are best at making replace_in_file tool calls that work correctly. This helps us understand model capabilities and also speeds up our own experiments with prompts and diffing algorithms to make Cline better over time. We want to know a few key things.

First, can the model create diffs, which are just sets of SEARCH and REPLACE blocks, that apply cleanly to a file? This is what we call diffEditSuccess.

Second, how do different LLMs, like Claude or Grok, stack up against each other when they try to make these diff edits? We use a standard set of real-world test cases for this.

Third, do different system prompts, say our basicSystemPrompt versus the claude4SystemPrompt, change how well a model does at diff editing?

Fourth, we're also looking at different ways to apply the diffs themselves. We have a few algorithms like constructNewFileContentV1, V2, and V3, and we want to see which ones are more robust when fed model-generated diffs.

Fifth, we track how fast the model starts making an edit. The timeToFirstEditMs metric gives us a hint about how quickly a user would see changes happening in their editor.

And finally, we keep an eye on how many tokens are used and what it costs for each model and each try. This helps us compare how efficient they are.

Right now, these evals are mostly about whether the diff applies correctly. That means, do the SEARCH blocks find a match, and can the REPLACE blocks be put in without an error? We're not yet deeply analyzing if the change is valid code or matches what the user wanted semantically. That's a problem for another day, and will require a lot more scaffolding.

How We Run These Tests

Two prerequisites:

Make sure you have an evals/.env file with OPENROUTER_API_KEY=<your-openrouter-key>
Make sure you add a evals/diff-edits/cases folder with all the conversation jsons prior to running this.

Our testing strategy is based on replaying situations from actual user sessions where diff edits were tried.

It starts with our test cases. Each one is a JSON file in ./cases that has the conversation history that led to a diff edit, the original file content and its path, and the info needed to rebuild the system prompt from that original session.

Then, for every test run, we set up a specific configuration. This includes which LLM we're testing, which system prompt it gets, which function we use to parse the model's raw output, and which function we use to actually apply the diff. Here's the command I've been using:

npm run diff-eval -- --model-ids "anthropic/claude-3-5-sonnet,x-ai/grok-3-beta,anthropic/claude-3.7-sonnet,anthropic/claude-sonnet-4,google/gemini-2.5-pro-preview,google/gemini-2.5-flash" --max-cases 5 --valid-attempts-per-case 5 --parallel --diff-edit-function diff-06-26-25 --verbose

This will build the eval script, run it, and then open the streamlit dashboard to show the results.

The TestRunner.ts script is the main coordinator. For each test case and setup, ClineWrapper.ts takes over and sends the conversation and system prompt to the LLM. We then watch the model's response as it streams in and parse it to find any tool calls.

We're specifically looking for the model to make a single replace_in_file tool call. Multiple edits in one tool call are allowed, and recorded (in case you want to filter results by number of edits in a single tool call and compare success rate for that slice across different models/system prompts/etc). If it does, and it's for the correct file, we grab the diff content it produced. Then, the chosen diff application algorithm tries to apply that diff to the original file. We record whether this worked or not as diffEditSuccess.

We record a bunch of data for every attempt into a database. This includes details about the model and prompt, token counts, costs, the raw output from the model, the parsed tool calls, whether it succeeded or failed, any error messages, and timing info. For a detailed explanation of the database schema, see database.md.

A big part of this is how we handle "valid attempts," which I'll explain next.

Keeping it Fair with "Valid Attempts"

LLMs can be unpredictable. If we replay an old scenario, a new model, or even the same model later, might do something completely different than what happened originally. It might call another tool or ask a question instead of trying a diff edit.

Since we really want to test the diff editing part, we need a way to make sure we're comparing fairly. That's why we have this idea of "valid attempts."

An attempt is "valid" for this benchmark if the model actually tries to do what we're interested in. This means two things. One, it must call the replace_in_file tool. Two, it must target the same file path that was targeted in the original recorded conversation for that test case.

If the model does something else, like calling a different tool or picking the wrong file, we don't count that attempt against its diff editing score. Instead, we consider it an "invalid attempt" for this specific benchmark and simply re-run that test case with that model. We keep doing this until we've collected a set number of these "valid attempts."

For example, if we ask for 5 valid attempts per test case, the system will keep re-rolling for that case until the model has tried to edit the correct file using the replace_in_file tool 5 times. Only then do we look at how many of those 5 valid attempts actually resulted in a successful diff application (diffEditSuccess).

This way, if we're comparing two models and one gets a 10% success rate on its valid diff edit attempts, and another gets 90%, we have a much clearer picture of their actual diff-generating capabilities. It avoids muddying the waters with attempts where the model didn't even try to perform the specific action we're evaluating. This approach helps us isolate and measure the diff-editing skill more directly, despite the non-deterministic nature of these models.

Replays

You can also use the replay argument to replay a previous benchmark run. This is super useful for iterating on our diffing algorithms without having to re-run expensive and time-consuming LLM calls.

When you run an evaluation, every detail is stored in the database—including the raw, unmodified output from the model. The replay feature takes advantage of this by pulling that raw output and feeding it into a different diffing algorithm. This lets you isolate the performance of the diffing logic itself. We can see if a new algorithm is better at applying the exact same set of diffs that a model generated in a previous run.

This process is blazingly fast and free, as it completely bypasses the need to make new API calls. It ensures a true apples-to-apples comparison between diffing strategies, since the model's output—the "ground truth" for the evaluation—remains identical.

Here’s an example of how you would replay a previous run with a new diffing algorithm:

cd evals && npm run diff-eval -- --replay-run-id 9902189e-63a8-4210-a4fc-fe59e2eaf2c2 --diff-apply-file diff-06-23-25 --verbose

In this command:

--replay-run-id specifies the original run we want to use as our ground truth.
--diff-apply-file tells the script to use the new diffing logic from the diff-06-23-25.ts file.

The script will then create a new run in the database that mirrors the original, but with the results of applying the new diffing algorithm. This allows for a direct comparison in the dashboard, helping us quickly see which of our diffing strategies is the most robust.

README.md

A Note on Cline's Diff Evaluation Setup

What We're Trying to Figure Out

How We Run These Tests

Keeping it Fair with "Valid Attempts"

Replays