CLAUDE.md, AGENTS.md

Guidance for Claude Code when working with this repository. Instructions here override default behavior.

Design philosophy

Three tenets govern every change in this repo — to code, MCP tools, or prompts. Read docs/PHILOSOPHY.md (mirrored as mcp-steroid://skill/design-philosophy for runtime fetch via steroid_fetch_resource) before proposing any of:

a new steroid_* MCP tool
a new method on McpScriptContext
a "helper" that wraps an IntelliJ API

Short version: the MCP tool surface (10 today) stays narrow on purpose; the IntelliJ capability surface stays full, exposed via steroid_execute_code plus prompt resources. The strategy page's "Give AI the whole IDE, not just the files" is delivered through that combination — steroid_execute_code reaches every IDE API, and the mcp-steroid:// prompt corpus teaches the agent how. New tools and new context methods are not the lever.

Recursive context lookup (do this before sub-folder work)

Before acting on any task that touches files in a sub-folder, walk the directory tree from the changed file's folder up to the project root and read every CLAUDE.md and AGENTS.md you find on the way (including this one). Sub-folder guides take precedence over the root for their own scope; the root only holds project-wide rules.

Recipe (run this in your head, or with a one-liner). Normalize the starting directory to an absolute path first — relative paths converge on . and never match the repo root, infinite-looping:

# from any file path (relative or absolute), walk parents to repo root, collecting CLAUDE.md / AGENTS.md
file="<changed-file>"
dir=$(cd "$(dirname "$file")" && pwd)
root=$(git rev-parse --show-toplevel)
while [ "$dir" != "$root" ] && [ "$dir" != "/" ]; do
  for f in CLAUDE.md AGENTS.md; do [ -f "$dir/$f" ] && echo "$dir/$f"; done
  dir=$(dirname "$dir")
done
for f in CLAUDE.md AGENTS.md; do [ -f "$root/$f" ] && echo "$root/$f"; done

When changing files across multiple sub-folders, read the guides for each.

Sub-folder guides

Folder	Guide	Scope
`ij-plugin/`	ij-plugin/CLAUDE.md	IntelliJ plugin code, services, threading, build, deployment, sandbox/index troubleshooting, registry keys
`prompts/`	prompts/AGENTS.md	Prompt file format, IDE conditionals, `mcp-steroid://` resources, KtBlocks
`test-integration/`	test-integration/AGENTS.md	Stable Docker IDE smoke tests, shared infra, hung-test diagnosis, multi-version compat tests, playgrounds, Rider/.NET, Linux Docker CI gotchas
`test-experiments/`	test-experiments/CLAUDE.md	DPAIA arena suite, debugger demos, prompt-quality comparisons, IMPROVEMENTS.md harness
`docs/`	docs/CLAUDE.md	Autoresearch / prompt-optimization working notes, DPAIA history
`website/`	website/CLAUDE.md	Hugo site sources, GitHub Pages deployment
`installer-gen/`	installer-gen/CLAUDE.md	Build-tooling: computed JDK data model (Corretto/Azul, PGP-verified, pinned fingerprints), on-disk download cache, install.sh/install.ps1 generation
`website-gen/`	website-gen/CLAUDE.md	Build-tooling generator: version.json + updatePlugins.xml (depends on `:installer-gen` for shared HTTP)

MUST DO

Use IntelliJ MCP for everything where you can — see ij-plugin/CLAUDE.md for the API patterns.
Never ignore warnings or errors — fix them properly.
No test-only branches (isUnitTestMode) — use correct IntelliJ actions (writeIntentReadAction, writeCommandAction).
Tests must show reality. Never remove, disable, or weaken a failing test; fix the underlying issue.
No @Suppress("DEPRECATION") — find the non-deprecated replacement.
Prefer JSON libraries for JSON parsing/manipulation; only static final JSON constants may be hand-written as raw strings.
Log new ideas/tasks in TODO* files (TODO.md, TODO-*.md).
Atomic commits with descriptive messages (what and why). Test and build before committing.
Never include AI as co-author or mention AI in commit messages.

Banned patterns

runCatching{}.onFailure{} — use try { } catch (e: Exception) { } instead. Other runCatching uses (.getOrNull(), .getOrDefault()) are fine.
The internal visibility modifier. Prefer plain public (no modifier). Don't add internal to declarations — including test-visible helpers.
Returning a (value, errorFlag) pair/tuple from a call that can fail. Return the value (or a domain value object) and signal failure by throwing or returning null — not Pair<Result, Boolean> where the boolean is an error/isError flag.
Empty catch / catch (_: Exception) {}. Fail fast and log: every catch must rethrow, log via System.err.println / logger.error, or both. Silent failure hides root causes.
run-agent.sh references in production code or tests. It is a manual dev/peer-review tool only. Never COPY or chmod +x it inside Dockerfiles. Implement agent integrations directly via CLI flags.
Cross-subproject build/ directory access in Gradle build files. Use Gradle dependency configurations. Fail fast with require()/error() — no silent fallbacks.
append("\n") tricks to bypass the NoLargeInlineStringsTest lint rule. When a buildString { } exceeds the consecutive-appendLine limit, move the content to prompts/src/main/prompts/ and reference it via the article URI.
Hardcoded mcp-steroid://... URI literals in production Kotlin. Use the generated article class: XxxPromptArticle().uri (from com.jonnyzzz.mcpSteroid.prompts.generated.*). Enforced by NoHardcodedMcpSteroidUriUsageTest. See FetchResourceToolHandler.kt.
Infrastructure workarounds in tests. When a test fails due to missing Docker socket, missing CLI, wrong JDK, or missing native library, fix the infrastructure — never add detection-and-skip code.
Detecting failures and skipping tests at runtime (try { } catch { skip() }, Assumptions.assumeTrue(isAvailable), TestAbortedException on error). The only acceptable skip is at the Gradle task level (enabled = !condition) when an entire suite is structurally incompatible with the platform.
- Single documented exception: Gemini API key on CI. TC has no Gemini token and there is no plan to add one. DockerGeminiSession.Companion opts into skipTestWhenKeyMissing = true (see test-helper/.../AISessionBase.kt), so requireApiKey() throws AssumptionViolatedException instead of IllegalStateException when the key is missing. Constraints when working in this area:
  1. Session creation must stay lazy — called from inside test method bodies, never from setUp() / class init / @ClassRule. BasePlatformTestCase-backed tests route every Throwable through JUnit38ClassRunner to fireTestFailure, so an early init failure shows up against the wrong test.
  2. Do NOT add excludeTestsMatching or other test-class-level filters — that hides the test from reports.
  3. Unresolved %credentialsJSON:…% must still fail hard with IllegalStateException — that branch indicates a real TC misconfiguration and must stay visible. The contract is unit-tested in :test-helper:test AIAgentCompanionApiKeyTest.
  4. Do NOT extend the opt-in to other agents. Anthropic / OpenAI keys ARE configured on TC; their tests must keep failing if the key disappears.
Java threading primitives (CountDownLatch, Semaphore, Object.wait()) in coroutine code. Use CompletableDeferred<T> + withTimeout(d) { deferred.await() }, Channel<T>, or suspendCancellableCoroutine.
./gradlew test at the repo root. It fans out to every module and can take hours. Always scope: ./gradlew :ij-plugin:test, ./gradlew :kotlin-cli:test, ./gradlew :prompts:test --tests '<pattern>'. See per-module guidance in ij-plugin/CLAUDE.md.
Literal /* inside KDoc bodies. Kotlin doc comments support nested /* */, so a string like "`7z/win-x64/*`" in a /** */ block starts an inner comment; the next */ closes the INNER, leaving the outer open. The compiler reports Unclosed comment at the end of the file plus a cascade of unresolved-reference errors. Rewrite as // line comments or quote the substring to avoid the /* sequence.
MCP stdio scripts writing to stdout. Any shell/PowerShell wrapper invoked by an agent CLI as a stdio MCP server (devrig mcp, etc.) must emit only stderr before exec-ing the inner binary. Stdout is the JSON-RPC channel — a single stray byte corrupts the protocol. Use >&2 (POSIX) or Write-Error / [Console]::Error.WriteLine (PowerShell).
claude mcp add without --scope user. The Claude CLI defaults to --scope local, which writes to claude.json.projects.<cwd>.mcpServers instead of the top-level user-scope mcpServers. Registration is then invisible from any other project. All user-wide Claude mcp add calls must pass --scope user. Codex and Gemini default to global/user-wide and do not need the flag.
Materializing files for Gradle's daemon classpath at CONFIG phase. Anything that needs to be on the gradle daemon's classloader during config phase (e.g., resources read by :ij-plugin's IPGP local(provider) at task-graph time) must be pre-staged in settings.gradle.kts — buildSrc is chicken-and-egg (its tasks don't run until after settings + buildSrc itself), and main-project task outputs are too late. See gradle/seven-zip-bootstrap.settings.gradle.kts for the canonical example (commit 0b7bbe78).

Test execution discipline

NEVER run :test-integration or :test-experiments tests in parallel. Each test starts a full Docker IntelliJ container. Two concurrent runs exhaust RAM/CPU and OOM-kill both. Wait for completion before starting the next. See test-integration/AGENTS.md for the full Docker-test playbook.
Diagnose stuck/slow tests with JDK tooling BEFORE killing. jps -l | grep GradleWorkerMain → jcmd <pid> Thread.print > /tmp/dump.txt while the JVM is alive; then grep '<YourTest>Test' /tmp/dump.txt -A 5. Killing throws away evidence and forces guess-and-retry. Once you have the stuck test's name, iterate on just that test (--tests 'com.example.StuckTest'
- --rerun-tasks).
Prose-only prompt edits need only the contract test. When a change under prompts/src/main/prompts/** touches no ```kotlin fence, run ./gradlew :prompts:test --tests '*MarkdownArticleContract*' (seconds). The *KtBlock* compilation matrix recompiles every fence against every unpacked IDE (60–120 min) and is only needed when kotlin fences change — never run it casually (a workflow agent once hung 37 min on it for a prose edit).
1-minute rule for integration tests. Any :test-integration / :test-experiments case that hasn't printed PASS/FAIL within ~60 s of > Task :*:test is suspicious — usually a modal dialog, indexing stall, or background task that won't finish. Capture the latest screenshot (ls -t test-integration/build/test-logs/test/run-*/screenshot/*.png | head -1) and an in-container thread dump (docker exec <id> jcmd <PID> Thread.print) before deciding. Full recipe and symptom→cause table in test-integration/AGENTS.md → "Debugging a stuck/hung Docker test".

Project Overview

MCP Steroid — IntelliJ Platform plugin that exposes a standalone MCP server letting LLM agents drive the IDE via Kotlin code execution.

Public repo: https://github.com/jonnyzzz/mcp-steroid
Docs: README.md, docs/guides/AGENT-STEROID-GUIDE.md
Modules: see settings.gradle.kts. Plugin code lives in ij-plugin/; prompt resources in prompts/; Docker IDE smoke tests in test-integration/; experimental/long-running tests in test-experiments/.

Technology Stack

Gradle 9.5.1 / Kotlin 2.3.20 / Java 25 toolchain / IntelliJ Platform 2026.1+ / Ktor 3.3.2 (CIO+SSE) / kotlinx.serialization

Bytecode targets Java 21 (class-file v65) while the toolchain stays JDK 25: Android Studio 2026.1 bundles JBR 21 (IDEA bundles JBR 25), so the plugin must load on both. Set via the root subprojects {} convention (jvmTarget=21 + -Xjdk-release=21 + options.release=21); enforced by verifyClassFileVersions on the plugin/devrig distributions; regression-gated by AndroidStudioRuntimeCompatTest. See issue #157.

The Gradle Daemon is pinned to JDK 25 via gradle/gradle-daemon-jvm.properties (matches IDEA 2026.1's bundled JBR — see docs/262-EAP-PLAN.md). The foojay-resolver-convention plugin in settings.gradle.kts is the auto-download fallback if no JDK 25 is present locally. To change the daemon JVM: edit gradle-daemon-jvm.properties directly (one-line toolchainVersion=N).

Workflow

Read requirements; ask if ambiguous.
Add a failing test, then implement (test-first; integration tests preferred; never fake tests).
Run Gradle build/test via the IDE's MCP, not shell — see ij-plugin/CLAUDE.md for the run-config recipe.
Deploy: ./gradlew deployPlugin.
Test with IntelliJ MCP. Validate full Docker scenarios via :test-integration:test / :test-experiments:test.
Use steroid_execute_code to verify warnings/errors are gone before declaring done.
Update TODO* and commit.

CI

Root build.gradle.kts defines ci-prefixed aggregator tasks for TeamCity and GitHub Actions. ./gradlew tasks --group ci lists them.

Task	Subprojects	Notes
`buildPluginOnCI`	`:ij-plugin` (builds + publishes ZIP)	Entry point for both GH Actions and TC
`ciBuildPluginTests`	All plugin modules except prompts + non-plugin	Per-OS matrix on TC
`ciBuildPromptsTests`	`prompt-generator`, `prompts`, `prompts-api`	Linux only; full matrix takes 60–120+ min
`ciIntegrationTests`	`:test-helper:test` → `:ij-plugin:integrationTest` → `:test-integration:test`	Strict sequential ordering via `mustRunAfter`; needs Docker + API keys

:test-integration:test and :test-experiments:test have an onlyIf guard — plain root ./gradlew test silently skips both. Direct ./gradlew :test-integration:test --tests '...' still works.

TeamCity DSL lives in a separate repo (~/Work/mcp-steroid-teamcity). See its own CLAUDE.md for the generate→edit→regenerate→commit workflow. The TC VCS root pulls from jb, not origin — see "Git remotes" below.

GitHub Actions (.github/workflows/): builds the publishable plugin ZIP and deploys the website to GitHub Pages. Plugin tests are intentionally NOT mirrored — full coverage stays on TC (3–5× faster internal agents). Trigger PR builds via workflow_dispatch on the PR's head branch.

Website deploys from the website branch, NOT main. GitHub Pages builds on a push to the long-lived website branch (github-pages.yml). website tracks main (advance via git merge main → website, normally often) but can deliberately lag it so website changes that depend on an unreleased devrig binary — e.g. a new install.sh/install.ps1 CLI contract — stay off the live site until a matching GitHub release exists. The release process advances website AFTER publishing the release (release/release-instructions.md → "Stage 7c"). website is origin-only (never synced to jb, which runs TeamCity only). A push to main no longer deploys the website.

Git Remotes: `origin` vs `jb`

Remote	URL	Role
`origin`	`git@github.com:jonnyzzz/mcp-steroid`	Day-to-day development fork; source of truth for new commits
`jb`	`git@github.com:JetBrains/mcp-steroid.git`	JetBrains-org mirror; consumed by TeamCity (`mcp_steroid` project)

Sync direction:

origin → jb: always via merge (the jb-merge procedure below).
jb → origin: always via cherry-pick (individual commits, manual conflict resolution).
Never fast-forward-push main to jb — jb/main carries org-specific commits that would be lost.

git fetch jb
git checkout -b jb-merge jb/main
git merge main --no-ff -m "Merge remote-tracking branch 'origin/main' into jb-merge"
git push jb jb-merge:main
git checkout main && git branch -D jb-merge

--no-ff preserves jb/main's existing head as the merge's first parent so jb-only history stays reachable. Why this matters for CI: TC pulls from jb. If your commit isn't on jb/main, TC builds stale code.

No GitHub Actions on jb. The JetBrains-org mirror runs TeamCity only — it must carry no .github/workflows/ at all (those are origin/jonnyzzz-only: the GitHub Pages website deploy, PR compile gate, etc.). jb/main intentionally deletes every workflow file (e.g. commit "Delete .github/workflows/github-pages.yml"); that deletion is org-specific history to preserve. So during jb-merge, a modify/delete conflict on any .github/workflows/* file is expected whenever origin edits a workflow — always resolve by keeping it deleted on jb (git rm .github/workflows/<file> then commit the merge). Never resurrect a workflow onto jb.

IntelliJ Source Research

The IntelliJ project at ~/Work/intellij is open in the IDE for research. Use steroid_execute_code with project_name="intellij" and PSI APIs (FilenameIndex, JavaPsiFacade) — faster and more accurate than grep. See test-integration/AGENTS.md → "Researching IntelliJ APIs" for the recipe.

run-agent.sh from ~/Work/jonnyzzz-ai-coder/ launches AI agents (Claude/Codex/Gemini) for peer reviews, research, and consensus checks. Encouraged from agent sessions — the BANNED rule applies only to production code/tests referencing it.

Environment Constraints

timeout / gtimeout are not available on this Mac. Use Gradle's own timeout mechanisms or the Bash tool's timeout parameter.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

CLAUDE.md, AGENTS.md

Design philosophy

Recursive context lookup (do this before sub-folder work)

Sub-folder guides

MUST DO

Banned patterns

Test execution discipline

Project Overview

Technology Stack

Workflow

CI

Git Remotes: `origin` vs `jb`

IntelliJ Source Research

Environment Constraints

FilesExpand file tree

CLAUDE.md

Latest commit

History

CLAUDE.md

File metadata and controls

CLAUDE.md, AGENTS.md

Design philosophy

Recursive context lookup (do this before sub-folder work)

Sub-folder guides

MUST DO

Banned patterns

Test execution discipline

Project Overview

Technology Stack

Workflow

CI

Git Remotes: origin vs jb

IntelliJ Source Research

Environment Constraints

Git Remotes: `origin` vs `jb`