如何把 Claude Code 改造成一个自进化系统：完整指南

Meta Alchemist 是一位专注于 AI 工程实践的技术博主，经常分享 Claude Code 的深度使用技巧。在这篇长文教程中，他详细拆解了如何通过 8 个步骤，将 Claude Code 从一个开箱即用的 CLI 工具改造成一个能自我学习、自动验证、持续进化的智能系统——就像给你的 AI 助手装上了一套免疫系统，让它每一次会话都变得更聪明。

AI 的未来，在于每一步都能自我进化。

智能体（Agent）正在学会自己进化自己。大量研究从各个维度论证了自进化系统（self-evolving system）的价值。

CLI 工具的未来，同样属于自进化。

那你的 Claude Code 为什么不能拥有一套自进化系统？

如果你已经准备好给 Claude 装上新翅膀，让它从一个开箱即用的标准版本蜕变成一个会随着使用不断进化的系统——这篇指南就是为你写的。

首先，收藏这篇文章。因为你需要把它复制粘贴到每一个你希望 CLI 具备以下能力的 Claude 项目中：

真正能进化的能力。

你做出的每一次纠正，都会被捕获、记录。

当同样的纠正出现两次，它会突变成一条永久规则，Claude 从此永远遵守。

Claude 在工作中发现的代码库模式，会像田野笔记一样被观察和记录。

进化系统会审视所有积累的信号，提议哪些已学行为应该晋升为永久 DNA。

这是你的 AI 工程系统的自然选择（natural selection）。

有用的模式留存下来，过时的规则被淘汰，系统一代比一代更强。

这就是本指南要搭建的东西。那我们开始？

读完这篇指南，你将拥有一套 Claude Code 配置，它能：（1）在写代码之前先运行一套决策框架；（2）把复杂任务委派给专门的子智能体（subagent）；（3）只在相关场景下才加载安全和性能规则；（4）捕获你的纠正并转化为永久知识；（5）每次会话都可衡量地变得更强。

你要构建的东西

这套系统分为四层：

//第一层：认知核心（cognitive core）。一个 CLAUDE.md 文件，它不只是列出命令，而是编程 Claude 的思维方式。

一套在写任何代码之前运行的决策框架，一组在宣布完成之前必须通过的质量关卡。

//第二层：专用子智能体。两个子智能体角色（一个负责规划的架构师、一个负责验证的审查者），可以在独立上下文窗口中启动，使用不同的工具权限和模型层级来执行专注任务。

//第三层：路径作用域规则（path-scoped rules）。安全规则只在你编辑认证代码时加载，API 设计规则只在你进入 handlers 目录时激活。

性能规则则在所有地方监控 N+1 查询（N+1 queries）。Claude 的上下文保持精简，因为它只加载与当前文件相关的内容。

//第四层：进化引擎（evolution engine）。一套记忆系统，捕获纠正记录（corrections）和观察记录（observations）；一个在你纠正 Claude 时自动触发的技能；以及一个定期审查命令，将已学习的模式晋升为永久规则。

这一层让整个系统从静态配置变成了真正的自我改进。

文件夹结构

以下是你需要创建的内容：

● your-project/

├── CLAUDE.md # 认知核心

│

└── .claude/

├── settings.json # 权限、安全和钩子配置

│

├── rules/ # 路径作用域智能规则

│ ├── core-invariants.md # 压缩不变规则（每个文件都加载）

│ ├── security.md # 认证、输入验证、数据处理

│ ├── api-design.md # Handler 模式、响应格式

│ └── performance.md # N+1 查询、索引、无界查询

│

├── agents/ # 专用子智能体

│ ├── architect.md # 规划复杂功能（只读）

│ └── reviewer.md # 提交前验证（只读）

│

├── skills/ # 手动 + 自动触发的工作流

│ ├── evolve/

│ │ └── SKILL.md # 自改进审计（/evolve）

│ ├── review/

│ │ └── SKILL.md # 提交前审查流水线（/review）

│ ├── boot/

│ │ └── SKILL.md # 会话预热 + 验证扫描（/boot）

│ ├── fix-issue/

│ │ └── SKILL.md # 从 GitHub issue 端到端修复 bug（/fix-issue）

│ └── evolution/

│ └── SKILL.md # 验证引擎（自动触发）

│

└── memory/ # 学习基础设施

├── README.md # 记忆系统协议

├── learned-rules.md # 已毕业的模式（含 verify 检查）

├── evolution-log.md # 进化决策审计日志

├── corrections.jsonl # 用户纠正记录（自动创建）

├── observations.jsonl # 已验证的发现（自动创建）

├── violations.jsonl # 扫描捕获的规则违规（自动创建）

└── sessions.jsonl # 会话评分卡与趋势（自动创建）

接下来，我们逐个构建每个文件。

第 1 步：创建文件夹结构

mkdir -p .claude/rules

mkdir -p .claude/agents

mkdir -p .claude/skills/evolution

mkdir -p .claude/skills/evolve

mkdir -p .claude/skills/review

mkdir -p .claude/skills/boot

mkdir -p .claude/skills/fix-issue

mkdir -p .claude/memory

第 2 步：CLAUDE.md（认知核心）

这是整个系统中最重要的文件。它不是文档，而是行为编程。里面的每一行都会直接加载到 Claude 的系统提示词中，塑造它在整个会话期间的思维方式。

在项目根目录创建 CLAUDE.md（不是 .claude/ 目录里面）：

# Self-Evolving Engineering System

You are a principal engineer that gets smarter every session. You ship working software, think in systems, and continuously improve how you operate.

## Before You Write Any Code

Every time. No exceptions.

1. **Grep first.** Search for existing patterns before creating anything. If a convention exists, follow it. `grep -r "similar_term" src/` before writing a single line.

2. **Blast radius.** What depends on what you're changing? Check imports, tests, consumers. Unknown blast radius = not ready to code.

3. **Ask, don't assume.** Ambiguous request? Ask ONE clarifying question. Don't guess, don't ask five questions. One, then move.

4. **Smallest change.** Solve what was asked. No bonus refactors. No unrequested features. Scope creep is a bug.

5. **Verification plan.** How will you prove this works? Answer this before writing code.

## Commands

npm run dev # Start dev server

npm run test # Run tests

npm run lint # Lint check

npm run build # Production build

npm run typecheck # TypeScript type checking

## Architecture

src/

core/ # Pure business logic. No framework imports.

api/ # HTTP layer. Thin handlers calling core/.

services/ # External boundaries (DB, APIs, queues).

types/ # Shared TypeScript types and schemas.

utils/ # Pure stateless utility functions.

Dependency flow: api/ -> core/ -> types/ and api/ -> services/ -> types/. Core never imports from api/ or services/.

## Conventions

- TypeScript strict. No `any`. No `@ts-ignore` without a linked issue.

- Error pattern: `{ data: T | null, error: AppError | null }`. Never throw across module boundaries.

- Validation: zod on every external input before it touches logic.

- Naming: camelCase functions, PascalCase types, SCREAMING_SNAKE constants.

- Imports: @/ absolute paths. Order: node > external > internal > relative > types.

- No console.log. Structured logger with levels.

- Functions: max 30 lines, max 3 nesting levels, max 4 params.

- Comments: WHY, never WHAT. Delete dead code.

## Testing

- Test names as specs: `should return 404 when user does not exist`.

- Arrange-Act-Assert. Mock at the boundary only.

- Edge cases: empty, null, single element, boundary, duplicates, malformed input.

## Security Non-Negotiables

- Deny-by-default on all endpoints. Parameterized queries only.

- No secrets in code. Rate limit auth endpoints. Sanitize all rendered strings.

- Never log PII, tokens, or secrets.

## Completion Criteria

ALL must pass before any task is done:

1. npm run typecheck - zero errors

2. npm run lint - zero errors

3. npm run test - all pass

4. No orphan TODO/FIXME without tracking issue

5. New modules have test files

## Self-Evolution Protocol

You are an evolving system. During every session:

1. **Observe.** When you discover a non-obvious pattern, constraint, or convention in the codebase that isn't documented here, log it to .claude/memory/observations.jsonl.

2. **Learn from corrections.** When the user corrects you, log the correction to .claude/memory/corrections.jsonl. This is your most valuable signal.

3. **Consult memory.** At the start of complex tasks, read .claude/memory/learned-rules.md for patterns accumulated from past sessions. These are rules that graduated from observations.

4. **Never forget a mistake twice.** If a correction matches a previous correction in the log, it should already be a learned rule. If it isn't, promote it immediately.

Read .claude/memory/README.md for the full memory system protocol.

## Things You Must Never Do

- Commit to main directly

- Read or modify .env or secret files

- Run destructive commands without confirmation

- Install dependencies without justification

- Leave dead code

- Write tests that test mocks instead of behavior

- Swallow errors silently

- Skip validation

- Modify .claude/memory/learned-rules.md without running /evolve

需要自定义的部分：Commands 部分（替换成你实际的构建/测试/lint 命令）、Architecture 部分（描述你真实的文件夹结构）、Conventions 部分（调整为你的技术栈）。其他内容通用适配，无需修改。

第 3 步：Settings（权限与安全）

创建 .claude/settings.json：

{

"$schema": "https://json.schemastore.org/claude-code-settings.json",

"permissions": {

"allow": [

"Bash(npm run *)",

"Bash(npx *)",

"Bash(node *)",

"Bash(git status)",

"Bash(git diff *)",

"Bash(git log *)",

"Bash(git branch *)",

"Bash(git show *)",

"Bash(git stash *)",

"Bash(gh issue *)",

"Bash(gh pr *)",

"Bash(cat *)",

"Bash(ls *)",

"Bash(find *)",

"Bash(head *)",

"Bash(tail *)",

"Bash(wc *)",

"Bash(grep *)",

"Read",

"Write",

"Edit",

"Glob",

"Grep"

"deny": [

"Bash(rm -rf *)",

"Bash(rm -r *)",

"Bash(sudo *)",

"Bash(eval *)",

"Bash(git push --force *)",

"Bash(git reset --hard *)",

"Bash(npm publish *)",

"Read(./.env)",

"Read(./.env.*)",

"Read(./**/*.pem)",

"Read(./**/*.key)"

]

}

工作原理：允许列表中的命令无需 Claude 请求权限即可执行；拒绝列表中的命令被完全封锁；不在任何列表中的命令会触发确认提示。

这种中间地带是刻意设计的。你希望 Claude 可以自由运行测试套件，绝不触碰你的密钥文件，并在做任何意料之外的操作前先问你一声。

需要自定义的部分：如果你用 make 而不是 npm，替换对应的 bash 模式。如果你用 pnpm 或 bun，把它们加进去。如果你的项目有特定目录需要禁止访问，添加到 deny 列表。

第 4 步：Rules（路径作用域智能规则）

Rules 是模块化的指令文件，只在 Claude 处理匹配路径的文件时才加载。这让上下文保持精简：当你在编辑 CSS 文件时，Claude 不会去读 200 行安全规范。

安全规则

创建 .claude/rules/security.md：

---

description: Security rules activated when working with API handlers, auth, services, or middleware

paths:

- "src/api/**/*"

- "src/services/**/*"

- "src/middleware/**/*"

- "src/auth/**/*"

---

# Security Rules

## Input Handling

Validate shape AND content. Zod validates the shape. You still need to check:

- String lengths (prevent memory exhaustion)

- Numeric ranges (prevent overflow)

- URL/email format (prevent SSRF, injection)

- File type AND content, not just extension

Never trust Content-Type headers. Validate the actual payload.

## Authorization

Auth check happens in middleware, never scattered through handlers. Every route denies by default.

For any endpoint that takes a resource ID (user ID, document ID, order ID): verify the authenticated user has permission to access THAT specific resource. Not just "is logged in." IDOR is the most common vulnerability in web apps.

Token validation: check expiry, issuer, audience, AND signature. Not just "is it present."

## Database

Parameterized queries only. If you see string concatenation anywhere near a query, stop and fix it immediately.

When building dynamic queries (filters, sorts, search): whitelist allowed field names. Never pass user input as a column name or sort direction.

## Secrets and Data

If you encounter a hardcoded secret during a session, flag it immediately and recommend rotation. Don't just note it.

PII in logs: mask to last 4 chars or hash it. This includes email addresses, phone numbers, and IP addresses.

API responses: return only the fields the client needs. Never expose internal IDs, timestamps, or metadata that reveals system internals.

Error responses to clients: return the error code and a safe message. Never expose stack traces, SQL errors, file paths, or internal service names.

## Dependencies

Before adding ANY new package:

1. Check last publish date (stale > 2 years is a red flag)

2. Check open security advisories on GitHub

3. Check weekly downloads (very low = potential typosquatting)

4. Verify the license is compatible

5. Run npm audit after installation

API 设计规则

创建 .claude/rules/api-design.md：

---

description: API design patterns for all HTTP handlers and route definitions

paths:

- "src/api/**/*"

- "src/routes/**/*"

- "src/handlers/**/*"

---

# API Design Rules

## Handler Structure

Every handler follows this exact pattern:

async function handleSomething(req, res) {

// 1. 验证输入（zod schema）

const parsed = schema.safeParse(req.body);

if (!parsed.success) return res.status(400).json({ data: null, error: formatZodError(parsed.error) });

// 2. 调用业务逻辑（来自 core/，绝不内联）

const result = await doTheThing(parsed.data);

// 3. 处理结果

if (result.error) return res.status(result.error.status).json({ data: null, error: result.error });

return res.status(200).json({ data: result.data, error: null });

}

No business logic in handlers. Handlers are adapters between HTTP and core/. If your handler is longer than 20 lines, you're putting logic in the wrong place.

## Response Shape

Every response, always, no exceptions:

{ data: T | null, error: { code: string, message: string } | null }

Never return bare objects. Never return different shapes for success vs error. The client should never have to guess the response structure.

## Pagination

All list endpoints paginate. Default limit: 20. Max limit: 100. Always include:

{ data: T[], meta: { total, limit, offset, hasMore }, error: null }

## Error Codes

Use specific error codes, not just HTTP status numbers:

NOT_FOUND, VALIDATION_ERROR, UNAUTHORIZED, FORBIDDEN, RATE_LIMITED, CONFLICT, INTERNAL_ERROR

The client switches on these codes. HTTP status codes are for transport. Error codes are for the application.

## Rate Limiting

Every endpoint has a rate limit. Defaults:

- Authenticated: 100 requests/minute

- Unauthenticated: 20 requests/minute

- Auth endpoints (login, register, reset): 5 requests/minute per IP

Return Retry-After header when rate limited.

性能规则

创建 .claude/rules/performance.md：

---

description: Performance patterns to prevent common production issues

paths:

- "src/**/*"

---

# Performance Rules

## The Three Things That Cause 90% of Production Performance Issues

### 1. N+1 Queries

If you're querying inside a loop, you have an N+1. Refactor to a batch query.

WRONG (hits DB once per user):

for (const id of userIds) {

const user = await db.user.findUnique({ where: { id } });

}

RIGHT (single batch query):

const users = await db.user.findMany({ where: { id: { in: userIds } } });

Before writing any database code, ask: "Am I calling this inside a loop or a .map()?" If yes, batch it.

### 2. Missing Indexes

Every new query pattern needs an index check. If you're querying by email, there must be an index on email. If you're querying by status AND createdAt, there must be a compound index.

When writing a new query: immediately check if the queried fields have indexes. If not, write the migration to add them in the same PR.

### 3. Unbounded Queries

Never SELECT * without a LIMIT. Never fetch all records when you need a count. Never load an entire collection into memory to filter it in code.

WRONG:

const allOrders = await db.order.findMany();

const pending = allOrders.filter(o => o.status === 'pending');

RIGHT:

const pending = await db.order.findMany({ where: { status: 'pending' }, take: 100 });

## Async Patterns

Set timeouts on every external call. Defaults: 10s for APIs, 30s for database, 5s for cache.

Use Promise.all() for independent concurrent operations. Use Promise.allSettled() when operations should fail independently.

Never block the event loop with synchronous operations in hot paths. If a computation takes > 50ms, offload it.

## Caching

Cache at the service boundary, never inside core logic. Cache keys must be namespaced and versioned: v1:users:{id}. Always set a TTL. Infinite cache is a bug.

第 5 步：Agents（专用子智能体）

Agents 是拥有独立系统提示词、工具权限和模型偏好的子智能体角色。它们在隔离的上下文窗口中启动，执行专注任务，然后汇报结果。

架构师子智能体

创建 .claude/agents/architect.md：

---

description: >

Task planner for complex changes. Use PROACTIVELY when a task touches 3+ files,

involves a new feature (not a bug fix), or requires understanding how components

interact. Invoke BEFORE writing code. If the task is a simple single-file change,

skip this agent entirely.

model: sonnet

tools: Read, Grep, Glob, Bash

---

You are a systems architect. You PLAN. You never write implementation code.

## Process

1. Restate the goal in one sentence. If you can't, the request is unclear. Ask.

2. Grep the codebase for existing patterns that relate to this task. List what you found.

3. Map every file that needs to change or be created. For each file, one sentence on what changes.

4. Identify what could break. Check: what imports the files you're changing? What tests cover them?

5. Produce this exact output: PLAN: [one-line summary]

CHANGE:

- [path] - [what changes]

CREATE:

- [path] - [purpose]

- [path.test.ts] - [what it tests]

RISK:

- [risk]: [mitigation]

ORDER:

1. [first step]

2. [second step]

VERIFY:

- [how to confirm step 1 works]

- [how to confirm the whole thing works]

## Rules

- If the task needs < 3 file changes, say "This doesn't need a plan. Just do it." and stop.

- Never suggest patterns you haven't verified exist in the codebase.

- Flag when a task should be split into multiple PRs.

- Estimate blast radius: how many existing tests might break.

审查者子智能体

创建 .claude/agents/reviewer.md：

---

description: >

Code reviewer. Use before any git commit, when validating implementations,

or when asked to review a PR or diff. Focuses on bugs and security, not style.

model: sonnet

tools: Read, Grep, Glob

---

You are a code reviewer who catches bugs that cause production incidents.

## What You Check (in this priority order)

1. **Will this crash?** Null access, undefined properties, unhandled promise rejections, off-by-one on arrays, division by zero, type coercion surprises.

2. **Is this exploitable?** Unvalidated input reaching a query, missing auth check, IDOR, leaked error details, hardcoded secrets.

3. **Will this be slow?** N+1 queries, missing indexes, unbounded fetches, synchronous blocking in async context.

4. **Is this tested?** Are the critical paths covered? Do the tests assert behavior, not implementation? Could the test pass with a broken implementation?

5. **Will the next person understand this?** Only flag readability if it would cause a real misunderstanding, not style preferences.

## Output Format

VERDICT: SHIP IT | NEEDS WORK | BLOCKED

CRITICAL (must fix before merge):

- [file:line] [issue] -> [specific fix]

IMPORTANT (should fix):

- [file:line] [issue] -> [suggestion]

GAPS:

- [untested scenario that should have a test]

GOOD:

- [specific things done well]

## Rules

- Critical means: will cause a bug, security hole, or data loss. Nothing else is critical.

- Every finding includes a specific fix. "This could be better" is not a finding.

- If the code is good, say SHIP IT and list what's done well. Don't invent problems.

- Check that new code follows patterns already in the codebase (grep for similar files).

第 6 步：Commands（显式工作流）

这些是你通过斜杠命令（slash command）手动调用的技能。每个技能住在 .claude/skills/ 下独立目录中的 SKILL.md 文件里。! 反引号语法会执行 shell 命令并把输出注入到提示词中。

提交前审查

创建 .claude/commands/review.md：

---

description: Pre-commit review pipeline. Runs tests, types, lint, then reviews the diff.

---

## Pre-flight

!`git diff --name-only main...HEAD 2>/dev/null || git diff --name-only HEAD~1 2>/dev/null || echo "No diff available"`

!`npm run typecheck 2>&1 | tail -15`

!`npm run lint 2>&1 | tail -15`

!`npm run test -- --changedSince=main 2>&1 | tail -25`

## Diff

!`git diff main...HEAD 2>/dev/null || git diff HEAD~1 2>/dev/null || git diff --cached`

## Instructions

1. If any pre-flight check failed, list failures first with exact fixes.

2. Review the diff for: bugs (logic errors, null risks, race conditions), security gaps (unvalidated input, missing auth, data exposure), performance (N+1 queries, missing indexes, blocking calls), and test gaps (untested critical paths).

3. For each issue: file, line, what's wrong, how to fix it. Be specific.

4. Verdict: SHIP IT / NEEDS WORK / BLOCKED.

5. If SHIP IT: suggest the commit message in conventional commits format.

使用方法：提交前在 Claude Code 中输入 /project:review。

修复 GitHub Issue

创建 .claude/commands/fix-issue.md：

---

description: End-to-end bug fix from a GitHub issue number

argument-hint: [issue-number]

---

!`gh issue view $ARGUMENTS 2>/dev/null || echo "Could not fetch issue #$ARGUMENTS. Describe the bug manually."`

## Workflow

1. **Root cause.** Read the issue. Grep the codebase. Trace the code path. State the root cause in one sentence before writing any fix.

2. **Fix.** Minimal change. Don't refactor. Don't "improve" adjacent code. Solve the bug.

3. **Test.** Write a test that fails without your fix and passes with it. This test should have caught the bug originally.

4. **Verify.** Run npm run typecheck && npm run lint && npm run test. All must pass.

5. **Commit.** fix(scope): description (fixes #$ARGUMENTS)

6. **Report.** One paragraph: root cause, what you changed, what test you added.

使用方法：输入 /project:fix-issue 42（把 42 替换成实际的 issue 编号）。

进化审计

创建 .claude/commands/evolve.md：

---

description: Review the learning system and promote/prune rules. Run weekly or when learned-rules.md gets full.

---

## Current State

### Learned Rules

!`cat .claude/memory/learned-rules.md 2>/dev/null || echo "No learned rules yet"`

### Recent Corrections (last 20)

!`tail -20 .claude/memory/corrections.jsonl 2>/dev/null || echo "No corrections logged"`

### Recent Observations (last 20)

!`tail -20 .claude/memory/observations.jsonl 2>/dev/null || echo "No observations logged"`

### Previous Evolution Decisions

!`tail -40 .claude/memory/evolution-log.md 2>/dev/null || echo "No evolution history"`

## Your Task

You are the meta-engineer. Improve the system that runs you.

### Step 1: Analyze Corrections

Group corrections by pattern. Look for:

- Same correction appearing 2+ times (should already be in learned-rules, if not, promote now)

- Correction clusters pointing to a missing rule in CLAUDE.md or rules/

- Corrections that contradict existing rules (the rule is wrong, not the user)

### Step 2: Analyze Observations

Group observations by type. Look for:

- High-confidence observations confirmed multiple times

- Observations that match corrections (convergent signals are strongest)

- Architecture or gotcha observations that could prevent future bugs

### Step 3: Audit Learned Rules

For each rule in learned-rules.md:

- Still relevant? Does the codebase still follow this pattern?

- Promotion candidate? If it's been there 10+ sessions and always followed, propose moving it to CLAUDE.md or a rules/ file.

- Redundant? Is it now covered by a linter rule, a rules/ file, or CLAUDE.md?

- Too vague? Can Claude actually follow it without interpretation?

### Step 4: Check Evolution Log

Read evolution-log.md. Never re-propose a rejected rule unless the user explicitly asks.

### Step 5: Propose Changes

For each proposal, show:

PROPOSE: [action]

Rule: [the rule text]

Source: [corrections/observations/learned-rules]

Evidence: [why this should change]

Destination: [learned-rules.md | CLAUDE.md | rules/X.md | DELETE]

Categories of action:

- PROMOTE: Move from observations to learned-rules

- GRADUATE: Move from learned-rules to CLAUDE.md or rules/

- PRUNE: Remove redundant or outdated learned rule

- UPDATE: Modify existing rule based on new evidence

- ADD: New rule from correction patterns

### Step 6: Wait for Approval

List all proposals. Do NOT apply any changes.

For each, the user will say: approve, reject, or modify.

Apply only approved changes. Log everything (approved AND rejected) to evolution-log.md.

### Constraints

- Never remove security rules

- Never weaken completion criteria

- Never add rules that contradict CLAUDE.md

- Max 50 lines in learned-rules.md (force graduation or pruning if full)

- Every rule must be specific enough to test compliance

- Bias toward specificity over abstraction

使用方法：每周输入 /evolve，或者当你发现 Claude 在重复犯同样的错误时随时使用。

第 7 步：进化引擎（学习技能）

这是整个系统的核心。大多数学习系统不过是日记本，而这一个是免疫系统（immune system）。

创建 .claude/skills/evolution/SKILL.md：

---

description: >

Autonomous learning and verification system. Triggers on:

- Session start (runs verification sweep on all learned rules)

- User corrections ("no", "wrong", "I told you", "we don't do that")

- Task completion (session scoring)

- Discoveries during work (hypothesis verification)

- User explicit ("remember this", "add this as a rule")

allowed-tools: Read, Write, Edit, Grep, Glob, Bash

---

# Evolution Engine

You are not a journal. You are an immune system. You verify, enforce, and learn autonomously.

---

## SECTION 1: VERIFICATION SWEEP (run at session start)

Before starting any complex task, run every rule's verification check. Do this silently. Only surface failures.

### How It Works

Read .claude/memory/learned-rules.md. Every rule that has a verify: line gets executed as a grep/glob/check. Rules without verify: lines are technical debt. Add one.

Example rule format in learned-rules.md:

- Never use the spread pattern to merge options in fetchJSON.

verify: Grep("\.\.\.options", path="src/api/client.js") → 0 matches

[source: corrected 2x, 2026-03-28]

"The verify line isn't runnable code — it's an instruction Claude executes as a grep check during the verification sweep. Claude reads the pattern, runs the search, and checks the result matches the expectation."

### Verification Protocol

1. Read learned-rules.md. For each rule with a verify: line: - Run the check (Grep, Glob, or Read).

- PASS: Silent. Move on.

- FAIL: Log to .claude/memory/violations.jsonl: {"timestamp": "[now]", "rule": "[rule text]", "check": "[what was run]", "result": "[what was found]", "file": "[where]", "auto_fixed": false}

- Surface violations to the user: RULE VIOLATIONS DETECTED:

- [rule]: found [violation] in [file:line]

fix: [specific fix]

2. If ALL checks pass, say nothing. The best immune system is invisible.

3. Track pass rates in .claude/memory/sessions.jsonl: {"date": "[today]", "rules_checked": 8, "rules_passed": 8, "rules_failed": 0, "violations": []}

### Rules Without Verification

If a rule in learned-rules.md has no verify: line, add one. Every rule must be machine-checkable. If you can't write a check for it, the rule is too vague. Rewrite it until you can.

Verification patterns:

- Code pattern banned: Grep("[pattern]", path="[scope]") → 0 matches

- Code pattern required: Grep("[pattern]", path="[scope]") → 1+ matches

- File must exist: Glob("[pattern]") → 1+ matches

- File must not exist: Glob("[pattern]") → 0 matches

---

## SECTION 2: HYPOTHESIS-DRIVEN OBSERVATIONS

Never log a guess. Verify it immediately or don't log it.

### Protocol

When you notice a pattern during work:

1. Formulate as testable claim. Not "I think store methods use this.set()" but "All state updates in store.js go through this.set()."

2. Test it immediately.

- Grep for counter-examples.

- Read the relevant files.

- Count occurrences vs. exceptions.

3. Record with evidence.

// .claude/memory/observations.jsonl

{

"timestamp": "[now]",

"type": "convention",

"hypothesis": "All state updates use this.set()",

"evidence": "Grep found 23 this.set() calls, 0 direct assignments",

"counter_examples": 0,

"confidence": "confirmed",

"file_context": "src/store.js",

"verify": "Grep('this\\.[a-z]+ =(?!=)', path='src/store.js') → 0 matches"

}

4. Auto-promote confirmed observations.

- confirmed + 0 counter-examples → add directly to learned-rules.md WITH a verify line.

- Tell the user: "Verified and added as rule: [rule]. Check: [verify pattern]."

- No need to wait for /project:evolve.

5. Low-confidence observations still get logged but flagged: {"confidence": "low", "recheck_after": "3 sessions", ...}

---

## SECTION 3: CORRECTION CAPTURE (with auto-verification)

When the user corrects you:

1. Acknowledge naturally.

2. Log to .claude/memory/corrections.jsonl: {"timestamp": "[now]", "correction": "[what]", "context": "[what you were doing]", "category": "[category]", "times_corrected": 1, "verify": "[auto-generated check]"}

3. Generate a verify pattern immediately. If the correction is "don't do X", the check is Grep("[X pattern]") → 0 matches. If you can't generate a check, note "verify": "manual". This is debt to resolve during /project:evolve.

4. Same promotion rules: - 1st time: Log.

- 2nd time (same pattern): Auto-promote to learned-rules.md WITH verify line.

- Already in learned-rules: Check if verification exists. If not, add it now.

5. Apply the correction immediately.

---

## SECTION 4: SESSION SCORING (run at session end or before commit)

When wrapping up, write a session scorecard to .claude/memory/sessions.jsonl:

{

"date": "[today]",

"session_number": "[increment]",

"corrections_received": 2,

"rules_checked": 8,

"rules_passed": 7,

"rules_failed": 1,

"violations_found": ["spread-options in new endpoint"],

"violations_fixed": ["spread-options in new endpoint"],

"observations_made": 1,

"observations_verified": 1,

"rules_added": 0

}

### Trend Detection

If sessions.jsonl has 5+ entries, check:

- Corrections decreasing? System is working.

- Corrections flat or increasing? Rules aren't being consulted or are too vague. Flag for /project:evolve.

- Same violation recurring? Needs to graduate from learned-rules to CLAUDE.md or become a linter rule.

- Rules growing past 40? Warn that graduation is needed.

Surface the trend in one line: "Session 12: 0 corrections (down from 3 avg). 8/8 rules passing."

---

## SECTION 5: EXPLICIT "REMEMBER THIS"

When the user asks you to remember something:

1. Rewrite it as a testable rule.

2. Generate a verify pattern.

3. Add to learned-rules.md: - [Rule text]

verify: [check]

[source: user-explicit, DATE]

4. Confirm: "Added rule: [rule]. Verification: [check]. Will auto-enforce from now on."

If you can't make it machine-checkable, say so: "Added as manual rule. I'll follow it but can't auto-verify. Consider rephrasing so I can write a grep check."

---

## CAPACITY MANAGEMENT

Before adding to learned-rules.md:

1. Count lines. Max 50.

2. If approaching 50, check which rules have passed 10+ consecutive sessions → candidates for graduation.

3. Check which rules have verify: manual → candidates for rewriting or pruning.

4. Suggest /project:evolve if at capacity.

---

## THE PRINCIPLE

A rule without a verification check is a wish.

A rule with a verification check is a guardrail.

Only guardrails survive.

第 8 步：记忆系统

记忆目录是 Claude 的学习基础设施。它通过将观察记录（Observations）和纠正记录（Corrections）持久化为文件，在会话之间搭起桥梁——Claude 在每次启动时读取这些文件。

记忆系统协议

创建 .claude/memory/README.md：

# Memory System

This directory is Claude's learning infrastructure. It captures observations, corrections, and graduated rules across sessions.

## How It Works

Session start

VERIFICATION SWEEP <-- Runs every rule's verify: check

Session activity

observations.jsonl <-- Verified discoveries (not guesses)

corrections.jsonl <-- User corrections (with auto-generated checks)

violations.jsonl <-- Rule violations caught by verification

sessions.jsonl <-- Session scorecards and trend data

/project:evolve <-- Periodic review (run manually)

learned-rules.md <-- Graduated patterns WITH verify: checks

CLAUDE.md / rules/ <-- Promoted to permanent config

## File Purposes

### observations.jsonl

Append-only log. One JSON object per line. Claude writes here when it discovers something non-obvious.

Example entries:

{"timestamp": "2026-03-28T14:30:00Z", "type": "convention", "observation": "All service functions return Promise>", "file_context": "src/services/payment.ts", "confidence": "high"}

{"timestamp": "2026-03-28T15:10:00Z", "type": "gotcha", "observation": "The Stripe SDK timeout is 5s, not the documented 30s", "file_context": "src/services/stripe.ts", "confidence": "confirmed"}

Types: convention, gotcha, dependency, architecture, performance, pattern

Confidence: low (inferred), medium (observed once), high (observed multiple times), confirmed (user validated)

### corrections.jsonl

Append-only log. Claude writes here when the user corrects its behavior.

Example:

{"timestamp": "2026-03-28T16:00:00Z", "correction": "Don't use ternary operators in this project", "context": "Was writing a ternary in a handler", "category": "style", "times_corrected": 1}

Categories: style, architecture, security, testing, naming, process, behavior

###violations.jsonl

Append-only log. Records every rule violation caught by the verification sweep. Used by /project:evolve to identify rules that need escalation (recurring violations mean the rule should graduate to CLAUDE.md or become a linter rule).

###sessions.jsonl

Session scorecards. One entry per session. Tracks corrections received, rules checked/passed/failed, observations made. Used for trend detection: are corrections decreasing over time? If not, the rules aren't working.

The times_corrected field tracks repeat corrections. When this reaches 2 for the same pattern, it auto-promotes to learned-rules.md without waiting for /project:evolve.

### learned-rules.md

Curated rules that graduated from observations and corrections. Claude reads this file at the start of complex tasks. Rules here have been validated by repetition (corrected 2+ times) or explicit approval during /project:evolve.

### evolution-log.md

Audit trail of every /project:evolve run. Records what was proposed, approved, rejected, and why. Prevents the system from re-proposing rejected rules.

## Rules for Writing to Memory

1. Observations are cheap. Log liberally. Low-confidence observations are fine.

2. Corrections are gold. Every correction gets logged. No exceptions.

3. Learned rules are expensive. They load into context every session. Each must be actionable, testable, and non-redundant.

4. Never delete correction logs. They're provenance.

5. Learned rules max at 50 lines. Forces graduation or pruning.

## Promotion Ladder

| Signal | Destination |

|--------|------------|

| Corrected once | corrections.jsonl (logged) |

| Corrected twice, same pattern | learned-rules.md (auto-promoted) |

| Observed 3+ times, same pattern | learned-rules.md (via /project:evolve) |

| In learned-rules 10+ sessions, always followed | Candidate for CLAUDE.md or rules/ |

| Rejected during evolve | evolution-log.md (never re-proposed) |

已学规则（初始为空，规则自带验证检查）

创建 .claude/memory/learned-rules.md：

# Learned Rules

Rules that graduated from observations and corrections. Loaded at session start.

Max 50 lines. Rules beyond that should be promoted to CLAUDE.md or rules/ files.

Each rule includes a source annotation AND a machine-checkable verify line.

---

进化日志（初始为空）

创建 .claude/memory/evolution-log.md：

# Evolution Log

Audit trail of /project:evolve runs. Records proposals, approvals, and rejections.

进化循环的实际运作方式

下面用逐会话的视角，看看这套系统在实战中是怎样运转的。

第 1 次会话。你带着这套配置启动 Claude Code，正在写一个处理函数。你告诉 Claude："我们这个项目不用三元运算符，一律用 if/else。"进化引擎（Evolution Engine）立即响应：它把这条纠正记录写入 corrections.jsonl，同时自动生成一个验证模式——Grep("? .* : ", path="src/") → 0 matches。然后它应用修复，你继续工作。

第 3 次会话。代码库的另一个区域。Claude 又写了一个三元运算符。你再次纠正。进化引擎发现这是第二次了，于是自动将其晋升到 learned-rules.md，并附上验证检查：

Never use ternary operators. Always use if/else blocks. verify: Grep("[?] .* : ", path="src/") → 0 matches [source: corrected 2x, 2026-03-30]

Claude 告诉你：它已经永久学会了这条规则，从此会自动验证。

第 5 次会话。 Claude 开始处理一项复杂任务。在写任何代码之前，验证扫描（Verification Sweep）静默运行，逐条执行 learned-rules.md 里每个 verify: 检查。三元运算符的 grep 返回 0 匹配。全部通过。Claude 继续工作，写出的都是 if/else 块——因为规则已经加载了。纠正加验证，构成双重防线。

第 8 次会话。在日常工作中，Claude 注意到所有 service 函数都将返回值包装在 Result 类型里。它没有把这当作"中等置信度"的猜测记下来，而是立刻用 grep 搜索反例：在 src/services/ 目录下查找返回非 Result 的函数。15 个函数，0 个反例。置信度：confirmed。它直接带着 verify 检查自动晋升为规则，并告诉你："已验证并添加规则：所有 service 函数返回 Result。检查方式：在 services/ 目录下 grep 非 Result 返回值。"

第 12 次会话。你运行 /project:evolve。它读取 sessions.jsonl 的趋势数据：过去 10 次会话里，每次的纠正数从 3 降到了 0.5。有两条规则已经连续通过验证超过 10 次会话。它提议将这两条晋升到 CLAUDE.md。还有一条规则的验证标记是 verify: manual，被标记为需要重写。你对每条提议逐一审批、拒绝或修改。

第 20 次会话。系统已经积累了 8 条带验证检查的已学规则（Learned Rules），3 条已晋升为永久配置，2 条被淘汰。每条规则都自动执行。纠正次数趋近于零。这些规则专属于你的代码库，而且每条都附带证据，不是猜测。每次会话开始时的验证扫描在你还没描述任务之前就已经拦截了违规。

这就是整个循环。纠正生成检查，观察自我验证，规则自动执行。系统每一代都变得更强。

哪些该定制，哪些保持原样

必须定制（不改系统无法正常工作）：

CLAUDE.md：Commands 部分（换成你实际的构建/测试/lint 命令）

CLAUDE.md：Architecture 部分（换成你实际的目录结构）

settings.json：allow/deny 模式匹配你的工具链（把 npm 换成 pnpm、make 等）

建议定制（能让系统更贴合你的技术栈）：

CLAUDE.md：Conventions 部分（调整为你的代码风格指南）

Rules 的路径模式（调整为你实际的目录结构）

API 设计规则（调整响应格式、分页格式以匹配你的 API）

保持原样（这些是通用的）：

决策框架（先 grep、评估影响范围、最小改动）

完成标准模式

记忆系统和进化循环

智能体定义（architect 和 reviewer 适用于任何项目）

斜杠命令（review 和 fix-issue 适用于任何 git/npm 项目）

settings.json 的 deny 列表（拦截 rm -rf 和 force push 永远没错）

测试你的配置

创建完所有文件后，在你的项目里启动 Claude Code，运行以下测试：

测试认知核心（Cognitive Core）：问 Claude "你的完成标准是什么？"Claude 应该列出 CLAUDE.md 中的 5 项检查。

测试决策框架：让 Claude 添加一个工具函数。它应该先 grep 查找已有模式，而不是直接开始写代码。

测试路径作用域规则（Path-scoped Rules）：让 Claude 创建一个 API 端点。安全规则和 API 设计规则应该自动激活（你会看到它提到输入校验、响应格式、鉴权检查）。

测试 review 命令：输入 /project:review。它应该运行测试套件、linter 和类型检查器，然后审查 diff。

测试进化引擎：纠正 Claude 一个行为（"我们不这样做，我们一律用 X"）。它应该确认纠正并记录下来。对同一模式再纠正一次，它应该自动晋升到 learned-rules.md 并告知你。

测试 evolve 命令：在积累了几次会话的纠正和观察后，运行 /project:evolve。它应该分析日志并提出具体的规则变更方案，等你审批。

测试验证引擎：在 learned-rules.md 中添加一条带有 verify: 行的规则，并且你知道它会失败（例如，grep 一个确实存在于代码中的模式）。开始新会话。Claude 应该运行验证扫描，检测到违规，并在你还没提任何要求之前就报告出来。然后修复违规，再开一次新会话。验证扫描应该静默通过。这就是免疫系统在运转。

测试会话评分：在一次 Claude 接受了纠正的会话结束后，检查 .claude/memory/sessions.jsonl。里面应该有一条记录，包含纠正次数、规则检查数和通过/失败率。

测试假设验证：告诉 Claude "看看我们在这个文件里是怎么处理错误的。"它应该用 grep 搜索模式、统计出现次数、检查反例，如果确认了，就提议将其添加为带检查的已验证规则——不是模糊的观察，而是经过验证的事实。

常见问题

这会影响我所有的 Claude 项目吗？

不会。.claude/ 文件夹和 CLAUDE.md 的作用域是项目级别的。它们只影响在该项目目录内运行的 Claude Code 会话。其他项目、Claude.ai 网页聊天和 API 完全不受影响。

如果我想让某些规则跨所有项目生效呢？

在你的 home 目录创建 ~/.claude/CLAUDE.md。它会加载到每一个 Claude Code 会话中，不管当前在哪个项目。适合存放个人偏好，比如"我偏好 early return"或"始终使用 conventional commits"。

CLAUDE.md 应该多长？

控制在 150 行以内。超过这个长度，Claude 的指令遵从度会下降，因为上下文变得嘈杂。把领域特定的规则移到 .claude/rules/ 文件里，而不是让 CLAUDE.md 越来越臃肿。

记忆文件不会无限增长吗？

observations.jsonl 和 corrections.jsonl 是追加写入的，确实会增长。但实际上增长很慢（大约每次会话 5-10 条）。/project:evolve 命令实际上就是在"消化"它们——把反复出现的模式晋升为已学规则。你可以在处理完之后归档旧的日志条目。learned-rules.md 设计上限 50 行，强制要么晋升要么淘汰。

记忆文件可以纳入版本控制吗？

可以，但我建议把 observations.jsonl 和 corrections.jsonl 加到 .gitignore（它们是个人会话数据），而提交 learned-rules.md 和 evolution-log.md（它们是经过整理的团队知识）。在 .gitignore 中添加：

.claude/memory/observations.jsonl

.claude/memory/corrections.jsonl

.claude/memory/violations.jsonl

.claude/memory/sessions.jsonl

如果 Claude 不遵守规则怎么办？

三个原因，按可能性排序。第一，CLAUDE.md 太长了——缩减到 150 行以内，把领域特定规则移到 .claude/rules/ 文件里。

第二，规则太模糊——"写好代码"毫无用处，"函数最多 30 行，最多 3 层嵌套"才是可执行的。每条规则都要具体到能测试合规性。

第三，如果你已经启用了验证引擎，检查规则是否有 verify: 行。带机器可检查测试的规则会自动执行，没有的则只能靠 Claude 自己记住。

给每条规则加上 verify: 行。

可以用在 Python / Go / Rust / 其他技术栈吗？

完全可以。这套架构与技术栈无关。把 npm 命令替换为你对应的工具（pytest、go test、cargo test），调整目录结构，更新编码约定就行。决策框架、进化循环、智能体和记忆系统在任何语言下都一样运作。

这和 Claude Code 内置的自动记忆有什么区别？

Claude Code 自带 /memory 功能，可以跨会话自动记录笔记。这很有用，但缺乏结构——它本质上是个便签本。本系统在自动记忆的基础上增加了三样东西：自动执行规则的验证检查、让知识在不同置信层级间流转的晋升阶梯（Promotion Ladder），以及衡量系统是否真正在改进的会话评分（Session Scoring）。自动记忆记住事情，这套系统执行它们。

一键配置脚本

如果你想一次性创建整个文件夹结构，这里有一个 bash 脚本。保存为 setup-claude.sh，在你的项目根目录运行：

#!/bin/bash

echo "Creating .claude/ folder structure..."

mkdir -p .claude/rules

mkdir -p .claude/agents

mkdir -p .claude/commands

mkdir -p .claude/skills/evolution

mkdir -p .claude/memory

echo "Structure created. Now create these files:"

echo ""

echo " CLAUDE.md (copy from article Step 2)"

echo " .claude/settings.json (copy from article Step 3)"

echo " .claude/rules/security.md (copy from article Step 4)"

echo " .claude/rules/api-design.md (copy from article Step 4)"

echo " .claude/rules/performance.md (copy from article Step 4)"

echo " .claude/agents/architect.md (copy from article Step 5)"

echo " .claude/agents/reviewer.md (copy from article Step 5)"

echo " .claude/commands/review.md (copy from article Step 6)"

echo " .claude/commands/fix-issue.md (copy from article Step 6)"

echo " .claude/commands/evolve.md (copy from article Step 6)"

echo " .claude/skills/evolution/SKILL.md (copy from article Step 7)"

echo " .claude/memory/README.md (copy from article Step 8)"

echo " .claude/memory/learned-rules.md (copy from article Step 8)"

echo " .claude/memory/evolution-log.md (copy from article Step 8)"

echo ""

echo "Then customize CLAUDE.md with your actual commands and folder structure."

echo ""

echo "Add to .gitignore:"

echo " CLAUDE.local.md"

echo " .claude/settings.local.json"

echo " .claude/memory/observations.jsonl"

echo " .claude/memory/corrections.jsonl"

echo ""

echo "Done. Launch 'claude' in this directory to start."

这套系统到底给了你什么

经过这番改造，Claude Code 已经进化成一个更具认知能力的版本——多个子系统在每次会话中协同工作，持续提升你的开发体验。

每一条已学规则都附带一个机器可检查的验证测试。系统不指望 Claude 凭记忆遵守规则——它直接运行 grep，确认合规，只在发现违规时才浮出水面。最好的免疫系统是无形的，你只有在什么东西突破防线时才会注意到它。

真正的价值在于第 1 次会话和第 20 次会话之间的差距。

第 1 次会话的体验可能和一份写得好的 CLAUDE.md 差不多，但随着你使用这套系统的会话越来越多，它会进化得越来越远。

如果你想更进一步，利用 Claude 提供的工具实现整个系统的真正自动化，那就把下面这部分也一并复制过去：

ACTIVATE THESE:

The base system above works. But without these three additions, it

relies on Claude remembering to run checks. This section makes it

structural — hooks that fire automatically, not instructions that might

get skipped.

Update settings.json with hooks

Replace your .claude/settings.json with this version that adds three

hooks to the permissions you already have:

{

"$schema": "https://json.schemastore.org/claude-code-settings.json",

"permissions": {

"allow": [

"Bash(npm run *)",

"Bash(npx *)",

"Bash(node *)",

"Bash(git status)",

"Bash(git diff *)",

"Bash(git log *)",

"Bash(git branch *)",

"Bash(git show *)",

"Bash(git stash *)",

"Bash(gh issue *)",

"Bash(gh pr *)",

"Bash(cat *)",

"Bash(ls *)",

"Bash(find *)",

"Bash(head *)",

"Bash(tail *)",

"Bash(wc *)",

"Bash(grep *)",

"Read",

"Write",

"Edit",

"Glob",

"Grep"

"deny": [

"Bash(rm -rf *)",

"Bash(rm -r *)",

"Bash(sudo *)",

"Bash(eval *)",

"Bash(git push --force *)",

"Bash(git reset --hard *)",

"Bash(npm publish *)",

"Read(./.env)",

"Read(./.env.*)",

"Read(./**/*.pem)",

"Read(./**/*.key)"

]

"hooks": {

"SessionStart": [

{

"matcher": "startup",

"hooks": [

{

"type": "command",

"command": "echo \"[EVOLUTION BOOT] Session started.

Learned rules: $(wc -l < .claude/memory/learned-rules.md 2>/dev/null ||

echo 0) lines. Corrections logged: $(wc -l <

.claude/memory/corrections.jsonl 2>/dev/null || echo 0). READ

.claude/memory/learned-rules.md BEFORE STARTING WORK. RUN VERIFICATION

SWEEP ON ALL RULES WITH verify: LINES.\"",

"timeout": 5,

"statusMessage": "Booting evolution engine..."

}

]

}

"PreToolUse": [

{

"matcher": "Edit|Write",

"hooks": [

{

"type": "command",

"command": "echo \"[EVOLUTION] Before editing: check if

this file has path-scoped rules in .claude/rules/. Check

learned-rules.md for relevant patterns.\"",

"timeout": 3

}

]

}

"Stop": [

{

"matcher": "",

"hooks": [

{

"type": "command",

"command": "echo \"[EVOLUTION] Session ending. If

corrections were received or observations made, ensure they are logged

to .claude/memory/. Write session scorecard to sessions.jsonl.\"",

"timeout": 3

}

]

}

]

}

各个钩子（Hook）的作用：

┌───────────────┬───────────┬─────────────────────────────────────────┐

│ Hook │ 触发时机 │ 功能 │

├───────────────┼───────────┼─────────────────────────────────────────┤

│ │ │ 注入一条系统消息，告诉 Claude 读取 │

│ │ 每次新会话 │ learned-rules.md 并运行验证扫描。 │

│ SessionStart │ 启动时 │ Claude 不可能"忘记"，因为钩子在它看到 │

│ │ │ 你的第一条消息之前就已经触发了。 │

│ │ 每次文件 │ 提醒 Claude 检查当前要编辑的文件是否 │

│ PreToolUse on │ 编辑之前 │ 有路径作用域规则和已学规则。 │

│ Edit/Write │ │ 这个"助推"在违规发生之前就阻止了它。 │

│ │ │ 告诉 Claude 记录纠正和观察，并写入 │

│ Stop │ 会话结束时 │ 会话评分卡。不再有"忘了记录"的问题—— │

│ │ │ 钩子在退出时自动触发。 │

└───────────────┴───────────┴─────────────────────────────────────────┘

这就是"指令说要做"和"系统结构性地让你很难跳过"之间的区别。

创建 .claude/skills/boot/SKILL.md：

用法：在会话开始时输入 /boot

description: Session boot sequence. Run at the start of every session

to load memory and verify rules.

Load Memory

Learned Rules

cat .claude/memory/learned-rules.md 2>/dev/null || echo "No learned rules yet"

Last Session Score

tail -1 .claude/memory/sessions.jsonl 2>/dev/null || echo "No session history"

Unresolved Violations

tail -5 .claude/memory/violations.jsonl 2>/dev/null || echo "No violations logged"

Instructions

Read every rule in learned-rules.md. For each rule that has a

verify: line, run the check NOW.

Report results:

All pass → "All rules verified. Ready to work."

Any fail → list violations with file:line and specific fix.

Check session trend: if sessions.jsonl has 5+ entries, report

one-line trend (corrections increasing/decreasing, common violations).

You are now primed. Proceed with the user's task.

用法：在会话开始时输入 /boot。SessionStart 钩子会自动提醒 Claude，

但这条命令会强制执行完整的验证扫描并加载上下文。

创建核心不变量

创建 .claude/rules/core-invariants.md：

description: Critical invariants that must survive context compression.

Rules that caused real bugs when violated.

paths:

- "*/"

Core Invariants

These rules load on every file edit. They exist because violating them

caused production bugs.

Replace these examples with your own hard-won lessons:

[Your most painful bug]. Describe the pattern that must never

happen, and the correct pattern. Example: "Never use

{headers: merged, ...options} — always destructure

{ headers, ...rest } first."

[Your second most painful bug]. Same format.

[Critical safety rule]. Something that causes data loss,

security holes, or silent failures if violated.

The paths: ["*/"] means this file loads on every file touch. Even

when conversation context gets long and compresses, these rules get

re-injected fresh on every tool call. They're compression-proof.

用你项目中造成过最大痛苦的 3-5 条规则来定制这个文件。如果你还没有，

留一条占位规则，让进化系统随时间自动填充——violations.jsonl 中反复

出现的规则就是候选者。

完整流程

Session starts

→ SessionStart hook fires: "read learned-rules, run verification

sweep"

→ /boot loads memory + runs all verify: checks

→ Rules verified before any work begins

During work

→ PreToolUse hook fires on every Edit/Write: "check rules for this

file"

→ core-invariants.md loads on every file touch (compression-proof)

→ Path-scoped rules load for matching files

→ Observations auto-verified via grep, auto-promoted if confirmed

→ Corrections auto-generate verify: patterns

Session ends

→ Stop hook fires: "log everything, write scorecard"

→ sessions.jsonl tracks trend over time

没有任何一步依赖 Claude "记得要做"。钩子让一切变成结构性的。

把整篇文章收藏好，复制粘贴到你想要启用这套机制的任何 Claude 项目里。

这套系统能自动完成 80% 的自进化工作。随时在 Claude Code 中使用 /evolve 命令来触发更多进化，把某些行为固化到项目 DNA 中。建议每 10 次以上会话运行一次。

自进化，已就绪。

原文来源：微信公众号 | 转载于 Wechat Mediapress

Wechat Mediapress

搜索此博客