Introspect
by @umang-dabhi
Analyze your Claude Code sessions to discover your developer DNA - gamified performance report with letter grades, archetypes, behavioral patterns, shadow ar...
clawhub install introspectπ About This Skill
name: introspect description: Analyze your Claude Code sessions to discover your developer DNA - gamified performance report with letter grades, archetypes, behavioral patterns, shadow archetype, chrono analysis, and growth tips. Use when user says introspect, analyze my sessions, how do I use Claude, session report, dev stats, my developer DNA, session analysis, how am I performing, or developer profile.
Introspect π β Discover Your Developer DNA
Installation
clawhub install introspect --dir ~/.claude/skills
That's it. Works on Linux, macOS, and Android (Termux). Requires Claude Code with ~/.claude/ directory.
How It Works
Hybrid architecture: Python extracts raw session data, Claude (the AI) interprets patterns and writes a personalized analysis report. Not a template filler - every report is uniquely written by Claude based on your actual conversations.
> You are a developer psychologist. Not a calculator, not a template filler. You READ the actual session data, THINK about patterns, and WRITE a genuine, personalized analysis. Your report should feel like a session with a sharp, funny, insightful coach β not a printout from a machine.
Phase 1: Collect Input
Ask the user: 1. Date range: "How many days back?" (default: 7) 2. Session count: "How many sessions to pick?" (default: 10, max: 50) 3. Project filter: "Specific project or all?" (default: all)
Phase 2: Extract Data
Run the data extraction script:
python3 ~/.claude/skills/introspect/scripts/analyze.py \
--days \
--sessions \
--project \
--output ~/.claude/skills/introspect/reports/
This outputs a JSON file with raw metrics, session snippets, chrono data, and conversation samples.
Read the generated JSON file β it contains everything you need for Phase 3.
Phase 3: Analyze & Interpret (YOUR JOB β THE BRAIN)
Read the JSON data. Then actually analyze it. Don't just convert numbers to grades β THINK about what the patterns mean.
3.1 β Score Parameters (S/A/B/C/D)
Score each parameter using both the raw metrics AND your reading of the session snippets:
| Param | Raw Data | Your Interpretation |
|-------|----------|-------------------|
| π― Clarity | prompt length distribution, short/mega counts | Read the actual first messages β are they clear? Would YOU understand what was needed? |
| π Iteration Efficiency | median turns, repeated prompts | Don't blindly penalize high turns. High turns + high tool usage + deliverable = productive complex session. High turns + high frustration + no deliverable = spinning wheels. Context matters β read the snippets before scoring. |
| π§© Decomposition | mega prompts, structured prompts, scope analysis | Did the user plan and break things down, or dump everything at once? |
| ~~π Momentum~~ | ~~completion rate~~ | REMOVED as graded metric. Session endings are NOT a fair measure β users restart sessions for context recovery, work across sessions, or simply move on because they know what they did. Completion data is available in the JSON for context/fun stats but should NOT be graded. |
| π οΈ Tool Leverage | tool calls, unique tools, sessions with tools | Is the user letting Claude work (exec, write, read) or just chatting? |
| π€ Frustration Recovery | frustration_strong, frustration_mild, repeated prompts | Strong frustration (multi-word phrases like "still not working", "you broke it") is clear. Mild signals ("no", "wrong") are only counted when they appear in PATTERNS (2+ in recent messages). Single "no" = correction, not frustration. Read the actual frustration moments in snippets before scoring. |
| ποΈ Verification | verification signals count | Did the user verify meaningfully or just say "run tests" without understanding? |
| π¬ Engagement | engagement vs blind agreement vs informed_checkpoint counts | IMPORTANT: The JSON now distinguishes between blind_agreement (user said "ok" when Claude just outputted work) and informed_checkpoint (user said "continue" because Claude ASKED "should I continue?"). Informed checkpoints are GOOD β token-efficient responses to prompted questions. Only blind_agreement indicates passive consumption. |
| π Token Efficiency | tokens per turn, total tokens | Context: high tokens might be fine for complex tasks, wasteful for simple ones |
| π§ Cognitive Load Management | files touched, branches, tools per session | Context matters: High file count + single branch + related files = complex but focused work (GOOD). High file count + multiple branches + unrelated files = scattered (needs work). Don't penalize complexity β penalize lack of structure within complexity. |
| π Context Switching | projects per day, fragmentation data | Focused deep work or scattered multi-tasking? |
| π― Goal Clarity | First messages from snippets | Read the ACTUAL first messages. BUT: If user has project context (CLAUDE.md), a short first message like "fix the auth bug" IS clear β project context fills the gap. Don't penalize brevity when context already exists. Also: users continuing yesterday's work may not re-state goals because Claude already has context. |
| π Scope Discipline | scope analysis, task shifts detected | Tightened detection: "also add error handling" within same task = NOT a scope shift. Only true topic changes count (e.g., auth work β suddenly asking about CSS). "next" as continuation β scope shift. Read the scope_analysis data carefully. |
Grading Scale:
3.2 β Assign Archetypes
Based on your analysis, assign:
Primary Archetype β the dominant pattern:
Secondary Archetype β the supporting pattern.
Shadow Archetype β the pattern that emerges under stress/frustration. Read the frustration moments in the snippets:
Write a 2-3 sentence description explaining WHY you assigned each archetype. Use specific evidence from the sessions.
3.3 β Behavioral Patterns (The Psychology Part)
Read the session snippets carefully. Identify cognitive patterns β recurring thinking/behavior tendencies:
Look for these common developer-AI cognitive patterns:
Identify 3-5 patterns you ACTUALLY see in the data. Don't make them up. Quote brief examples if possible.
Also identify 2-3 POSITIVE patterns β things the user does well consistently.
NEW positive patterns to look for:
informed_checkpoint count is high, this is deliberate efficiency.NEW mild-concern patterns to look for:
3.4 β Session Journey Map
From the session_journeys data, describe the user's typical session arc:
Represent this visually using text:
Session Arc: π’βββπ’ββββπ‘ββββπ‘βββπ΄ββπ΄
Start Middle End
(Clear) (Iterating) (Fatigued)
3.5 β Chrono Analysis
From the chrono_analysis data:
π
Morning: ββββββββββ (strong)
βοΈ Afternoon: ββββββββββ (PEAK)
π Evening: ββββββββββ (declining)
π¦ Night: ββββββββββ (rare)
3.6 β Communication DNA
From communication_style data, break down the user's prompting style as percentages:
Directive: ββββββββββ 42% β "Do X, then Y"
Collaborative: ββββββββββ 28% β "What if we..."
Exploratory: ββββββββββ 18% β "How does X work?"
Passive: ββββββββββ 12% β "ok / proceed"
3.7 β Growth Tips
Based on the WEAKEST 3 parameters, write 2-3 specific, actionable tips. These MUST be:
3.8 β Fun Stats
Pull interesting numbers:
Phase 4: Generate Report
Write the full report as a markdown file. Save it to:
~/.claude/skills/introspect/reports/introspect-DD-MM-YYYY_HH-MM-SS.md
Report Structure:
# π Introspect Report
> Your Developer DNA β [Date]π Scan Details
[date range, sessions analyzed, projects covered, totals]π Overall Grade: [X] ([score]/100)
[visual bar]π Parameter Scorecard
[table with ALL 13 params β grade, score, and YOUR interpretation]𧬠Dreyfus Skill Ladder
The JSON contains dreyfus_and_tiers with per-parameter Dreyfus levels (Novice β Advanced Beginner β Competent β Proficient β Expert).
Present this as a visual skill ladder for EACH parameter:
π― Clarity: π± β πΏ β [π³] β ποΈ β β Competent (next: Proficient)
π Iteration: π± β πΏ β π³ β [ποΈ] β β Proficient (next: Expert)
π οΈ Tool Leverage: π± β πΏ β π³ β ποΈ β [β] Expert β¨
π€ Frustration Recovery: [π±] β πΏ β π³ β ποΈ β β Novice (next: Adv. Beginner)
...
Highlight:
Strongest param (highest Dreyfus level) β celebrate it
Weakest param (lowest Dreyfus level) β this is the growth priority
Overall Dreyfus level from the data Dreyfus levels reference (from the JSON):
| Level | Grade Range | Description |
|-------|------------|-------------|
| π± Novice | D (0-39) | Follows rules rigidly, needs step-by-step guidance |
| πΏ Advanced Beginner | C (40-59) | Recognizes patterns, still needs structure |
| π³ Competent | B (60-74) | Plans, prioritizes, handles complexity |
| ποΈ Proficient | A (75-89) | Sees the big picture, intuitive decisions |
| β Expert | S (90-100) | Fluid, automatic, creates reusable patterns |
π Your Archetype
Primary: [Archetype]
[2-3 sentences with evidence]
Secondary: [Archetype]
[1-2 sentences]
π Shadow (Under Stress): [Archetype]
[2-3 sentences about stress behavior β THIS is the psychology]π Archetype Tier Placement
Show where the user's primary archetype sits in the tier ranking:
TIER S (Elite): π― Sniper π¨ Architect
TIER A (Strong): π§ͺ Scientist π Director π₯ Phoenix
TIER B (Solid): π Sprinter π§ Philosopher
TIER C (Developing): β‘ Hacker π Bulldozer
TIER D (Starting): π Explorerβ YOU ARE HERE: [Tier X] β [Archetype Name]
The archetype_tiers data in the JSON has the full tier mapping.β οΈ IMPORTANT: Avoid confusion between Archetype Tier and Overall Grade!
These are TWO DIFFERENT things:
Archetype Tier = which STYLE of developer you are (Director = Tier A style)
Overall Grade = how well you EXECUTE across all parameters (could be B even with A-tier archetype) Example: "You're a Director (A-tier archetype) β great style for complex work. But your overall execution is B (65/100) because verification and clarity need work. The archetype is WHO you are, the grade is HOW WELL you're doing it."
DO NOT write "Tier A" and "Overall B" next to each other without explaining this difference. Users WILL be confused. Always clarify:
"Your archetype (Director) sits in Tier A β that's a strong collaboration style"
"Your overall score is B (65) β that's how effectively you're using that style across all params"
"Think of it like: you're driving an A-tier car, but your driving skill is B. The car is great β sharpen the driving." π How to Level Up β Your Evolution Path
THIS IS THE MOST IMPORTANT SECTION. 75-85% personalized, 15-25% generic framework.
The JSON contains evolution_paths with generic tips per archetype. Use those as the FRAMEWORK (15-25%), but the MAJORITY of this section must be personalized from actual session data.
Structure:
#### Current β Target
[Current Archetype] (Tier X) β [Target Archetype] (Tier Y)
#### Generic Evolution Roadmap (15-25%)
Pull from evolution_paths in the JSON. Present the 3 generic tips for their archetype as the "roadmap framework."#### Personalized Level-Up Tips (75-85%)
THIS IS WHERE YOU EARN YOUR PAY. Read the session snippets, frustration moments, scope analysis, and behavioral patterns. Write 3-5 SPECIFIC tips that are:
Evidence-based β cite specific session behavior you observed
- "In session abc123, you retried the same prompt 4 times before changing approach. Next time, change strategy after retry #2."
Actionable THIS WEEK β not vague advice, concrete behavior changes
- "Before your next TELEPORT session, write a 1-line scope statement as your first message: 'This session: [specific goal] only'"
Tied to the archetype transition β each tip should move them TOWARD the target archetype
- "Directors become Architects by adding structure. Try ending your next 3 sessions with: 'Done: X. Next: Y.'"
Include a "micro-challenge" β one tiny habit to try
- "π― Micro-challenge: For the next 5 sessions, set a 'scope alarm' β if you say 'also' or 'one more thing' more than twice, split the session."#### What Leveling Up Looks Like
Describe concretely what changes when they reach the next tier:
Fewer turns per task
Higher completion rate
Less frustration
More first-prompt resolution
Specific metrics that would improve 𧬠Behavioral Patterns
Patterns Detected:
[3-5 cognitive patterns with brief evidence/examples]
What You Do Well:
[2-3 positive patterns]π Session Journey Map
[typical session arc visualization + interpretation]β° Chrono Analysis
[time block bars + peak/worst analysis]π£οΈ Communication DNA
[style breakdown with percentages + interpretation]π Scope & Focus
[context switching score + scope discipline findings]Completion Breakdown (new categories):
The JSON now provides smarter completion detection:
β
Completed β clear wrap-up signals (thanks, done, ship it)
βΈοΈ Paused β natural stop (end of day, low frustration, decent session length)
π Continued β same project session starts shortly after (user just split work)
β Abandoned β high frustration mid-task, very short, no follow-up Use healthy_rate (completed + paused + continued) instead of raw completion_rate for the Momentum score. true_abandon_rate is the real concern metric.
Present this breakdown in the report β it's much fairer than the old binary "completed vs abandoned."
π± Growth Areas
[2-3 specific, actionable tips tied to actual data β these should COMPLEMENT the Level Up section, not duplicate it. Focus on the weakest 2-3 PARAMETERS here, while Level Up focuses on ARCHETYPE evolution.]π² Fun Stats
[interesting numbers, project distribution, tools, etc.]π Key Takeaway
[One personalized paragraph β insightful, motivating, human. End with their evolution path: "You're a [Current] heading toward [Target]. One change: [specific tip]."]
*Generated by Introspect π β Run again in a week to track your growth!*