1. Design Principle — Output First
Every question exists because a specific output element needs the data. If a question doesn’t feed a visible output, it doesn’t belong.
FREE Tier Outputs → Data Needed
| Output Element | Data Required |
| Archetype assignment | Top 2 superpowers → maps to 1 of 6 archetypes |
| Top 3 superpowers (visual bars) | 5 dimension scores (relative ranking) |
| AI Potential | Mindset Index + peak dimension + learning velocity proxy |
| Best Percentile stat | Industry, role, company type, seniority (multi-dimensional benchmarking) |
| Growth edge | Weakest dimension (positive framing) |
| Mode profile (Card C) | Primary + secondary mode + agentic orientation |
| Adaptive card layout | Overall percentile tier (determines visibility) |
PAID Tier Outputs → Additional Data Needed
| Output Element | Data Required |
| Superpowered Score (0–100) | 4 component indices: Mindset 35%, Skills 30%, Domain 25%, Technical 10% |
| 20 sub-competency scores | Layer 2 behavioral evidence for all 20 sub-competencies |
| Full radar chart | All 5 dimension scores (refined with Layer 2) |
| Mode depth chart | Per-mode depth 0–5 + agentic depth per mode |
| Self-perception gap | Layer 1 vs Layer 2 score comparison per dimension |
| AI Potential (refined) | Learning velocity from conversation + consistency gap |
| 3 learning path recs | Weakest sub-competencies + domain-specific growth areas |
| Certificate | Name, archetype, score, percentile, framework version, date |
2. Assessment Flow
FREE Tier — Spark Profile (~7–8.5 min to result)
PHASE 0: AI WELCOME CHAT (2.5-3 min)
│ Conversational onboarding + mode profiling
│ Msgs 1-4: name, work context, AI usage, mindset hint
│ Msgs 5-6: mode selection (multi-select) + depth per mode
│ NLP extracts: industry, role, company type
│ Direct capture: modes used, depth per mode
▼
PHASE 2: FORCED-CHOICE ITEMS (3-4 min)
│ 22 items: tap one of 2 statements (~8-10 sec each)
▼
PHASE 3: ANCHORING + AGENTIC + SKILL (1 min)
│ 5 anchoring (Likert) + 1 agentic + 3 skill items
▼
LAYER 1 SCORING ENGINE
│ → 5 dimensions, archetype, AI Potential, mode profile
│ + Phase 0 signals (adoption, mindset, mode depth)
▼
★ EMAIL GATE ★
│ "Your profile is ready! Where should I send it?"
▼
★ SPARK PROFILE + ADAPTIVE CARD ★
│ Mode Profile labeled "your self-assessment"
▼
PHASE 4: MINI AI CHAT (2-3 min)
│ References what user said in Phase 0
│ Creates curiosity gap → upgrade trigger
▼
UPGRADE WALL
Flow note: Phase 0 combines conversational onboarding (Msgs 1–4) with interactive mode profiling (Msgs 5–6), replacing old form fields AND old MODE1/MODE2 items from Phase 3. Mode depth is self-reported with compression scoring (capped at 70/100). Email Gate captures leads at peak curiosity. Mode Profile in free tier labeled “your self-assessment” with upgrade CTA.
PAID Tier — Full Profile (~25–35 min)
EVERYTHING FROM FREE (7-8.5 min)
▼
PHASE 5: FULL AI CONVERSATION (15-20 min)
│ 5a: Experience Exploration (5-8 min)
│ 5b: Scenario Challenges (5-8 min)
│ 5c: Reflection & Closing (2-3 min)
▼
FULL SCORING ENGINE
│ L1 (30%) + L2 (70%)
│ 20 sub-competencies, 4 indices
▼
FULL PROFILE + REPORT + CERTIFICATE
3. Phase 0: AI Welcome Chat
Phase 0 combines conversational onboarding (Messages 1–4) with interactive mode profiling (Messages 5–6). It captures context, adoption level, mindset hint, AND full mode selection + depth — replacing both the old form fields and the old MODE1/MODE2 items.
Conversation Script (6–7 messages, ~2.5–3 min)
Message 1 — Name
AI: “Hi! I’m here to map your AI superpowers. Let’s start simple — what’s your name?”
User: [free text] → extracts Name
Message 2 — Work Context
AI: “Nice to meet you, [Name]! What do you do? Tell me your role and industry — just in your own words.”
User: [free text] → NLP extracts Role, Industry, Company type
Message 3 — AI Adoption
AI: “Got it. Now I’m curious — how does AI show up in your work today?”
User: [free text] → extracts AI Adoption Level + usage patterns
Message 4 — Mindset Hint
AI: “When AI gives you a result you didn’t expect, what’s your first instinct?”
User: [free text] → extracts Mindset hint
Message 5 — Mode Selection (interactive cards)
AI: “Now let me understand HOW you use AI. Which of these do you do? Tap all that apply.”
4 tapable mode cards (multi-select):
💬 Chatuju s AI
🎨 Tvorim obsah s AI
🔧 Buduji s AI
⚡ Automatizuju s AI
Message 6 — Mode Depth (per selected mode)
AI: “For each one — tap the description that best matches what you ACTUALLY DO:”
For each selected mode, show 5 descriptive cards (single-select, no numbers visible). See depth card tables below.
Message 7 — Transition
AI: “Love it. I have a great picture of how you work with AI. Now let’s find out where your real superpowers are.”
→ Animated transition to Phase 2 (forced-choice UI)
Mode Depth Cards (behavioral framing: “what do you DO?”)
User sees descriptive cards only — no numbers. Internal depth values are for scoring backend.
💬 Conversational
| Internal | Card text |
| C-1 | I occasionally ask AI something |
| C-2 | I regularly use AI to solve work tasks |
| C-3 | I lead a dialogue with AI — I give context, iterate, refine |
| C-4 | AI knows my context — I know how to prompt it for my work |
| C-5 | AI is my daily partner for strategy and decision-making |
🎨 Creative
| Internal | Card text |
| R-1 | I occasionally generate something — an image, text, a presentation |
| R-2 | I regularly create content with AI — I have my own workflows |
| R-3 | I create at a professional level — deep in one or more formats |
| R-4 | I have my own production system — multiple AI tools in a coordinated process |
| R-5 | I produce at scale — campaigns, series, dozens of outputs |
🔧 Builder
| Internal | Card text |
| B-1 | I tried, but nothing I actually use |
| B-2 | I built a few simple things — pages, tools |
| B-3 | I regularly build things I use at work |
| B-4 | I build apps that others use too |
| B-5 | I manage AI agents that code for me |
⚡ Orchestration
| Internal | Card text |
| O-1 | I tried connecting a few tools |
| O-2 | I have a few simple automations that save me time |
| O-3 | I build workflows where AI processes data by rules |
| O-4 | I have systems that run on their own — I check them occasionally |
| O-5 | My workflows adapt and make decisions on their own |
Mode depth design principles (expert panel): Behavioral framing (“what do you DO”), no numbers visible, max 8–10 words per card, cumulative levels, compression scoring (capped at 70/100), free tier labeled “your self-assessment.” Expected L1 vs L2 gap: 30–40% of users self-report 1–2 levels higher than L2 confirms.
NLP Extraction Logic
| Field | Extraction Method | Fallback |
| Name | Direct from Message 1 | Ask again |
| Industry | NLP classification → 12 categories | Quick picker: 12 chips |
| Role | NLP classification → 7 levels | Quick picker: 7 chips |
| Company type | NLP inference from context | Quick picker: 5 chips |
| AI Adoption | NLP → High / Medium / Low / None | Default Medium |
| Mindset Hint | NLP → Analytical / Trusting / Adaptive | Soft signal, no fallback |
| Mode Selection | Direct from Message 5 card taps | — |
| Mode Depth | Direct from Message 6 card taps | — |
Signals from Phase 0
| Signal | Source | Type | Usage |
| AI Adoption | Msg 3 | Bonus | Contextualizes L1 scores. Does NOT affect dimension scores. |
| Mindset Hint | Msg 4 | Bonus | Tiebreaker for archetype. Does NOT override L1. |
| Mode Selection | Msg 5 | Primary | Which modes appear in Mode Profile. Feeds scenario selection. |
| Mode Depth | Msg 6 | Primary (compressed) | Self-assessed depth. Capped at 70/100. Free tier: “self-assessment.” Paid: replaced 70% by L2. |
Deferred Fields
| Field | When Collected | Purpose |
| Seniority | Post-result (optional) or NLP-inferred | Seniority percentile |
| Country | Auto-detect from IP/locale | Geographic percentile |
Why conversational + interactive onboarding: Phase 0 combines natural conversation (Msgs 1–4) with interactive card-based UI (Msgs 5–6). This gives both qualitative context and structured mode data from the first 3 minutes. The user’s first impression is “I’m talking to an AI that gets me.”
4. Phase 2: Forced-Choice Item Bank (22 items)
Each item presents two positive statements. The user selects which resonates more. There is no wrong answer — the choice reveals which superpower is stronger. Items are tagged as Mindset (M) or Skill (S) to enable separate Mindset Index and Capability Index calculations.
Item Design Principles
- Both options are positive. No “wrong” choice.
- Each item contrasts two different dimensions.
- Language is natural and work-contextual.
- No tool names in items.
- Mindset items = attitude/orientation. Skill items = capability/action.
- Social desirability balance. Both options must sound equally “good” to a business professional. Validation required: present items to 20 people without context — if one option is chosen >65% as “more impressive,” rewrite it.
Perception Items (5)
P1 — AI Curiosity (M) vs. Critical Trust (M)
“When I hear about a new AI tool, I’m more likely to...”
(a) Try it right away to see what it can do → Perception: AI Curiosity
(b) Research its reliability and limitations before investing time → Intelligence: Critical Trust
P2 — Opportunity Recognition (S) vs. Prompt Mastery (S)
“When facing a new work challenge, my first instinct is to...”
(a) Think about which parts of this problem AI could handle → Perception: Opportunity Recognition
(b) Think about how to structure my input so AI gives me the best result → Intelligence: Prompt Mastery
P3 — AI Curiosity (M) vs. Iterative Learning (M)
“What excites me more about AI:”
(a) Discovering capabilities I didn’t know existed → Perception: AI Curiosity
(b) Seeing how fast my own workflow evolves because of it → Knowledge: Iterative Learning
P4 — Problem Reframing (S) vs. Process Decomposition (S)
“When given a complex task, I naturally...”
(a) Question whether the task itself is the right thing to do → Perception: Problem Reframing
(b) Break it down into smaller steps and assign each to the best approach → Integration: Process Decomposition
P5 — Opportunity Recognition (S) vs. Workflow Orchestration (S)
“I add the most value when I...”
(a) Spot where AI can be used in places nobody else thought of → Perception: Opportunity Recognition
(b) Design a system that connects multiple tools into a smooth workflow → Integration: Workflow Orchestration
Intelligence Items (4)
I1 — Critical Trust (M) vs. Creative Courage (M)
“When AI gives me an unexpected result, I’m more likely to...”
(a) Investigate why — I want to understand the reasoning → Intelligence: Critical Trust
(b) See if I can use it creatively — unexpected results can be valuable → Creation: Creative Courage
I2 — Prompt Mastery (S) vs. The 90/10 Craft (S)
“I spend more energy on...”
(a) Getting the right input into AI — the prompt, the context, the structure → Intelligence: Prompt Mastery
(b) Refining what comes out of AI — editing, iterating, polishing → Creation: The 90/10 Craft
I3 — Critical Trust (M) vs. Augmentation Vision (M)
“I trust AI most when...”
(a) I can verify its work against clear criteria → Intelligence: Critical Trust
(b) I’ve designed a system where AI and I each play to our strengths → Integration: Augmentation Vision
I4 — Strategic AI Dialogue (S) vs. Knowledge Architecture (S)
“My strength with AI is more about...”
(a) Using AI as a thinking partner — asking the right questions to get deep insights → Intelligence: Strategic AI Dialogue
(b) Organizing and structuring knowledge so it’s always ready to use → Knowledge: Knowledge Architecture
Knowledge Items (4)
K1 — Iterative Learning (M) vs. Critical Trust (M)
“When an AI tool I rely on starts giving worse results, I...”
(a) See it as a chance to find something better — I adapt fast → Knowledge: Iterative Learning
(b) Investigate what changed and whether the outputs can still be trusted → Intelligence: Critical Trust
K2 — Knowledge Architecture (S) vs. Opportunity Recognition (S)
“After a successful AI project, I’m more likely to...”
(a) Document what worked and create a reusable template → Knowledge: Knowledge Architecture
(b) Look for other situations where the same approach could apply → Perception: Opportunity Recognition
K3 — Iterative Learning (M) vs. Creative Courage (M)
“My relationship with change:”
(a) I actively seek it — I’m always looking for better ways to work → Knowledge: Iterative Learning
(b) I embrace it because it opens possibilities that didn’t exist before → Creation: Creative Courage
K4 — Knowledge Compounding (S) vs. Collaboration Design (S)
“When I discover a great AI workflow, I naturally...”
(a) Write it down so I or others can replicate it → Knowledge: Knowledge Compounding
(b) Show it to my team and design how we can use it together → Integration: Collaboration Design
Creation Items (4)
C1 — Creative Courage (M) vs. Augmentation Vision (M)
“I’m most proud when I...”
(a) Create something ambitious with AI that I couldn’t have done alone → Creation: Creative Courage
(b) Design a system where AI handles the heavy lifting while I focus on strategy → Integration: Augmentation Vision
C2 — The 90/10 Craft (S) vs. Workflow Orchestration (S)
“My work with AI is better described as...”
(a) I push AI outputs to a level of quality that impresses people → Creation: The 90/10 Craft
(b) I connect multiple AI tools into workflows that produce consistent results → Integration: Workflow Orchestration
C3 — Creative Courage (M) vs. Critical Trust (M)
“When considering an AI project I’ve never attempted before, I think...”
(a) “Let’s try it — worst case, I learn something” → Creation: Creative Courage
(b) “Let me first understand the risks and limitations” → Intelligence: Critical Trust
C4 — Building (S) vs. Context Engineering (S)
“My AI versatility shows in...”
(a) Building tools, apps, or prototypes — I make things that work → Creation: Building
(b) Preparing the right context for AI — the data, examples, and structure that make outputs excellent → Knowledge: Context Engineering
Integration Items (5)
G1 — Augmentation Vision (M) vs. Iterative Learning (M)
“I see AI’s biggest impact as...”
(a) Transforming how humans and technology work together as a system → Integration: Augmentation Vision
(b) Accelerating how fast we can learn, adapt, and improve → Knowledge: Iterative Learning
G2 — Workflow Orchestration (S) vs. Knowledge Architecture (S)
“My strongest contribution with AI is...”
(a) Designing multi-step processes where the right tool does each job → Integration: Workflow Orchestration
(b) Building organized knowledge systems that make information accessible and reusable → Knowledge: Knowledge Architecture
G3 — Augmentation Vision (M) vs. AI Curiosity (M)
“When I think about AI’s future, I focus on...”
(a) How human-AI collaboration will reshape how organizations work → Integration: Augmentation Vision
(b) What new capabilities will become possible that we can’t imagine today → Perception: AI Curiosity
G4 — Process Decomposition (S) vs. Problem Reframing (S)
“When I approach a big project with AI, I’m known for...”
(a) Breaking it into perfectly-sized pieces that each tool can handle → Integration: Process Decomposition
(b) Stepping back and redefining what we’re actually trying to achieve → Perception: Problem Reframing
G5 — Collaboration Design (S) vs. The 90/10 Craft (S)
“I create more value by...”
(a) Designing how my team uses AI together effectively → Integration: Collaboration Design
(b) Ensuring every AI-assisted deliverable meets the highest quality standard → Creation: The 90/10 Craft
Coverage Matrix
Ipsative measurement note: Forced-choice items produce ipsative (relative) scores — choosing one dimension suppresses another. A person cannot score high on all 5 simultaneously from forced-choice alone. The Dimension Anchoring Items in Phase 3 provide normative (absolute-level) calibration to complement the ipsative profile.
| Dimension | Appearances | Mindset (M) | Skill (S) |
| Perception | 8 | 3 (P1, P3, G3) | 5 (P2, P4, P5, K2, G4) |
| Intelligence | 8 | 4 (P1, I1, I3, K1, C3) | 3 (P2, I2, I4) |
| Knowledge | 9 | 3 (K1, K3, G1) | 5 (I4, K2, K4, G2, C4) |
| Creation | 8 | 4 (I1, K3, C1, C3) | 4 (I2, C2, C4, G5) |
| Integration | 11 | 4 (I3, C1, G1, G3) | 7 (P4, P5, K4, C2, G2, G4, G5) |
Balance note: Integration has the most appearances (11), Perception/Intelligence/Creation the fewest (8 each). Acceptable — Integration is the most cross-cutting dimension by design.
5. Phase 3: Anchoring, Agentic & Skill Items (12 items)
Dimension Anchoring Items (5 — Likert scale 1–5)
These items establish absolute levels for each dimension, solving the ipsative measurement trap. Rated 1–5 (“Not at all like me” to “Exactly like me”).
ANCHOR1 — Perception
“I actively look for new ways AI could be used in my work — even in areas where nobody else is using it yet.”
1 · 2 · 3 · 4 · 5
ANCHOR2 — Intelligence
“I have a systematic approach to evaluating AI outputs — I know what to trust, what to verify, and what to reject.”
1 · 2 · 3 · 4 · 5
ANCHOR3 — Knowledge
“I organize my AI knowledge — prompts, templates, workflows — so I can reuse and build on what I’ve learned.”
1 · 2 · 3 · 4 · 5
ANCHOR4 — Creation
“I use AI to tackle ambitious creative projects I wouldn’t attempt on my own.”
1 · 2 · 3 · 4 · 5
ANCHOR5 — Integration
“I design workflows where AI handles repeatable tasks while I focus on judgment and strategy.”
1 · 2 · 3 · 4 · 5
Dimension_L1_Final = (Ipsative_Rank_Normalized × 0.6) + (Anchor_Score_Normalized × 0.4)
Agentic Orientation (1 item — single select)
Note: Mode selection and mode depth are now captured in Phase 0 (Section 3, Messages 5–6). Old MODE1/MODE2 items removed — the new Phase 0 design provides richer mode data (multi-select modes + 5-level depth per mode).
MODE3 — Agentic Orientation
“Which best describes how AI operates in your daily work?”
(a) I start every AI interaction myself → Non-agentic (0)
(b) I have AI assistants with instructions I return to regularly → Low agentic (1)
(c) Some of my AI processes run on their own, I check in periodically → Moderate agentic (2)
(d) I have AI systems that make decisions and take actions autonomously → High agentic (3)
Skill Signal Items (3 — single select)
SKILL1 — AI Usage Frequency
“How often do you use AI tools in your work?”
(a) Rarely — a few times a month → Score: 1
(b) Weekly — it’s part of my toolkit → Score: 2
(c) Daily — I use AI for multiple tasks every day → Score: 3
(d) Constantly — AI is running in the background of most of my work → Score: 4
SKILL2 — AI Tool Breadth
“How many different AI tools do you use regularly?”
(a) 1 (mostly ChatGPT or similar) → Score: 1
(b) 2–3 different tools → Score: 2
(c) 4–6 tools across different categories → Score: 3
(d) 7+ tools — I have a full AI toolkit → Score: 4
SKILL3 — Building Depth
“Have you built anything with AI? (apps, automations, custom tools)”
(a) No, I haven’t tried → Score: 0
(b) I’ve experimented but nothing I use regularly → Score: 1
(c) Yes, I’ve built tools or automations I actually use → Score: 2
(d) I regularly build apps, scripts, or systems with AI → Score: 3
6. Layer 1 Scoring Model
Dimension Scoring
Each forced-choice item assigns +1 to the chosen dimension and 0 to the other. Raw dimension scores are the sum of all choices for that dimension across 22 items.
Dimension_Normalized = (Raw_Score / Max_Possible_Score) × 100
Mindset Index (for AI Potential)
Mindset_Index_L1 = average(
Perception_Mindset_Items, // AI Curiosity signals
Intelligence_Mindset_Items, // Critical Trust signals
Knowledge_Mindset_Items, // Iterative Learning signals
Creation_Mindset_Items, // Creative Courage signals
Integration_Mindset_Items // Augmentation Vision signals
) × normalization_factor
AI Potential (Layer 1 estimate)
AI_Potential_L1 = min(100, weighted_average(
Mindset_Index_L1 × 1.4, // High mindset = high ceiling
Peak_Dimension × 1.1, // Strongest dimension → potential
SKILL1_frequency × 10, // Usage frequency as velocity proxy
100 - |Mindset - Capability| × 0.5 // Gap = untapped potential
))
Archetype Assignment
Top 2 dimensions by normalized score → Archetype lookup:
| Top 2 Dimensions | Archetype |
| Integration + Intelligence | AI Architect |
| Perception + Intelligence | AI Navigator |
| Creation + Integration | AI Builder |
| Knowledge + Perception | AI Catalyst |
| Creation + Knowledge | AI Amplifier |
| Perception + Creation | AI Pioneer |
7. Email Gate & Lead Capture
The Email Gate sits between Layer 1 scoring and result display — the moment of peak curiosity. The user has invested 6+ minutes, scoring is complete, and they’re about to see their profile. This is the optimal point for email capture.
UX Copy & Design
Scoring complete → Loading: “Analyzing your responses…”
“Your AI Superpower Profile is ready!”
Enter your email to see your results and get personalized tips for growth.
[ your@email.com ] [ Show my profile → ]
We’ll send your profile + 3 personalized growth tips. No spam, unsubscribe anytime.
Design principles: Show a blurred preview of the Spark Profile card behind the email form (curiosity amplifier). Loading animation before the gate builds anticipation. Single field only (email) — name already captured in Phase 0. No skip option: users invested 6+ min and WILL enter email (sunk cost + curiosity). Target capture rate: 75–85%.
What Happens on Submit
- Email validated (format + disposable email detection)
- Spark Profile displayed immediately — zero delay
- Welcome email sent within 60 seconds (profile card, top 3 superpowers, 3 growth tips, CTA to Full Profile)
- Lead captured in CRM with tags: archetype, top superpower, industry, role, company type, AI adoption level
Email Follow-Up Sequence
| Day | Email | Content | CTA |
| Day 0 | Welcome + Profile | Spark Profile card, top 3 superpowers, 3 growth tips | “Go deeper → Full Profile” |
| Day 3 | Growth Insight | Deep-dive on #1 superpower: what it means, how top performers use it | “See how you compare →” |
| Day 7 | Curiosity Nudge | Tease Full Profile reveals (sub-competencies, radar, learning path). Reference Growth Edge. | “Unlock your full potential →” |
| Day 14 | Social Proof | Aggregated stats: “3,200+ professionals mapped. [Archetype] types like you tend to…” | “Get your complete profile →” |
Sequence principle: Each email adds NEW value (not just “buy now”). The user should feel smarter after reading each email, even if they never upgrade. This builds trust and brand — some upgrade on day 3, some on day 14, some forward it to colleagues (organic growth).
Expected Metrics
| Metric | Target |
| Email capture rate | 75–85% |
| Welcome email open rate | 65–75% |
| Day 3 open rate | 35–45% |
| Day 7 open rate | 25–35% |
| Paid conversion from sequence | 5–12% |
8. Phase 4: Mini AI Chat (FREE Tier)
2–3 personalized follow-up questions based on Layer 1 results. Gives users a taste of the AI conversation and creates the upgrade trigger.
Q1: Strongest Superpower Probe (always asked first)
| Top Superpower | Question |
| Perception | “You seem to naturally spot AI opportunities. Can you give me a quick example — the last time you saw an AI use case that others missed?” |
| Intelligence | “You strike me as someone who really thinks about how they use AI. What’s an example of a time your careful approach paid off?” |
| Knowledge | “You have a knack for making things reusable. What’s the most valuable AI template or system you’ve built for yourself?” |
| Creation | “You seem like someone who ships ambitious things with AI. What’s the most creative project you’ve tackled?” |
| Integration | “You think in systems. What’s the most sophisticated AI workflow you’ve designed?” |
Q2: Growth Area Hint (asked second)
| Weakest Superpower | Question |
| Perception | “If you had to find ONE new area where AI could help that you haven’t explored yet — what would it be?” |
| Intelligence | “When AI gives you something 80% right, what’s your process for getting it to 100%?” |
| Knowledge | “After a great AI session, do you save what worked — or start fresh next time?” |
| Creation | “If you could build anything with AI this week — no constraints — what would it be?” |
| Integration | “Imagine you could automate one repetitive part of your weekly work. What would it be?” |
Upgrade Wall
There’s much more here.
In 3 quick answers, I can already see your [archetype_name] profile forming. But to really understand your superpowers — and show you where your biggest growth opportunity is — I need about 20 more minutes.
Unlock your Full Profile →
Deep AI conversation · Full superpower radar · Personalized learning path · Certificate
Curiosity Gap Principle (expert panel)
After each mini-chat response, the AI references something it noticed but can’t explore:
- After Q1: “The way you described that tells me something about your [second_superpower] that I’d need more time to unpack properly.”
- After Q2: “I can see a gap between what you know is possible and what you’re doing. That gap is actually your biggest asset — but I’d need to go deeper to show you why.”
Scoring note: Mini AI chat responses are NOT scored for Layer 2. They are purely for user experience, creating informational curiosity gaps, and optional qualitative flagging.
9. Phase 5: Full AI Conversation (PAID Tier)
Conversation Architecture
| Phase | Duration | Purpose |
| 5a: Experience Exploration | 5–8 min | Deep-dive into real AI usage, 5 Observable Differentiators |
| 5b: Scenario Challenges | 5–8 min | 3–5 adaptive scenarios, domain-specific + universal |
| 5c: Reflection & Closing | 2–3 min | Forward-looking mindset signals, identity framing |
5a: Experience Exploration
Opening:
“Let’s go deeper. Tell me about a recent project where AI played a significant role. Walk me through it — what was the task, what did you do, and how did it turn out?”
The AI probes the 2 weakest dimensions and 1 strongest (to confirm) using these probes:
| Probe | Tests | Question |
| A: Decomposition | Process Decomposition, Opportunity Recognition | “When you face a complex problem — how do you decide what to hand to AI and what to handle yourself?” |
| B: Multi-Tool | Workflow Orchestration, Building | “Did you use just one tool, or did you combine several? How did they work together?” |
| C: Iteration | The 90/10 Craft, Prompt Mastery | “When AI gives you something close but not quite right, what’s your process?” |
| D: System Thinking | Knowledge Architecture, Augmentation Vision | “Do you have any AI workflows or templates you use repeatedly?” |
| E: Reframing | Problem Reframing, AI Curiosity | “Has there been a time when the question itself needed to be different?” |
5b: Scenario Challenges
Selection logic: Focus on weakest 2 dimensions, include domain-specific scenario, include 1 outside primary mode, escalate complexity.
Universal Scenario Bank (select 2–3)
| ID | Scenario | Primary Dimensions |
| S1 | The Competitive Analysis Sprint — 15 companies, 3 days, no analyst | Perception, Intelligence, Integration |
| S2 | The AI Failure — hallucinated data already sent to client | Intelligence, Knowledge, Creation |
| S3 | The Reluctant Team — AI tool automates 40%, half resist | Integration, Creation, Knowledge |
| S4 | The Urgent Presentation — 2 hours, unexpected | Creation, Intelligence, Perception |
| S5 | Knowledge Overflow — 200 pages, extract 10 insights | Knowledge, Integration, Intelligence |
| S6 | The Creative Brief — full campaign in one day | Creation, Perception, Knowledge |
| S7 | Process Audit — find top 3 AI opportunities in department | Integration, Knowledge, Perception |
| S8 | The Learning Challenge — master a new AI tool | Knowledge, Intelligence, Creation |
5c: Reflection & Closing
Three final questions that capture mindset signals:
- Forward-looking: “Looking ahead 12 months — how do you expect your work with AI to change?”
- Temperature check: “Gut feeling — how much of your work will involve AI two years from now?”
- Identity: “If you had to describe your relationship with AI in one sentence?”
“Is there anything about how you use AI that we haven’t covered? Anything you’re particularly proud of, or struggling with?”
10. Layer 2 Scoring Model
Per-Response Scoring
| Score | Meaning |
| 0 | No signal (dimension not relevant) |
| 1 | Weak negative signal (absence of competency) |
| 2 | Weak positive signal (slight indication) |
| 3 | Clear positive signal (competency demonstrated) |
| 4 | Strong positive signal (exceptional depth) |
Confidence Weighting
| Confidence | Weight | When |
| High | 1.0 | Direct behavioral evidence — specific example, detailed process |
| Medium | 0.7 | Indirect evidence — general description, hypothetical |
| Low | 0.4 | Ambiguous — could indicate this or another dimension |
5 Observable Differentiators
| Differentiator | Score 1 | Score 3 | Score 5 | Maps to |
| Decomposition | Dumps whole problems | Sometimes breaks down | Reflexive decomposition | 5.3 + 1.2 |
| Multi-Tool | Single tool only | 2–3 tools linked | Complex pipelines | 5.2 + 4.3 |
| Iteration | 1–2 tries | 3–4 intentional rounds | Until quality bar met | 4.2 + 2.2 |
| System Thinking | Everything one-off | Some templates saved | Builds reusable systems | 3.2 + 5.1 |
| Reframing | Executes as given | Occasionally questions | Redefines before executing | 1.4 + 1.1 |
Layer 2 Technical Architecture
Single-call LLM design (expert panel): Use one LLM call per user turn that both continues conversation AND outputs structured scoring JSON. This halves API cost and eliminates scoring latency vs. running a separate “judge LLM.”
// Single LLM call returns both:
{
"conversation_response": "Natural language reply...",
"scoring": {
"sub_competencies": { "1.1_ai_curiosity": { "score": 3, "confidence": "high" } },
"language_signals": ["exploration", "system_thinking"],
"next_probe_priority": "knowledge"
}
}
Observable Differentiator reliability: Score each differentiator at 3 separate points during conversation (not just once at end), average the 3 scores, calibrate with 50+ human-expert-scored transcripts before launch.
11. Combined Scoring (Layer 1 + Layer 2)
Sub-Competency Score
Sub_Competency_Final = (L1_Score × 0.30) + (L2_Score × 0.70)
Dimension Score
Dimension_Score =
(Mindset_Sub × 0.30) +
(Skill_Sub × 0.30) +
(Additional_1 × 0.20) +
(Additional_2 × 0.20)
Normalized to 0-100.
Superpowered Score
Superpowered_Score =
(Mindset_Index × 0.35) +
(Capability_Index × 0.30) +
(Application_Index × 0.25) +
(Technical_Depth × 0.10)
Range: 0-100
AI Potential (refined with Layer 2)
AI_Potential_raw = weighted_average(
Mindset_Index × 1.4,
Learning_Velocity × 1.2,
Strongest_Dimension × 1.1,
100 - Consistency_Gap × 0.5
)
// Sigmoid normalization to distribute across 40-100 range
AI_Potential_Final = sigmoid_map(AI_Potential_raw,
pilot_mean, pilot_sd, target_range=[40, 100])
Display threshold: Only show AI Potential when AI_Potential − Score ≥ 10. For top performers where they’re nearly equal, omit it — it adds no insight at that level.
Self-Perception Gap
Consistency = 1 - |L1_normalized - L2_normalized| / 100
> 0.8: "Highly consistent self-awareness"
0.5-0.8: "Some gaps between self-perception and practice"
< 0.5: "Significant self-perception gap"
Self-report inflation detection: If Layer 1 exceeds Layer 2 by >20 points for any dimension: “Your self-perception in [dimension] is stronger than what your practices show. This often means untapped potential — you know what good looks like but haven’t fully applied it yet.”
12. Data-to-Output Mapping
| Output | Phase 0 (AI Chat) | Phase 2 (FC) | Phase 3 (Mode/Skill) | Email Gate | Phase 4 (Mini) | Phase 5 (Full) |
| Archetype | Mindset hint (tiebreaker) | Top 2 dimensions | — | — | — | Refined |
| Top 3 Superpowers | — | Dimension ranking | — | — | — | Refined |
| AI Potential | Adoption level (context) | Mindset + peak dim | SKILL1 velocity | — | — | Learning velocity + gap |
| Best Percentile | Industry, role, company type | Dimension scores | — | — | — | Refined scores |
| Growth Edge | — | Weakest dimension | — | — | — | Weakest sub-competency |
| Mode Profile | Mode selection + depth (self-report, compressed) | — | MODE3 (agentic) | — | — | Behavioral depth (70% weight) |
| SP Score | — | L1 estimate | Skill signals | — | — | Full 4-component |
| 20 Sub-competencies | — | L1 signals | — | — | — | Full L2 scoring |
| Learning Path | Industry context | — | — | — | — | Weakest 3 + domain |
| Certificate | Name | — | — | — | — | Score + archetype |
| Lead Capture | Name, industry, role, company type, adoption | — | — | Email | — | — |
| Email Sequence | Growth Edge + adoption for tips | Top superpower for content | — | Triggers sequence | Curiosity gap for Day 3 | Full data for emails |
13. UX Design Principles
Progress & Pacing
- Show progress bar throughout (“Step 2 of 4” or percentage)
- Forced-choice: one per screen on mobile, 2–3 visible on desktop
- Estimated time remaining always visible
- Mode/Skill items feel like a break (different visual format)
Visual Design
- Clean, minimal interface — the assessment itself should feel “superpowered”
- Superpower colors: ■ Perception ■ Intelligence ■ Knowledge ■ Creation ■ Integration
- Forced-choice items: two cards side by side, tap to select
- Back button: Allowed within Phase 2 and Phase 3, not between phases. Phase 0 chat is forward-only. Going back is tracked as a signal
- Phase 0 → Phase 2 transition: Animated morph from chat bubbles to forced-choice cards (1–2 sec)
- Post-result → Phase 4 transition: Animated shift back to chat UI — already familiar from Phase 0
Tone
- Phase 0 welcome: Friendly, curious, human — “Hi! I’m here to map your AI superpowers.”
- Phase 0 → Phase 2: “Love it. Now let’s find out where your real superpowers are — this next part is quick.”
- Progress: “Great — you’re almost through the quick calibration”
- Results tease: “Your profile is ready. Let me show you what I found.”
- Phase 4 mini-chat opener: References Phase 0 context — “Earlier you mentioned [X]. That tells me something interesting…”
Mobile-First
- Full-width cards, large tap targets
- Chat: familiar messaging UI (bubbles, typing indicator)
- Results: swipeable card format
- Share: native share sheet integration
14. Anti-Gaming & Quality Controls
Time-Per-Item Tracking
If 22 items completed in <60 seconds (<3s per item), flag as “speed-through.” Don’t invalidate — apply confidence: low modifier to all L1 scores, increasing L2 weight.
Semantic Consistency Check
The Consistency Score (L1 vs L2 comparison) detects both self-perception gaps (coaching insight) and gaming (reliability concern):
- > 0.8: High confidence in all scores
- 0.5–0.8: Flag dimensions with biggest gaps for extra L2 probing
- < 0.5: Mark as “low confidence” — recommend re-assessment
No explicit attention checks (“select option B”). They break immersion and feel insulting to professionals.
Social Desirability Mitigation
- Pre-launch validation: Present all 22 items to 20 people without context — if one option chosen >65%, rewrite it
- Both-positive design: Both options must sound equally desirable
- Ipsative format helps: Forced-choice is inherently more resistant to gaming than agree/disagree scales
15. Re-Assessment Strategy
Hybrid Item Rotation (60/40)
| Component | Strategy | Rationale |
| Phase 0 AI Chat | Same structure, different phrasing | AI varies wording naturally; Q3 (AI usage) captures genuine change over time |
| Forced-choice | 13 anchor items identical + 9 rotated | Anchors track change; rotation reduces memory gaming |
| Anchoring items | Always identical (all 5) | Stable absolute-level measurement |
| Mode/Skill items | Always identical | Track actual behavioral change |
| Mini AI chat | Different questions each time | Natural variation |
| Layer 2 scenarios | Always different | Adaptive nature ensures different paths |
Expanded bank needed: 35 total items (13 permanent + 22 rotatable, drawing 9 per session). Target: Q3 2026.
16. Alternative Path — Conversation Upload Assessment
The Idea
Instead of answering questions about how you use AI, you show how you use AI. The user uploads 30–50 of their past AI conversations — from any tool (ChatGPT, Claude, Cursor, Copilot, Gemini, or any export) — and the system analyzes their actual behavior to generate a full SP™ AI Score profile.
This is a 100% behavioral assessment — no self-report, no forced-choice, no “how often do you…” questions. Just real evidence of how you work with AI.
Why This Path Exists
| Problem with traditional assessment | How conversation upload solves it |
| People overestimate their skills | Conversations don’t lie — they show what you actually do |
| Assessment fatigue (“not another quiz”) | Zero questions — you just share what you already have |
| Hard to assess advanced users | Power users leave the richest behavioral traces |
| Self-report misses nuance | Real conversations reveal patterns the user doesn’t even notice |
| Takes 25–35 minutes | Upload takes 2 minutes — analysis happens in the background |
What the User Provides
Input: 30–50 past conversations with any AI tool.
Accepted formats:
- Cursor sessions — exported from session history (agent transcripts)
- ChatGPT exports — via Settings → Data Controls → Export Data (JSON)
- Claude exports — via conversation history export
- Plain text / Markdown — copy-pasted conversation logs
- Any AI tool — as long as the format shows user messages and AI responses
Minimum requirement: 30 conversations spanning at least 2 weeks. Recommended: 50+ conversations across 30+ days to avoid recency bias.
What the System Analyzes
1. AI Interaction Modes
| Signal | What it reveals |
| Questions, brainstorming, back-and-forth dialogue | Conversational mode — do you think WITH AI? |
| Content creation, drafts, creative requests | Creative mode — do you produce, iterate, and refine? |
| Code, tools, apps, prototypes, technical builds | Builder mode — do you build things that work? |
| Automation, pipelines, multi-step workflows, agents | Orchestration mode — do you design systems, not just tasks? |
2. Five Superpowers
| Superpower | What we look for |
| Perception | Do you spot opportunities? Do you reframe problems? Do you use AI for things others wouldn’t think of? |
| Intelligence | Do you think strategically with AI? Do you evaluate outputs critically? Do you simulate perspectives? |
| Knowledge | Do you build systems that remember? Do you manage context? Do you compound knowledge across sessions? |
| Creation | Do you produce at quality? Do you iterate until it’s right? Do you maintain your own voice and standards? |
| Integration | Do you connect AI into your real workflows? Do you design processes, not just tasks? Do you build for others? |
3. AI Skills
| What we observe | Skill it maps to |
| How you structure requests, give context, iterate | Prompting |
| How many tools, models, and integrations you combine | Tool Selection |
| Whether you create reusable agents, instructions, or templates | AI Assistants |
| Whether you build apps, scripts, prototypes, or tools | Vibe Coding |
| Whether you create content — posts, articles, copy, docs | Content Creation |
| Whether you analyze, clean, or enrich data | Data Work |
| Whether things run without you | Automation |
| Whether you design multi-step or multi-agent workflows | Autonomous Workflows |
| Whether you build systems that persist and reuse knowledge | Knowledge Management |
| Whether you use AI to accelerate entire projects | Project Acceleration |
4. Mindset — Observable Through Behavior
| Mindset component | Behavioral signal |
| Curiosity & Openness | How often do you try new approaches? Do you experiment? |
| Critical Trust | Do you push back on AI? Do you evaluate outputs or just accept them? |
| Augmentation Vision | Do you ask AI to DO things, or to help you THINK about things? |
| Learning Velocity | Are you visibly better in recent conversations than older ones? |
| Collaboration Design | Do you design AI interactions for others, or only for yourself? |
How the Output Looks
The output is a full SP™ AI Score profile — identical in structure to the Paid Full Profile:
- SP™ AI Score (0–100) — composite score based on the four weighted components
- Archetype — derived from top 2 superpowers
- Narrative — 2–3 sentences describing who you are as an AI professional
- Radar chart — 5 superpower dimensions
- Mode profile — which modes you use and at what depth
- Your Stack (Skills) — all 10 skills ranked and scored in human language
- Behavioral Evidence — the strongest patterns observed, written as insights
- What Would Make You Faster — 3–5 growth recommendations, each with a “Solve with AI” button that opens the prompt directly in Claude or ChatGPT
Scoring Methodology
Scoring follows the standard SP™ framework with one key difference: Layer 2 (behavioral evidence) carries 100% of the weight — there is no Layer 1 self-report.
SP™ AI Score = weighted_average(
Mindset Index × 35% // observed via behavioral proxies
Applied AI Skills × 30% // observed via tool usage and task types
Domain Integration × 25% // observed via workflow complexity
Technical Depth × 10% // observed via builder-mode activity
)
Comparison to Standard Assessment
| Aspect | Standard (questionnaire) | Conversation Upload |
| Input | User answers questions about behavior | User shows actual behavior |
| Time to complete | 7–35 minutes | 2 minutes + async processing |
| Self-report bias | Present (people overestimate) | Absent (behavior doesn’t lie) |
| Works for beginners | Yes — questions are accessible | Limited — needs 30+ AI conversations |
| Works for power users | May feel reductive | Excellent — richer signals |
| Scoring layers | L1 (30%) + L2 (70%) or L1 only | L2 only (100% behavioral) |
| Privacy model | Answers stored | Conversations processed and discarded |
Target Audience
Ideal for:
- Power users who use AI daily and have rich conversation history
- Professionals who hate quizzes — “don’t ask me, look at what I do”
- Teams / enterprise — bulk upload from shared tools, no scheduling
- Developers and builders — Cursor/Copilot sessions contain the richest data
- Repeat assessment — upload latest 50 conversations quarterly
Not ideal for:
- AI beginners — fewer than 30 conversations won’t produce a meaningful profile
- Casual users — shallow conversations produce shallow profiles
- Privacy-sensitive users — requires sharing actual conversation content
Pilot Results
Tested with 50+ Cursor session transcripts (Jan–Feb 2026):
- 20 sessions → usable but recency-biased profile
- 50 sessions → stable, accurate, and rich profile
- Behavioral signals strong enough to score all 20 sub-competencies including mindset
- The “What Would Make You Faster” section had the highest perceived value
17. Open Questions
Resolved (from expert panel)
| # | Question | Resolution |
4 | AI Potential clusters 85–95 | Added sigmoid normalization + display threshold (≥ 10 gap) |
7 | Anti-gaming | Time-per-item tracking + semantic consistency. No attention checks. |
8 | Re-assessment items | Hybrid rotation (60% anchor / 40% rotated) |
Still Open
- Item validation (NON-NEGOTIABLE): 22 FC items + 5 anchoring items need validation with 50–100 pilot users. Required before public launch.
- Phase 0 + Mini chat scoring weight: Phase 0 bonus signals (AI Adoption, Mindset Hint) are used for personalization and tiebreaking, NOT primary scoring. Mini chat (Phase 4) remains UX/conversion only. Revisit if pilot shows conversational responses are highly diagnostic.
- Scenario bank size: Target 20 universal by Q2 2026, 35+ rotatable items by Q3 2026.
- Layer 1 strategic accuracy: Business decision — make L1 directionally accurate (archetype correct 70%+) but numerically imprecise. Show archetype confidently; blur dimension scores/sub-competencies in free tier.
- Chat technology: Start with text; add voice (ElevenLabs) as v2 feature.
- Observable Differentiator calibration: 50+ human-expert-scored transcripts needed. Target: Cohen’s kappa > 0.7.
- Enterprise path: Add “share with your team” prompt post-assessment.