08 — Assessment Design — Documentation

1. Design Principle — Output First

Every question exists because a specific output element needs the data. If a question doesn’t feed a visible output, it doesn’t belong.

FREE Tier Outputs → Data Needed

Output Element	Data Required
Archetype assignment	Top 2 superpowers → maps to 1 of 6 archetypes
Top 3 superpowers (visual bars)	5 dimension scores (relative ranking)
AI Potential	Mindset Index + peak dimension + learning velocity proxy
Best Percentile stat	Industry, role, company type, seniority (multi-dimensional benchmarking)
Growth edge	Weakest dimension (positive framing)
Mode profile (Card C)	Primary + secondary mode + agentic orientation
Adaptive card layout	Overall percentile tier (determines visibility)

PAID Tier Outputs → Additional Data Needed

Output Element	Data Required
Superpowered Score (0–100)	4 component indices: Mindset 35%, Skills 30%, Domain 25%, Technical 10%
20 sub-competency scores	Layer 2 behavioral evidence for all 20 sub-competencies
Full radar chart	All 5 dimension scores (refined with Layer 2)
Mode depth chart	Per-mode depth 0–5 + agentic depth per mode
Self-perception gap	Layer 1 vs Layer 2 score comparison per dimension
AI Potential (refined)	Learning velocity from conversation + consistency gap
3 learning path recs	Weakest sub-competencies + domain-specific growth areas
Certificate	Name, archetype, score, percentile, framework version, date

2. Assessment Flow

FREE Tier — Spark Profile (~7–8.5 min to result)

PHASE 0: AI WELCOME CHAT (2.5-3 min) │ Conversational onboarding + mode profiling │ Msgs 1-4: name, work context, AI usage, mindset hint │ Msgs 5-6: mode selection (multi-select) + depth per mode │ NLP extracts: industry, role, company type │ Direct capture: modes used, depth per mode ▼ PHASE 2: FORCED-CHOICE ITEMS (3-4 min) │ 22 items: tap one of 2 statements (~8-10 sec each) ▼ PHASE 3: ANCHORING + AGENTIC + SKILL (1 min) │ 5 anchoring (Likert) + 1 agentic + 3 skill items ▼ LAYER 1 SCORING ENGINE │ → 5 dimensions, archetype, AI Potential, mode profile │ + Phase 0 signals (adoption, mindset, mode depth) ▼ ★ EMAIL GATE ★ │ "Your profile is ready! Where should I send it?" ▼ ★ SPARK PROFILE + ADAPTIVE CARD ★ │ Mode Profile labeled "your self-assessment" ▼ PHASE 4: MINI AI CHAT (2-3 min) │ References what user said in Phase 0 │ Creates curiosity gap → upgrade trigger ▼ UPGRADE WALL

Flow note: Phase 0 combines conversational onboarding (Msgs 1–4) with interactive mode profiling (Msgs 5–6), replacing old form fields AND old MODE1/MODE2 items from Phase 3. Mode depth is self-reported with compression scoring (capped at 70/100). Email Gate captures leads at peak curiosity. Mode Profile in free tier labeled “your self-assessment” with upgrade CTA.

PAID Tier — Full Profile (~25–35 min)

EVERYTHING FROM FREE (7-8.5 min) ▼ PHASE 5: FULL AI CONVERSATION (15-20 min) │ 5a: Experience Exploration (5-8 min) │ 5b: Scenario Challenges (5-8 min) │ 5c: Reflection & Closing (2-3 min) ▼ FULL SCORING ENGINE │ L1 (30%) + L2 (70%) │ 20 sub-competencies, 4 indices ▼ FULL PROFILE + REPORT + CERTIFICATE

3. Phase 0: AI Welcome Chat

Phase 0 combines conversational onboarding (Messages 1–4) with interactive mode profiling (Messages 5–6). It captures context, adoption level, mindset hint, AND full mode selection + depth — replacing both the old form fields and the old MODE1/MODE2 items.

Conversation Script (6–7 messages, ~2.5–3 min)

Message 1 — Name

AI: “Hi! I’m here to map your AI superpowers. Let’s start simple — what’s your name?”

User: [free text] → extracts Name

Message 2 — Work Context

AI: “Nice to meet you, [Name]! What do you do? Tell me your role and industry — just in your own words.”

User: [free text] → NLP extracts Role, Industry, Company type

Message 3 — AI Adoption

AI: “Got it. Now I’m curious — how does AI show up in your work today?”

User: [free text] → extracts AI Adoption Level + usage patterns

Message 4 — Mindset Hint

AI: “When AI gives you a result you didn’t expect, what’s your first instinct?”

User: [free text] → extracts Mindset hint

Message 5 — Mode Selection (interactive cards)

AI: “Now let me understand HOW you use AI. Which of these do you do? Tap all that apply.”

4 tapable mode cards (multi-select):

💬 Chatuju s AI 🎨 Tvorim obsah s AI 🔧 Buduji s AI ⚡ Automatizuju s AI

Message 6 — Mode Depth (per selected mode)

AI: “For each one — tap the description that best matches what you ACTUALLY DO:”

For each selected mode, show 5 descriptive cards (single-select, no numbers visible). See depth card tables below.

Message 7 — Transition

AI: “Love it. I have a great picture of how you work with AI. Now let’s find out where your real superpowers are.”

→ Animated transition to Phase 2 (forced-choice UI)

Mode Depth Cards (behavioral framing: “what do you DO?”)

User sees descriptive cards only — no numbers. Internal depth values are for scoring backend.

💬 Conversational

Internal	Card text
C-1	I occasionally ask AI something
C-2	I regularly use AI to solve work tasks
C-3	I lead a dialogue with AI — I give context, iterate, refine
C-4	AI knows my context — I know how to prompt it for my work
C-5	AI is my daily partner for strategy and decision-making

🎨 Creative

Internal	Card text
R-1	I occasionally generate something — an image, text, a presentation
R-2	I regularly create content with AI — I have my own workflows
R-3	I create at a professional level — deep in one or more formats
R-4	I have my own production system — multiple AI tools in a coordinated process
R-5	I produce at scale — campaigns, series, dozens of outputs

🔧 Builder

Internal	Card text
B-1	I tried, but nothing I actually use
B-2	I built a few simple things — pages, tools
B-3	I regularly build things I use at work
B-4	I build apps that others use too
B-5	I manage AI agents that code for me

⚡ Orchestration

Internal	Card text
O-1	I tried connecting a few tools
O-2	I have a few simple automations that save me time
O-3	I build workflows where AI processes data by rules
O-4	I have systems that run on their own — I check them occasionally
O-5	My workflows adapt and make decisions on their own

Mode depth design principles (expert panel): Behavioral framing (“what do you DO”), no numbers visible, max 8–10 words per card, cumulative levels, compression scoring (capped at 70/100), free tier labeled “your self-assessment.” Expected L1 vs L2 gap: 30–40% of users self-report 1–2 levels higher than L2 confirms.

NLP Extraction Logic

Field	Extraction Method	Fallback
Name	Direct from Message 1	Ask again
Industry	NLP classification → 12 categories	Quick picker: 12 chips
Role	NLP classification → 7 levels	Quick picker: 7 chips
Company type	NLP inference from context	Quick picker: 5 chips
AI Adoption	NLP → High / Medium / Low / None	Default Medium
Mindset Hint	NLP → Analytical / Trusting / Adaptive	Soft signal, no fallback
Mode Selection	Direct from Message 5 card taps	—
Mode Depth	Direct from Message 6 card taps	—

Signals from Phase 0

Signal	Source	Type	Usage
AI Adoption	Msg 3	Bonus	Contextualizes L1 scores. Does NOT affect dimension scores.
Mindset Hint	Msg 4	Bonus	Tiebreaker for archetype. Does NOT override L1.
Mode Selection	Msg 5	Primary	Which modes appear in Mode Profile. Feeds scenario selection.
Mode Depth	Msg 6	Primary (compressed)	Self-assessed depth. Capped at 70/100. Free tier: “self-assessment.” Paid: replaced 70% by L2.

Deferred Fields

Field	When Collected	Purpose
Seniority	Post-result (optional) or NLP-inferred	Seniority percentile
Country	Auto-detect from IP/locale	Geographic percentile

Why conversational + interactive onboarding: Phase 0 combines natural conversation (Msgs 1–4) with interactive card-based UI (Msgs 5–6). This gives both qualitative context and structured mode data from the first 3 minutes. The user’s first impression is “I’m talking to an AI that gets me.”

4. Phase 2: Forced-Choice Item Bank (22 items)

Each item presents two positive statements. The user selects which resonates more. There is no wrong answer — the choice reveals which superpower is stronger. Items are tagged as Mindset (M) or Skill (S) to enable separate Mindset Index and Capability Index calculations.

Item Design Principles

Both options are positive. No “wrong” choice.
Each item contrasts two different dimensions.
Language is natural and work-contextual.
No tool names in items.
Mindset items = attitude/orientation. Skill items = capability/action.
Social desirability balance. Both options must sound equally “good” to a business professional. Validation required: present items to 20 people without context — if one option is chosen >65% as “more impressive,” rewrite it.

Perception Items (5)

P1 — AI Curiosity (M) vs. Critical Trust (M)

“When I hear about a new AI tool, I’m more likely to...”

(a) Try it right away to see what it can do → Perception: AI Curiosity

(b) Research its reliability and limitations before investing time → Intelligence: Critical Trust

P2 — Opportunity Recognition (S) vs. Prompt Mastery (S)

“When facing a new work challenge, my first instinct is to...”

(a) Think about which parts of this problem AI could handle → Perception: Opportunity Recognition

(b) Think about how to structure my input so AI gives me the best result → Intelligence: Prompt Mastery

P3 — AI Curiosity (M) vs. Iterative Learning (M)

“What excites me more about AI:”

(a) Discovering capabilities I didn’t know existed → Perception: AI Curiosity

(b) Seeing how fast my own workflow evolves because of it → Knowledge: Iterative Learning

P4 — Problem Reframing (S) vs. Process Decomposition (S)

“When given a complex task, I naturally...”

(a) Question whether the task itself is the right thing to do → Perception: Problem Reframing

(b) Break it down into smaller steps and assign each to the best approach → Integration: Process Decomposition

P5 — Opportunity Recognition (S) vs. Workflow Orchestration (S)

“I add the most value when I...”

(a) Spot where AI can be used in places nobody else thought of → Perception: Opportunity Recognition

(b) Design a system that connects multiple tools into a smooth workflow → Integration: Workflow Orchestration

Intelligence Items (4)

I1 — Critical Trust (M) vs. Creative Courage (M)

“When AI gives me an unexpected result, I’m more likely to...”

(a) Investigate why — I want to understand the reasoning → Intelligence: Critical Trust

(b) See if I can use it creatively — unexpected results can be valuable → Creation: Creative Courage

I2 — Prompt Mastery (S) vs. The 90/10 Craft (S)

“I spend more energy on...”

(a) Getting the right input into AI — the prompt, the context, the structure → Intelligence: Prompt Mastery

(b) Refining what comes out of AI — editing, iterating, polishing → Creation: The 90/10 Craft

I3 — Critical Trust (M) vs. Augmentation Vision (M)

“I trust AI most when...”

(a) I can verify its work against clear criteria → Intelligence: Critical Trust

(b) I’ve designed a system where AI and I each play to our strengths → Integration: Augmentation Vision

I4 — Strategic AI Dialogue (S) vs. Knowledge Architecture (S)

“My strength with AI is more about...”

(a) Using AI as a thinking partner — asking the right questions to get deep insights → Intelligence: Strategic AI Dialogue

(b) Organizing and structuring knowledge so it’s always ready to use → Knowledge: Knowledge Architecture

Knowledge Items (4)

K1 — Iterative Learning (M) vs. Critical Trust (M)

“When an AI tool I rely on starts giving worse results, I...”

(a) See it as a chance to find something better — I adapt fast → Knowledge: Iterative Learning

(b) Investigate what changed and whether the outputs can still be trusted → Intelligence: Critical Trust

K2 — Knowledge Architecture (S) vs. Opportunity Recognition (S)

“After a successful AI project, I’m more likely to...”

(a) Document what worked and create a reusable template → Knowledge: Knowledge Architecture

(b) Look for other situations where the same approach could apply → Perception: Opportunity Recognition

K3 — Iterative Learning (M) vs. Creative Courage (M)

“My relationship with change:”

(a) I actively seek it — I’m always looking for better ways to work → Knowledge: Iterative Learning

(b) I embrace it because it opens possibilities that didn’t exist before → Creation: Creative Courage

K4 — Knowledge Compounding (S) vs. Collaboration Design (S)

“When I discover a great AI workflow, I naturally...”

(a) Write it down so I or others can replicate it → Knowledge: Knowledge Compounding

(b) Show it to my team and design how we can use it together → Integration: Collaboration Design

Creation Items (4)

C1 — Creative Courage (M) vs. Augmentation Vision (M)

“I’m most proud when I...”

(a) Create something ambitious with AI that I couldn’t have done alone → Creation: Creative Courage

(b) Design a system where AI handles the heavy lifting while I focus on strategy → Integration: Augmentation Vision

C2 — The 90/10 Craft (S) vs. Workflow Orchestration (S)

“My work with AI is better described as...”

(a) I push AI outputs to a level of quality that impresses people → Creation: The 90/10 Craft

(b) I connect multiple AI tools into workflows that produce consistent results → Integration: Workflow Orchestration

C3 — Creative Courage (M) vs. Critical Trust (M)

“When considering an AI project I’ve never attempted before, I think...”

(a) “Let’s try it — worst case, I learn something” → Creation: Creative Courage

(b) “Let me first understand the risks and limitations” → Intelligence: Critical Trust

C4 — Building (S) vs. Context Engineering (S)

“My AI versatility shows in...”

(a) Building tools, apps, or prototypes — I make things that work → Creation: Building

(b) Preparing the right context for AI — the data, examples, and structure that make outputs excellent → Knowledge: Context Engineering

Integration Items (5)

G1 — Augmentation Vision (M) vs. Iterative Learning (M)

“I see AI’s biggest impact as...”

(a) Transforming how humans and technology work together as a system → Integration: Augmentation Vision

(b) Accelerating how fast we can learn, adapt, and improve → Knowledge: Iterative Learning

G2 — Workflow Orchestration (S) vs. Knowledge Architecture (S)

“My strongest contribution with AI is...”

(a) Designing multi-step processes where the right tool does each job → Integration: Workflow Orchestration

(b) Building organized knowledge systems that make information accessible and reusable → Knowledge: Knowledge Architecture

G3 — Augmentation Vision (M) vs. AI Curiosity (M)

“When I think about AI’s future, I focus on...”

(a) How human-AI collaboration will reshape how organizations work → Integration: Augmentation Vision

(b) What new capabilities will become possible that we can’t imagine today → Perception: AI Curiosity

G4 — Process Decomposition (S) vs. Problem Reframing (S)

“When I approach a big project with AI, I’m known for...”

(a) Breaking it into perfectly-sized pieces that each tool can handle → Integration: Process Decomposition

(b) Stepping back and redefining what we’re actually trying to achieve → Perception: Problem Reframing

G5 — Collaboration Design (S) vs. The 90/10 Craft (S)

“I create more value by...”

(a) Designing how my team uses AI together effectively → Integration: Collaboration Design

(b) Ensuring every AI-assisted deliverable meets the highest quality standard → Creation: The 90/10 Craft

Coverage Matrix

Ipsative measurement note: Forced-choice items produce ipsative (relative) scores — choosing one dimension suppresses another. A person cannot score high on all 5 simultaneously from forced-choice alone. The Dimension Anchoring Items in Phase 3 provide normative (absolute-level) calibration to complement the ipsative profile.

Dimension	Appearances	Mindset (M)	Skill (S)
Perception	8	3 (P1, P3, G3)	5 (P2, P4, P5, K2, G4)
Intelligence	8	4 (P1, I1, I3, K1, C3)	3 (P2, I2, I4)
Knowledge	9	3 (K1, K3, G1)	5 (I4, K2, K4, G2, C4)
Creation	8	4 (I1, K3, C1, C3)	4 (I2, C2, C4, G5)
Integration	11	4 (I3, C1, G1, G3)	7 (P4, P5, K4, C2, G2, G4, G5)

Balance note: Integration has the most appearances (11), Perception/Intelligence/Creation the fewest (8 each). Acceptable — Integration is the most cross-cutting dimension by design.

5. Phase 3: Anchoring, Agentic & Skill Items (12 items)

Dimension Anchoring Items (5 — Likert scale 1–5)

These items establish absolute levels for each dimension, solving the ipsative measurement trap. Rated 1–5 (“Not at all like me” to “Exactly like me”).

ANCHOR1 — Perception

“I actively look for new ways AI could be used in my work — even in areas where nobody else is using it yet.”

1 · 2 · 3 · 4 · 5

ANCHOR2 — Intelligence

“I have a systematic approach to evaluating AI outputs — I know what to trust, what to verify, and what to reject.”

1 · 2 · 3 · 4 · 5

ANCHOR3 — Knowledge

“I organize my AI knowledge — prompts, templates, workflows — so I can reuse and build on what I’ve learned.”

1 · 2 · 3 · 4 · 5

ANCHOR4 — Creation

“I use AI to tackle ambitious creative projects I wouldn’t attempt on my own.”

1 · 2 · 3 · 4 · 5

ANCHOR5 — Integration

“I design workflows where AI handles repeatable tasks while I focus on judgment and strategy.”

1 · 2 · 3 · 4 · 5

Dimension_L1_Final = (Ipsative_Rank_Normalized × 0.6) + (Anchor_Score_Normalized × 0.4)

Agentic Orientation (1 item — single select)

Note: Mode selection and mode depth are now captured in Phase 0 (Section 3, Messages 5–6). Old MODE1/MODE2 items removed — the new Phase 0 design provides richer mode data (multi-select modes + 5-level depth per mode).

MODE3 — Agentic Orientation

“Which best describes how AI operates in your daily work?”

(a) I start every AI interaction myself → Non-agentic (0)

(b) I have AI assistants with instructions I return to regularly → Low agentic (1)

(d) I have AI systems that make decisions and take actions autonomously → High agentic (3)

Skill Signal Items (3 — single select)

SKILL1 — AI Usage Frequency

“How often do you use AI tools in your work?”

(a) Rarely — a few times a month → Score: 1

(b) Weekly — it’s part of my toolkit → Score: 2

(d) Constantly — AI is running in the background of most of my work → Score: 4

SKILL2 — AI Tool Breadth

“How many different AI tools do you use regularly?”

(a) 1 (mostly ChatGPT or similar) → Score: 1

(b) 2–3 different tools → Score: 2

(d) 7+ tools — I have a full AI toolkit → Score: 4

SKILL3 — Building Depth

“Have you built anything with AI? (apps, automations, custom tools)”

(a) No, I haven’t tried → Score: 0

(b) I’ve experimented but nothing I use regularly → Score: 1

(d) I regularly build apps, scripts, or systems with AI → Score: 3

6. Layer 1 Scoring Model

Dimension Scoring

Each forced-choice item assigns +1 to the chosen dimension and 0 to the other. Raw dimension scores are the sum of all choices for that dimension across 22 items.

Dimension_Normalized = (Raw_Score / Max_Possible_Score) × 100

Mindset Index (for AI Potential)

Mindset_Index_L1 = average(
  Perception_Mindset_Items,   // AI Curiosity signals
  Intelligence_Mindset_Items, // Critical Trust signals
  Knowledge_Mindset_Items,    // Iterative Learning signals
  Creation_Mindset_Items,     // Creative Courage signals
  Integration_Mindset_Items   // Augmentation Vision signals
) × normalization_factor

AI Potential (Layer 1 estimate)

AI_Potential_L1 = min(100, weighted_average(
  Mindset_Index_L1 × 1.4,      // High mindset = high ceiling
  Peak_Dimension × 1.1,         // Strongest dimension → potential
  SKILL1_frequency × 10,        // Usage frequency as velocity proxy
  100 - |Mindset - Capability| × 0.5  // Gap = untapped potential
))

Archetype Assignment

Top 2 dimensions by normalized score → Archetype lookup:

Top 2 Dimensions	Archetype
Integration + Intelligence	AI Architect
Perception + Intelligence	AI Navigator
Creation + Integration	AI Builder
Knowledge + Perception	AI Catalyst
Creation + Knowledge	AI Amplifier
Perception + Creation	AI Pioneer

7. Email Gate & Lead Capture

The Email Gate sits between Layer 1 scoring and result display — the moment of peak curiosity. The user has invested 6+ minutes, scoring is complete, and they’re about to see their profile. This is the optimal point for email capture.

UX Copy & Design

Scoring complete → Loading: “Analyzing your responses…”

“Your AI Superpower Profile is ready!”
Enter your email to see your results and get personalized tips for growth.

[ your@email.com ] [ Show my profile → ]

We’ll send your profile + 3 personalized growth tips. No spam, unsubscribe anytime.

Design principles: Show a blurred preview of the Spark Profile card behind the email form (curiosity amplifier). Loading animation before the gate builds anticipation. Single field only (email) — name already captured in Phase 0. No skip option: users invested 6+ min and WILL enter email (sunk cost + curiosity). Target capture rate: 75–85%.

What Happens on Submit

Email validated (format + disposable email detection)
Spark Profile displayed immediately — zero delay
Welcome email sent within 60 seconds (profile card, top 3 superpowers, 3 growth tips, CTA to Full Profile)
Lead captured in CRM with tags: archetype, top superpower, industry, role, company type, AI adoption level

Email Follow-Up Sequence

Day	Email	Content	CTA
Day 0	Welcome + Profile	Spark Profile card, top 3 superpowers, 3 growth tips	“Go deeper → Full Profile”
Day 3	Growth Insight	Deep-dive on #1 superpower: what it means, how top performers use it	“See how you compare →”
Day 7	Curiosity Nudge	Tease Full Profile reveals (sub-competencies, radar, learning path). Reference Growth Edge.	“Unlock your full potential →”
Day 14	Social Proof	Aggregated stats: “3,200+ professionals mapped. [Archetype] types like you tend to…”	“Get your complete profile →”

Sequence principle: Each email adds NEW value (not just “buy now”). The user should feel smarter after reading each email, even if they never upgrade. This builds trust and brand — some upgrade on day 3, some on day 14, some forward it to colleagues (organic growth).

Expected Metrics

Metric	Target
Email capture rate	75–85%
Welcome email open rate	65–75%
Day 3 open rate	35–45%
Day 7 open rate	25–35%
Paid conversion from sequence	5–12%

8. Phase 4: Mini AI Chat (FREE Tier)

2–3 personalized follow-up questions based on Layer 1 results. Gives users a taste of the AI conversation and creates the upgrade trigger.

Q1: Strongest Superpower Probe (always asked first)

Top Superpower	Question
Perception	“You seem to naturally spot AI opportunities. Can you give me a quick example — the last time you saw an AI use case that others missed?”
Intelligence	“You strike me as someone who really thinks about how they use AI. What’s an example of a time your careful approach paid off?”
Knowledge	“You have a knack for making things reusable. What’s the most valuable AI template or system you’ve built for yourself?”
Creation	“You seem like someone who ships ambitious things with AI. What’s the most creative project you’ve tackled?”
Integration	“You think in systems. What’s the most sophisticated AI workflow you’ve designed?”

Q2: Growth Area Hint (asked second)

Weakest Superpower	Question
Perception	“If you had to find ONE new area where AI could help that you haven’t explored yet — what would it be?”
Intelligence	“When AI gives you something 80% right, what’s your process for getting it to 100%?”
Knowledge	“After a great AI session, do you save what worked — or start fresh next time?”
Creation	“If you could build anything with AI this week — no constraints — what would it be?”
Integration	“Imagine you could automate one repetitive part of your weekly work. What would it be?”

Upgrade Wall

There’s much more here.

In 3 quick answers, I can already see your [archetype_name] profile forming. But to really understand your superpowers — and show you where your biggest growth opportunity is — I need about 20 more minutes.

Unlock your Full Profile →
Deep AI conversation · Full superpower radar · Personalized learning path · Certificate

Curiosity Gap Principle (expert panel)

After each mini-chat response, the AI references something it noticed but can’t explore:

After Q1: “The way you described that tells me something about your [second_superpower] that I’d need more time to unpack properly.”
After Q2: “I can see a gap between what you know is possible and what you’re doing. That gap is actually your biggest asset — but I’d need to go deeper to show you why.”

Scoring note: Mini AI chat responses are NOT scored for Layer 2. They are purely for user experience, creating informational curiosity gaps, and optional qualitative flagging.

9. Phase 5: Full AI Conversation (PAID Tier)

Conversation Architecture

Phase	Duration	Purpose
5a: Experience Exploration	5–8 min	Deep-dive into real AI usage, 5 Observable Differentiators
5b: Scenario Challenges	5–8 min	3–5 adaptive scenarios, domain-specific + universal
5c: Reflection & Closing	2–3 min	Forward-looking mindset signals, identity framing

5a: Experience Exploration

Opening:

“Let’s go deeper. Tell me about a recent project where AI played a significant role. Walk me through it — what was the task, what did you do, and how did it turn out?”

The AI probes the 2 weakest dimensions and 1 strongest (to confirm) using these probes:

Probe	Tests	Question
A: Decomposition	Process Decomposition, Opportunity Recognition	“When you face a complex problem — how do you decide what to hand to AI and what to handle yourself?”
B: Multi-Tool	Workflow Orchestration, Building	“Did you use just one tool, or did you combine several? How did they work together?”
C: Iteration	The 90/10 Craft, Prompt Mastery	“When AI gives you something close but not quite right, what’s your process?”
D: System Thinking	Knowledge Architecture, Augmentation Vision	“Do you have any AI workflows or templates you use repeatedly?”
E: Reframing	Problem Reframing, AI Curiosity	“Has there been a time when the question itself needed to be different?”

5b: Scenario Challenges

Selection logic: Focus on weakest 2 dimensions, include domain-specific scenario, include 1 outside primary mode, escalate complexity.

Universal Scenario Bank (select 2–3)

ID	Scenario	Primary Dimensions
S1	The Competitive Analysis Sprint — 15 companies, 3 days, no analyst	Perception, Intelligence, Integration
S2	The AI Failure — hallucinated data already sent to client	Intelligence, Knowledge, Creation
S3	The Reluctant Team — AI tool automates 40%, half resist	Integration, Creation, Knowledge
S4	The Urgent Presentation — 2 hours, unexpected	Creation, Intelligence, Perception
S5	Knowledge Overflow — 200 pages, extract 10 insights	Knowledge, Integration, Intelligence
S6	The Creative Brief — full campaign in one day	Creation, Perception, Knowledge
S7	Process Audit — find top 3 AI opportunities in department	Integration, Knowledge, Perception
S8	The Learning Challenge — master a new AI tool	Knowledge, Intelligence, Creation

5c: Reflection & Closing

Three final questions that capture mindset signals:

Forward-looking: “Looking ahead 12 months — how do you expect your work with AI to change?”
Temperature check: “Gut feeling — how much of your work will involve AI two years from now?”
Identity: “If you had to describe your relationship with AI in one sentence?”

“Is there anything about how you use AI that we haven’t covered? Anything you’re particularly proud of, or struggling with?”

10. Layer 2 Scoring Model

Per-Response Scoring

Score	Meaning
0	No signal (dimension not relevant)
1	Weak negative signal (absence of competency)
2	Weak positive signal (slight indication)
3	Clear positive signal (competency demonstrated)
4	Strong positive signal (exceptional depth)

Confidence Weighting

Confidence	Weight	When
High	1.0	Direct behavioral evidence — specific example, detailed process
Medium	0.7	Indirect evidence — general description, hypothetical
Low	0.4	Ambiguous — could indicate this or another dimension

5 Observable Differentiators

Differentiator	Score 1	Score 3	Score 5	Maps to
Decomposition	Dumps whole problems	Sometimes breaks down	Reflexive decomposition	5.3 + 1.2
Multi-Tool	Single tool only	2–3 tools linked	Complex pipelines	5.2 + 4.3
Iteration	1–2 tries	3–4 intentional rounds	Until quality bar met	4.2 + 2.2
System Thinking	Everything one-off	Some templates saved	Builds reusable systems	3.2 + 5.1
Reframing	Executes as given	Occasionally questions	Redefines before executing	1.4 + 1.1

Layer 2 Technical Architecture

Single-call LLM design (expert panel): Use one LLM call per user turn that both continues conversation AND outputs structured scoring JSON. This halves API cost and eliminates scoring latency vs. running a separate “judge LLM.”

// Single LLM call returns both:
{
  "conversation_response": "Natural language reply...",
  "scoring": {
    "sub_competencies": { "1.1_ai_curiosity": { "score": 3, "confidence": "high" } },
    "language_signals": ["exploration", "system_thinking"],
    "next_probe_priority": "knowledge"
  }
}

Observable Differentiator reliability: Score each differentiator at 3 separate points during conversation (not just once at end), average the 3 scores, calibrate with 50+ human-expert-scored transcripts before launch.

11. Combined Scoring (Layer 1 + Layer 2)

Sub-Competency Score

Sub_Competency_Final = (L1_Score × 0.30) + (L2_Score × 0.70)

Dimension Score

Dimension_Score =
  (Mindset_Sub × 0.30) +
  (Skill_Sub × 0.30) +
  (Additional_1 × 0.20) +
  (Additional_2 × 0.20)

Normalized to 0-100.

Superpowered Score

Superpowered_Score =
  (Mindset_Index × 0.35) +
  (Capability_Index × 0.30) +
  (Application_Index × 0.25) +
  (Technical_Depth × 0.10)

Range: 0-100

AI Potential (refined with Layer 2)

AI_Potential_raw = weighted_average(
  Mindset_Index × 1.4,
  Learning_Velocity × 1.2,
  Strongest_Dimension × 1.1,
  100 - Consistency_Gap × 0.5
)

// Sigmoid normalization to distribute across 40-100 range
AI_Potential_Final = sigmoid_map(AI_Potential_raw,
  pilot_mean, pilot_sd, target_range=[40, 100])

Display threshold: Only show AI Potential when AI_Potential − Score ≥ 10. For top performers where they’re nearly equal, omit it — it adds no insight at that level.

Self-Perception Gap

Consistency = 1 - |L1_normalized - L2_normalized| / 100

> 0.8: "Highly consistent self-awareness"
0.5-0.8: "Some gaps between self-perception and practice"
< 0.5: "Significant self-perception gap"

Self-report inflation detection: If Layer 1 exceeds Layer 2 by >20 points for any dimension: “Your self-perception in [dimension] is stronger than what your practices show. This often means untapped potential — you know what good looks like but haven’t fully applied it yet.”

12. Data-to-Output Mapping

Output	Phase 0 (AI Chat)	Phase 2 (FC)	Phase 3 (Mode/Skill)	Email Gate	Phase 4 (Mini)	Phase 5 (Full)
Archetype	Mindset hint (tiebreaker)	Top 2 dimensions	—	—	—	Refined
Top 3 Superpowers	—	Dimension ranking	—	—	—	Refined
AI Potential	Adoption level (context)	Mindset + peak dim	SKILL1 velocity	—	—	Learning velocity + gap
Best Percentile	Industry, role, company type	Dimension scores	—	—	—	Refined scores
Growth Edge	—	Weakest dimension	—	—	—	Weakest sub-competency
Mode Profile	Mode selection + depth (self-report, compressed)	—	MODE3 (agentic)	—	—	Behavioral depth (70% weight)
SP Score	—	L1 estimate	Skill signals	—	—	Full 4-component
20 Sub-competencies	—	L1 signals	—	—	—	Full L2 scoring
Learning Path	Industry context	—	—	—	—	Weakest 3 + domain
Certificate	Name	—	—	—	—	Score + archetype
Lead Capture	Name, industry, role, company type, adoption	—	—	Email	—	—
Email Sequence	Growth Edge + adoption for tips	Top superpower for content	—	Triggers sequence	Curiosity gap for Day 3	Full data for emails

13. UX Design Principles

Progress & Pacing

Show progress bar throughout (“Step 2 of 4” or percentage)
Forced-choice: one per screen on mobile, 2–3 visible on desktop
Estimated time remaining always visible
Mode/Skill items feel like a break (different visual format)

Visual Design

Clean, minimal interface — the assessment itself should feel “superpowered”
Superpower colors: ■ Perception ■ Intelligence ■ Knowledge ■ Creation ■ Integration
Forced-choice items: two cards side by side, tap to select
Back button: Allowed within Phase 2 and Phase 3, not between phases. Phase 0 chat is forward-only. Going back is tracked as a signal
Phase 0 → Phase 2 transition: Animated morph from chat bubbles to forced-choice cards (1–2 sec)
Post-result → Phase 4 transition: Animated shift back to chat UI — already familiar from Phase 0

Tone

Phase 0 welcome: Friendly, curious, human — “Hi! I’m here to map your AI superpowers.”
Phase 0 → Phase 2: “Love it. Now let’s find out where your real superpowers are — this next part is quick.”
Progress: “Great — you’re almost through the quick calibration”
Results tease: “Your profile is ready. Let me show you what I found.”
Phase 4 mini-chat opener: References Phase 0 context — “Earlier you mentioned [X]. That tells me something interesting…”

Mobile-First

Full-width cards, large tap targets
Chat: familiar messaging UI (bubbles, typing indicator)
Results: swipeable card format
Share: native share sheet integration

14. Anti-Gaming & Quality Controls

Time-Per-Item Tracking

If 22 items completed in <60 seconds (<3s per item), flag as “speed-through.” Don’t invalidate — apply confidence: low modifier to all L1 scores, increasing L2 weight.

Semantic Consistency Check

The Consistency Score (L1 vs L2 comparison) detects both self-perception gaps (coaching insight) and gaming (reliability concern):

> 0.8: High confidence in all scores
0.5–0.8: Flag dimensions with biggest gaps for extra L2 probing
< 0.5: Mark as “low confidence” — recommend re-assessment

No explicit attention checks (“select option B”). They break immersion and feel insulting to professionals.

Social Desirability Mitigation

Pre-launch validation: Present all 22 items to 20 people without context — if one option chosen >65%, rewrite it
Both-positive design: Both options must sound equally desirable
Ipsative format helps: Forced-choice is inherently more resistant to gaming than agree/disagree scales

15. Re-Assessment Strategy

Hybrid Item Rotation (60/40)

Component	Strategy	Rationale
Phase 0 AI Chat	Same structure, different phrasing	AI varies wording naturally; Q3 (AI usage) captures genuine change over time
Forced-choice	13 anchor items identical + 9 rotated	Anchors track change; rotation reduces memory gaming
Anchoring items	Always identical (all 5)	Stable absolute-level measurement
Mode/Skill items	Always identical	Track actual behavioral change
Mini AI chat	Different questions each time	Natural variation
Layer 2 scenarios	Always different	Adaptive nature ensures different paths

Expanded bank needed: 35 total items (13 permanent + 22 rotatable, drawing 9 per session). Target: Q3 2026.

16. Alternative Path — Conversation Upload Assessment

The Idea

Instead of answering questions about how you use AI, you show how you use AI. The user uploads 30–50 of their past AI conversations — from any tool (ChatGPT, Claude, Cursor, Copilot, Gemini, or any export) — and the system analyzes their actual behavior to generate a full SP™ AI Score profile.

This is a 100% behavioral assessment — no self-report, no forced-choice, no “how often do you…” questions. Just real evidence of how you work with AI.

Why This Path Exists

Problem with traditional assessment	How conversation upload solves it
People overestimate their skills	Conversations don’t lie — they show what you actually do
Assessment fatigue (“not another quiz”)	Zero questions — you just share what you already have
Hard to assess advanced users	Power users leave the richest behavioral traces
Self-report misses nuance	Real conversations reveal patterns the user doesn’t even notice
Takes 25–35 minutes	Upload takes 2 minutes — analysis happens in the background

What the User Provides

Input: 30–50 past conversations with any AI tool.

Accepted formats:

Cursor sessions — exported from session history (agent transcripts)
ChatGPT exports — via Settings → Data Controls → Export Data (JSON)
Claude exports — via conversation history export
Plain text / Markdown — copy-pasted conversation logs
Any AI tool — as long as the format shows user messages and AI responses

Minimum requirement: 30 conversations spanning at least 2 weeks. Recommended: 50+ conversations across 30+ days to avoid recency bias.

What the System Analyzes

1. AI Interaction Modes

Signal	What it reveals
Questions, brainstorming, back-and-forth dialogue	Conversational mode — do you think WITH AI?
Content creation, drafts, creative requests	Creative mode — do you produce, iterate, and refine?
Code, tools, apps, prototypes, technical builds	Builder mode — do you build things that work?
Automation, pipelines, multi-step workflows, agents	Orchestration mode — do you design systems, not just tasks?

2. Five Superpowers

Superpower	What we look for
Perception	Do you spot opportunities? Do you reframe problems? Do you use AI for things others wouldn’t think of?
Intelligence	Do you think strategically with AI? Do you evaluate outputs critically? Do you simulate perspectives?
Knowledge	Do you build systems that remember? Do you manage context? Do you compound knowledge across sessions?
Creation	Do you produce at quality? Do you iterate until it’s right? Do you maintain your own voice and standards?
Integration	Do you connect AI into your real workflows? Do you design processes, not just tasks? Do you build for others?

3. AI Skills

What we observe	Skill it maps to
How you structure requests, give context, iterate	Prompting
How many tools, models, and integrations you combine	Tool Selection
Whether you create reusable agents, instructions, or templates	AI Assistants
Whether you build apps, scripts, prototypes, or tools	Vibe Coding
Whether you create content — posts, articles, copy, docs	Content Creation
Whether you analyze, clean, or enrich data	Data Work
Whether things run without you	Automation
Whether you design multi-step or multi-agent workflows	Autonomous Workflows
Whether you build systems that persist and reuse knowledge	Knowledge Management
Whether you use AI to accelerate entire projects	Project Acceleration

4. Mindset — Observable Through Behavior

Mindset component	Behavioral signal
Curiosity & Openness	How often do you try new approaches? Do you experiment?
Critical Trust	Do you push back on AI? Do you evaluate outputs or just accept them?
Augmentation Vision	Do you ask AI to DO things, or to help you THINK about things?
Learning Velocity	Are you visibly better in recent conversations than older ones?
Collaboration Design	Do you design AI interactions for others, or only for yourself?

How the Output Looks

The output is a full SP™ AI Score profile — identical in structure to the Paid Full Profile:

SP™ AI Score (0–100) — composite score based on the four weighted components
Archetype — derived from top 2 superpowers
Narrative — 2–3 sentences describing who you are as an AI professional
Radar chart — 5 superpower dimensions
Mode profile — which modes you use and at what depth
Your Stack (Skills) — all 10 skills ranked and scored in human language
Behavioral Evidence — the strongest patterns observed, written as insights
What Would Make You Faster — 3–5 growth recommendations, each with a “Solve with AI” button that opens the prompt directly in Claude or ChatGPT

Scoring Methodology

Scoring follows the standard SP™ framework with one key difference: Layer 2 (behavioral evidence) carries 100% of the weight — there is no Layer 1 self-report.

SP™ AI Score = weighted_average(
  Mindset Index      × 35%   // observed via behavioral proxies
  Applied AI Skills  × 30%   // observed via tool usage and task types
  Domain Integration × 25%   // observed via workflow complexity
  Technical Depth    × 10%   // observed via builder-mode activity
)

Comparison to Standard Assessment

Aspect	Standard (questionnaire)	Conversation Upload
Input	User answers questions about behavior	User shows actual behavior
Time to complete	7–35 minutes	2 minutes + async processing
Self-report bias	Present (people overestimate)	Absent (behavior doesn’t lie)
Works for beginners	Yes — questions are accessible	Limited — needs 30+ AI conversations
Works for power users	May feel reductive	Excellent — richer signals
Scoring layers	L1 (30%) + L2 (70%) or L1 only	L2 only (100% behavioral)
Privacy model	Answers stored	Conversations processed and discarded

Target Audience

Ideal for:

Power users who use AI daily and have rich conversation history
Professionals who hate quizzes — “don’t ask me, look at what I do”
Teams / enterprise — bulk upload from shared tools, no scheduling
Developers and builders — Cursor/Copilot sessions contain the richest data
Repeat assessment — upload latest 50 conversations quarterly

Not ideal for:

AI beginners — fewer than 30 conversations won’t produce a meaningful profile
Casual users — shallow conversations produce shallow profiles
Privacy-sensitive users — requires sharing actual conversation content

Pilot Results

Tested with 50+ Cursor session transcripts (Jan–Feb 2026):

20 sessions → usable but recency-biased profile
50 sessions → stable, accurate, and rich profile
Behavioral signals strong enough to score all 20 sub-competencies including mindset
The “What Would Make You Faster” section had the highest perceived value

17. Open Questions

Resolved (from expert panel)

#	Question	Resolution
4	AI Potential clusters 85–95	Added sigmoid normalization + display threshold (≥ 10 gap)
7	Anti-gaming	Time-per-item tracking + semantic consistency. No attention checks.
8	Re-assessment items	Hybrid rotation (60% anchor / 40% rotated)

Still Open

Item validation (NON-NEGOTIABLE): 22 FC items + 5 anchoring items need validation with 50–100 pilot users. Required before public launch.
Phase 0 + Mini chat scoring weight: Phase 0 bonus signals (AI Adoption, Mindset Hint) are used for personalization and tiebreaking, NOT primary scoring. Mini chat (Phase 4) remains UX/conversion only. Revisit if pilot shows conversational responses are highly diagnostic.
Scenario bank size: Target 20 universal by Q2 2026, 35+ rotatable items by Q3 2026.
Layer 1 strategic accuracy: Business decision — make L1 directionally accurate (archetype correct 70%+) but numerically imprecise. Show archetype confidently; blur dimension scores/sub-competencies in free tier.
Chat technology: Start with text; add voice (ElevenLabs) as v2 feature.
Observable Differentiator calibration: 50+ human-expert-scored transcripts needed. Target: Cohen’s kappa > 0.7.
Enterprise path: Add “share with your team” prompt post-assessment.

Reports & Certificates Documentation Home