The Reference Framework

This document consolidates the maturity model, the operating principle, and the two scales that structure the AI transformation.

The Universal Translation Rule

The operating principle of the entire transformation fits in one sentence:

Replace "the human produces the artifact" with "the human defines the spec → the system produces the artifact."

What This Means by Department

Engineering

The human defines...

Architecture, constraints, tests

The system produces...

The implementation

Marketing

The human defines...

Strategy, positioning, hypotheses

The system produces...

Campaigns, variants, reports

Sales

The human defines...

Qualification logic, deal rules

The system produces...

Outreach, follow-ups, proposals

Customer Service

The human defines...

Escalation logic, success criteria

The system produces...

Responses, health scoring, actions

Finance

The human defines...

Financial models, rules

The system produces...

Reports, forecasts, anomaly detection

The human defines...

Hiring profiles, evaluation grids

The system produces...

Sourcing, screening, summaries

Product

The human defines...

Problem, constraints

The system produces...

Specs, test cases, drafts

Leadership

The human defines...

Direction, trade-offs

The system produces...

Scenarios, analyses

The Litmus Test

If this person disappeared, could a system execute 80% of their tasks?

If no → the role is still execution-based
If yes → the role is AI-native

This isn't "AI adoption." It's the shift from a labor-based company to a systems-based company.

Organizational Scale — Levels 1 to 3

This scale applies across the entire group — engineering, marketing, sales, finance, customer service.

Level 1

AI-Assisted

What it looks like:

AI is a tool that individuals choose to use
Same structures, same processes, same roles
If AI disappeared tomorrow, nothing structural would change

Typical behaviors:

Using ChatGPT/Claude like Google or a spell checker
Isolated prompts, no iteration
AI outputs manually pasted into work
No shared prompts, no documentation
Adoption is uneven and optional

The gap is measurable: in technical roles, AI has 94% theoretical task coverage but only 33% actual usage. Level 1 organizations leave most of AI's capability untouched.

10-30% efficiency gains for those who adopt

Competitors at Level 3 get 10x leverage and make Level 1 non-viable

Level 2

AI-Integrated

What it looks like:

AI is integrated into workflows and systems
Some processes redesigned around AI capabilities
Roles start shifting from "doing" to "directing" (see role evolution patterns)
If AI disappeared tomorrow, some workflows would break

Typical behaviors:

Saved prompts, templates, prompt libraries
AI used across multiple steps of a task, not just one
Tools like Copilot, Notion AI, Zapier, n8n in active use
Prompts and workflows shared among colleagues
AI usage is expected, not optional

2-3x output with the same headcount

Half-measures create confusion; uneven adoption limits gains

Level 3

AI-Native

What it looks like:

Organizational design assumes AI as a first-class resource
Roles are defined by judgment and direction, not execution
Headcount is a fraction of a traditional company at the same output
If AI disappeared tomorrow, the company couldn't function

Typical behaviors:

The starting question is: "What part should be automated?"
Agents, pipelines, and decision systems built (code or no-code)
Processes designed so humans handle judgment, AI handles execution
AI impact is measured (time saved, costs reduced, quality improved)
AI literacy is a condition of employment

10x leverage, structural cost advantage, speed that competitors can't match

Requires people who are hard to find; no room for passengers

Engineering Scale — Rungs 0 to 5

Engineering needs finer granularity. Based on Dan Shapiro's framework, this scale describes the progression of software development. The AI Lab details it and how it operates.

Rung	Human's role	Who writes the code	Who reviews the code
0 — Assisted coding	Human codes, AI suggests	Human	Human
1 — Scoped delegation	Human assigns scoped tasks	AI	Human (everything)
2 — Supervised generation	Human supervises multi-file changes	AI	Human (everything)
3 — Directed development	Human directs, reviews at feature/PR level	AI	Human (PR)
4 — Spec-driven development	Human writes the spec, verifies results	AI	Nobody (tests verify)
5 — Autonomous production	Spec goes in, software comes out	AI	Agent reviewer + scenarios; risk-graded human gates

The achievable Rung on a given codebase is capped by codebase readiness — a team at Rung 5 on a codebase at Readiness Level 2 will produce fast hallucinated output with no correction signal.

What Rung 5 actually looks like

Rung 5 is not monolithic. Validation is risk-graded by an action's blast radius, reversibility, and consequence:

HITL

Human-in-the-Loop

Human approves before execution.

Default for irreversible high-impact actions: financial transactions, production deploys, customer-facing changes.

HOTL

Human-on-the-Loop

Agent acts autonomously; human monitors with intervention authority.

Default for reversible production work with eval coverage.

HOOTL

Human-out-of-the-Loop

Agent acts within pre-defined boundaries; no real-time human involvement.

Reserved for sandboxed, reversible work with strong tests and an agent reviewer on every PR.

A Rung 5 team operates all three concurrently, picking the gate per action class. Each gate runs the same operational unit — Context → Clarification → Execution → Validation → Recovery — described in detail in the AI Lab.

Beyond engineering

The engineering scale is the most documented instance of a broader pattern. The Rung 5 operational reality — recurring loop, risk-graded validation, agent-as-reviewer — generalizes to other discrete-task domains (engineering, customer service, finance operations, legal review, knowledge research) where AI similarly operates as the execution layer. Parallel readiness models for those domains are on the framework's roadmap. See also the depth note in the Skill Progression Map for how this asymmetry shows up role-by-role.

Mapping

Organizational scale	Engineering scale
Level 1 — AI-Assisted	Rungs 0-1
Level 2 — AI-Integrated	Rungs 2-3
Level 3 — AI-Native	Rungs 4-5

Diagnostic Questions

Three quick tests to gauge maturity. For a full team-by-team assessment methodology, see Assessing Your Organization.

"If AI disappeared tomorrow, what would change?"

Nothing structural → Level 1
Some workflows break → Level 2
The company can't function → Level 3

For acceptance criteria per level and the transformation path — timelines, prerequisites, and what to watch for — see the Implementation Roadmap.

Leadership Tiers

The company can't exceed the tier of its leadership. Leadership is the ceiling.

Tier 1

AI-Supportive

Publicly endorses AI. Uses it personally. Doesn't push adoption.

Tier 2

AI-Operational

Sets expectations by role. Asks "how did AI help?". Funds automation before hiring.

Tier 3

AI-Strategic

Redesigns the organizational structure. Rewrites roles and KPIs. Makes AI literacy a condition of leadership.

Individual Tiers

The individual tier scale measures where a person is in their AI transition. The three main tiers (T1, T2, T3) describe established states. The intermediate tiers (T0.5, T1.5, T2.5) describe the transitions between them – proof that someone is moving, not just labeled.

Tier 0

Unexposed

AI is not part of work in any form.

Tier 0.5

AI-Curious

Has tried AI but it hasn't changed how work gets done.

Tier 1

AI-Aware (Consumer)

"AI helps me do my job faster."

Tier 1.5

AI-Building

Actively designing and testing AI workflows. The construction phase.

Tier 2

AI-Augmented (Operator)

"AI helps us do this task better and more systematically."

Tier 2.5

AI-Advanced

Building systems where AI handles most execution. The role is transforming from the inside.

Tier 3

AI-Native (Architect)

"This role should exist differently because AI exists."

The three main tiers are operational states – they describe how someone works. The .5 tiers are transitional – they describe that someone is actively moving between states.

Tier 0 / 0.5 – The gap between T0 and T1 isn't knowledge, it's the habit of reaching for AI when work starts. A person at T0.5 has experimented but hasn't integrated. The risk: people stay here indefinitely without structured nudges.

Tier 1.5 – This is where most people stall. They're past ad-hoc usage and actively building workflows – designing prompts, testing systems, iterating. Some experiments fail, some stick. The transition from T1 to T2 requires that these experiments become established workflows, not just personal experiments. Context files and shared prompt libraries become critical at this stage.

Tier 2.5 – Multiple processes redesigned around AI. The person spends most of their time on direction, judgment, and review rather than execution. Starting to eliminate coordination overhead – doing alone what used to require team handoffs. The distinction from T3: the role still has the same title and boundaries, but the work inside has fundamentally changed. These people demonstrate what Level 3 looks like before organizational structure catches up.

Tier 3 – AI is the execution layer; humans give direction and validate. Work is structured around a recurring operational unit (Context → Clarification → Execution → Validation → Recovery), with cycle times collapsed to days at the feature level rather than weeks at the sprint level. The person's value concentrates at the boundaries: front-boundary work (specification, alignment, clarification dialogues with the AI) and back-boundary work (validation, edge-case sessions, first-user UX testing). Inside the loop, the agent runs. The discipline distinguishing T3 from T2.5 isn't more tool use — it's process design for AI: defining the constraints, gates, and validation tiers within which AI operates consistently. The role and the work inside it look fundamentally different from T2.5; titles may not have changed but the day-to-day has.

Mapping to organizational levels

Individual tier	Organizational level
T0 / T0.5	Below Level 1 (awareness without integration)
T1	Level 1 (AI-Assisted)
T1.5	Transition from Level 1 to Level 2
T2	Level 2 (AI-Integrated)
T2.5	Transition from Level 2 to Level 3
T3	Level 3 (AI-Native)

← Back to home · Implementation Roadmap · AI Execution Standards · The AI Lab · Glossary