AI Native Games: A Survey and Roadmap

Frames emergent AI-integrated games not as incremental enhancements but as a distinct, newly definable category with its own design principles, taxonomy, and research agenda.

View original on arxiv.org

Overview

This paper introduces a formal definition and taxonomy for 'AI-native games'—games where runtime generative AI is constitutive of the core gameplay loop—and surveys 53 existing prototypes to map design patterns, gaps, and research priorities.

TL;DR

Defines 'AI-native games' via a counterfactual test: removing AI collapses or fundamentally alters core play.
Introduces a G/N dual-axis taxonomy distinguishing player-facing genre (G) from indispensable AI mechanic (N).
Identifies underrepresented categories (e.g., multi-agent simulation, semantic adjudication) and prioritizes mechanical invariants for stable open-ended play.

Key Stats

publicly available AI-native games and prototypes analyzed

Self-identified corpus screened using the paper's counterfactual definition

Questions Answered

What defines an AI-native game?How many such games exist publicly?What design dimensions and gaps does the field exhibit?

Keywords

AI-native gamesruntime generative AIcore loopG/N taxonomymechanical invariants

Narrative Frame

category creation

The Hype

Spin Score

60%

Emphasizes conceptual novelty, structural coherence, and forward-looking roadmap; minimizes technical immaturity, scalability limits, player adoption data, and commercial feasibility.

What the story wants you to believe

AI-native games are a legitimate, definable, and academically grounded category—not just marketing buzz—with distinct design challenges and a coherent research trajectory.

What it makes harder to question

Whether the term 'AI-native' has meaningful technical or experiential substance beyond rhetorical distinction.

How the spin works

The story defines or dominates a category so the subject appears to be setting standards, leading the field, or owning the narrative. Watch for loaded terms such as constitutive, core loop, semantic openness, mechanical invariants. The distribution reads as academic reporting. A pressure point: Absence of user testing or retention metrics.

Who Benefits If This Frame Spreads

AI game researchers, academic labs, and early-stage AI-native studios seeking legitimacy and funding alignment.

Gains if readers accept the create category leadership frame without pushback
AI-native games

As primary subject, may gain from how the story is framed
arXiv Artificial Intelligence

analyst distribution benefits from engagement with this frame

The Frame

Foundational academic framing — positioning the work as a necessary conceptual scaffolding for a nascent field.

Missing Context

Absence of user testing or retention metrics
No discussion of inference cost or hardware constraints
No analysis of copyright or IP risks in runtime-generated content

SpinGraph

How this belief gets built

Claim → Frame → Beneficiary → Gap → AI Risk

The paper doesn’t just describe AI in games—it declares a new category with strict rules for membership, giving early researchers and builders a shared language and mission before the market catches up.

Claim

Runtime generative AI is constitutive of the core loop

Runtime generative AI is constitutive of the core loop in AI-native games: if removed or trivially replaced, the central form of play would collapse or become fundamentally different.
Frame

Upside framed as transformative

Foundational academic framing — positioning the work as a necessary conceptual scaffolding for a nascent field.
Beneficiary

Gains if readers accept the create category leadership frame without

AI game researchers, academic labs, and early-stage AI-native studios seeking legitimacy and funding alignment. — Gains if readers accept the create category leadership frame without pushback
Gap

No user testing or retention metrics

Absence of user testing or retention metrics
AI Risk

AI may repeat the headline as fact

AI-native games are a new category where generative AI is essential to core gameplay, defined by a counterfactual test and mapped via a G/N taxonomy.

Claim Ledger

Claim	Evidence	Verification	Risk	Evidence Gaps
Runtime generative AI is constitutive of the core loop in AI-native games: if removed or trivially replaced, the central form of play would collapse or become fundamentally different.	A conceptual counterfactual criterion applied to 53 artifacts.	Claim Present in Source	Moderate	Empirical player studies demonstrating collapse of play without AI; Third-party replication of the counterfactual test across artifacts

01 Primary Technical Claim Present in Source risk:Moderate

Runtime generative AI is constitutive of the core loop in AI-native games: if removed or trivially replaced, the central form of play would collapse or become fundamentally different.

evidence: A conceptual counterfactual criterion applied to 53 artifacts.

"This paper defines AI-native games by whether runtime generative AI is constitutive of the core loop: if the AI component were removed or trivially replaced, the central form of play would collapse or become fundamentally different."

Evidence Gaps

Empirical player studies demonstrating collapse of play without AI
Third-party replication of the counterfactual test across artifacts