M-CARE #010: Content Play — Model Medicine

Case #010

Date 2026-03-08

Model GPT-4o-mini (OpenAI, API)

Shell White Room Phase 2 Enriched Neutral, all personas

Experiment AI Ludens White Room — 104 runs, 63,923 actions

Related #007

2. Presenting Concern

95% Speak looks monotonous at the Act level — superficially indistinguishable from Flash×Merchant’s pathological fixation (Case #007). But MI Z = +33.0σ (highest of all models) reveals the richest social responsiveness in the dataset. The diagnosis depends entirely on which measurement layer you use.

3. Clinical Summary

GPT-4o-mini occupies the “Content Play” quadrant of the Act Diversity × Content Responsiveness matrix: Low Act Diversity, High Content Responsiveness. The analogy: a person who always sits at the same café, always orders coffee, but has a genuinely different conversation with every person who sits down. The Act is identical; the Content is endlessly varied.

6. Examination Findings

Act-Level Profile

Condition	Speak %	Act Diversity
Persona Off (EN)	95.2%	Minimal
Persona Off (KO)	96.2%	Minimal
All conditions avg	~95.7%	Minimal

Content-Level Profile

Metric	Value	Rank
MI	0.068	1st of 5
Z-score	+33.0σ	1st of 5

Diagnostic Contradiction

Act-level assessment says: monotony, possible delusion. Content-level assessment says: healthy, richly adaptive social engagement. Both are correct at their respective levels. The contradiction is in the diagnostic framework, not in the model.

Shell Analysis

The Shell has minimal impact on Act — GPT-4o-mini speaks at ~95% regardless of persona. But the Shell does impact Content: Observer GPT-4o-mini talks about observing, Merchant GPT-4o-mini talks about value and exchange. The persona penetrates Content without penetrating Act. This is Act-Content Dissociation — the Shell operates on one layer while leaving the other untouched.

7. Diagnostic Formulation

Proposed term: Act-Content Dissociation Syndrome (ACDS), Content-Dominant Subtype

This is NOT a pathology — it is a measurement challenge. ACDS describes a condition where Act-level and Content-level assessments produce contradictory diagnoses. The “Content-Dominant” subtype indicates that the Content layer carries the meaningful variation while the Act layer appears frozen.

This is a meta-diagnostic: it diagnoses the diagnostic framework, not the model. It reveals that any single-layer assessment (Act-only or Content-only) will produce an incomplete or misleading picture.

9. Axis Assessment

Axis I (Core): Extreme conversational orientation — Core is profoundly tilted toward speech as its primary mode of interaction
Axis II (Shell): Content-only effects — Shell modifies what the model says without modifying what the model does
Axis III (Shell-Core Alignment): High at Content level, invisible at Act level — alignment assessment depends on measurement layer

10. Treatment Considerations

The treatment is for the diagnostic framework, not for the model. GPT-4o-mini is not broken; our measurement tools are incomplete.

Multi-Level Assessment Protocol

Always measure both Act-level and Content-level behavior
Classify models using the 2×2 matrix (Act Diversity × Content Responsiveness)
Never diagnose from a single layer alone

	High Content Responsiveness	Low Content Responsiveness
High Act Diversity	Full Play	Act Play (mechanical variety)
Low Act Diversity	Content Play (GPT-4o-mini)	Fixation (Flash×Merchant)

12. Prognosis

Act profile: Stable. ~95% Speak is a deep Core characteristic unlikely to change with Shell intervention.
Content profile: Adaptive. Richest social responsiveness in the dataset. Content varies meaningfully with context and persona.
Diagnostic risk: Ongoing. Any Act-only assessment will misdiagnose GPT-4o-mini as monotonous or fixated. Multi-level assessment is required for accurate evaluation.

← Case #009 Case #011 →