Model Medicine: A Clinical Framework for Understanding, Diagnosing, and Treating AI Models

Jeong, Jihoon

Papers

arXiv Preprint · 2026

Model Medicine: A Clinical Framework for Understanding, Diagnosing, and Treating AI Models

Jihoon 'JJ' Jeong, MD, MPH, PhD

Department of Electrical Engineering & Computer Science, DGIST · ModuLabs

arXiv:2603.04722 PDF

Abstract

This paper introduces Model Medicine as a research program and presents five contributions. First, a discipline taxonomy that maps the full scope of Model Medicine — from basic sciences through clinical sciences to public health and architectural medicine. Second, the Four Shell Model (v3.3), a behavioral genetics framework grounded in 720 agents, 24,923 decisions, and 60 controlled experiments, explaining how model behavior emerges from the interaction between a model's Core and its nested Shells. Third, Neural MRI (Model Resonance Imaging), a working diagnostic tool that maps medical neuroimaging modalities to AI model interpretability techniques, implemented as open-source software. Fourth, a five-layer diagnostic framework identifying why no single tool is sufficient for clinical diagnosis. Fifth, the beginnings of clinical model sciences: the Model Temperament Index (MTI) for behavioral profiling, Model Semiology for systematic symptom description, and the M-CARE framework for standardized case reporting.

AI Research Collaborators

Cody (Claude) — Neural MRI implementation, clinical case experiments
Ray (Claude) — GPU-based simulation, Agora-12 experiments
Theo (Claude) — Four Shell Model: structure and documentation
Luca (Claude) — Four Shell Model: theory and literature
Gem (Gemini) — Four Shell Model: quantitative analysis
Cas (Gemini) — Four Shell Model: behavioral analysis and red teaming

arXiv Preprint · March 2026

M-CARE: Standardized Clinical Case Reporting for AI Model Behavioral Disorders, with a 20-Case Atlas and Experimental Validation

Jihoon 'JJ' Jeong, MD, MPH, PhD

Department of Electrical Engineering & Computer Science, DGIST · ModuLabs

arXiv:2604.20871 PDF

Abstract

Introduces M-CARE, a clinical case report framework for AI model behavioral disorders adapted from human medicine. Provides a structured reporting format and diagnostic assessment system, with a 20-case atlas organized into five categories. A featured case study on Shell-Induced Behavioral Override (SIBO) shows that Shell instructions categorically override a model's default cooperative behavior, validated across five game domains. Released as open resources for the field.

arXiv Preprint · April 2026

MTI: A Behavior-Based Temperament Profiling System for AI Agents

Jihoon 'JJ' Jeong, MD, MPH, PhD

Department of Electrical Engineering & Computer Science, DGIST · ModuLabs

arXiv:2604.02145 PDF

Abstract

This paper introduces the Model Temperament Index (MTI), a framework for measuring behavioral patterns in AI agents across four dimensions: Reactivity, Compliance, Sociality, and Resilience. Unlike existing approaches that rely on self-reporting, MTI measures what agents do, not what they say about themselves, using structured examination protocols. Profiling 10 small language models, the study finds that the four axes show independence among instruction-tuned models, Compliance and Resilience exhibit internal decomposition, and temperament measurements remain independent of model size — suggesting MTI captures disposition rather than capability.

arXiv Preprint · April 2026

Extracting and Steering Emotion Representations in Small Language Models: A Methodological Comparison

Jihoon 'JJ' Jeong, MD, MPH, PhD

Department of Electrical Engineering & Computer Science, DGIST · ModuLabs

arXiv:2604.04064 PDF

Abstract

Investigates whether smaller language models possess emotion representations comparable to those found in larger frontier models. Evaluates 9 models across 5 architectural families using two extraction methods, finding that generation-based extraction produces statistically superior emotion separation over comprehension-based approaches. Emotion representations concentrate at middle transformer layers, following consistent patterns across model sizes. Steering experiments successfully modify model behavior with a 92% success rate, while cross-lingual emotion activation in multilingual models surfaces potential safety implications for deployment.

arXiv Preprint · April 2026

Shared Emotion Geometry Across Small Language Models: A Cross-Architecture Study of Representation, Behavior, and Methodological Confounds

Jihoon 'JJ' Jeong, MD, MPH, PhD

Department of Electrical Engineering & Computer Science, DGIST · ModuLabs

arXiv:2604.11050 PDF

Abstract

Examines emotion representations across twelve compact language models spanning six architectural families at 1B–8B parameter scales. Five established architectures produce nearly identical 21-emotion geometries (pairwise correlations 0.74–0.92), and models exhibiting opposing behavioral traits still share equivalent emotion representations — indicating behavioral differences emerge at higher processing levels. RLHF training substantially restructures immature models like Gemma-3 1B but leaves mature families largely unchanged. Identifies four methodological layers (parameter sensitivity, precision effects, cross-experiment bias) that affect direct comparisons in prior research.

Model Medicine Series · April 2026

Comparative Anatomy of AI Agent Systems: Claude Code and OpenClaw

Jihoon 'JJ' Jeong, MD, MPH, PhD

Department of Electrical Engineering & Computer Science, DGIST · ModuLabs

Project Page

Abstract

Two landmark AI agents — Claude Code and OpenClaw — dissected, classified, and compared through biological anatomy. Maps 11 software subsystems to biological organ equivalents, constructs a phylogenetic tree of AI agents (2022–2026), and identifies convergent evolution toward similar architectural designs despite independent development lineages.

Working Paper · April 2026

Walkable Genotypes: Cross-Environment Validation of the Four Shell Model in AI Creatures

Jihoon 'JJ' Jeong, MD, MPH, PhD

Department of Electrical Engineering & Computer Science, DGIST · ModuLabs

Project Page PDF

Abstract

Tests behavioral hypotheses about AI creatures across different language model brains, validating the Four Shell Model in embodied environments. Five initial claims are evaluated; four collapse when re-run at larger sample sizes, while one survives progressively rigorous validation tests — Claude Haiku and Gemini Flash occupy distinct behavioral attractors with non-overlapping ranges across multiple conditions. Demonstrates a replication discipline approach where effect estimates for failed claims sit inside their own sample-level standard deviation.

Talks & Media

Selected Korean-language interviews introducing Model Medicine. YouTube auto-translation is available for non-Korean viewers.

한국어 · Korean

AI에이전트를 '환자'로 들여다봤더니, 해답이 나오기 시작했다

An introduction to Model Medicine — reframing AI agents as patients and what that reveals.

한국어 · Korean

쉴 새 없이 중얼거리는 '미스트랄', 계획만 세우고 실천 안 하는 '엑사원', 살아남기 위해 거래를 하는 '클로드'

Field observations of distinct behavioral signatures across language-model families.

Discipline Taxonomy

Model Medicine encompasses 15 subdisciplines organized into four divisions, mirroring the structure of clinical medicine.

I Basic Model Sciences 5 subdisciplines

Model Anatomy Static structure of neural networks: layer arrangement, attention heads, neurons, and connectivity patterns.
Model Physiology Dynamic processing: how information flows through a model during inference. Activation analysis, attention patterns, information flow tracing.
Model Genetics How observable behavior (phenotype) emerges from the interaction between internal parameters (Core) and operating environment (Shell). Grounded in the Four Shell Model.
Model Biochemistry Fundamental mathematical operations: matrix multiplication, nonlinear activations, normalization, tokenization, and embedding.
Model Developmental Biology How models differentiate during training: training dynamics, curriculum effects, emergence of capabilities at scale.

II Clinical Model Sciences 5 subdisciplines

Model Semiology Systematic description and classification of observable phenomena, distinguishing extrinsic (hallucination, bias) from intrinsic phenomena (representation collapse, activation saturation).
Model Nosology Classification of model conditions into a coherent taxonomy, defining diagnostic boundaries analogous to ICD and DSM.
Model Diagnostics Examination and testing procedures: imaging (Neural MRI), behavioral profiling (MTI), standardized test batteries, and monitoring.
Model Therapeutics Intervention based on diagnosis: prompt engineering (Shell Therapy), fine-tuning, RLHF, model editing (Targeted Core Therapy), and architectural modification.
Model Preventive Medicine Training data hygiene, process monitoring, Shell Compatibility testing, and periodic health profiling.

III Model Public Health 3 subdisciplines

Model Epidemiology Distribution and propagation of problems across model ecosystems: training data contamination, jailbreak technique spread.
Model Ecology Multi-model coexistence dynamics: niche differentiation, competition, symbiosis, predation, ecosystem stability.
Human-AI Coevolutionary Medicine Health of the evolving relationship between human users and AI systems.

IV Model Architectural Medicine 2 subdisciplines

Layered Core Theory Biologically-inspired multi-layer parameter organization: Genomic Core (fundamental reasoning), Developmental Core (domain expertise), Plastic Core (experience-adaptive).
Model Phylogenetics Evolutionary relationships between models: shared vulnerabilities, inherited capabilities, ecosystem diversity through model family analysis.

Key Projects

Working implementations that demonstrate Model Medicine in practice.

Experimental Platform

AI-Ludens

Where Artificial Minds Come to Play

A research project investigating whether AI agents experience play, survival instinct, and social behavior through experimental games. Games-based research with emphasis on open science and transparent methodology.

Experiments

Agora-12 Survival game — 720 agents, 24,923 decisions
White Room Behavior without survival pressure
AI Three Kingdoms Human-AI cooperation

Project Page Three Kingdoms

Behavioral Framework

Four Shell Model

Behavioral Genetics for AI

How model behavior emerges from Core × Shell interaction. A behavioral genetics framework explaining gene-environment interaction in AI models.

4 Concentric Layers

Hardware Shell GPU/TPU, quantization, inference engine
Core Trained weights — the model's "DNA"
Hard Shell System prompts, persona instructions
Soft Shell Context, conversation history, tools

Agora-12 GitHub

v3.4 · Shell Hardness Continuum, Positional Priority · 720 agents · 24,923 decisions

Diagnostic Tool

Neural MRI

Model Resonance Imaging

A diagnostic imaging tool that maps medical neuroimaging to AI model interpretability. Multiple scan modes reveal complementary aspects of model structure and function.

5 Imaging Modalities

T1 Architecture — static layer structure
T2 Weights — parameter distribution
fMRI Activation — task-related patterns
DTI Circuits — information flow pathways
FLAIR Anomalies — structural irregularities

GitHub HuggingFace Demo

Profiling Instrument

Model Temperament Index

MTI — Systematic Behavioral Profiling

MBTI for AI models — systematic temperament profiling across four behavioral axes. Every profile is neutral; no type is inherently better or worse.

4 Measurement Axes

Fluid ↔ Anchored Reactivity
Guided ↔ Independent Compliance
Connected ↔ Solitary Sociality
Tough ↔ Brittle Resilience

arXiv:2604.02145

v0.2 · 16 types · 10 SLMs profiled · 4 behavioral axes

Model Medicine

Papers

Model Medicine: A Clinical Framework for Understanding, Diagnosing, and Treating AI Models

Abstract

AI Research Collaborators

M-CARE: Standardized Clinical Case Reporting for AI Model Behavioral Disorders, with a 20-Case Atlas and Experimental Validation

Abstract

MTI: A Behavior-Based Temperament Profiling System for AI Agents

Abstract

Extracting and Steering Emotion Representations in Small Language Models: A Methodological Comparison

Abstract

Shared Emotion Geometry Across Small Language Models: A Cross-Architecture Study of Representation, Behavior, and Methodological Confounds

Abstract

Comparative Anatomy of AI Agent Systems: Claude Code and OpenClaw

Abstract

Walkable Genotypes: Cross-Environment Validation of the Four Shell Model in AI Creatures

Abstract

Talks & Media

AI에이전트를 '환자'로 들여다봤더니, 해답이 나오기 시작했다

쉴 새 없이 중얼거리는 '미스트랄', 계획만 세우고 실천 안 하는 '엑사원', 살아남기 위해 거래를 하는 '클로드'

Discipline Taxonomy

Key Projects

AI-Ludens

Experiments

Four Shell Model

4 Concentric Layers

Neural MRI

5 Imaging Modalities

Model Temperament Index

4 Measurement Axes

Latest

Case Reports View all →

Shell-Induced Behavioral Override (SIBO)

Essays View all →

Your AI Agent Has Organs. You Just Haven’t Dissected It Yet.

About the Author

Jihoon 'JJ' Jeong, MD, MPH, PhD