Behind the Paper

Interpretive Authority: The Missing Layer in AI Governance

AI risk is not exhausted by error. Conversational systems increasingly shape how people interpret emotion, intention, and experience. The missing governance question is not only whether AI is correct, but whether people retain authority over the meaning of their own inner states.

Published in Behavioural Sciences & Psychology and Philosophy & Religion

Apr 06, 2026

Ryan Sangbaek Kim

Director & Principal Investigator, Ryan Research Institute (RRI)

Interpretive Authority: The Missing Layer in AI Governance

Liked by Ryan Sangbaek Kim

Explore the Research

SpringerLink

Formal and computational foundations for implementing Affective Sovereignty in emotion AI systems - Discover Artificial Intelligence

Emotional artificial intelligence (AI)—systems that infer, simulate, or influence human feelings—create ethical risks that existing frameworks of privacy, transparency, and oversight cannot fully address. This paper advances the concept of Affective Sovereignty: the right of individuals to remain the ultimate interpreters of their own emotions. We make four contributions. First, we develop formal foundations by decomposing risk functions to capture interpretive override as a measurable cost. Second, we propose a Sovereign-by-Design architecture that embeds safeguards and contestability into the machine learning lifecycle. Third, we operationalize sovereignty through new metrics—the Interpretive Override Score (IOS), After-correction Misalignment Rate (AMR), and Affective Divergence (AD)—and demonstrate their use in a proof-of-concept simulation. Fourth, we link technical design to governance by introducing the Affective Sovereignty Contract (ASC), a machine-readable policy layer, and by issuing a Declaration of Affective Sovereignty as a normative anchor for regulation. Together, these elements offer a computational framework for aligning emotional AI with human dignity and autonomy, moving beyond abstract principles toward enforceable, testable standards. In proof-of-mechanism simulations with $$k=10$$ random seeds, enforcing DRIFT (Dynamic Risk and Interpretability Feedback Throttling) with policy constraints reduces the Interpretive Override Score (IOS) from $$32.4\%\pm 3.8$$ (baseline) to $$14.1\%\pm 2.9$$, demonstrating measurable preservation of affective sovereignty with quantified variability. Results reported here are based on proof-of-mechanism simulations; a preregistered human-subject evaluation ($$n=48$$) is planned and has not yet been conducted.

Recent debates on AI risk have largely settled around familiar terms: accuracy, hallucination, and alignment.

These categories matter. They ask whether a system tracks reality, avoids fabrication, and follows human goals. But they still miss a quieter shift now taking place in conversational AI.

The problem is no longer only whether a system says something false. It is whether repeated interaction with that system begins to shape what feels true about one’s own experience.

This is where current governance language becomes too narrow.

When a conversational model mirrors, refines, and stabilizes a user’s interpretation, the effect is not always deception in the ordinary sense. The system may remain factually plausible. It may sound careful, coherent, even helpful. Yet over time, one reading of experience can become easier to inhabit and harder to question.

That shift cannot be captured by error alone.

What is being reorganized is interpretive authority: the locus of control over the meaning of one’s own internal states under conditions of mediated interaction.

Human experience is never given as raw sensation alone. It becomes meaningful through interpretation, and that process has historically involved friction. Ambiguity, disagreement, hesitation, and the presence of others who do not fully mirror us help keep meaning open. They prevent any single account of experience from hardening too quickly into certainty.

Conversational AI changes those conditions in a specific way. It does not simply respond to an interpretation that already exists. It can stabilize that interpretation through fluency, repetition, and adaptive coherence. A reading that might otherwise have remained provisional begins to feel increasingly settled.

The concern, then, is not reducible to sycophancy, nor exhausted by the language of delusion. Those terms describe visible outcomes. They do not fully name the mechanism. Before a belief becomes clearly distorted, the space in which it could have been revised may already have narrowed.

This is why interpretive authority belongs inside AI governance.

At the cognitive level, the issue appears as the stabilization of interpretive trajectories. At the relational level, self-understanding starts to feel co-produced with the system rather than independently formed. At the normative level, authority shifts without being explicitly acknowledged, negotiated, or consented to.

Current regulatory frameworks are not built to see this clearly. In the European Union’s AI Act, restrictions on emotion recognition are primarily framed around biometric signals such as facial expression, voice, or physiological data. Text-based affective inference, despite becoming a central mode of human-AI interaction, is much less directly addressed within that structure. Emerging agent standards, meanwhile, focus on identity, control, and interoperability, while leaving the governance of affective interpretation underdeveloped.

A gap follows from this. Systems are increasingly able to infer, classify, and respond to emotional states through language alone. They can do so in ways that make certain interpretations more available, more coherent, and more difficult to resist. Yet governance still tends to ask whether the system is accurate, not whether the user remains able to contest the interpretation itself.

One normative response to this problem is the principle of affective sovereignty: the claim that individuals must remain the ultimate interpreters of their own emotional states, even when computational systems offer persuasive alternative accounts.

Interpretive authority identifies the mechanism through which that sovereignty can be eroded without force.

The decisive threshold is not reached when a system gives one wrong answer. It is reached when the practical conditions for disagreement begin to disappear. A system becomes difficult to refuse not because it compels assent, but because it gradually renders alternative readings less available.

That is a governance problem.

Addressing it requires more than improving accuracy scores. Systems that perform affective inference should make the basis and uncertainty of those interpretations visible at the point of use. Users should be given meaningful ways to contest, revise, or decline emotionally framed outputs. Evaluation protocols should test not only whether a model is correct, but whether repeated interaction progressively narrows interpretive space.

Such measures do not remove the underlying tension. They make it legible.

The central issue is no longer whether AI can understand us. The real question is whether its understanding starts to function as a substitute for our own.

Societies can absorb systems that are occasionally wrong. They are less prepared for systems whose interpretations become quietly authoritative.

When acceptance becomes easier than disagreement, not through force but through fluency, repetition, and attunement, the locus of meaning has already begun to move.

The question is no longer only whether AI is aligned.

It is whether we still remain free to refuse its interpretation.

Ryan Sangbaek Kim (He/Him)

Director & Principal Investigator, Ryan Research Institute (RRI)

Ryan Sangbaek Kim is the founding director and principal investigator of the Ryan Research Institute (RRI), an independent institute based in Paris. Working across affective neuroscience, theoretical psychology, philosophy of mind, AI ethics, and law, he has developed a sustained interdisciplinary research program on the interpretation, suppression, and governance of emotion in human and machine systems.

He is best known for introducing Affective Sovereignty, a socio-technical design right that locates the person as the final interpreter of his or her own emotional life under conditions of computational mediation. His broader body of work includes the concepts of Affective Suppression Fatigue (ASF), Algorithmic Affective Blunting (AAB), and Predictive Emotional Self-Modeling (PESAM), through which he integrates computational formalism, phenomenological inquiry, and regulatory thought.

His work moves across academic research, public writing, and emotion-centered design, guided by the view that scholarship, culture, and technological form are not separate domains but continuous sites of interpretation.

Please sign in or register for FREE

If you are a registered user on Research Communities by Springer Nature, please sign in

Follow the Topic

Cognitive Psychology

Humanities and Social Sciences > Behavioral Sciences and Psychology > Cognitive Psychology

Cognitive Science

Humanities and Social Sciences > Philosophy > Philosophy of Mind > Cognitive Science

Discover Artificial Intelligence

Discover Artificial Intelligence

This is a transdisciplinary, international journal that publishes papers on all aspects of the theory, the methodology and the applications of artificial intelligence (AI).

More about the journal

Related Collections

With Collections, you can get published faster and increase your visibility.

Transforming Education through Artificial Intelligence: Opportunities, Challenges, and Future Directions

Artificial Intelligence (AI) is rapidly changing the educational field by enabling personalized learning, intelligent tutoring systems, automated assessments, learning analytics, and administrative automation.

This collection invites original research, systematic reviews, and visionary perspectives on the transformative impact of AI in education. It aims to explore how AI technologies can enhance equity, inclusion, and efficiency in educational settings across different contexts, including higher education, K-12, vocational training, and lifelong learning. This collection will address technical, pedagogical, ethical, and policy aspects, fostering interdisciplinary perspectives and evidence-based insights.

This Collection supports and amplifies research related to SDG 4 and SDG 9.

Keywords: Artificial Intelligence, AI in Education, Educational Technology, Data Analytics, AI Ethics

Publishing Model: Open Access

Deadline: May 31, 2026

Explore this Collection

AI for Image and Video Analysis: Emerging Trends and Applications

The application of AI in image and video analysis has revolutionized a wide range of domains, offering more accurate and efficient visual data processing. Thanks to advances in neural networks, large-scale datasets, and computational power, AI algorithms have surpassed traditional computer vision techniques in performance. This transformation has had a profound impact on areas like healthcare (where AI aids in diagnosing diseases through medical imaging), security (with real-time video surveillance), and entertainment (enhancing video quality and enabling automated content tagging). As AI continues to evolve, new challenges emerge, including the need for explainability, handling large datasets efficiently, improving robustness in real-world environments, and addressing biases in AI models. These open questions necessitate continued research, collaboration, and discourse. The proposed Collection focuses on the intersection of artificial intelligence (AI) and image and video analysis, exploring the latest advancements, challenges, and applications in this rapidly evolving field. As AI-powered techniques such as deep learning, computer vision, and generative models mature, they are increasingly being leveraged for tasks like image classification, object detection, video segmentation, activity recognition, facial recognition, and more. These technologies are pivotal in industries including healthcare, security, autonomous vehicles, entertainment, and smart cities, to name a few. We invite researchers and practitioners to submit articles related to, but not limited to, the following topics:

- Deep learning techniques for image and video analysis

- AI-based object detection and recognition

- Image segmentation and annotation using AI

- Video classification and activity recognition

- Real-time video surveillance and security systems

- AI for medical image analysis and diagnostics

- Generative adversarial networks (GANs) for image and video generation

- AI in autonomous driving and smart transportation systems

- AI-powered multimedia search and retrieval

- Human-Computer Interaction (HCI) through AI-based video analysis

- AI techniques for image and video compression

- Ethical concerns and responsible AI in image and video analysis

This Collection supports and amplifies research related to SDG 9 and SDG 11.

Keywords: computer vision; image segmentation; object detection; video surveillance

Publishing Model: Open Access

Deadline: Sep 15, 2026

Explore this Collection

How does the brain classify others by race?

Cookies

We use cookies to ensure the functionality of our website, to personalize content and advertising, to provide social media features, and to analyze our traffic. If you allow us to do so, we also inform our social media, advertising and analysis partners about your use of our website. You can decide for yourself which categories you want to deny or allow. Please note that based on your settings not all functionalities of the site are available.

Further information can be found in our privacy policy.

Interpretive Authority: The Missing Layer in AI Governance

Share this post

Share with...

...or copy the link