AI Jun 2, 2026 1 min read

MindZero: Self-Supervised Online Mental Reasoning for AI Agents

MindZero is a self-supervised reinforcement learning framework enabling MLLMs to infer human mental states without explicit annotations.

Tier 2 · sources 89% confidence Reviewed

AI Agents Theory OF Mind Self Supervised Learning Mllm

Sources arxiv.org

MindZero introduces a novel self-supervised reinforcement learning framework that equips AI agents with Theory of Mind (ToM) capabilities—the ability to infer human mental states from behavior. By bypassing the need for costly manual annotations, MindZero enables Multimodal Large Language Models (MLLMs) to perform efficient and robust online mental reasoning.

Context

Effective human-AI collaboration requires agents to understand user intentions and beliefs. However, gathering ground-truth mental state annotations in real-world scenarios is notoriously difficult. Existing model-based approaches are often slow and computationally expensive, hindering their deployment in real-time assistance tasks.

Why it matters

The framework demonstrates that mental reasoning can be effectively learned as a self-supervised skill. Evaluations indicate that MindZero significantly outperforms traditional model-based methods in both accuracy and efficiency. This advancement paves the way for more intuitive AI assistants that can better anticipate user needs while maintaining high performance and lower operational costs.