COMPASS is a framework that utilizes MCTS (Monte Carlo Tree Search) to safely align LLM-powered search agents. It helps detect malicious intents disguised as seemingly harmless sub-queries while monitoring every step of the agent's execution.
Why It Matters
Prevents "stealthy attacks" on AI-powered search systems while maintaining overall performance.