Bỏ qua đến nội dung chính

Tag

#AI

48 English Kalera News articles tagged AI — source-backed.

All tags

AI

AI: Human-AI Interaction Trends in Clinical Trials

Data analysis from ClinicalTrials.gov reveals a sharp increase in AI-related trials, particularly in China and the US. The study utilized GPT-5.5 combined with human experts for classification. The results indicate that hybrid Human-AI models hold great potential but require clearer definitions of interaction to achieve high accuracy.

Sources arxiv.org
AI

AI: Lecturer Beliefs and Behaviors in AI-Integrated Education

A survey of 72 university lecturers on AI integration based on the DOT framework. The results indicate strong support for AI as a pedagogical tool while emphasizing the importance of human oversight. Key barriers remain a lack of institutional policies, training, and infrastructure.

Sources arxiv.org
AI · tools-ai

AI: Speeding Up Guardrails 12x via "Latent Reasoning"

The new COLAGUARD model addresses the safety-speed trade-off in guardrailing large language models. Instead of requiring explicit reasoning which causes high latency, COLAGUARD shifts the multi-step reasoning process into the latent space during inference. Results show that the model significantly improves F1 scores compared to Llama Guard 3, while being 12.9x faster and consuming 22.4x fewer tokens.

Sources arxiv.org
AI · tools-ai

AI: Mitigating Hallucination with Agentic AI and Nested Learning

A new study proposes a Nested Learning architecture combined with Continuum Memory Systems (CMS) to mitigate hallucination in multi-agent systems. The three-stage pipeline reduces the Total Hallucination Score (THS) by -31.3% to -35.9%. Leveraging semantic caching saves 47.3% of LLM calls, optimizing energy and operational costs at production scale.

Sources arxiv.org
AI · tools-ai

AI: Securing Autonomous Agents with Out-of-Band Data

Redpanda introduces the Agentic Data Plane (ADP), an architecture that utilizes out-of-band metadata channels to manage security for autonomous AI agents. Instead of relying on agents to handle access policies directly, ADP pushes security contexts and audit trails out of their control. This helps prevent risks from agent hallucinations or manipulation, ensuring compliance with data rights and execution policies even in complex tasks like financial portfolio management.

Sources arxiv.org
AI · tools-ai

AI: BEAMS - A Framework for Evaluating AI in Modeling and Simulation

The BEAMS initiative establishes standards for AI in modeling and simulation, aiming for responsibility and ethics. Experimental results show that current AI tools are strong at discussion and qualitative tasks, but still struggle with causal reasoning and quantitative debugging. The open-source sd-ai project helps increase transparency in evaluation.

Sources arxiv.org
AI · tools-ai

AI: VFEAgent - Automating Finite Element Analysis with Multi-Agent Systems

VFEAgent is an end-to-end multi-agent system that automates the Finite Element Analysis (FEA) workflow from images and descriptions. The system combines ReAct reasoning capabilities with a self-correcting code generation framework. Experiments demonstrate that VFEAgent outperforms traditional LLM approaches in terms of reliability and physical validity.

Sources arxiv.org
AI · tools-ai

AI: LLM Agents Can Break the "Bottleneck" of Biological Phenotype Annotation

New research shows that LLM-based AI agents (Anthropic, OpenAI) are capable of annotating biological phenotype data with accuracy comparable to human experts. This has traditionally been a highly specialized and time-consuming process, causing a bottleneck in evolutionary biology research. Agents equipped with a self-contained workspace (research PDFs, annotation guidelines, ontologies) achieved performance that far exceeds traditional NLP tools.

Sources arxiv.org