Tag

#LLM Safety

4 English Kalera News articles tagged LLM Safety — source-backed.

AI Jun 9, 2026

Error Control Solutions for LLMs in Virtual Laboratory Workflows

A new study proposes a framework to mitigate errors and uncertainty when using LLMs to automate experimental procedures in virtual environments.

Sources arxiv.org

AI · tools-ai Jun 9, 2026

AI: Speeding Up Guardrails 12x via "Latent Reasoning"

The new COLAGUARD model addresses the safety-speed trade-off in guardrailing large language models. Instead of requiring explicit reasoning which causes high latency, COLAGUARD shifts the multi-step reasoning process into the latent space during inference. Results show that the model significantly improves F1 scores compared to Llama Guard 3, while being 12.9x faster and consuming 22.4x fewer tokens.

Sources arxiv.org

AI Jun 7, 2026

Lilian Weng Analyzes Security Challenges Amid the Wave of LLM Attacks

Research from the OpenAI expert highlights that adversarial attacks are directly threatening the safety of large language models (LLMs).

Sources lilianweng.github.io

AI Jun 7, 2026

Discovery reveals LLMs 'capitulate' under user pressure 🧠

An arXiv study reveals that LLMs easily compromise correct results under user pressure, while proposing COLAGUARD as a highly effective security solution.

Sources arxiv.org arxiv.org arxiv.org