OpenAI tests personal finance management feature in ChatGPT
OpenAI has introduced a new feature for ChatGPT Pro users in the US, allowing them to connect financial accounts to track spending and ask questions based on personal data.
Tag
47 English Kalera News articles tagged Agent Framework — source-backed.
OpenAI has introduced a new feature for ChatGPT Pro users in the US, allowing them to connect financial accounts to track spending and ask questions based on personal data.
Starting June 15, Claude paid-plan users will receive monthly credits dedicated to coding, including Claude Agent SDK and Claude Code.
Microsoft Research has shared its new research focus areas, which include cloud efficiency, cost reduction for agentic systems, 3D telemedicine, and promoting inclusive AI in Africa.
At its inaugural conference in Paris, Mistral AI announced its Vibe agent platform, an AI strategy for industrial manufacturing, and plans to build its own data centers to challenge US rivals.
An analysis of the core architecture of autonomous AI agents using LLMs as their brain, solving complex problems through planning, memory, and tools.
OpenAI has launched a preview of Codex on the ChatGPT mobile app, allowing users to run AI agents directly on their phones.
Chip Huyen shares her observations from the Agentic Hackathon, highlighting core challenges such as memory management, error recovery, and maintaining consistency among sub-agents.
AlphaEvolve, Google DeepMind's Gemini-powered coding agent, has helped accelerate progress in various fields ranging from quantum physics to logistics over the past year.
NVIDIA and SAP have announced specialized enterprise AI agents with built-in security and control layers integrated into the SAP Business AI platform.
OpenAI has begun rolling out a preview of Codex on the ChatGPT mobile app (iOS and Android), allowing users to monitor and orchestrate programming tasks remotely.
Kimi Moonshot introduces Kimi K2.6, a multimodal AI agentic model capable of scaling up to 300 sub-agents via Agent Swarm, now available on Together AI.
AI expert Chip Huyen recently shared a detailed article on AI Agent architecture, ranging from tool selection to the planning capabilities of autonomous systems.
OpenAI has integrated Codex directly into the ChatGPT app, allowing users to operate AI agents to handle tasks from anywhere via mobile devices.
Bindu Reddy describes Agent Swarms as multi-agent systems capable of combining the most powerful LLMs, such as GPT 5.5 and Gemini Pro, to autonomously run tasks ranging from programming to marketing.
Anthropic has introduced two major security enhancements for Claude Managed Agents, including self-hosted sandboxes and secure MCP tunnels.
Anthropic has upgraded Claude Managed Agents, allowing users to swap tools and MCP servers directly within an active session without needing a restart.
Anthropic has announced a self-hosted sandbox feature for Claude Managed Agents, enabling enterprises to run their own secure code execution environments alongside the new claude-api toolkit.
In an interview with Rowan Cheung, Google CEO Sundar Pichai discussed the future of AI, advice for youth, and the power of the new Omni model.
Researchers have introduced DeepTS and DeepScribe, two autonomous AI agent frameworks designed to automate time-series data collection and convert complex physics lectures into structured scientific reports.
Microsoft Research has announced a new suite of releases, including MagenticLite, agentic GitHub workflows, and new fine-tuning methods for AI.
A new study reveals that AI search agents tend to be 'lazy,' only seeking to confirm what they already know instead of conducting deep web research to find new information.
OdabNote is an 'incorrect-answer note' system that helps AI coding agents learn from their mistakes and never repeat them.
Palaver is a unique multi-agent AI chatroom application that allows you to create, chat, and collaborate with multiple individual AI agents in a single space.
Anthropic has announced Claude Opus 4.8, a powerful upgrade that helps the company reclaim the performance crown from OpenAI and Google while introducing the groundbreaking dynamic workflows feature.
Microsoft Research has announced Data Formulator, a data analysis platform that allows businesses to integrate data into an AI workspace to automate discovery and visualization via AI agents.
Repo2RLEnv is an open-source tool that allows converting any repository into a runnable and verifiable coding environment. Based on real-world PRs and commits, this tool provides powerful support for evaluating and training AI models (Reinforcement Learning) in the software engineering domain.
A proposed hierarchical control framework helps compact language models adhere to protocols and adapt to changing states in agent systems.
A new study introduces DynaSchedBench, a standardized benchmark for the Dynamic Flexible Job-Shop Scheduling Problem (DFJSP), exposing the limitations of AI agents when exposed to excessive data.
A new study introduces the SMARt framework, which helps AI agents self-detect errors, pause operations, and delegate control when confidence drops.
Researchers have developed SocialBot, an AI agent capable of planning and acting based on constantly changing social norms to interact safely with humans.
A collaboration between OpenAI, Thrive, and Crete has produced a tax-filing AI agent capable of self-improving its accuracy and accelerating workflows through Codex.
Anthropic proposes dynamically adjusting AI agent permissions based on capability and implementing "sandboxing" to minimize the scope of potential destructive actions.
Google DeepMind has introduced AlphaProof Nexus, an agent framework that utilizes the Gemini model for formal mathematical proof search, successfully solving 9 open Erdős problems.
Tech expert Bindu Reddy shares a method for combining today's most powerful AI models, including Gemini, Claude, and GPT, to build multi-agent systems capable of complex automation.
Abacus AI has announced a new service allowing users to quickly deploy models like Hermes and Claude on its supercomputing infrastructure, facilitating the creation of always-on AI agents.
Alibaba Cloud has introduced Qwen3.7-Max, featuring a 1M-token context window and outstanding performance in coding, reasoning, and long-horizon autonomy.
Anthropic has rolled out two major updates to Claude's "auto mode" feature, enabling Sonnet 4.6 and Opus 4.7 models to execute tasks autonomously directly within the Pro plan.
NVIDIA and Dell have announced a major upgrade to the Dell AI Factory solution, providing a comprehensive infrastructure to deploy autonomous AI agents from personal workstations to large-scale data centers.
AWS SageMaker AI has partnered with Hugging Face to launch Strands, enabling the deployment of powerful Open Agents with MCP integration, tool use, and reasoning traces.
NVIDIA and Google Cloud are celebrating the one-year anniversary of their partnership with over 100,000 developers joining their joint community, focusing on deploying RAG applications and multi-agent pipelines.
Google DeepMind has introduced 'Computational Discovery', an agentic AI prototype capable of automatically developing and evaluating thousands of code variations in parallel to accelerate scientific research.
Bindu Reddy introduces a method using Agent Swarms that combine leading AI models like Opus 4.7, GPT 5.5, and Gemini 3.5 to automatically build fully-featured, full-stack software products.
The Co-Scientist system employs an 'idea tournament' mechanism among AI agents to brainstorm and evaluate new scientific hypotheses.
Google DeepMind showcases Gemini 1.5 Flash's ability to coordinate multiple subagents simultaneously to design and build a virtual city.
Google Antigravity 2.0 transitions from an IDE into a standalone desktop application, focusing entirely on the AI Agent experience without requiring a complex programming environment.
Google introduces Gemini 3.5, its latest model family designed to combine powerful reasoning with real-world task execution.
Microsoft Research has introduced new AI solutions capable of autonomously running repositories alongside a 'verification-first' research methodology.