Bỏ qua đến nội dung chính
Back to home
Tech 2 min read

ChatGPT now automatically fills out forms using photos and voice 📝

OpenAI has updated ChatGPT with a new feature that automatically fills out various forms using uploaded images combined with text or voice instructions, streamlining paperwork processing.

Tier 2 · sources 95% confidence Reviewed
📚 Aggregated from 2 sources X — @ChatGPTapp X — @ChatGPTapp

OpenAI has officially introduced automatic form filling for ChatGPT, marking a significant step forward in applying AI to everyday paperwork tasks. Users can now upload a photo of any form and have ChatGPT complete it based on the provided information.

How it works

This new feature leverages the multimodal capabilities of the latest large language models. According to OpenAI, this process can be done in two flexible ways. First, users can upload an image of the form and type in the details for the AI to automatically place them in the correct spots.

Second, ChatGPT allows users to "speak" directly via Voice Mode to fill out forms. "Paperwork is better when you can talk through it," OpenAI shared on social media platform X. Combining computer vision and voice, ChatGPT can recognize blank fields, section headings, and form layouts to generate a completed version. This significantly reduces the hassle of dealing with lengthy files or complex application forms.

Why it matters

This is not just a mere utility tool but also proof that ChatGPT is evolving into an "agent" capable of powerfully interacting with the physical world through images. For users in Vietnam, this feature is incredibly useful for handling receipts, applications, or administrative forms that still play a major role in daily life.

Instead of manually typing out every line of information from a photo into a computer, users can simply give commands. However, experts warn that users should be cautious when sharing images containing sensitive personal information with AI. Data security and the accuracy of information extraction on forms with unique structures or languages will require further real-world verification.