OpenAI's new AI agent struggled with basic tasks

futurism.com — July 20, 2025 at 02:01 PM UTC

OpenAI's new ChatGPT Agent, designed to automate tasks, took an hour to order food and recommended a baseball stadium in the ocean, highlighting its limitations. The AI agent, which uses a "virtual computer," requires human approval for significant actions, slowing down its performance. It struggled with simple tasks and made factual errors, such as suggesting a stadium location in the Gulf of Mexico. The agent is initially available to Pro users with a prompt limit, and will be rolled out to other subscribers later. The release underscores ongoing challenges in AI reliability and the need for human oversight.

With a significance score of 3.4, this news ranks in the top 9% of today's 33133 analyzed articles.

Get summaries of news with significance over 5.5 (usually ~10 stories per week). Read by 10,000+ subscribers: