Microsoft AI models struggle with long, multi-step tasks

theregister.com — May 11, 2026 at 10:01 PM UTC

Microsoft researchers found AI models introduce significant errors in long, multi-step tasks. Even advanced AI models lost an average of 25 percent of document content over 20 interactions, failing to meet readiness benchmarks in most tested professional domains. The study suggests users must closely monitor AI systems, as tools did not improve performance and errors can occur suddenly, impacting current AI automation investments.

With a significance score of 4.1, this news ranks in the top 4.9% of today's 34005 analyzed articles.

Get summaries of news with significance over 5.5 (usually ~10 stories per week). Read by 10,000+ subscribers:

Most significant articles on this story:

[4.1]

Microsoft AI models struggle with long, multi-step tasks (theregister.com)

1d 14h

Source