Modern Australian
Times Advertising

OpenAI’s new ‘deep research’ agent is still just a fallible tool – not a human-level expert

  • Written by Raffaele F Ciriello, Senior Lecturer in Business Information Systems, University of Sydney
OpenAI’s new ‘deep research’ agent is still just a fallible tool – not a human-level expert

OpenAI’s “deep research” is the latest artificial intelligence (AI) tool making waves and promising to do in minutes what would take hours for a human expert to complete.

Bundled as a feature in ChatGPT Pro and marketed as a research assistant that can match a trained analyst, it autonomously searches the web, compiles sources and delivers structured reports. It even scored 26.6% on Humanity’s Last Exam (HLE), a tough AI benchmark, outperforming many models.

But deep research doesn’t quite live up to the hype. While it produces polished reports, it also has serious flaws. According to journalists who’ve tried it, deep research can miss key details, struggle with recent information and sometimes invents facts.

OpenAI flags this when listing the limitations of its tool. The company also says it “can sometimes hallucinate facts in responses or make incorrect inferences, though at a notably lower rate than existing ChatGPT models, according to internal evaluations”.

It’s no surprise that unreliable data can slip in, since AI models don’t “know” things in the same way humans do.

The idea of an AI “research analyst” also raises a slew of questions. Can a machine – no matter how powerful – truly replace a trained expert? What would be the implications for knowledge work? And is AI really helping us think better, or just making it easier to stop thinking altogether?

What is ‘deep research’ and who is it for?

Marketed towards professionals in finance, science, policy, law and engineering, as well as academics, journalists and business strategists, deep research is the latest “agentic experience” OpenAI has rolled out in ChatGPT. It promises to do the heavy lifting of research in minutes.

Currently, deep research is only available to ChatGPT Pro users in the United States, at a cost of US$200 per month. OpenAI says it will roll out to Plus, Team and Enterprise users in the coming months, with a more cost-effective version planned for the future.

Unlike a standard chatbot that provides quick responses, deep research follows a multi-step process to produce a structured report:

  1. The user submits a request. This could be anything from a market analysis to a legal case summary.
  2. The AI clarifies the task. It may ask follow-up questions to refine the research scope.
  3. The agent searches the web. It autonomously browses hundreds of sources, including news articles, research papers and online databases.
  4. It synthesises its findings. The AI extracts key points, organises them into a structured report and cites its sources.
  5. The final report is delivered. Within five to 30 minutes, the user receives a multi-page document – potentially even a PhD-level thesis – summarising the findings.

At first glance, it sounds like a dream tool for knowledge workers. A closer look reveals significant limitations.

Many early tests have exposed shortcomings:

  • It lacks context. AI can summarise, but it doesn’t fully understand what’s important.
  • It ignores new developments. It has missed major legal rulings and scientific updates.
  • It makes things up. Like other AI models, it can confidently generate false information.
  • It can’t tell fact from fiction. It doesn’t distinguish authoritative sources from unreliable ones.

While OpenAI claims its tool rivals human analysts, AI inevitably lacks the judgement, scrutiny and expertise that make good research valuable.

What AI can’t replace

ChatGPT isn’t the only AI tool that can scour the web and produce reports with just a few prompts. Notably, a mere 24 hours after OpenAI’s release, Hugging Face released a free, open-source version that nearly matches its performance.

The biggest risk of deep research and other AI tools marketed for “human-level” research is the illusion that AI can replace human thinking. AI can summarise information, but it can’t question its own assumptions, highlight knowledge gaps, think creatively or understand different perspectives.

And AI-generated summaries don’t match the depth of a skilled human researcher.

Any AI agent, no matter how fast, is still just a tool, not a replacement for human intelligence. For knowledge workers, it’s more important than ever to invest in skills that AI can’t replicate: critical thinking, fact-checking, deep expertise and creativity.

If you do want to use AI research tools, there are ways to do so responsibly. Thoughtful use of AI can enhance research without sacrificing accuracy or depth. You might use AI for efficiency, like summarising documents, but retain human judgement for making decisions.

Always verify sources, as AI-generated citations can be misleading. Don’t trust conclusions blindly, but apply critical thinking and cross-check information with reputable sources. For high-stakes topics — such as health, justice and democracy — supplement AI findings with expert input.

Despite prolific marketing that tries to tell us otherwise, generative AI still has plenty of limitations. Humans who can creatively synthesise information, challenge assumptions and think critically will remain in demand – AI can’t replace them just yet.

Authors: Raffaele F Ciriello, Senior Lecturer in Business Information Systems, University of Sydney

Read more https://theconversation.com/openais-new-deep-research-agent-is-still-just-a-fallible-tool-not-a-human-level-expert-249496

What Not to Pack When Moving: The Essential Guide to Smart Packing

Moving house is one of those all-encompassing events in life and most people focus their energy on deciding what to pack. But knowing what not to pa...

From Assistance to Independence: Progression in Daily Living Skills

The ultimate goal of many support systems is to empower individuals to lead lives defined by autonomy and self-reliance. While some support requiremen...

The Cost Difference Between Early Repairs and Delayed Replacement

Automotive maintenance often involves a choice between addressing a small issue immediately or waiting until a component fails completely. When it c...

What Is a Stainless Steel Bar? Applications, Benefits, and Buying Tips

Stainless steel is one of the most widely used materials across industrial and commercial sectors, known for its strength, corrosion resistance, and...

Scholars in Developing Nations Depending on Z library

Access to books often shapes the course of study for scholars who live in regions with thin library shelves and slow supply chains. Many students wo...

6 Cheapest POS Systems in Australia (2026)

The cheapest POS systems in Australia for 2026 are POSApt, Square, Zeller, Loyverse, Epos Now, and Shopify POS (Lite). However, “cheap” does no...

The Ultimate Guide to Automating Your Weekend Yard Chores

We all look forward to the weekend as a chance to unwind after a long week of work. You probably picture yourself relaxing on the patio with a cold ...

How Ignoring Regular Car Servicing Can Lead to Costly Repairs

Owning a car gives you a sweet sense of freedom and comfort. You can go wherever you want, whenever you want. But with that freedom comes responsibili...

Someone Trips at Your Fundraiser. Now What? Understanding Public Liability for NFPs

Three months of planning. Volunteers giving up their weekends. Sponsorships chased, catering sorted, tables decorated. And then, about an hour into ...

Stainless Steel Tube: A Complete Specification Guide for Engineers, Project Managers, and Industrial Buyers

Few materials in the industrial and manufacturing world are as universally relied upon — or as frequently misspecified — as stainless steel tube...

How to Choose the Right Barber Shears Scissors for Professional Results

Since a barber is only as good as their tool, choosing the right barber shear scissor must not be taken lightly. Most barbers end up buying the first ...

Why Commercial Construction Companies Play A Critical Role In Modern Urban Development

Urban development requires highly organised planning, engineering expertise, and professional construction teams capable of delivering complex build...

Essential Features for Comfortable Family Caravan Trips

Choosing the right van for family travel requires careful consideration of how the space will be used on a daily basis. Families have specific needs...

Chatswood Tutor: Helping Students Achieve Academic Success With Personalised Learning

Education plays a crucial role in shaping a student’s future, and many students benefit from additional academic support outside the classroom. A pr...

How External Consulting Can Guide Enterprise IT Strategy and Procurement

Internal IT teams carry deep operational knowledge, but that familiarity can create blind spots in strategic decisions. An external IT consultant br...

Why Sports Nutrition Australia Is Important for Performance and Recovery

Athletes and fitness enthusiasts place significant demands on their bodies during training and competition. Maintaining energy levels, supporting mu...

How Body Contouring Bundoora Helps Improve Shape And Confidence

Modern aesthetic treatments have made it possible to refine body shape without the need for invasive surgery. One of the most popular non-surgical o...

Why Plantation Shutters Are a Stylish and Practical Choice for Modern Homes

Window coverings play a major role in the comfort, privacy, and overall design of a home. Homeowners often look for solutions that provide both visu...