Close Menu
TechurzTechurz

    Subscribe to Updates

    Get the latest creative news from FooBar about art, design and business.

    What's Hot

    This Sequoia-backed lab thinks the brain is ‘the floor, not the ceiling’ for AI

    February 10, 2026

    Primary Ventures raises healthy $625M Fund V to focus on seed investing

    February 10, 2026

    Vega raises $120M Series B to rethink how enterprises detect cyber threats

    February 10, 2026
    Facebook X (Twitter) Instagram
    Trending
    • This Sequoia-backed lab thinks the brain is ‘the floor, not the ceiling’ for AI
    • Primary Ventures raises healthy $625M Fund V to focus on seed investing
    • Vega raises $120M Series B to rethink how enterprises detect cyber threats
    • Former Tesla product manager wants to make luxury goods impossible to fake, starting with a chip
    • Former GitHub CEO raises record $60M dev tool seed round at $300M valuation
    • Hauler Hero collects $16M for its AI waste management software
    • Proptech startup Smart Bricks raises $5 million pre-seed led by a16z
    • Databricks CEO says SaaS isn’t dead, but AI will soon make it irrelevant
    Facebook X (Twitter) Instagram Pinterest Vimeo
    TechurzTechurz
    • Home
    • AI
    • Apps
    • News
    • Guides
    • Opinion
    • Reviews
    • Security
    • Startups
    TechurzTechurz
    Home»AI»Can AI outdiagnose doctors? Microsoft’s tool is 4 times better for complex cases
    AI

    Can AI outdiagnose doctors? Microsoft’s tool is 4 times better for complex cases

    TechurzBy TechurzJuly 4, 2025No Comments4 Mins Read
    Share Facebook Twitter Pinterest LinkedIn Tumblr Reddit Telegram Email
    Microsoft AI system diagnoses complex cases better than human doctors - and for less money
    Share
    Facebook Twitter LinkedIn Pinterest Email


    krisanapong detraphiphat/Getty

    Research on AI for medicine looks increasingly promising — the tech already speeds up drug development, Google is using AI to improve its medical advice, and wearable companies are leveraging the technology for predictive health features. Now, Microsoft is the latest to move the goal post. 

    On Monday, the company announced in a blog post that Microsoft AI Diagnostic Orchestrator (MAI-DxO), its medical AI system, successfully diagnosed 85% of cases in the New England Journal of Medicine (NEJM). This rate of diagnosis is more than four times higher than human physicians. NEJM cases are particularly complex and often require several specialists.

    Also: OpenAI’s HealthBench shows AI’s medical advice is improving – but who will listen?

    Given how inaccessible, complex, and confusing healthcare systems continue to be, it’s no surprise people are seeking help from technology wherever possible. 

    “Across Microsoft’s AI consumer products like Bing and Copilot, we see over 50 million health-related sessions every day,” Microsoft said in the announcement. “From a first-time knee-pain query to a late-night search for an urgent-care clinic, search engines and AI companions are quickly becoming the new front line in healthcare.”

    How it works 

    Human physicians must pass the US Medical Licensing Examination (USMLE) to practice medicine, a test that’s also used to evaluate how AI systems perform in medical contexts, both model-to-model and when compared with humans. 

    Currently, AI scores well on the USMLE — a side effect, Microsoft said, of the models memorizing (rather than understanding) answers to multiple-choice questions, which won’t produce the most sound medical analysis. Most industry-standard AI benchmarks have been saturated for a while, meaning AI models are evolving too quickly for the tests to be usefully challenging. 

    To combat this issue, Microsoft created the Sequential Diagnosis Benchmark (SD Bench). Sequential diagnosis is a process real clinicians use to diagnose patients by beginning with how their symptoms present and proceeding with questions and tests from there. The test presents diagnostic challenges from 304 NEJM cases, which humans and AI models can use to ask questions. 

    Also: Anthropic says Claude helps emotionally support users – we’re not convinced

    Microsoft then paired the diagnostic agent, MAI-DxO, with several frontier models, including GPT, Llama, Claude, Gemini, Grok, and DeepSeek, and put the agent to the SD Bench test. MAI-DxO turns whatever LLM it is using into a “virtual panel of physicians with diverse diagnostic approaches collaborating to solve diagnostic cases,” Microsoft explained.

    In a video demo, MAI-DxO also shows its reasoning as it queries the benchmark, develops possible diagnoses, and tracks the cost of each requested test. Once the agent has the required information from the benchmark about the case, it changes its diagnoses, asking for different scans and displaying a diagnostic process much more familiar to human physicians. 

    Correct diagnoses that cost less

    “MAI-DxO boosted the diagnostic performance of every model we tested,” said Microsoft’s blog post, noting that the system performed best when paired with OpenAI’s o3 model. The company compared the results to those of 21 physicians from the UK and the US with experience ranging from five to 20 years, who reached a mean accuracy of just 20%.

    Also: You shouldn’t trust AI for therapy – here’s why

    Microsoft noted that MAI-DxO is also configurable, meaning it can run within cost limitations set by a user or organization — a feature that lets the agent run a cost-benefit analysis of certain tests, which is highly relevant to the astronomical pricing of US medical care and something human doctors and patients have to consider as well. 

    This feature is also a guardrail, of sorts — without it, the AI might “default to ordering every possible test — regardless of cost, patient discomfort, or delays in care,” the blog post explained. MAI-DxO also returned higher accuracy and lower costs than individual models or human physicians. 

    Will AI replace your doctor?

    Probably not anytime soon — though Microsoft’s blog post noted that because of its breadth of knowledge, AI can surpass “clinical reasoning capabilities that, across many aspects of clinical reasoning, exceed those of any individual physician.” 

    The company believes systems like this one can “reshape healthcare” by giving patients the option to check themselves reliably and help doctors with complex cases. The cost savings would be another plus for an industry constantly plagued by inexplicably high costs and opaque pricing structures. 

    Also: AI is relieving therapists from burnout. Here’s how it’s changing mental health

    Microsoft conceded that MAI-DxO has only been tested on these special cases, so it’s unclear how it would handle everyday tasks. However, this issue may not be relevant anyway if the agent isn’t intended to replace human doctors, which Microsoft also maintained in the blog post. 

    MAI-DxO is part of a “dedicated consumer health effort” Microsoft AI initiated last year, the company said in the release. Other AI products within that initiative include RAD-DINO, a radiology workflow tool, and Microsoft Dragon Copilot, a voice AI assistant designed for medical professionals. 

    cases complex doctors Microsofts outdiagnose Times Tool
    Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
    Previous ArticleLosing my keys cost me $600; this Best Buy AirTag sale for July 4 drops the price to under $20 each
    Next Article Al Pacino masterpiece returns to Netflix, 30 years after acclaimed debut
    Techurz
    • Website

    Related Posts

    Opinion

    Former GitHub CEO raises record $60M dev tool seed round at $300M valuation

    February 10, 2026
    Opinion

    OpenAI to acquire the team behind executive coaching AI tool Convogo

    January 8, 2026
    Opinion

    Uber Eats alum lands $14M seed from a16z to fix WhatsApp chaos for LatAm’s doctors

    December 16, 2025
    Add A Comment
    Leave A Reply Cancel Reply

    Top Posts

    College social app Fizz expands into grocery delivery

    September 3, 20251,438 Views

    A Former Apple Luminary Sets Out to Create the Ultimate GPU Software

    September 25, 202514 Views

    The Reason Murderbot’s Tone Feels Off

    May 14, 202511 Views
    Stay In Touch
    • Facebook
    • YouTube
    • TikTok
    • WhatsApp
    • Twitter
    • Instagram
    Latest Reviews

    Subscribe to Updates

    Get the latest tech news from FooBar about tech, design and biz.

    Most Popular

    College social app Fizz expands into grocery delivery

    September 3, 20251,438 Views

    A Former Apple Luminary Sets Out to Create the Ultimate GPU Software

    September 25, 202514 Views

    The Reason Murderbot’s Tone Feels Off

    May 14, 202511 Views
    Our Picks

    This Sequoia-backed lab thinks the brain is ‘the floor, not the ceiling’ for AI

    February 10, 2026

    Primary Ventures raises healthy $625M Fund V to focus on seed investing

    February 10, 2026

    Vega raises $120M Series B to rethink how enterprises detect cyber threats

    February 10, 2026

    Subscribe to Updates

    Get the latest creative news from FooBar about art, design and business.

    Facebook X (Twitter) Instagram Pinterest
    • About Us
    • Contact Us
    • Privacy Policy
    • Terms and Conditions
    • Disclaimer
    © 2026 techurz. Designed by Pro.

    Type above and press Enter to search. Press Esc to cancel.