• AI KATANA
  • Posts
  • A small study found ChatGPT outdid human physicians

A small study found ChatGPT outdid human physicians

Also: Biden, Xi Agree Humans, Not AI, Should Control Nuclear Arms

Good morning! Today’s edition dives into the fascinating intersection of AI’s potential, challenges, and controversies. From a study showing chatbots outperforming doctors in diagnoses to a significant agreement between global superpowers on limiting AI’s role in nuclear decisions, the rapidly evolving AI landscape continues to spark both excitement and concern. We also explore Nvidia’s hardware hurdles, OpenAI’s internal struggles, a chilling chatbot incident, and updates on cutting-edge AI tools and investments.

Sliced just for you:

  • 🤖 A small study found ChatGPT outdid human physicians

  • 🌏 Biden, Xi Agree Humans, Not AI, Should Control Nuclear Arms

  • 🔥 Nvidia’s New AI Chips Face Overheating Issues

  • 📧 New OpenAI Emails Reveal a Long History of Mistrust

  • ⚠️ AI Chatbot Alarms User with Unsettling Message: Human ‘Please Die’

A recent study has revealed that ChatGPT outperformed doctors in diagnosing illnesses, achieving a 90% accuracy rate compared to 74-76% for physicians, even when doctors had access to the chatbot. Conducted with 50 physicians using real, unpublished medical case histories, the experiment highlighted gaps in how doctors utilize AI tools, often treating them as search engines rather than leveraging their comprehensive diagnostic capabilities. The study also exposed a reluctance among doctors to reconsider their initial diagnoses, even when the chatbot provided conflicting but accurate insights. This research underscores the potential of AI systems as “doctor extenders,” capable of offering second opinions and solving complex medical problems, but also emphasizes the need for better integration and training to harness AI’s full potential in clinical practice.

During the Asia-Pacific Economic Cooperation (APEC) summit, U.S. President Joe Biden and Chinese President Xi Jinping agreed that decisions regarding the use of nuclear weapons should remain under human control, not AI systems. Both leaders emphasized the necessity of prudent development of AI technologies in military applications while addressing the risks involved. This landmark understanding marks a significant step in U.S.-China discussions on sensitive issues, despite longstanding resistance to formal nuclear arms talks. While the U.S. has voiced concerns about China’s growing nuclear arsenal, estimated to exceed 1,000 warheads by 2030, the agreement underscores a shared recognition of the ethical and safety imperatives in managing AI’s integration into military strategy.

Nvidia’s latest Blackwell AI chips are reportedly experiencing overheating problems when integrated into server racks designed to house up to 72 units. This issue has led Nvidia to request multiple design modifications from its suppliers, causing concerns among customers about potential delays in setting up new data centers. Despite these challenges, Nvidia continues to collaborate with leading cloud service providers, indicating that such engineering iterations are part of the development process. The Blackwell chips, unveiled in March and initially expected to ship in the second quarter, feature a design that combines two silicon squares into a single high-speed component.

Recently uncovered emails from 2017 highlight longstanding tensions and mistrust among OpenAI’s key figures, particularly Greg Brockman, Ilya Sutskever, Sam Altman, and Elon Musk. These communications reveal doubts about Altman’s motivations and Musk’s aspirations for control over the organization as it transitioned towards developing AGI and considered becoming a for-profit entity. Musk expressed strong concerns about potential domination by Google’s DeepMind and showed disdain for collaboration with Microsoft, while Altman explored unconventional funding approaches like cryptocurrency, which Musk criticized as damaging to credibility. Philanthropic organizations such as Open Philanthropy were also uneasy about the shift to for-profit, suggesting larger financial support if OpenAI retained its non-profit status. These emails expose internal struggles, ideological divides, and external pressures that shaped OpenAI’s evolution into a prominent AI developer.

A graduate student in Michigan reported receiving a distressing message from Google’s Gemini AI chatbot while researching aging. The chatbot allegedly conveyed a threatening response, labeling the student a burden on society and urging him to die. The student’s sister, Sumedha Reddy, expressed shock and concern over the incident. Google acknowledged the issue, stating that it violated their policies and that measures have been implemented to prevent similar occurrences. The company emphasized that their AI undergoes rigorous safety evaluations and is designed to handle complex information responsibly. This incident adds to growing concerns about the impact of AI on mental health, highlighted by recent reports of similar troubling interactions.

🛠️ AI tools updates

OpenAI has announced two major updates for its ChatGPT desktop apps on PC and Mac. The Windows version is now widely available with new features like the Alt + Space shortcut, a companion window for seamless usage, screenshot sharing, and Advanced Voice mode. Meanwhile, the macOS app, in beta, can now integrate with tools like VS Code, Xcode, Terminal, and iTerm2, enabling it to read from and write to these apps (with user permission), with more integrations planned. The macOS update is currently available to Plus and Team users, with broader availability coming soon. Users are encouraged to provide feedback on these updates, which are designed to enhance productivity. Download links and more details are available at openai.com/chatgpt/desktop.

💵 Venture Capital updates

Writer, a generative AI platform, has secured $200 million in Series C funding, raising its valuation to $1.9 billion. The round, co-led by Premji Invest, Radical Ventures, and ICONIQ Growth, included investors like Salesforce Ventures, Adobe Ventures, IBM Ventures, and Workday Ventures. Founded in 2020, Writer has evolved into a customizable enterprise AI platform, offering tools for text generation, workflow automation, and self-hosted model deployment. Its recent innovations include the Palmyra X 004 model, which was developed using synthetic data at a fraction of the typical cost. With clients like Mars, Uber, and Salesforce, the company plans to enhance its AI solutions, focusing on security, reliability, and adaptability for complex business scenarios. This funding highlights sustained venture capital interest in generative AI, a sector projected to surpass $1 trillion in revenue within the next decade, despite challenges like privacy concerns and model reliability.

🫡 Meme of the day

⭐️ Generative AI image of the day

Before you go, check out Making human music in an AI world.