• AI KATANA
  • Posts
  • OpenAI rolls out Advanced Voice Mode with more voices and a new look

OpenAI rolls out Advanced Voice Mode with more voices and a new look

Also: Singapore Startup Cosmos Innovation Harnesses AI to Supercharge Solar Efficiency

Good morning! In today’s roundup, we explore OpenAI’s latest feature rollout, providing a more interactive voice experience with their Advanced Voice Mode, plus a fresh design to keep the platform dynamic. As AI continues to evolve at breakneck speed, OpenAI’s CEO Sam Altman is making bold predictions about the advent of AI superintelligence in the next decade, offering a glimpse into a future where AI reshapes everything from healthcare to education. On the educational front, Princeton researchers are urging us to stay grounded in reality amidst the increasing hype around generative AI, advocating for a more informed, critical approach to understanding the technology. Meanwhile, exciting innovations in solar energy and key updates on the competitive AI landscape remind us just how diverse the field is, with companies like Microsoft and Singapore’s Cosmos Innovation making waves. With new tools, venture capital milestones, and AI-driven startups emerging every day, there’s no shortage of insights to keep you on top of the game.

Sliced just for you:

  • 🔈 OpenAI rolls out Advanced Voice Mode with more voices and a new look

  • 🤖 OpenAI CEO: We may have AI superintelligence in “a few thousand days”

  • 🎓 OpenAI Academy to help train developers, offer free credits

  • 📚 Generative AI Hype Feels Inescapable. Tackle It Head On With Education

  • 🌞 Singapore Startup Cosmos Innovation Harnesses AI to Supercharge Solar Efficiency

  • 📉 Microsoft's AI Lead Has Shrunk, Analyst Says

OpenAI has launched its Advanced Voice Mode (AVM) for ChatGPT, introducing enhanced features that make the chatbot more conversational and interactive. Initially available to Plus and Teams subscribers, the rollout will extend to Enterprise and Edu users in the coming weeks. The update includes a refreshed design featuring a blue animated sphere and five new voices—Arbor, Maple, Sol, Spruce, and Vale—designed to provide a more natural and dynamic user experience. Notably absent is the Sky voice, which was removed following legal concerns from actress Scarlett Johansson. While video and screen-sharing functionalities demonstrated earlier remain unreleased, AVM has improved voice recognition for accents and offers smoother, faster interactions. Additionally, users can now customize responses through ChatGPT’s Custom Instructions and Memory features. Despite these advancements, AVM is not yet available in the EU, U.K., and several other regions.

OpenAI CEO Sam Altman has outlined his vision of a future where AI reaches superintelligence in as little as a few thousand days, potentially within the next decade. In a blog post titled "The Intelligence Age," Altman emphasizes that advancements in deep learning have been pivotal in this progress, leading to a new era of technological and societal transformation. He predicts AI will enhance global prosperity, drive breakthroughs in sectors like education and healthcare, and introduce personal AI teams capable of assisting in a vast range of tasks. While he acknowledges potential challenges such as labor market disruptions, Altman remains optimistic about the overall positive impact on humanity. However, he warns that inadequate infrastructure could lead to AI being a resource controlled by the wealthy, further emphasizing the need for abundant and affordable computing power. Although Altman faces criticism from skeptics who argue his predictions are overly optimistic, his stance reflects a strong belief in AI’s transformative potential and the importance of navigating its risks wisely.

OpenAI has announced the launch of OpenAI Academy, a global initiative aimed at training developers in AI and generative AI, with a focus on empowering developers, particularly in developing countries and emerging tech sectors. The academy will provide training, technical guidance, and free API credits, with an initial distribution of $1 million in API credits. This program also includes professional translations of the Massive Multitask Language Understanding (MMLU) benchmark in 14 languages to help developers better understand large language models. OpenAI aims to foster a collaborative network of developers while addressing local needs through tailored AI applications, and is looking to partner with philanthropists to establish incubators for further growth.

In the midst of rising generative AI hype, Princeton researchers Arvind Narayanan and Sayash Kapoor advocate for a critical, education-driven approach to combat misinformation. In their book AI Snake Oil, they argue that companies, researchers, and journalists contribute to unrealistic expectations about AI’s capabilities, often leading to biased outcomes and public confusion. The authors stress the importance of understanding AI's true limitations, emphasizing that education is key to demystifying the technology and reducing undue trust in AI systems. By fostering a more informed understanding of AI, especially in terms of machine learning and neural networks, the public can better navigate the challenges posed by generative tools and resist the allure of sensationalized narratives.

Singapore-based startup Cosmos Innovation is using AI to break through efficiency barriers in solar energy. Founded by AI scientists Dr. Vijay Chandrasekhar and Dr. Joel Li, the company focuses on improving the performance of solar cells through the integration of AI-driven optimization processes. Leveraging perovskite, a material with exceptional semiconductor properties, Cosmos Innovation aims to surpass the efficiency limitations of traditional silicon solar cells, which dominate the market but are nearing their practical efficiency limits. Their proprietary AI platform, Mobius, accelerates recipe development for solar cell production, allowing for optimization in materials and processes. By collaborating with Singapore's Agency for Science, Technology and Research (ASTAR) and tapping into expert talent through the T-Up scheme, Cosmos Innovation has significantly reduced development time, achieving optimization results 10 times faster than traditional methods. The company is committed to translating research into practical, mass-produced solar technology, contributing to the global push for low-cost, high-efficiency renewable energy solutions.

Microsoft's stock was downgraded to neutral from buy by D.A. Davidson analyst Gil Luria, who cited increasing competition in the AI market. Despite Microsoft's early lead in generative AI, particularly through its partnership with OpenAI, rivals like Google Cloud and Amazon Web Services have advanced with their own custom AI processors, reducing Microsoft's competitive edge. This development leaves Microsoft reliant on Nvidia for AI processors, potentially impacting shareholder value. While Luria maintained a price target of $475, Microsoft's stock saw a slight dip to $433.51. Though this downgrade is notable, most analysts remain bullish on the stock, with 56 out of 59 still holding buy ratings.

🛠️ AI tools updates

Duolingo has introduced two major new features—Adventures and AI video calls—to enhance language learning. Adventures immerse users in gamified storylines that apply real-world language scenarios, while AI video calls offer conversational practice with characters in a low-pressure, personalized environment. These features aim to make language learning more engaging and effective, with the AI video calling feature available to subscribers on iOS.

Alibaba has introduced a new AI video generator as part of its Tongyi Wanxiang portfolio, joining an increasingly crowded field of text-to-video tools. Announced at the Alibaba Cloud Apsara Conference, this tool generates high-quality videos from text prompts in both English and Chinese. It offers a range of video styles, from realistic live-action to various animation formats, powered by advanced diffusion transformer architecture. This is part of Alibaba's broader push into AI, with over 100 new large language models being introduced. The tool is designed for use in marketing, entertainment, and even video games, as Alibaba expands its third-party partnerships.

💵 Venture Capital updates

Nurix AI, a startup launched in 2024, secured $27.5 million in Series A funding led by Accel and General Catalyst. The company focuses on creating AI solutions that integrate generative AI with human oversight, initially targeting the business process outsourcing (BPO) industry. Its custom AI agents, designed to replicate human-like reasoning, aim to optimize customer service at scale. The funds will fuel Nurix's R&D and technology expansion in Asia and North America.

🫡 Meme of the day

⭐️ Generative AI image of the day