Home TechnologyMicrosoft launches 3 new AI models in direct shot ...
Technology⭐ Featured

Microsoft launches 3 new AI models in direct shot at OpenAI and Google

Microsoft on Thursday launched three new foundational AI models it built entirely in-house — a state-of-the-art speech transcription system, a voice generation engine, and an upgraded image creator — marking the most concrete evidence yet that the $3 trillion software giant intends to compete directly with OpenAI , Google , and other frontier labs on model development, not just distribution. The trio of models — MAI-Transcribe-1 , MAI-Voice-1 , and MAI-Image-2 — are available immediately through Microsoft Foundry and a new MAI Playground . They span three of the most commercially valuable modalities in enterprise AI: converting speech to text, generating realistic human voice, and creating images. Together, they represent the opening salvo from Microsoft's superintelligence team , which Suleyman formed just six months ago to pursue what he calls " AI self-sufficiency ." "I'm very excited that we've now got the first models out, which are the very best in the world for transcription," Suleyman told VentureBeat in an interview ahead of the public announcement. "Not only that, we're able to deliver the model with half the GPUs of the state-of-the-art competition." The announcement lands at a precarious moment for Microsoft. The company's stock just closed its worst quarter since the 2008 financial crisis , as investors increasingly demand proof that hundreds of billions of dollars in AI infrastructure spending will translate into revenue. These models — priced aggressively and positioned to reduce Microsoft's own cost of goods sold — are Suleyman's first answer to that pressure. Microsoft's new

5 April 2026 at 05:53 pm
1 views
Microsoft launches 3 new AI models in direct shot at OpenAI and Google

Microsoft has made a bold move in the AI race by launching three new foundational models that directly challenge OpenAI and Google. The company, worth $3 trillion, has developed these models in-house, marking a significant shift in its strategy from focusing on distribution to model development. The trio of models—MAI-Transcribe-1, MAI-Voice-1, and MAI-Image-2—are available immediately through Microsoft Foundry and a new MAI Playground. These models cover three of the most commercially valuable modalities in enterprise AI: speech-to-text conversion, voice generation, and image creation.

The launch of these models represents the opening salvo from Microsoft's superintelligence team, led by Yusuf Hamied Suleyman, who formed the team just six months ago. Suleyman aims to achieve "AI self-sufficiency," a goal that has been a priority for Microsoft in recent months. In an interview with VentureBeat ahead of the announcement, Suleyman expressed excitement about the new models, particularly MAI-Transcribe-1, which he claimed is the best in the world for transcription. He also highlighted that Microsoft's models can be delivered with half the GPUs required by the state-of-the-art competition.

This launch comes at a critical time for Microsoft. The company's stock has closed its worst quarter since the 2008 financial crisis, with investors increasingly demanding proof that the hundreds of billions of dollars invested in AI infrastructure will yield returns. The new models, priced aggressively and designed to reduce Microsoft's cost of goods sold, are Suleyman's first response to these concerns.

MAI-Transcribe-1, the standout model, claims best-in-class accuracy across 25 languages. The speech-to-text model achieves the lowest average Word Error Rate on the FLEURS benchmark, the industry-standard multilingual test, for the top 25 languages by Microsoft product usage. This benchmark is widely regarded as a critical measure of a model's performance, and Microsoft's achievement here is a significant milestone.

MAI-Voice-1, the voice generation engine, is another notable addition. Microsoft has not disclosed specific details about this model, but the company has emphasized its commitment to delivering cutting-edge AI solutions. The voice generation engine is expected to be a powerful tool for businesses looking to integrate natural language processing into their applications.

MAI-Image-2, the upgraded image creator, builds on Microsoft's existing capabilities in the field. The company has not released specifics about this model, but it is clear that Microsoft is investing heavily in AI to stay competitive in the rapidly evolving market.

The launch of these models is a clear indication that Microsoft is serious about competing with OpenAI and Google in the AI space. The company's strategy of building models in-house and offering them through its existing platforms, such as Microsoft Foundry and the new MAI Playground, is a smart move. By positioning these models as cost-effective alternatives to competitors', Microsoft can attract businesses looking for reliable and affordable AI solutions.

The timing of this launch is also significant. Microsoft's stock has been under pressure, with investors demanding tangible results from the company's massive AI investments. The new models, with their aggressive pricing and potential to reduce Microsoft's own costs, are a direct answer to these concerns. They provide a tangible product that can generate revenue and demonstrate the value of Microsoft's AI strategy.

In conclusion, Microsoft's launch of three new AI models is a significant development in the AI race. The company's in-house development of these models, combined with their immediate availability through established platforms, signals a clear intent to compete with OpenAI and Google. The models' performance, particularly in speech transcription, is a testament to Microsoft's capabilities in AI. As Microsoft continues to invest heavily in AI, the success of these models will be crucial in reassuring investors and establishing Microsoft as a leading player in the AI industry.

Source: VentureBeat
📰 Related News
Ekaya Banaras Founder Palak Shah’s ₹40 Lakh Billboard Mistake Became a Masterclass in Startup Marketing
Ekaya Banaras Founder Palak Shah’s ₹40 Lakh Billboard Mistake Became a Masterclass in Startup Marketing
Ekaya Banaras founder Palak Shah recently opened up about one of the most expensive mistakes she made while building her luxury textile brand. During the early years of the company, Shah rented a premium billboard near Delhi’s DLF Emporio to increase brand visibility. However, after forgetting to cancel the campaign, the hoarding reportedly continued running for months — resulting in losses of nearly ₹40 lakh. The incident has now become a viral example of how small operational oversights can turn into costly business lessons for startups and entrepreneurs.
28 May
Betting On AI: Jensen Huang And NVIDIA’s Rise To The Top
Betting On AI: Jensen Huang And NVIDIA’s Rise To The Top
Before AI was inevitable, it was a gamble—and Jensen Huang went all in.
14 Apr
Red Hat OpenShift sandboxed containers 1.12 and Red Hat build of Trustee 1.1 bring confidential computing to bare metal and AI workloads
Red Hat OpenShift sandboxed containers 1.12 and Red Hat build of Trustee 1.1 bring confidential computing to bare metal and AI workloads
Red Hat is excited to announce the release of Red Hat OpenShift sandboxed containers 1.12 and Red Hat build of Trustee 1.1, marking a major leap forward in our confidential computing journey. These releases graduate confidential containers on bare metal from …
14 Apr
Large AI firms hoovering maximum funding, not enough for smaller startups: Y Combinator’s Ankit Gupta
Large AI firms hoovering maximum funding, not enough for smaller startups: Y Combinator’s Ankit Gupta
YC Startup School: India’s talent pool across colleges and universities are key for building next-gen startups, which is what YC is looking to tap into. It wants to target entrepreneurs building for global markets, focussed on fintech, consumer, B2B, and ecom…
14 Apr
TSMC likely to book fourth straight quarter of record profit on insatiable AI demand
TSMC likely to book fourth straight quarter of record profit on insatiable AI demand
TSMC-RESULTS/ (PREVIEW, PIX):PREVIEW-TSMC likely to book fourth straight quarter of record profit on insatiable AI demand
14 Apr
TSMC likely to book fourth straight quarter of record profit on insatiable AI demand
TSMC likely to book fourth straight quarter of record profit on insatiable AI demand
Any profit result ‌above T$505.7 billion would mark the company's highest-ever quarterly net income ​and its ninth consecutive quarter of profit growth
14 Apr
TSMC likely to book fourth straight quarter of record profit on insatiable AI demand
TSMC likely to book fourth straight quarter of record profit on insatiable AI demand
On Thursday, ​TSMC is expected to report a net profit of $17.1 billion for the quarter, according to an LSEG SmartEstimate compiled from 19 analysts. The war in the Middle East threatens to disrupt the supply of production materials for semiconductors such as…
14 Apr
If we can’t kick the habit, how do we manage AI’s energy needs?
If we can’t kick the habit, how do we manage AI’s energy needs?
One can only hope that OpenAI’s Sam Altman was joking when he sought to justify the immense energy consumption of artificial intelligence
14 Apr
What caused Nvidia Blackwell GPU prices to spike? #tech
What caused Nvidia Blackwell GPU prices to spike? #tech
Blackwell GPU hourly “rent” surges on agentic AI demand A compute pricing index tracking hourly costs for Nvidia Blackwell GPUs shows a sharp climb: hourly rental hit $4.08 , up 48% from $2.75 just two months earlier. The reported driver is rising demand tied…
14 Apr
Anthropic Releases Claude Mythos Preview with Cybersecurity Capabilities but Withholds Public Access
Anthropic Releases Claude Mythos Preview with Cybersecurity Capabilities but Withholds Public Access
Anthropic has introduced Claude Mythos Preview, its most advanced AI model, improving significantly in reasoning, coding, and cybersecurity. Unlike previous releases, it will not be publicly available. Access is limited to a consortium of tech companies throu…
14 Apr