×

注意!页面内容来自https://www.geekwire.com/2026/microsoft-releases-new-ai-models-to-further-expand-beyond-openai/,本站不储存任何内容,为了更好的阅读体验进行在线解析,若有广告出现,请及时反馈。若您觉得侵犯了您的利益,请通知我们进行删除,然后访问 原网页

Mustafa SuleymanCEO of Microsoft AI. (GeekWire File Photo / Kevin Lisota)

Microsoft is expanding its roster of in-house AI modelsreleasing a new speech-to-text system and making two existing models broadly available to developers for the first time.

The moves by Microsoft AI (MAI) are part of a broader effort by the company to expand its proprietary AI capabilities beyond its partnership with OpenAIgiving Microsoft more control over its own destiny in the competition against GoogleAmazonand others.

Microsoft announced MAI-Transcribe-1 on Thursdaya speech-to-text model that it says is the most accurate currently available. The company also released its existing voice and image generation modelsknown as MAI-Voice-1 and MAI-Image-2for broad commercial use.

It’s Microsoft’s first major model release since a March reorganizationannounced by CEO Satya Nadellain which Microsoft AI CEO Mustafa Suleyman shifted away from day-to-day Copilot oversight to focus on frontier model development and superintelligence.

Suleyman told The Verge that the transcription model runs at “half the GPU cost of the other state-of-the-art models.” He told VentureBeat that the model was built by a team of just 10 peopleand that Microsoft plans to eventually build a frontier large language model to be “completely independent” if needed.

Microsoft also recently hired former Allen Institute for CEO Ali Farhadi and other top AI researchers from the Seattle-based institute to further bolster Suleyman’s teamas GeekWire reported last week.

MAI-Transcribe-1 is designed to handle noisy real-world conditions such as call centers and conference roomsand Microsoft says it is testing integrations with Copilot and Teams. Microsoft says it offers the best price-performance of any large cloud providercompeting directly with OpenAI’s Whisper and Google’s Gemini on the FLEURS benchmark.

In a blog postSuleyman called the model “not just the most accurate but also lightning fast.”

MAI-Voice-1 generates natural-sounding speech and now lets developers create custom voices from short snippets of sample audio. MAI-Image-2 ranks in the top three on the Arena.ai image generation leaderboard and is rolling out in Bing and PowerPoint.

All three are available on the Microsoft Foundry developer AI platform and MAI Playground.

Job Listings on GeekWork

Find more jobs on GeekWork. Employerspost a job here.