newsio aggregates and links to original sources. We do not own the original images or content. If you believe content infringes on intellectual property rights, contact us — it will be removed at first notice.

genai / news / / South China Morning Post

Alibaba's Fun-Realtime-ASR model ranked first with a word error rate of 1.8 per cent.

Alibaba's AI voice model outperformed OpenAI and xAI on the Speech Arena global benchmark.

KEY POINTS
A new artificial intelligence voice model from Alibaba Group Holding has beaten out Western rivals OpenAI and xAI on a major global benchmark, underscoring its technical edge in capturing complex Chinese dialects and accents. Alibaba owns the South China Morning Post. The Speech Arena benchmark is operated by Artificial Analysis, a San Francisco-based AI evaluation organisation backed by investors including former GitHub chief executive Nat Friedman and Google Brain founder Andrew Ng. The platform ranks models through blind user evaluations of generated speech clips using an Elo-based system. Speech Arena users test how well models can perform across three core capabilities – converting speech into text, enabling end-to-end voice understanding and conversational interaction, and transforming text into natural-sounding speech. In a separate Artificial-Analysis Word Error Rate index, Alibaba’s Fun-Realtime-ASR model ranked first with a word error rate of 1.8 per cent, meaning fewer than two words out of every 100 were transcribed incorrectly.
COMPANIES
Read the full story on South China Morning Post →
Share X LinkedIn

Summarized by Newsio from South China Morning Post. How we summarize →