The future of audio AI: adoption use cases powering the Middle East

MBZUAI · Notable

Summary

ElevenLabs, a voice AI research and product company, presented at MBZUAI's Incubation and Entrepreneurship Center (IEC) on the adoption of audio AI in the Middle East. Hussein Makki, general manager for the Middle East at ElevenLabs, highlighted the potential of voice-native AI across sectors like telecommunications, banking, and education. ElevenLabs focuses on making content accessible and engaging across languages and voices through its text-to-speech models. Why it matters: This signals growing interest and investment in voice AI applications within the region, potentially transforming customer service and content accessibility in Arabic.

Keywords

ElevenLabs · MBZUAI · voice AI · text-to-speech · audio AI

Read original article →

Get the weekly digest

Top AI stories from the GCC region, every week.

Research talk on Privacy and Security Issues in Speech

MBZUAI · Invalid Date

A research talk was given on privacy and security issues in speech processing, highlighting the unique privacy challenges due to the biometric information embedded in speech. The talk covered the legal landscape, proposed solutions like cryptographic and hashing-based methods, and adversarial processing techniques. Dr. Bhiksha Raj from Carnegie Mellon University, an expert in speech and audio processing, delivered the talk. Why it matters: As speech-based interfaces become more prevalent in the Middle East, understanding and addressing the associated privacy risks is crucial for ethical AI development and deployment.

N-Shot Benchmarking of Whisper on Diverse Arabic Speech Recognition

arXiv · Jun 5

This paper benchmarks the performance of OpenAI's Whisper model on diverse Arabic speech recognition tasks, using publicly available data and novel dialect evaluation sets. The study explores zero-shot, few-shot, and full finetuning scenarios. Results indicate that while Whisper outperforms XLS-R models in zero-shot settings on standard datasets, its performance drops significantly when applied to unseen Arabic dialects.

The future of audio AI: adoption use cases powering the Middle East

Summary

Keywords

Related

Research talk on Privacy and Security Issues in Speech

N-Shot Benchmarking of Whisper on Diverse Arabic Speech Recognition