Friday 17 October 2025 - 11am | ILCC

Speaker: Hanan Aldarmaki (Mohamed Bin Zayed University of Artificial Intelligence (MBZUAI)).

Title: Arabic-Centric Speech Processing: Diglossia, Code-Switching, and Other Quirks

Abstract: Research on speech and language processing is increasingly moving towards large-scale multilingual and multimodal models. While these developments have led to impressive general-purpose systems, they often overlook the intricacies of specific target languages, each with its own linguistic quirks that can challenge model assumptions and design. Arabic exemplifies this challenge: a language characterized by diglossia, dialectal variations, frequent code-switching and spelling variations.

This talk summarizes recent findings on Automatic Speech Processing (ASR) through empirical studies on Arabic-centric pre-training, multi-dialectal fine-tuning, and multilingual pre-training, analyzing their impact on ASR performance across various dialects and conditions. The discussion highlights both the limitations of current models and the value of targeted strategies for robust speech processing systems.

This article was published on Wednesday 23 July 2025