avatar

Adel Moumen

Research MSc Computer Science Student
Research Engineer
SpeechBrain Core Maintainer

Avignon Computer Science Laboratory
adel.moumen@univ-avignon.fr


About Me

I am a 22-year-old research engineer at Avignon University. I completed my Bachelor’s degree in computer science with distinction in an innovation and research-devoted curriculum and earned a two-year entrepreneurship diploma in 2022. Currently, I am participating in a master’s apprenticeship program in computer science with a specialization in AI, where I am professionally contributing to the development of SpeechBrain, an all-in-one, open-source, PyTorch-based speech processing toolkit with more than 6,600+ stars on GitHub. At SpeechBrain, I lead the efforts of the Automatic Speech Recognition community. In 2019, I started as an autodidact on deep learning and helped frame the largest French AI community.

Research Interests

My current research interests revolve around improving Deep Neural Networks. Specifically, I focus on exploring innovative methods to enhance their efficiency by rethinking their core architecture. I am particularly interested in addressing concrete challenges, such as the gradient exploding problem in Recurrent Neural Networks. My research primarily applies these concepts to the field of Automatic Speech Recognition.

SpeechBrain

I serve as a core maintainer of the SpeechBrain toolkit, responsible for its core development and overall management. My role entails actively supporting the toolkit by engaging in discussions, addressing issues, and reviewing pull requests. Additionally, I focus on expanding the toolkit’s capabilities by introducing new features for automatic speech recognition.

One of my notable contributions was integrating the openAI’s Whisper model. My ongoing work involves incorporating advanced decoding methods into SpeechBrain speech recognition systems. It includes integrating CTC frame-synchronous beam search and CTC/Att joint decoding, leveraging language models such as kenLM and TransformerLM (e.g., GPT2) for improved performance. I am also working on a new large-scale ASR model, “openSB-ASR,” trained solely on open-source data.

News

Publications

  1. Adel Moumen, Titouan Parcollet
    IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2023.