Massive generation of TTS for Deepfake detection

Submitted by Damien LOLIVE on
Corps / Catégorie
Ingénieur de recherche
Lieu
Lannion
Type de contrat
CDD
Equipe de recherche
Contexte
Expression team from IRISA is hiring an engineer in Computer Science on a full-time 12 months contract (may be extended). Expression research team is at the heart of the AI revolution as it studies and generates Human language using different modalities, i.e. Text, Speech and Sign. In particular, the team participates to a project targeting the development and evaluation of deep fake speech detection systems. To this end, we have to implement a large variety of speech synthesis systems, including voice cloning systems and voice conversion systems. The engineer will work on the massive generation of synthesized speech in the context of deepfake detection.
Mission

Development of speech synthesis systems including a large variety of technologies, including voice cloning and voice conversion systems :
• Data preparation for different languages ;

• Set up a global framework for Text-To-Speech synthesis (TTS) ;

• Implement different TTS systems for different languages ;

• Contribute to the developement of deep fake detection systems ;

Profil / Compétences
PhD in Computer Science, Master in Machine Learning or Master in Speech and Language Processing

Required skills: Software engineering (C++, Python) ; Machine learning methods and tools (Tensorflow, PyTorch, Keras) ; Automatic Speech and Language Processing ; CI/CD.
Durée du contrat (en mois)
12
Quotité
100%
Lieu de travail
IRISA / Enssat, Lannion
Diplôme requis
PhD in Computer Science, Master in Machine Learning or Master in Speech and Language Processing
Salaire brut mensuel
Depending on experience
Date prévisionnelle d'embauche
As soon as possible (May 2024)
Date limite de candidature
Candidater
Send a CV and motivation letter to damien.lolive@irisa.fr, arnaud.delhay@irisa.fr, vincent.barreaud@irisa.fr