Ivan Yakovlev

ivan [at] iyakovlev.dev · Barcelona

I am a Senior Research Engineer at Palabra AI, where I work on multilingual text-to-speech, streaming ASR, and speaker recognition. My research focuses on building production-grade speech models that scale across languages with limited supervision.

Prior to joining Palabra, I was a Senior Machine Learning Engineer at ID R&D Inc. from 2019 to 2025, working on speaker verification. There I designed the ReDimNet architecture and contributed to our entries in the VoxCeleb and SASV challenges, which received top placements at Interspeech workshops.

I received my B.Sc. in Quantum Mechanics from Saint Petersburg State University in 2016.

news

Mar 2026 Submitted ReDimNet2 to Interspeech 2026 — second-generation ReDimNet with time-pooled dimension reshaping (arXiv)
Mar 2025 Joined Palabra AI as Senior Research Engineer
Jul 2024 Reshape Dimensions Network for Speaker Recognition accepted to Interspeech 2024 (arXiv)
Aug 2023 1st place (open track) at the VoxCeleb Speaker Recognition Challenge 2023; oral presentation at Interspeech 2023

selected publications

arXiv2026
Ivan Yakovlev, Anton Okhotnikov
Submitted to Interspeech 2026
Interspeech2024
Ivan Yakovlev, Rostislav Makarov, Andrei Balykin, Pavel Malov, Anton Okhotnikov, Nikita Torgashov
Interspeech 2024
Interspeech2023
Ivan Yakovlev, Anton Okhotnikov, Nikita Torgashov, Rostislav Makarov, Yuri Voevodin, Konstantin Simonchik
Interspeech 2023
Interspeech2022
Alexander Alenin, Nikita Torgashov, Anton Okhotnikov, Rostislav Makarov, Ivan Yakovlev
Interspeech 2022

See all publications →