- Automatic closed captioning for Estonian live broadcastsAlumäe, Tanel; Kalda, Joonas; Bode, Külliki; Kaitsa, MartinProceedings of the 24th Nordic Conference on Computational Linguistics (NoDaLiDa)2023 / p. 492–499 https://aclanthology.org/2023.nodalida-1.49 https://aclanthology.org/2023.nodalida-1.49.pdf
- Collar-aware training for streaming speaker change detection in broadcast speechKalda, Joonas; Alumäe, TanelThe Speaker and Language Recognition Workshop (Odyssey 2022), 28 June - 1 July 2022, Beijing, China : proceedings2022 / p. 141–147 https://doi.org/10.21437/Odyssey.2022-20
- Design choices for PixIT-based speaker-attributed ASR: Team ToTaTo at the NOTSOFAR-1 challengeKalda, Joonas; Baroudi, Séverin; Lebourdais, Martin; Pages, Clement; Marxer, Ricard; Alumäe, Tanel; Bredin, HervéComputer Speech & Language2026 / art. 101824, 16 p. : ill https://doi.org/10.1016/j.csl.2025.101824
- Diarization-Guided Multi-Speaker EmbeddingsKalda, Joonas; Pages, Clement; Alumäe, Tanel; Bredin, HervéInterspeech 2025, Rotterdam, The Netherlands, 17-21 August 20252025 / p. 5233–5237 : ill https://doi.org/10.21437/Interspeech.2025-1807
- Improved Training Methods for Multi-Talker Speech Processing = Treeningmeetodid mitme rääkijaga kõne töötluseksKalda, Joonas2026 https://www.ester.ee/record=b6035751*est https://digikogu.taltech.ee/et/Item/8b828320-473c-4232-bca5-100d40f3c7ed https://doi.org/10.23658/taltech.15/2026
- PixIT: joint training of speaker diarization and speech separation from real-world multi-speaker recordingsKalda, Joonas; Pagés, Clément; Marxer, Ricard; Alumäe, Tanel; Bredin, HervéThe Speaker and Language Recognition Workshop (Odyssey 2024), 18-21 June 2024, Quebec City, Canada2024 / p. 115-122 p. : ill https://doi.org/10.21437/odyssey.2024-17
- TalTech-IRIT-LIS speaker and language diarization systems for DISPLACE 2024Kalda, Joonas; Alumäe, Tanel; Lebourdais, Martin; Bredin, Hervé; Baroudi, Séverin; Marxer, RicardInterspeech 2024, 1-5 September 2024, Kos, Greece2024 / p. 1635–1639 : ill https://doi.org/10.21437/Interspeech.2024-2462 https://www.scopus.com/sourceid/21100212301 https://www.scopus.com/pages/publications/85214842577?inward https://www.webofscience.com/wos/woscc/full-record/WOS:001331850101156
- ToTaTo system descriptions for the NOTSOFAR1 challengeKalda, Joonas; Alumäe, Tanel; Baroudi, Séverin; Lebourdais, Martin; Bredin, Hervé; Marxer, Ricard8th International Workshop on Speech Processing in Everyday Environments (CHiME 2024), 6 September 2024, Kos, Greece2024 / p. 23-25 https://doi.org/10.21437/CHiME.2024-5