The effectiveness of morphology-aware segmentation in low-resource neural machine translation J Sälevä, C Lignos arXiv preprint arXiv:2103.11189, 2021 | 16 | 2021 |
Toward more meaningful resources for lower-resourced languages C Lignos, N Holley, C Palen-Michel, J Sälevä arXiv preprint arXiv:2202.12288, 2022 | 15 | 2022 |
What changes when you randomly choose BPE merge operations? Not much J Sälevä, C Lignos arXiv preprint arXiv:2305.03029, 2023 | 5 | 2023 |
ParaNames: A massively multilingual entity name corpus J Sälevä, C Lignos arXiv preprint arXiv:2202.14035, 2022 | 4 | 2022 |
Evaluating morphological compositional generalization in large language models M Ismayilzada, D Circi, J Sälevä, H Sirin, A Köksal, B Dhingra, A Bosselut, ... arXiv preprint arXiv:2410.12656, 2024 | 2 | 2024 |
ParaNames 1.0: Creating an entity name corpus for 400+ languages using Wikidata J Sälevä, C Lignos arXiv preprint arXiv:2405.09496, 2024 | 2 | 2024 |
Findings of the CoCo4MT 2023 Shared Task on Corpus Construction for Machine Translation A Ganesh, M Carpuat, W Chen, K Kann, C Lignos, JE Ortega, J Sälevä, ... Proceedings of the Second Workshop on Corpus Generation and Corpus …, 2023 | 1 | 2023 |
Mining Wikidata for Name Resources for African Languages J Sälevä, C Lignos AfricaNLP Workshop, EACL 2021, 2021 | 1 | 2021 |
A Multi-Orthography Parallel Corpus of Yiddish Nouns J Sälevä Proceedings of the Twelfth Language Resources and Evaluation Conference, 948-952, 2020 | 1 | 2020 |
OpenNER 1.0: Standardized Open-Access Named Entity Recognition Datasets in 50+ Languages C Palen-Michel, M Pickering, M Kruse, J Sälevä, C Lignos arXiv preprint arXiv:2412.09587, 2024 | | 2024 |
Proceedings of the Fourth Workshop on Multilingual Representation Learning (MRL 2024) J Sälevä, A Owodunni Proceedings of the Fourth Workshop on Multilingual Representation Learning …, 2024 | | 2024 |
Language Model Priors and Data Augmentation Strategies for Low-resource Machine Translation: A Case Study Using Finnish to Northern Sámi J Sälevä, C Lignos Findings of the Association for Computational Linguistics ACL 2024, 12949-12956, 2024 | | 2024 |
Proceedings of the First Workshop on Natural Language Processing for Turkic Languages (SIGTURK 2024) D Ataman, MO Derin, S Ivanova, A Köksal, J Sälevä, D Zeyrek Proceedings of the First Workshop on Natural Language Processing for Turkic …, 2024 | | 2024 |
Brandeis at VarDial 2024 DSL-ML Shared Task: Multilingual Models, Simple Baselines and Data Augmentation J Sälevä, C Palen-Michel Proceedings of the Eleventh Workshop on NLP for Similar Languages, Varieties …, 2024 | | 2024 |
Proceedings of the Workshop on Dataset Creation for Lower-Resourced Languages within the 13th Language Resources and Evaluation Conference J Sälevä, C Lignos Proceedings of the Workshop on Dataset Creation for Lower-Resourced …, 2022 | | 2022 |