Papers
2025
- T. Turatali, A. Alekseev, G. Jumalieva, G. Kabaeva, and S. Nikolenko. Human-Annotated NER Dataset for the Kyrgyz Language. Accepted to TurkLang-2025.
- T. Turatali, A. Turdubaeva, I. Zhenishbekov, Zh. Suranbaev, A. Alekseev, and R. Izmailov. Bridging the Gap in Less-Resourced languages: Building a Benchmark for Kyrgyz Language Models. Accepted to TurkLang-2025.
- A. Alekseev, A. Tillabaeva, G. Dzh. Kabaeva, and S. Nikolenko. Syntactic Transfer to Kyrgyz Using the Treebank Translation Method (in print). Journal of Mathematical Sciences, 2025. arXiv
- A. Drozin and A. Alekseev. Label Transfer Across Languages for Information Extraction: Yet Another Alignment-Based Approach (in print). Системы управления и цифровые технологии, 2025
2024
- A. Alekseev and T. Turatali. KyrgyzNLP: Challenges, Progress, and Future (in print). In Proceedings of the 12th International Conference on Analysis of Images, Social Networks and Texts. Springer, Cham, 2024. arXiv
2023
- A. Alekseev and G. Kabaeva. HJ-Ky-0.1: an Evaluation Dataset for Kyrgyz Word Embeddings. Herald of KSTU, 68(4), 2023. arXiv
- A. Alekseev, S. Nikolenko, and G. Kabaeva. Benchmarking Multilabel Topic Classification in the Kyrgyz Language. In International Conference on Analysis of Images, Social Networks and Texts, pages 21–35. Springer Nature Switzerland Cham, 2023. arXiv
Models
- Trained a better-performing fastText embeddings; published on Zenodo.
- apertium2ud: conversion of apertium-kir tags to the UD project tags
Other Materials
…supporting the development of Kyrgyz NLP
- KyrgyzNLP bibliometrics: all papers on KyrgyzNLP I am aware of + the co-authorship graph
- Awesome Kyrgyz NLP on github: curated list of Kyrgyz language processing software, relevant datasets, etc.
- Apertium’s List of Symbols: PDF poster, A0 format; tables copied from the Apertium Project Wiki
- Apertium tags description: a PDF table; Apertium tags glossary translated into Russian where possible.