№1, 2026

LINGUISTIC SOVEREIGNTY IN VIRTUAL SPACE: LANGUAGE ECOSYSTEM, PROBLEMS AND SOLUTIONS
Afruz Gurbanova, Mehriban Baghirova

Against the backdrop of globalization and digital transformation in the modern era, linguistic sovereignty has become a strategic cybersecurity issue, extending beyond the traditional paradigm of state security. As languages migrate from physical space to virtual ecosystems, national languages face the hegemonic influence of global tech giants, and, especially in the case of dominant languages, artificial intelligence systems. The lack of high-quality text corpora and natural language processing technologies for national languages in virtual environments leads to linguistic asymmetries in artificial intelligence systems, thereby limiting the functionality of national languages on digital platforms. Protecting linguistic sovereignty in the digital environment is crucial to ensuring the functional existence and sociocultural sustainability of national languages. This article analyzes the functional independence and sustainability of languages in the context of the digital ecosystem, identifying the key aspects of linguistic sovereignty. Conceptual approaches to ensuring and developing linguistic sovereignty in the digital environment are analyzed, and a multi-level model of linguistic sovereignty in this context is proposed. The main threats to linguistic sovereignty in the virtual space are identified, and the need to develop an ecosystem of national languages to address them is emphasized. The possibilities and key components of developing a language ecosystem on a digital government platform are identified. Recommendations are provided for expanding the functionality of national languages in the digital space and ensuring digital sovereignty. The development of a sustainable ecosystem of the Azerbaijani language on a digital government platform is proposed as a promising direction (pp.103-116).

Keywords:Linguistic sovereignty, Digital ecosystem, Artificial intelligence, National language technologies, Content production, Digital language policy, Technological independence
References
  • Alguliyev, R. M., & Gurbanova, A. M. (2018). The conceptual foundations of national terminological information system. IJ of Education and Management Engineering, 8(4), 19-30. DOI: 10.5815/ijeme.2018.04.03
  • Alguliyev, R.M. et al. (2025). Conceptual model of azerbaijani language ecosystem in the electronic government. Comparative Cultural Studies: European and Latin American Perspectives, (20), 1-24. https://doi.org/10.46661/ccselap-11174
  • Alguliyev, R.M., Mahmudov, R.S. (2018). Language industry: opportunities, perspectives and problems. Problems of Information society, 1, 3-26. (in Azerbaijani)
  • https://doi.org/10.25045/jpis.v09.i1.01   
  • Aliguliyev, R.M. et al. (2021). From cosmopolitan e-government to national e-government: prospects for creating Azerbaijani-language services in a virtual environment. Problems of Information society, №2, 3–25. https://doi.org/10.25045/JPIS.V12.I2.01 
  • Anastasopoulos, A., Neubig, G. (2020). Should All Cross-Lingual Embeddings Be Aligned? Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics. 4514–4525. DOI: 10.18653/v1/2020.acl-main.766
  • Barrinha, A., & Christou, G. (2022). Speaking sovereignty: the EU in the cyber domain. European Security, 31(3), 356-376. https://doi.org/10.1080/09662839.2022.2102895
  • Bella, G. et al., (2023). Towards bridging the digital language divide. arXiv preprint arXiv:2307.13405.  https://doi.org/10.48550/arXiv.2307.13405
  • Bird, S. (2020). Decolonising speech and language technology. Proceedings of the 28th International Conference on Computational Linguistics, 3504–3519 DOI: 10.18653/v1/2020.coling-main.313
  • Bradshaw, S., DeNardis, L. (2018). The politicization of the Internet’s Domain Name System: Implications for Internet security, universality, and freedom. New media & society, 20(1), 332-350. https://doi.org/10.1177/14614448166629
  • Chen, J. et al., (2024). When large language models meet personalization: Perspectives of challenges and opportunities. World Wide Web, 27(4), 42.https://link.springer.com/article/10.1007/s11280-024-01276-1
  • Council of Europe (2001). Council for Cultural Co-operation. Education Committee. Modern Languages Division. Common European framework of reference for languages: Learning, teaching, assessment. – Cambridge University Press, 278 p. https://rm.coe.int/common-european-framework-of-reference-for-languages-learning-teaching/16809ea0d4
  • Eckes, C. (2015). The reflexive relationship between internal and external sovereignty. Irish J. European L. 18, 33. https://www.researchgate.net/publication/272244952_The_Reflexive_Relationship_between_Internal_and_External_Sovereignty
  • Floridi, L. (2020). The fight for digital sovereignty: What it is, and why it matters, especially for the EU. Philosophy & technology, 33(3), 369-378. https://doi.org/10.1007/s13347-020-00423-6
  • Galla, C. K. (2016). Indigenous language revitalization, promotion, and education: Function of digital technology. Computer Assisted Language Learning, 29(7), 1137-1151. http://dx.doi.org/10.1080/09588221.2016.1166137
  • Gazzola, M. (2016). Multilingual communication for whom? Language policy and fairness in the European Union. European Union Politics, 17(4), 546-569.
  • Gurbanova A.M. (2015). On terminological threats to the Azerbaijani language in the context of globalization. Problems of Information society, №2, 87-95.
  • Gurbanova, A. (2022, November). Problems and Prospects for Minority Languages in the Age of Industry 4.0. In the International Symposium on Computer Science, Digital Economy and Intelligent Systems (pp. 722-734). Cham: Springer Nature Switzerland. https://doi.org/10.1007/978-3-031-24475-9_59  
  • Haddow, B. et al., (2022). Survey of low-resource machine translation. Computational Linguistics, 48(3), 673-732. https://aclanthology.org/2022.cl-3.6.pdf
  • Hayman And, P. A., Williams, J. (2006). Westphalian sovereignty: Rights, intervention, meaning and context. Global society, 20(4), 521-541.
  • Hornberger, N. H. et al., (2018). Ethnography of language planning and policy. Language Teaching, 51(2), 152–186. https://doi.org/10.1017/S0261444817000496
  • Hung, H. T. (2025). Exploring China’s cyber sovereignty concept and artificial intelligence governance model: a machine learning approach. Journal of Computational Social Science, 8(1), 24.
  • https://doi.org/10.1007/s42001-024-00346-8 
  • Joshi, P. et al., (2020). The state and fate of linguistic diversity and inclusion in the NLP world. arXiv preprint arXiv:2004.09095. https://doi.org/10.48550/arXiv.2004.09095
  • Kahiraba, K. M. K. (2025). The Impact of Globalization on Linguistic and Cultural Sovereignty in Mali. Interdisciplinary Language and Culture Studies. 3(1), 1-14. https://doi.org/10.5281/zenodo.15756840
  • Kelly-Holmes, H. (2019). Multilingualism and technology: A review of developments in digital communication from monolingualism to idiolingualism. Annual Review of Applied Linguistics. 39, 24-39. https://doi.org/10.1017/S0267190519000102 
  • Kornai, A. (2013). Digital language death. PloS one, 8(10), e77056.
  • Kraus, Peter A. (2018). “Between minority protection and linguistic sovereignty”. Revista de Llengua i Dret, Journal of Language and Law, Issue 69, 6-17, DOI:10.2436/rld.i69.2018.3122
  • Moshagen, S. N. et al., (2024). Indigenous language technology in the age of machine learning. Acta Borealia. 41(2), 102-116. https://doi.org/10.1080/08003831.2024.2410124   
  • Paris, R. (2020). The right to dominate: How old ideas about sovereignty pose new challenges for world order. International Organization, 74(3), 453-489. 
  • Phillipson, R. (2008). The linguistic imperialism of neoliberal empire. Critical inquiry in language studies. 5(1), 1-43.https://doi.org/10.1080/15427580701696886 
  • Rehm, G. et al., (2020, May). The European language technology landscape in 2020: Language-centric and human-centric AI for cross-cultural communication in multilingual Europe. In Proceedings of the Twelfth Language Resources and Evaluation Conference. 3322-3332. https://aclanthology.org/2020.lrec-1.407.pdf 
  • Scannell, K. P. (2007). The Crúbadán project: Corpus building for under-resourced languages ın Building and exploring Web corpora: Proceedings of the 3rd Web as Corpus Workshop. University Press of Louvain, vol. 4, 5–15. https://kevinscannell.com/files/wac3.pdf
  • Siminyu, K. et al., (2021). AI4D--African Language Program. arXiv preprint arXiv:2104.02516. https://arxiv.org/pdf/2104.02516
  • Tan, Y., Jehom, W. J. (2024). The Function of Digital Technology in Minority Language Preservation: The Case of the Gyalrong Tibetan Language. Preservation, Digital Technology & Culture, 53(3), 165-177.
  • UNESCO. (2001). Universal Declaration on Cultural Diversity. Paris: UNESCO. https://www.refworld.org/legal/resolution/unesco/2001/en/35901
  • UNESCO. (2023). Digital Initiatives for Indigenous Languages. Paris: UNESCO.  https://www.defindia.org/wp-content/uploads/2024/12/Digital-Initiatives-for-Indigenous-Languages.pdf?utm_source=chatgpt.com