Welcome to IndoNLP: The First Workshop on Natural Language Processing for Indo-Aryan and Dravidian Languages. This will be collocated with COLING 2025, in Abu Dhabi, UAE. on January 20, 2025.

Important Dates

All deadlines are 11:59PM UTC-12:00 ('Anywhere on Earth'). We will accept paper submissions; corresponding deadlines will be announced at a later moment in time.

Workshop description

The rapid advancement of Natural Language Processing (NLP) and Large Language Models (LLMs) has transformed the landscape of computational linguistics. However, Indo-Aryan and Dravidian Languages (IADL), which represent a significant portion of South Asia's linguistic heritage, remain under-resourced and under-researched in these technological developments. This workshop aims to bridge this gap by bringing together researchers, linguists, and technologists to focus on the unique challenges and opportunities. Participants will explore innovative methods for creating and annotating digital corpora, develop speech and language technologies suited to IADL, and promote interdisciplinary collaborations. By leveraging LLMs, we seek to address the complexities of syntax, morphology, and semantics in these languages to enhance the performance of NLP applications. Furthermore, the workshop will provide a platform for sharing best practices, tools, and resources, enhancing the digital infrastructure necessary for language preservation. Through collaborative efforts, we aim to build a research community to advance NLP for IADL, contributing to linguistic diversity and cultural preservation in the digital age.

The topics of the workshop include, but are not limited to:

Papers that can cover one or more of these areas are invited to submit.

Submission guidelines

Authors are invited to submit their unpublished work that represents novel research. The papers should be written in English using the *ACL style. Authors can also submit supplementary materials, including technical appendices, source codes, datasets, and multimedia appendices. All submissions, including the main paper and its supplementary materials, should be fully anonymised. For more information on formatting and anonymity guidelines, please refer to COLING 2022 submission guidelines.

The workshop accepts both long papers (8 pages) and short papers (4 pages). The paper can include unlimited appendix and references. Upon acceptance, the authors are provided with 1 more page to address the reviewer's comments.

All papers will be double-blind peer-reviewed. Two reviewers with the same technical expertise will review each paper. Authors of the accepted papers will present their work in either the Oral or Poster session. All accepted papers will appear in the workshop proceedings that will be published in ACL Anthology.

Both research paper and shared task paper must be submitted using SoftConf at https://softconf.com/coling2025/IndoNLP25/

Organisers

Ruvan Weerasinghe,Informatics Institute of Technology, Sri Lanka
Isuri Anuradha, Lancaster University, UK
Deshan Sumanathilaka, Swansea University, UK
Mo El-Haj, Lancaster University, UK
Chamila Liyanage, University of Colombo School of Computing, Sri Lanka
Fahad Khan, Istituto di Linguistica Computazionale in CNR, Italy
Andrew Hardie, Lancaster University, UK
Asim Abbas, Birmingham University, UK
Ruslan Mitkov Lancaster University, UK
Julian Hough, Swansea University, UK
Nicholas Micallef, Swansea University, UK
Naomi Krishnarajah, Informatics Institute of Technology, Sri Lanka

Contact us

You can reach the organizers by e-mail to indonlp2025@gmail.com

For more information, follow us in X