Welcome to IndoNLP: The First Workshop on Natural Language Processing for Indo-Aryan and Dravidian Languages. This will be collocated with COLING 2025, in Abu Dhabi, UAE. on January 20, 2025.

Proceeding of INDONLP 2025

Important Dates

- 1^st Call for Papers: July 20, 2024
- 2^nd Call for Papers: August 15, 2024
- Paper Submission Deadline : November 12, 2024
- Notification of Paper Acceptance: November 30, 2024 - December 5,2024
- Camera-ready Paper Deadline: December 13, 2024
- Workshop Date: January 20, 2025

All deadlines are 11:59PM UTC-12:00 ('Anywhere on Earth').

Workshop description

The rapid advancement of Natural Language Processing (NLP) and Large Language Models (LLMs) has transformed the landscape of computational linguistics. However, Indo-Aryan and Dravidian Languages (IADL), which represent a significant portion of South Asia's linguistic heritage, remain under-resourced and under-researched in these technological developments. This workshop aims to bridge this gap by bringing together researchers, linguists, and technologists to focus on the unique challenges and opportunities. Participants will explore innovative methods for creating and annotating digital corpora, develop speech and language technologies suited to IADL, and promote interdisciplinary collaborations. By leveraging LLMs, we seek to address the complexities of syntax, morphology, and semantics in these languages to enhance the performance of NLP applications. Furthermore, the workshop will provide a platform for sharing best practices, tools, and resources, enhancing the digital infrastructure necessary for language preservation. Through collaborative efforts, we aim to build a research community to advance NLP for IADL, contributing to linguistic diversity and cultural preservation in the digital age.

The topics of the workshop include, but are not limited to:

- Large Language Models for Indo-Aryan Languages and Dravidian Languages.
- Developing a cleaned Indo-Aryan and Dravidian language corpora (UNICODE) and digital linguistic resources.
- Machine Translation and Cross-Lingual Systems
- Speech Technologies: Recognition and Synthesis
- Language Identification and Dialect Detection
- Information Extraction, OCR systems and Knowledge Modelling
- NLP Applications - Fake News, Spam, and Rumor Detection
- Hate speech and Offensive Language Detection
- Sentiment Analysis and Text Summarisation
- NLP applications: Misinformation, Conspiracy theories. Rumours, SPAM, Phishing, and similar applications.

Papers that can cover one or more of these areas are invited to submit.

Submission guidelines

Authors are invited to submit their unpublished work that represents novel research. The papers should be written in English using the *ACL style. Authors can also submit supplementary materials, including technical appendices, source codes, datasets, and multimedia appendices. All submissions, including the main paper and its supplementary materials, should be fully anonymised. For more information on formatting and anonymity guidelines, please refer to COLING 2022 submission guidelines.

The workshop accepts both long papers (8 pages) and short papers (4 pages). The paper can include unlimited appendix and references. At the end of the paper (after the conclusions but before the references), papers need to include a mandatory section discussing the limitations of the work and, optionally, a section discussing ethical considerations. Papers can include unlimited pages of references and an unlimited appendix.
Upon acceptance, the authors are provided with 1 more page to address the reviewer's comments.

All papers will be double-blind peer-reviewed. Two reviewers with the same technical expertise will review each paper. Authors of the accepted papers will present their work in either the Oral or Poster session. All accepted papers will appear in the workshop proceedings that will be published in ACL Anthology.

To prepare your submission, please make sure to use the COLING 2025 style files available here:

Research paper and shared task paper must be submitted using SoftConf at https://softconf.com/coling2025/IndoNLP25/

Organising Committee

Ruvan Weerasinghe,Informatics Institute of Technology, Sri Lanka
Isuri Anuradha, Lancaster University, UK
Deshan Sumanathilaka, Swansea University, UK
Mo El-Haj, Lancaster University, UK
Chamila Liyanage, University of Colombo School of Computing, Sri Lanka
Fahad Khan, Istituto di Linguistica Computazionale in CNR, Italy
Andrew Hardie, Lancaster University, UK
Asim Abbas, Birmingham University, UK
Ruslan Mitkov Lancaster University, UK
Julian Hough, Swansea University, UK
Nicholas Micallef, Swansea University, UK
Naomi Krishnarajah, Informatics Institute of Technology, Sri Lanka

Programme Committee

Randil Pushpanandha, University of Colombo, Sri Lanka
Dulip Herath, Queensland University, Australia
Daisy Lal, Lancaster University, UK
Damith Premasiri, Lancaster University, UK
Venkatesh Raju, Stealth Mode AI Startup, India
Gayanath Chandrasena, University of Helsinki, Finland
Torin Wirasinghe,Informatics Institute of Technology, Sri Lanka
Kaza Sri Sai Swaroop, IBM, India
Asanka Wasala, Dell Technologies, Ireland
Kengatharaiyer Sarveswaran, University of Jaffna, Sri Lanka
Sinnathamby Mahesan, University of Jaffna, Sri Lanka
Nishantha Medagoda, Auckland University of Technology, New Zealand
Prasan Yapa, Kyoto University of Advance Science, Japan
Paul Rayson, Lancaster University, UK
Lochandaka Ranathunga, University of Moratuwa, Sri Lanka
Pumudu Fernando,Informatics Institute of Technology, Sri Lanka
Arjumand Younus, University College Dublin, Ireland
Abdul Nazeer, National Institute of Technology, Calicut, India
Pabitra Mitra, Indian Institute of Technology, Kharagpur, India
Tanmoy Chakraborty, Indian Institute of Technology, Delhi, India
Tirthankar Dasgupta, Indian Institute of Technology, Kharagpur, India
Girish Nath Jha, School for Sanskrit and Indic Studies, JNU, India
Arka Majhi, Indian Institute of Technology, Bombay, India
Anand Kumar, National Institute of Technology, Karnataka, India
Kishorjit Nongmeikapam, Indian Institute of Information Technology (IIIT) Manipur, India
Abdullah Alzahrani, Swansea University, Wales, UK
Jiby Mariya Jose, Indian Institute of Information Technology, India
Ayush Agarwal, Walmart, USA
Sobha Lalitha Devi, AU-KBC Research Centre, Anna University Chennai, India
Saman Galgodage, Swansea University, UK

Contact us

You can reach the organizers by e-mail to indonlp2025@gmail.com