Workshop Program

Keynote Speaker
Prof. Pushpak Bhattacharyya
Prof. Pushpak Bhattacharyya is Professor of Computer Science and Engineering Department IIT Bombay. His area of research is Natural Language Processing (NLP) and Machine Learning (ML). He currently holds the Major Bhagat Singh Rekhi Chair Professorship of IIT Bombay and had formerly held Vijay and Sita Vashi Chair Professorship (2014-16). He was formerly Director of IIT Patna (2015-20) and President of Association of Computational Linguistics (2016). He is a Fellow of National Academy of Engineering (2015) and Abdul Kalam National Fellow (2020).

Prof. Bhattacharyya received his B.Tech from IIT Kharagpur (1984, its Distinguished Alumnus Awardee in 2018), M.Tech from IIT Kanpur (1986) and Ph.D from IIT Bombay (1994). He has also been visiting scientist/researcher/faculty in MIT Cambridge USA, Stanford University USA, University Joseph Fouriere Grenoble France and University of Texas at Houston USA. Prof. Bhattacharyya has been lecturing on NLP in top national and international academic institutions and industries throughout the world.

Tentative Program Schedule

08:45-09:00
Opening Remark: Welcome Speech by Conference chair
09:00-09:50
Keynote Speech: Prof. Pushpak Bhattacharyya
Note : Each author will have 10 minutes for presentation and 3 minutes for Q&A.

Theme: Language Processing and Evaluation

10:00-10:15
Hindi Reading Comprehension: Do Large Language Models Exhibit Semantic Understanding?
Daisy Monika Lal, Paul Rayson, Mo El-Haj
10:15-10:30
Crossing Language Boundaries: Evaluation of Large Language Models on Urdu-English Question Answering
Samreen Kazi, Maria Rahim, Shakeel Khoja
10:30-11:00 Coffee Break

Theme: Building Resources and Improving Techniques for Indic Language NLP

11:00-11:15
Machine Translation and Transliteration for Indo-Aryan Languages: A Systematic Review
Sameera Perera, T.G.D.K. Sumanathilaka
11:15-11:30
Investigating the Effect of Back Translation for Indic Languages
Sudhansu Bala Das, Samujjal Choudhury, Tapas Kumar Mishra, Bidyut Kr. Patra
11:30-11:45
BERTopic for Topic Modeling of Hindi Short Texts: A Comparative Study
Atharva Mutsaddi, Anvi Jamkhande, Aryan Thakre, Yashodhara Haribhakta
11:45-12:00
Evaluating Structural and Linguistic Quality in Urdu DRS Parsing and Generation through Bidirectional Evaluation
Muhammad Saad Amin, Luca Anselma, Alessandro Mazzei
12:00-12:15
Studying the Effect of Hindi Tokenizer Performance on Downstream Tasks
Rashi Goel, Fatiha Sadat
12:15-12:30
Adapting Multilingual LLMs to Low-Resource Languages using Continued Pre-training and Synthetic Corpus: A Case Study for Hindi LLMs
Raviraj Joshi, Kanishk Singla, Anusha Kamath, Raunak Kalani, Rakesh Paul, Utkarsh Vaidya, Sanjay Singh Chauhan, Niranjan Wartikar, Eileen Long
12:30-12:45
OVQA: A Dataset for Visual Question Answering and Multimodal Research in Odia Language
Shantipriya Parida, Shashikanta Sahoo, Sambit Sekhar,Kalyanamalini Sahoo, Ketan Kotwal, Sonal Khosla, Satya Ranjan Dash , Aneesh Bose, Guneet Singh Kohli, Smruti Smita Lenka, Ondřej Bojar
12:45-13:00
Advancing Multilingual Speaker Identification and Verification for Indo-Aryan and Dravidian Languages
Braveenan Sritharan, Uthayasanker Thayasivam
13:00-14:00 Lunch

Theme: Applications and Societal Impact: Applying NLP to Real-World Problems and Societal Challenges

14:00-14:15
Sentiment Analysis of Sinhala News Comments using Transformers
Isuru Bandaranayake, Hakim Usoof
14:15-14:30
ExMute: A Context-Enriched Multimodal Dataset for Hateful Memes
Riddhiman Swanan Debnath, Nahian Beente Firuj, Abdul Wadud Shakib, Sadia Sultana, Md Saiful Islam
14:30-14:45
Studying the capabilities of Large Language Models in solving Combinatorics Problems posed in Hindi
Yash Kumar, Subhajit Roy
14:45-15:00
From Scarcity to Capability: Empowering Fake News Detection in Low-Resource Languages with LLMs
Hrithik Majumdar Shibu, Shrestha Datta, Md. Sumon Miah, Nasrullah Sami, Mahruba Sharmin Chowdhury, Md. Saiful Islam
15:00-15:15
Enhancing Participatory Development Research in South Asia through LLM Agents System: An Empirically-Grounded Methodological Initiative and Agenda from Field Evidence in Sri Lankan
Xinjie Zhao, Hao Wang, Shyaman Maduranga Sriwarnasinghe, Jiacheng Tang, Shiyun Wang, Sayaka Sugiyama, So Morikawa
15:15-15:30
Identifying Aggression and Offensive Language in Code-Mixed Tweets: A Multi-Task Transfer Learning Approach
Bharath Kancharla, Prabhjot Singh, Lohith Bhagavan Kancharla, Yashita Chama, Raksha Sharma
15:30-16:00 Coffee Break
16:00-16:15
Shared Task Overview and Discussion
16:15-16:30
Closing Remarks

Accepted Shared Tasks

  1. Team IndiDataMiner at IndoNLP 2025: Hindi Back Transliteration - Roman to Devanagari using LLaMa
  2. IndoNLP 2025 Shared Task: Romanized Sinhala to Sinhala Reverse Transliteration Using BERT
  3. Sinhala Transliteration: A Comparative Analysis Between Rule-based and Seq2Seq Approaches
  4. Deep Learning Approach for Romanized English to Malayalam Script Transliteration Using an Encoder-Decoder Framework