Publications
Publications
71 works grouped by research theme, including 13 CORE A* and 9 CORE A papers. My name is shown in bold.
A continuously updated list also lives on my Google Scholar profile. Most titles link to the full PDF.
Speech Processing, Recognition & Audio Security (12)
- ConferenceAbdul Hameed Azeemi, Ihsan Ayyub Qazi, Maryam Mustafa, Agha Ali Raza. Dissecting ASR Failures in Low-Resource South Asian Languages. Interspeech 2026.
- ConferenceSamee Arif, Aamina Jamal Khan, Mustafa Abbas, Agha Ali Raza, Awais Athar. WER We Stand: Benchmarking Urdu ASR Models. COLING 2025, Abu Dhabi, UAE.
- JournalNimra Zaheer, Agha Ali Raza, Mudassir Shabbir. Conversations in the Wild: Data Collection, Automatic Generation and Evaluation. Computer Speech & Language, vol. 89, 2025.
- ConferenceSheza Munir, Wassay Sajjad, Mukeet Raza, Emaan Mujahid Abbas, Abdul Hameed Azeemi, Ihsan Ayyub Qazi, Agha Ali Raza. Deepfake Defense: Constructing and Evaluating a Specialized Urdu Deepfake Audio Dataset. Findings of ACL 2024, Bangkok, Thailand.
- JournalNimra Zaheer, Obaid Ullah Ahmad, Mudassir Shabbir, Agha Ali Raza. Speech Emotion Recognition for the Urdu Language. Language Resources and Evaluation (LRE), 2023.
- ConferenceHira Dhamyal, Ayesha Ali, Ihsan Ayyub Qazi, Agha Ali Raza. Using Self-Attention DNNs to Discover Phonemic Features for Audio Deep Fake Detection. ASRU 2021, Cartagena, Colombia.
- ConferenceHira Dhamyal, Ayesha Ali, Ihsan Ayyub Qazi, Agha Ali Raza. Fake Audio Detection in Resource-Constrained Settings using Microfeatures. Interspeech 2021, Brno, Czechia.
- ConferenceAgha Ali Raza, Awais Athar, Shan Randhawa, Zain Tariq, Muhammad Bilal Saleem, Haris Bin Zia, Umar Saif, Roni Rosenfeld. Rapid Collection of Spontaneous Speech Corpora using Telephonic Community Forums. Interspeech 2018, Hyderabad, India.
- ConferenceAgha Ali Raza, Sarmad Hussain, Huda Sarfraz, Inam Ullah, Zahid Sarfraz. An ASR System for Spontaneous Urdu Speech. Oriental COCOSDA 2010, Kathmandu, Nepal.
- ConferenceHuda Sarfraz, Sarmad Hussain, Riffat Bokhari, Agha Ali Raza, Inam Ullah, Zahid Sarfraz, Sophia Pervez, Asad Mustafa, Iqra Javed, Rahila Parveen. Speech Corpus Development for a Speaker-Independent Spontaneous Urdu Speech Recognition System. Oriental COCOSDA 2010, Kathmandu, Nepal.
- ConferenceHuda Sarfraz, Sarmad Hussain, Riffat Bokhari, Agha Ali Raza, Inam Ullah, Zahid Sarfraz, Sophia Pervez, Asad Mustafa, Iqra Javed, Rahila Parveen. Large Vocabulary Continuous Speech Recognition for Urdu. Frontiers of Information Technology (FIT) 2010, Islamabad, Pakistan.
- ConferenceAgha Ali Raza, Sarmad Hussain, Huda Sarfraz, Inam Ullah, Zahid Sarfraz. Design and Development of a Phonetically Rich Urdu Speech Corpus. O-COCOSDA 2009, Ürümqi, China.
Natural Language Processing & LLMs for Low-Resource Languages (14)
- JournalIhsan Ayyub Qazi, Zohaib Khan, Abdullah Ghani, Agha Ali Raza, Zafar Ayyub Qazi, Wassay Sajjad, Ayesha Ali, Asher Javaid, Muhammad Abdullah Sohail, Abdul H. Azeemi. Large Language Models Show Dunning-Kruger-Like Effects in Multilingual Fact-Checking. Scientific Reports 16, 7594 (2026), a Nature Portfolio journal.
- WorkshopSamee Arif, Muhammad Saad Haroon, Aamina Jamal Khan, Taimoor Arif, Agha Ali Raza, Awais Athar. Kahaani: A Multimodal Co-Creative Storytelling System. EACL 2026 Student Research Workshop, Rabat, Morocco.
- ConferenceAbdullah Hashmat, Muhammad Arham Mirza, Agha Ali Raza. PakBBQ: A Culturally Adapted Bias Benchmark for QA. EMNLP 2025 (Main), Suzhou, China.
- WorkshopSamee Arif, Sualeha Farid, Abdul Hameed Azeemi, Awais Athar, Agha Ali Raza. The Fellowship of the LLMs: Multi-Model Workflows for Synthetic Preference Optimization Dataset Generation. GEM² Workshop at ACL 2025.
- JournalHafiz Rizwan Iqbal, Muhammad Sharjeel, Jawad Shafi, Usama Mehmood, Agha Ali Raza. Urdu Sentential Paraphrased Plagiarism Detection Using Large Language Models. ACM TALLIP, vol. 24(9), 2025.
- JournalHafiz Rizwan Iqbal, Muhammad Sharjeel, Jawad Shafi, Usama Mehmood, Saeed-Ul Hassan, Agha Ali Raza. Urdu Paraphrased Text Reuse and Plagiarism Detection using Pre-trained LLMs and Deep Hybrid Neural Networks. Multimedia Tools and Applications, vol. 84(35), 2025.
- ConferenceSamee Arif, Abdul Hameed Azeemi, Agha Ali Raza, Awais Athar. Generalists vs. Specialists: Evaluating Large Language Models for Urdu. Findings of EMNLP 2024, Miami, FL, USA.
- ConferenceSamee Arif, Sualeha Farid, Awais Athar, Agha Ali Raza. UQA: Corpus for Urdu Question Answering. LREC-COLING 2024, Torino, Italy.
- WorkshopAbdul Basit, Abdul Hameed Azeemi, Agha Ali Raza. Challenges in Urdu Machine Translation. LoResMT Workshop at ACL 2024, Bangkok, Thailand.
- JournalHafiz Rizwan Iqbal, Rashad Maqsood, Agha Ali Raza, Saeed-Ul Hassan. Urdu Paraphrase Detection: A Novel DNN-based Implementation using a Semi-Automatically Generated Corpus. Natural Language Engineering (NLE), vol. 30(2), 2024.
- ConferenceNamoos Hayat Qasmi, Haris Bin Zia, Awais Athar, Agha Ali Raza. SimplifyUR: Unsupervised Lexical Text Simplification for Urdu. LREC 2020, Marseille, France.
- ConferenceHaris Bin Zia, Agha Ali Raza, Awais Athar. Urdu Word Segmentation using Conditional Random Fields. COLING 2018, Santa Fe, NM, USA.
- ConferenceHaris Bin Zia, Agha Ali Raza, Awais Athar. PronouncUR: An Urdu Pronunciation Lexicon Generator. LREC 2018, Miyazaki, Japan.
- ConferenceAgha Ali Raza, Awais Athar, Sajid Nadeem. N-Gram Based Authorship Attribution in Urdu Poetry. Conference on Language and Technology (CLT) 2009, Lahore, Pakistan.
Efficient Machine Learning: Data Selection, Pruning, Active & Federated Learning (8)
- ConferenceAbdul Hameed Azeemi, Ihsan Ayyub Qazi, Agha Ali Raza. Language Model-Driven Data Pruning Enables Efficient Active Learning. Findings of EACL 2026, Rabat, Morocco.
- ConferenceAbdul Hameed Azeemi, Ihsan Ayyub Qazi, Agha Ali Raza. To Label or Not to Label: Hybrid Active Learning for Neural Machine Translation. COLING 2025, Abu Dhabi, UAE.
- JournalAbdul Hameed Azeemi, Ihsan Ayyub Qazi, Agha Ali Raza. A Survey on Data Selection for Efficient Speech Processing. IEEE Access, vol. 13, 2025.
- ConferenceAbdul Hameed Azeemi, Ihsan Ayyub Qazi, Agha Ali Raza. Data Pruning for Efficient Model Pruning in Neural Machine Translation. Findings of EMNLP 2023, Singapore.
- ConferenceMuhammad Tahir Munir, Muhammad Mustansar Saeed, Mahad Ali, Zafar Ayyub Qazi, Ihsan Ayyub Qazi, Agha Ali Raza. Learning Fast and Slow: Towards Inclusive Federated Learning. ECML PKDD 2023, Turin, Italy.
- ConferenceAbdul Hameed Azeemi, Ihsan Ayyub Qazi, Agha Ali Raza. Self-Supervised Dataset Pruning for Efficient Training in Audio Anti-spoofing. Interspeech 2023, Dublin, Ireland.
- WorkshopAbdul Hameed Azeemi, Ihsan Ayyub Qazi, Agha Ali Raza. Representative Subset Selection for Efficient Fine-Tuning in Self-Supervised Speech Recognition. NeurIPS 2023 Workshop (ENLSP), New Orleans, USA.
- ConferenceAbdul Hameed Azeemi, Ihsan Ayyub Qazi, Agha Ali Raza. Dataset Pruning for Resource-Constrained Spoofed Audio Detection. Interspeech 2022, Incheon, Korea.
Speech-Based Interfaces & ICT for Development (ICT4D) (14)
- ConferenceShan M. Randhawa, Tallal Ahmad, Jay Chen, Agha Ali Raza. Karamad: A Voice-based Crowdsourcing Platform for Underserved Populations. CHI 2021, Yokohama, Japan.
- JournalMuhammad Qasim, Haris Bin Zia, Awais Athar, Tania Habib, Agha Ali Raza. Personalized Weather Information for Low-Literate Farmers using Multimodal Dialog Systems. International Journal of Speech Technology, vol. 24(2), 2021.
- JournalAditya Vashistha, Umar Saif, Agha Ali Raza. The Internet of the Orals. Communications of the ACM, vol. 62(11), 2019.
- ConferenceAgha Ali Raza, Zain Tariq, Shan Randhawa, Bilal Saleem, Awais Athar, Umar Saif, Roni Rosenfeld. Voice-Based Quizzes for Measuring Knowledge Retention in Under-Connected Populations. CHI 2019, Glasgow, UK.
- ConferenceAditya Vashistha, Abhinav Garg, Richard Anderson, Agha Ali Raza. Threats, Abuses, Flirting, and Blackmail: Gender Inequity in Social Media Voice Forums. CHI 2019, Glasgow, UK.
- ConferenceAgha Ali Raza, Bilal Saleem, Shan Randhawa, Zain Tariq, Awais Athar, Umar Saif, Roni Rosenfeld. Baang: A Viral Speech-based Social Platform for Under-Connected Populations. CHI 2018, Montréal, Canada.
- ConferenceWaleed Riaz, Haris Durrani, Suleman Shahid, Agha Ali Raza. ICT Intervention for Agriculture Development: Designing an IVR System for Farmers in Pakistan. ICTD 2017, Lahore, Pakistan.
- ConferenceAgha Ali Raza, Rajat Kulshreshtha, Spandana Gella, Sean Blagsvedt, Maya Chandrasekaran, Bhiksha Raj, Roni Rosenfeld. Viral Spread via Entertainment and Voice-Messaging Among Telephone Users in India. ICTD 2016, Ann Arbor, MI, USA.
- ConferenceAgha Ali Raza, Samia Razaq, Amna Raja, Rizwan Naru, Ali Gibran, Abdullah Sabri, Haroon Niaz, Muhammad Bilal Saleem, Umar Saif. Real-Time Automated Surveys among Low-Literate Masses using Voice-based Telephone Services. ACM DEV 2016, Nairobi, Kenya.
- WorkshopNikolas Wolfe, Juneki Hong, Agha Ali Raza, Bhiksha Raj, Roni Rosenfeld. Rapid Development of Public Health Education Systems in Low-Literacy Multilingual Environments: Combating Ebola Through Voice Messaging. SLaTE 2015 (Interspeech satellite), Leipzig, Germany.
- ConferenceCHI Best PaperAgha Ali Raza, Farhan Ul Haq, Zain Tariq, Mansoor Pervaiz, Samia Razaq, Umar Saif, Roni Rosenfeld. Job Opportunities through Entertainment: Virally Spread Speech-Based Services for Low-Literate Users. CHI 2013, Paris, France.
- ConferenceHaohan Wang, Agha Ali Raza, Yibin Lin, Roni Rosenfeld. Behavior Analysis of Low-Literate Users of a Viral Speech-based Telephone Service. ACM DEV 2013, Cape Town, South Africa.
- ConferenceAgha Ali Raza, Farhan Ul Haq, Zain Tariq, Umar Saif, Roni Rosenfeld. Spread and Sustainability: The Geography and Economics of Speech-Based Services. ACM DEV 2013, Bangalore, India.
- ConferenceAgha Ali Raza, Mansoor Pervaiz, Christina Milo, Samia Razaq, Guy Alster, Jahanzeb Sherwani, Umar Saif, Roni Rosenfeld. Viral Entertainment as a Vehicle for Disseminating Speech-Based Services to Low-Literate Users. ICTD 2012, Atlanta, GA, USA.
HCI for Development: Health, Gender & Financial Inclusion (9)
- JournalSarojini Hirshleifer, Mustafa Naseem, Agha Ali Raza, Arman Rezaee. The Spread of (Mis)information: A Social Media Experiment in Pakistan. Journal of Development Economics, 2026.
- ConferenceAli Saif, Mohammad Taha Zakir, Agha Ali Raza, Mustafa Naseem. EvolveUI: User Interfaces that Evolve with User Proficiency. ACM COMPASS 2024, IIIT-Delhi, India.
- ConferenceImama Zahoor, Shiza Ihtsham, Umar Ramzan, Agha Ali Raza, Basmaa Ali. AI-Driven Healthcare Delivery in Pakistan: A Framework for Systemic Improvement. ACM COMPASS 2024, IIIT-Delhi, India.
- JournalAyesha Ali, Agha Ali Raza, Ihsan Ayyub Qazi. Validated Digital Literacy Measures for Populations with Low Levels of Internet Experiences. Development Engineering, vol. 8, 2023.
- ConferenceAgha Ali Raza, Mustafa Naseem, Namoos Hayat Qasmi, Shan Randhawa, Fizzah Malik, Behzad Taimur, Sacha St-Onge Ahmad, Sarojini Hirshleifer, Arman Rezaee, Aditya Vashistha. Fostering Engagement of Underserved Communities with Credible Health Information on Social Media. Web4Good at TheWebConf 2022 (WWW), Lyon, France.
- ConferenceCHI Honorable MentionMustafa Naseem, Bilal Saleem, Sacha Ahmad, Jay Chen, Agha Ali Raza. An Empirical Comparison of Technologically Mediated Advertising in Under-connected Populations. CHI 2020, Honolulu, HI, USA.
- ConferenceMaryam Mustafa, Amna Batool, Beenish Fatima, Fareeda Nawaz, Kentaro Toyama, Agha Ali Raza. Patriarchy, Maternal Health and Spiritual Healing: Designing Maternal Health Interventions in Pakistan. CHI 2020, Honolulu, HI, USA.
- JournalMaryam Mustafa, Amna Batool, Agha Ali Raza. Designing ICT Interventions for Women in Pakistan. Communications of the ACM, vol. 62(11), 2019.
- JournalHamid Mehmood, Tallal Ahmad, Lubna Razaq, Shrirang Mare, Maryam Zafar Usmani, Richard Anderson, Agha Ali Raza. Towards Digitization of Collaborative Savings Among Low-Income Groups. PACM on Human-Computer Interaction (CSCW), 2019.
Data Mining, Information Networks & Applied Optimization (2)
- JournalAbinta Mehmood Mir, Ali Hassan, Asma Khalid, Zohair Raza Hassan, Faisal Kamiran, Agha Ali Raza, Saeed-Ul Hassan, Mudassir Shabbir. Data-Driven Smart Policing: A Novel Road-Distance-based k-Median Model for Optimal Substation Placement. Computers in Human Behavior, vol. 127, 2022.
- ConferenceYibin Lin, Agha Ali Raza, Jay-Yoon Lee, Danai Koutra, Roni Rosenfeld, Christos Faloutsos. Influence Propagation: Patterns, Model and a Case Study. PAKDD 2014, Tainan, Taiwan.
Book Chapters (1)
- Book ChapterAditya Vashistha, Agha Ali Raza. Voice Interfaces for Underserved Communities. In An Introduction to Development Engineering, Springer, 2021.
Posters, Extended Abstracts & Symposium Papers (7)
- PosterHira Ejaz, Syed Ali Hussain, Agha Ali Raza. The Case for IVR-Based Citizen Journalism in Pakistan. MobileHCI 2018 (Adjunct), Barcelona, Spain.
- SymposiumHamid Mehmood, Lubna Razaq, Jennifer Webster, Amna Batool, Maryam Mustafa, Agha Ali Raza, Richard Anderson. Save My Money: Digitizing Informal Savings in Pakistan. HCI Across Borders Symposium, CHI 2018.
- SymposiumMaryam Ayub, Sacha St-Onge Ahmad, Bilal Saleem, Agha Ali Raza, Mustafa Naseem, Jay Chen. How to Advertise a Speech-Based Service to Offline Populations: A Case Study from Pakistan. HCI Across Borders Symposium, CHI 2018.
- SymposiumSacha St-Onge Ahmad, Muhammad Bilal Saleem, Maryam Ayub, Tallal Ahmad, Shan Randhawa, Zain Tariq, Mustafa Naseem, Agha Ali Raza. Usage and Feedback from a 3-Week Launch of a Maternal Health Line for Men in Pakistan. HCI Across Borders Symposium, CHI 2018.
- SymposiumSacha St-Onge Ahmad, Mustafa Naseem, Agha Ali Raza. Maternal Awareness for Low-Literate Expecting Parents via Voice-Based Telephone Services. HCI Across Borders Symposium, CHI 2017.
- SymposiumAbdullah Kharal, Mustafa Naseem, Sacha St-Onge Ahmad, Agha Ali Raza. Sustainable IVR-Based Social Media for the Developing World. HCI Across Borders Symposium, CHI 2017.
- SymposiumAditya Vashistha, Agha Ali Raza, Umar Saif, Roni Rosenfeld, Richard Anderson. Changing Perceptions of Citizens of India and Pakistan. HCI Across Borders Development Consortium, CHI 2016.
Early Work: Mathematics & Geometric Modeling (2)
- ConferenceAgha Ali Raza, Zulfiqar Habib, Manabu Sakai. Interpolation with Rational Cubic Spirals. IEEE ICET 2008, Rawalpindi, Pakistan.
- WorkshopAgha Ali Raza, Asma Zaib, Sana Altaf, Zulfiqar Habib. On a Special Case of Path Planning. 7th CIIT Workshop on Research in Computing, 2008, Lahore, Pakistan.
Preprints & Manuscripts Under Review (2)
- PreprintShan Randhawa, Agha Ali Raza, Kentaro Toyama, Julie Hui, Mustafa Naseem. Empathy Applicability Modeling for General Health Queries. arXiv:2601.09696, 2026 (under submission).
- PreprintZohaib Khan, Muhammad Khaquan, Omer Tafveez, Burhanuddin Samiwala, Agha Ali Raza. Beyond Uniform Query Distribution: Key-Driven Grouped Query Attention. arXiv:2408.08454, 2024.