Shounak Paul, Dhananjay Ghumare, Pawan Goyal, Saptarshi Ghosh, Ashutosh Modi. IL-PCSR: Legal Corpus for Prior Case and Statute Retrieval. Conference on Empirical Methods in Natural Language Processing (EMNLP Main Conference), Suzhou, China, November 2025.[Core A*] [Link]
Soumilya De, Soumyajit Datta, Koustav Rudra, Saptarshi Ghosh, Ashiqur R. Khudabukhsh, Kripabandhu Ghosh. Justice for the Disadvantaged: A Study of Public Reactions on Indian Supreme Court Judgments. International Conference on Social Networks Analysis and Mining (ASONAM), Niagara Falls, Ontario, Canada, August 2025.
Upal Bhattacharya, Aniket Deroy, Ayan Bandyopadhyay, Gourish Majumdar, Shouvik Kumar Guha, Koustav Rudra, Saptarshi Ghosh, Kripabandhu Ghosh. ARDI: a new dataset for automatic advocate recommendation in the Indian Legal System. . Artificial Intelligence and Law, Springer, 2025. [Link]
Sagar Chakraborty, Gaurav Harit, Saptarshi Ghosh. How well do MLLMs understand handwritten legal documents? A Novel Dataset for Benchmarking. International Journal on Document Analysis and Recognition (IJDAR), Springer, 2025. [Dataset] [Link]
Aniket Deroy, Kripabandhu Ghosh, Saptarshi Ghosh. Investigating Legal Question Generation using Large Language Models. Artificial Intelligence and Law, Springer, 2025. [Dataset] [Link]
Purbid Bambroo, Subinay Adhikary, Paheli Bhattacharya, Abhijnan Chakraborty, Saptarshi Ghosh, Kripabandhu Ghosh. MARRO: Multi-headed Attention for Rhetorical Role Labeling in Legal Documents. Artificial Intelligence and Law, Springer, 2025. [Dataset] [Link]
Sayan Mahapatra, Debtanu Datta, Shubham Soni, Adrijit Goswami, Saptarshi Ghosh. MILPaC: A Novel Benchmark for Evaluating Translation of Legal Text to Indian Languages. ACM Transactions on Asian and Low-Resource Language Information Processing (TALLIP), vol. 24, issue 8, pages 80:1--80:30, August 2025. [Dataset] [Translation Model] [Link]
Shounak Paul, Rohit Kumar Prajapati, Pawan Goyal, Saptarshi Ghosh. An Investigation into the Understanding and Reasoning Capabilities of LLMs for Legal Statute Identification. Workshop on Informing ML with Knowledge Engineering for Hybrid Intelligent Systems (co-located with International Conference on Hybrid Human-Artificial Intelligence), Pisa, Italy, June 2025.
2024
Abhinav Joshi, Shounak Paul, Akshat Sharma, Pawan Goyal, Saptarshi Ghosh, Ashutosh Modi. IL-TUR: Benchmark for Indian Legal Text Understanding and Reasoning. Annual Conference of the Association for Computational Linguistics (ACL Main), pp. 11460--11499, Bangkok, Thailand, August 2024. [Core A*] [Website] [Link]
Shounak Paul, Rajas Bhatt, Pawan Goyal, Saptarshi Ghosh. Legal Statute Identification: A Case Study using State-of-the-Art Datasets and Methods. ACM SIGIR Conference (Resource & Reproducibility track), pp. 2231--2240, Washington D.C., USA, July 2024. [Core A*] [Dataset & Codes] [Link]
Aniket Deroy, Kripabandhu Ghosh, Saptarshi Ghosh. Applicability of Large Language Models and Generative Models for Legal Case Judgement Summarization. Artificial Intelligence and Law, Springer, 2024. [Link]
Aniket Deroy, Kripabandhu Ghosh, Saptarshi Ghosh. Ensemble methods for improving extractive summarization of legal case judgements. Artificial Intelligence and Law, Springer, vol. 32, pp. 231--289, 2024. [Link]
2023
Debtanu Datta, Shubham Soni, Rajdeep Mukherjee, Saptarshi Ghosh. MILDSum: A Novel Benchmark Dataset for Multilingual Summarization of Indian Legal Case Judgments. Conference on Empirical Methods in Natural Language Processing (EMNLP), pp. 5291--5302, Singapore, December 2023. [Short Paper] [Core A*] [Dataset] [Link]
Sagar Chakraborty, Gaurav Harit, Saptarshi Ghosh. TransDocAnalyser: A framework for semi-structured offline handwritten documents analysis with an application to legal domain. International Conference on Document Analysis and Recognition (ICDAR), pp. 45--62, San José, California, USA, August 2023. [Core A] [Dataset] [Link]
Shounak Paul, Arpan Mandal, Pawan Goyal, Saptarshi Ghosh. Pre-trained Language Models for the Legal Domain: A Case Study on Indian Law. International Conference on Artificial Intelligence and Law (ICAIL), pp. 187-196, Braga, Portugal, June 2023. [Best pre-trained model (1 million+ downloads till date)] [Codes] [Link]
Paheli Bhattacharya, Shounak Paul, Kripabandhu Ghosh, Saptarshi Ghosh, Adam Wyner. DeepRhole: Deep Learning for Rhetorical Role Labeling of Sentences in Legal Case Documents. Artificial Intelligence and Law, Springer, vol. 31, pp. 53--90, 2023. [Link]
Aniket Deroy, Naksatra Kumar Bailung, Kripabandhu Ghosh, Saptarshi Ghosh, Abhijnan Chakraborty. Artificial Intelligence (AI) in Legal Data Mining. Technology and Analytics for Law & Justice, OakBridge Publishing, pp. 273--297, 2023 (ISBN: 978-939-5764-68-1). [Book - online version]
Debasis Ganguly, Jack G. Conrad, Kripabandhu Ghosh, Saptarshi Ghosh, Pawan Goyal, Paheli Bhattacharya, Shubham Kumar Nigam, Shounak Paul. Legal IR and NLP: the History, Challenges, and State-of-the-Art. Tutorial at the European Conference on Information Retrieval (ECIR), pp. 331--340, Dublin, Ireland, April 2023. [Core A] [Resource Page]
Jack Conrad, Shirsha Ray Chaudhuri, Shounak Paul, Saptarshi Ghosh. AI & Law: Formative Developments, State-of-the-Art Approaches, Challenges & Opportunities. Tutorial at the Joint International Conference on Data Science & Management of Data (CODS-COMAD), pp. 320–323, Mumbai, India, January 2023. [Pdf] [Resource Page]
2022
Abhay Shukla, Paheli Bhattacharya, Soham Poddar, Rajdeep Mukherjee, Kripabandhu Ghosh, Pawan Goyal, Saptarshi Ghosh. Legal Case Document Summarization: Extractive and Abstractive Methods and their Evaluation. Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics and the International Joint Conference on Natural Language Processing (AACL-IJCNLP), Virtual Event, pp. 1048--1064, November 2022. [Dataset + Codes] [Link]
Shounak Paul, Pawan Goyal, Saptarshi Ghosh. LeSICiN: A Heterogeneous Graph-based Approach for Automatic Legal Statute Identification from Indian Legal Documents. AAAI Conference on Artificial Intelligence (AAAI), Virtual Event, Canada, pp. 11139--11146, February 2022. [Core A*] [Dataset + Codes] [Link]
Paheli Bhattacharya, Kripabandhu Ghosh, Arindam Pal, Saptarshi Ghosh. Legal Case Document Similarity: You Need Both Network and Text. Information Processing and Management, Elsevier, vol. 59, issue 6, pp. 103069-1 -- 103069-24, November 2022. [Dataset] [Link]
Arpan Mandal, Kripabandhu Ghosh, Saptarshi Ghosh, Sekhar Mandal. A Sequence Labeling Model for Catchphrase Identification from Legal Case Documents. Artificial Intelligence and Law, Springer, vol. 30, pp. 325--358, 2022. [Codes] [Link]
Saptarshi Ghosh et al., Report on the 2nd Symposium on Artificial Intelligence and Law (SAIL) 2022. ACM SIGIR Forum Newsletter, Volume 56, number 1, June 2022. [Online Pdf]
2021
Aniket Deroy, Paheli Bhattacharya, Saptarshi Ghosh, Kripabandhu Ghosh. An Analytical Study of Algorithmic and Expert Summaries of Legal Cases. International Conference on Legal Knowledge and Information Systems (JURIX), Vilnius, Lithuania, pp. 90--99, December 2021. [Link]
Arpan Mandal, Paheli Bhattacharya, Sekhar Mandal, Saptarshi Ghosh. Improving Legal Case Document Summarization using Document-specific Catchphrases. International Conference on Legal Knowledge and Information Systems (JURIX), Vilnius, Lithuania, pp. 76--81, December 2021. [Short Paper] [Link]
Paheli Bhattacharya, Soham Poddar, Koustav Rudra, Kripabandhu Ghosh, Saptarshi Ghosh. Incorporating Domain Knowledge for Extractive Summarization of Legal Case Documents. International Conference on Artificial Intelligence and Law (ICAIL), Virtual Event, Brazil, pp. 22--31, June 2021. Donald Berman Award for Best Student Paper at ICAIL 2021 [Code] [Link]
Arpan Mandal, Kripabandhu Ghosh, Saptarshi Ghosh, Sekhar Mandal. Unsupervised Approaches for Measuring Textual Similarity between Legal Court Case Reports. Artificial Intelligence and Law, Springer, vol. 29, pp. 417--451, January 2021. [Link]
Vedant Parikh, Upal Bhattacharya, Parth Mehta, Ayan Bandyopadhyay, Paheli Bhattacharya, Kripabandhu Ghosh, Saptarshi Ghosh, Arindam Pal, Arnab Bhattacharya, Prasenjit Majumder. AILA 2021: Shared task on Artificial Intelligence for Legal Assistance. Proceedings of FIRE 2021 - Annual Meeting of the Forum for Information Retrieval Evaluation, pp. 12--15, Virtual Event, December 2021. [Link]
2020
Shounak Paul, Pawan Goyal, Saptarshi Ghosh. Automatic Charge Identification from Facts: A Few Sentence-Level Charge Annotations is All You Need. International Conference on Computational Linguistics (COLING), Virtual Event, Spain, pp. 1011–1022, December 2020. [Code] [Link]
Paheli Bhattacharya, Kripabandhu Ghosh, Arindam Pal, Saptarshi Ghosh. Hier-SPCNet: A Legal Statute Hierarchy-based Heterogeneous Network for Computing Legal Document Similarity. ACM SIGIR, Virtual Event, China, pp. 1657–1660, July 2020. [Short Paper] [Core A*] [Link]
Paheli Bhattacharya, Parth Mehta, Kripabandhu Ghosh, Saptarshi Ghosh, Arindam Pal, Arnab Bhattacharya, Prasenjit Majumder. FIRE 2020 AILA Track: Artificial Intelligence for Legal Assistance. Proceedings of FIRE 2020 - Annual Meeting of the Forum for Information Retrieval Evaluation, pp. 1-3, Virtual Event, December 2020. [Link]
2019
Paheli Bhattacharya, Shounak Paul, Kripabandhu Ghosh, Saptarshi Ghosh, Adam Wyner. Identification of Rhetorical Roles of Sentences in Indian Legal Judgments. International Conference on Legal Knowledge and Information Systems (JURIX), Madrid, Spain, December 2019. [ Proceedings published as Legal Knowledge and Information Systems, Series on Frontiers in Artificial Intelligence and Applications, vol. 322, pp. 3--12] [JURIX 2019 Best Paper Award] [Dataset + Codes] [Link]
Paheli Bhattacharya, Kaustubh Hiware, Subham Rajgaria, Nilay Pochhi, Kripabandhu Ghosh, Saptarshi Ghosh. A Comparative Study of Summarization Algorithms applied to Legal Case Judgments. European Conference on Information Retrieval (ECIR), Cologne, Germany, pp. 413-428, April 2019. [Core A] [Codes] [Link]
Paheli Bhattacharya, Kripabandhu Ghosh, Arindam Pal, Saptarshi Ghosh. Methods for Computing Legal Document Similarity: A Comparative Study. Workshop on Legal Data Analysis, co-located with JURIX 2019, Madrid, Spain, December 2019. [Link]
Paheli Bhattacharya, Kripabandhu Ghosh, Saptarshi Ghosh, Arindam Pal, Parth Mehta, Arnab Bhattacharya, Prasenjit Majumder. Overview of the FIRE 2019 AILA track: Artificial Intelligence for Legal Assistance. Working Notes of FIRE 2019 - Annual Meeting of the Forum for Information Retrieval Evaluation, CEUR Workshop Proceedings, vol. 2517, pp. 1-12, Kolkata, India, December 2019. [Link]
2017
Arpan Mandal, Kripabandhu Ghosh, Arindam Pal, Saptarshi Ghosh. Automatic Catchphrase Identification from Legal Court Case Documents. ACM Conference on Information and Knowledge Management (CIKM), Singapore, pp. 2267-2270, November 2017. [Short Paper] [Core A] [Codes] [Link]
Arpan Mandal, Kripabandhu Ghosh, Arnab Bhattacharya, Arindam Pal, Saptarshi Ghosh. Overview of the FIRE 2017 IRLeD Track: Information Retrieval from Legal Documents. Working notes of FIRE 2017 - Annual Meeting of the Forum for Information Retrieval Evaluation, CEUR workshop proceedings, vol. 2036, pp. 63-68, Bangalore, India, December 2017. [Link]