Probit Regressive Tversky Indexed Rocchio Convolutive Deep Neural Learning for Legal Document Data Analytics

Probit Regressive Tversky Indexed Rocchio Convolutive Deep Neural Learning for Legal Document Data Analytics

Abstract: Legal documents data analytics is a very significant process in the field of computational law. Semantically analyzing the documents is more challenging since it’s often more complicated than open domain documents. Efficient document analysis is crucial to current legal applications, such as case-based reasoning, legal citations, and so on. Due to the extensive growth of documents of data, several statistical machine-learning methods have been developed for Legal documents data analytics. However, documents are large and highly complex, so the traditional machine learning-based classification models are inefficient for accurate data analytics with minimum time. In order to improve the accurate legal documents data analytics with minimum time, an efficient technique called Probit Regressive Tversky Indexed Rocchio Convolutive Deep Neural Learning (PRTIRCDNL) is introduced. The PRTIRCDNL technique uses the Convolutive Deep neural learning concept to learn the given input with help of many layers and provides accurate classification results. Convolutive Deep Neural Learning uses two different processing steps such as keyword extraction and classification in the different layers such as input, two hidden layers and output layer. Initially, large numbers of legal documents are collected from the dataset. Then the collected legal documents are sent to the input layer of the convolutive deep neural learning. The input legal documents are transferred into the first hidden layer where the keyword extraction process is carried out by applying the Target projective probit Regression. Then the regression function extracts the keywords based on frequent occurrence score. Then the extracted keywords are transferred into the second hidden layer where the document classification is performed using the Tversky similarity indexive Rocchio classifier. Likewise, all the legal documents are classified into different classes. The experimental evaluation is carried out using different performance metrics such as accuracy, precision, recall, F-measure and computational time with respect to the number of legal documents collected from the dataset. The observed results confirmed that the presented PRTIRCDNL technique provides the better performance in terms of achieving higher accuracy, precision, recall and F-measure with minimum computation time.

___

  • [1] Donghong Ji, Peng Tao, Hao Fei, Yafeng Ren, “An end-to-end joint model for evidence information extraction from court record document”, Information Processing & Management, Elsevier, Volume 57, Issue 6, 2020, Pages 1-14
  • [2] Donghong Jia, Jun Gaoa, Hao Feia, Chong Tenga, Yafeng Ren, “A deep neural network model for speakers coreference resolution in legal texts”, Information Processing and Management, Elsevier, Volume 57, 2020, Pages 1-17
  • [3] Yinglong Ma, Peng Zhang, Jiangang Ma, “An Ontology Driven Knowledge Block Summarization Approach for Chinese Judgment Document Classification”, IEEE Access, Volume 6, 2018, Pages 71327 – 71338
  • [4] Deepa Anand, Rupali Wagh, “Effective deep learning approaches for summarization of legal texts”, Journal of King Saud University - Computer and Information Sciences, Elsevier, 2019, Pages 1-10
  • [5] Masha Medvedeva, Michel Vols, Martijn Wieling, “Using machinelearning to predict decisions of the European Court of Human Rights”, Artificial Intelligence and Law, Springer, Volume 28, 2020, Pages 237–266
  • [6] Kongfan Zhu, Baosen Ma, Tianhuan Huang, Zeqiang Li, Haoyang Ma, Yujun Li, “Sequence Generation Network Based on Hierarchical Attention for Multi-Charge Prediction”, IEEE Access, Volume 8, 2020, Pages 109315 - 109324
  • [7] Livio Robaldo, Serena Villata, Adam Wyner & Matthias Grabmair, “Introduction for artificial intelligence and law: special issue “natural language processing for legal texts”, Artificial Intelligence and Law, Springer, Volume 27, 2019, Pages 113-115
  • [8] Marc van Opijnen & Cristiana Santos, “On the concept of relevance in legal information retrieval”, Artificial Intelligence and Law, Springer, Volume 25, 2017, Pages 65-87
  • [9] Wenlong Fu, Bing Xue, Xiaoying Gao, Mengjie Zhang, “Outputbased transfer learning in genetic programming for documentclassification”, Knowledge-Based Systems, Elsevier, Volume 212, 2021, Pages 1-11
  • [10] Carina I. Hausladen, Marcel H. Schubert, Elliott Ash, “Text classification of ideological direction in judicial opinions”, International Review of Law and Economics, Elsevier, Volume 62, 2020, Pages 1-19
  • [11] Neha Bansal, Arun Sharma, R.K. Singh Indira Gandhi, “An Evolving Hybrid Deep Learning Framework for Legal Document Classification”, International Information and engineering technology association, Volume 24, Issue 4, 2019, Pages 425-431
  • [12] Yaakov HaCohen-Kerner,Daniel Miller,Yair Yigal, “The influence of preprocessing on text classification using a bag-of-words representation”, PLoS ONE, Volume 15, Issue 5, 2020, Pages 1-22
  • [13] Rafe Athar Shaikh, Tirath Prasad Sahua, Veena Anand, “Predicting Outcomes of Legal Cases based on Legal Factors usingClassifiers”, Procedia Computer Science, Elsevier, Volume 167, 2020, Pages 2393–2402
  • [14] Shahmin Sharafat, Zara Nasar, Syed Waqar Jaffry, “Data mining for smart legal systems”, Computers and Electrical Engineering, Elsevier, Volume 78, 2019, Pages 328–342
  • [15] Liu Liu, Kaile Liu, Zhenghai Cong, Jiali Zhao, Yefei Ji and Jun He, “Long Length Document Classification by LocalConvolutional Feature Aggregation”, Algorithms, Volume 11, Issue 8, 2018, Pages 1-12
  • [16] Azuki Ashihara, Cheikh Brahim El Vaigh, Chenhui Chu, Benjamin Renoust, Noriko Okubo, Noriko Takemura, Yuta Nakashima & Hajime Nagahara, “Improving topic modeling through homophily for legal documents”, Applied Network Science, Springer, Volume 5, 2020, Pages 1-20
  • [17] Manali Sharma and Mustafa Bilgic, “Learning with rationales for document classification”, Machine Learning, Springer, Volume 107, 2018, Pages 797-824
  • [18] Zhen Zhao, Zongmin Ma & Li Yan, “An Efficient Classification of Fuzzy XML Documents Based on Kernel ELM”, Information Systems Frontiers, Springer, 2019, Pages 1-16
  • [19] Zenun Kastrati, Ali Shariq Imran, Sule Yildirim Yayilgan, “The impact of deep learning on document classification usingsemantically rich representations”, Information Processing & Management, Elsevier, Volume 56, Issue 5, 2019, Pages 1618-1632
  • [20] Charles V. Trappey, Amy J.C. Trappey, Bo-Hung Liu, “Identify trademark legal case precedents - Using machine-learning to enable semantic analysis of judgments”, World Patent Information, Elsevier, Volume 62, 2020, Pages 1-10
  • [21] https://archive.ics.uci.edu/ml/datasets/Legal+Case+Reports