Document (#37522)

Author
Chen, Y.-L.
Liu, Y.-H.
Ho, W.-L.
Title
¬A text mining approach to assist the general public in the retrieval of legal documents
Source
Journal of the American Society for Information Science and Technology. 64(2013) no.2, S.280-290
Year
2013
Abstract
Applying text mining techniques to legal issues has been an emerging research topic in recent years. Although some previous studies focused on assisting professionals in the retrieval of related legal documents, they did not take into account the general public and their difficulty in describing legal problems in professional legal terms. Because this problem has not been addressed by previous research, this study aims to design a text-mining-based method that allows the general public to use everyday vocabulary to search for and retrieve criminal judgments. The experimental results indicate that our method can help the general public, who are not familiar with professional legal terms, to acquire relevant criminal judgments more accurately and effectively.
Theme
Data Mining
Field
Rechtswissenschaft

Similar documents (author)

  1. Chen, Y.N.; Chen, S.J.: ¬A metadata practice of the OFLA FRBR model : a case study for the National Palace Museum in Taipai (2004) 4.34
    4.3394766 = sum of:
      4.3394766 = weight(author_txt:chen in 4384) [ClassicSimilarity], result of:
        4.3394766 = score(doc=4384,freq=2.0), product of:
          0.99999994 = queryWeight, product of:
            6.136947 = idf(docFreq=260, maxDocs=44421)
            0.16294746 = queryNorm
          4.339477 = fieldWeight in 4384, product of:
            1.4142135 = tf(freq=2.0), with freq of:
              2.0 = termFreq=2.0
            6.136947 = idf(docFreq=260, maxDocs=44421)
            0.5 = fieldNorm(doc=4384)
    
  2. Chen, C.C.; Chen, H.H.; Chen, K.H.: ¬The design of the XML/Metadata management system (2000) 3.99
    3.9860637 = sum of:
      3.9860637 = weight(author_txt:chen in 5633) [ClassicSimilarity], result of:
        3.9860637 = score(doc=5633,freq=3.0), product of:
          0.99999994 = queryWeight, product of:
            6.136947 = idf(docFreq=260, maxDocs=44421)
            0.16294746 = queryNorm
          3.986064 = fieldWeight in 5633, product of:
            1.7320508 = tf(freq=3.0), with freq of:
              3.0 = termFreq=3.0
            6.136947 = idf(docFreq=260, maxDocs=44421)
            0.375 = fieldNorm(doc=5633)
    
  3. Chen, W.Y.: Observations on cataloguing and classification (1991) 3.84
    3.8355918 = sum of:
      3.8355918 = weight(author_txt:chen in 4183) [ClassicSimilarity], result of:
        3.8355918 = score(doc=4183,freq=1.0), product of:
          0.99999994 = queryWeight, product of:
            6.136947 = idf(docFreq=260, maxDocs=44421)
            0.16294746 = queryNorm
          3.835592 = fieldWeight in 4183, product of:
            1.0 = tf(freq=1.0), with freq of:
              1.0 = termFreq=1.0
            6.136947 = idf(docFreq=260, maxDocs=44421)
            0.625 = fieldNorm(doc=4183)
    
  4. Chen, H.: Knowledge-based document retrieval : framework and design (1992) 3.84
    3.8355918 = sum of:
      3.8355918 = weight(author_txt:chen in 5282) [ClassicSimilarity], result of:
        3.8355918 = score(doc=5282,freq=1.0), product of:
          0.99999994 = queryWeight, product of:
            6.136947 = idf(docFreq=260, maxDocs=44421)
            0.16294746 = queryNorm
          3.835592 = fieldWeight in 5282, product of:
            1.0 = tf(freq=1.0), with freq of:
              1.0 = termFreq=1.0
            6.136947 = idf(docFreq=260, maxDocs=44421)
            0.625 = fieldNorm(doc=5282)
    
  5. Chen, P.S.: On inference rules of logic-based information retrieval systems (1994) 3.84
    3.8355918 = sum of:
      3.8355918 = weight(author_txt:chen in 6730) [ClassicSimilarity], result of:
        3.8355918 = score(doc=6730,freq=1.0), product of:
          0.99999994 = queryWeight, product of:
            6.136947 = idf(docFreq=260, maxDocs=44421)
            0.16294746 = queryNorm
          3.835592 = fieldWeight in 6730, product of:
            1.0 = tf(freq=1.0), with freq of:
              1.0 = termFreq=1.0
            6.136947 = idf(docFreq=260, maxDocs=44421)
            0.625 = fieldNorm(doc=6730)
    

Similar documents (content)

  1. Berry, M.W.; Esau, R.; Kiefer, B.: ¬The use of text mining techniques in electronic discovery for legal matters (2012) 0.23
    0.22995368 = sum of:
      0.22995368 = product of:
        0.82126313 = sum of:
          0.020485545 = weight(abstract_txt:retrieval in 1091) [ClassicSimilarity], result of:
            0.020485545 = score(doc=1091,freq=1.0), product of:
              0.062854156 = queryWeight, product of:
                1.1755714 = boost
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.015379518 = queryNorm
              0.3259219 = fieldWeight in 1091, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.09375 = fieldNorm(doc=1091)
          0.03255814 = weight(abstract_txt:been in 1091) [ClassicSimilarity], result of:
            0.03255814 = score(doc=1091,freq=2.0), product of:
              0.06794102 = queryWeight, product of:
                1.2222162 = boost
                3.614442 = idf(docFreq=3251, maxDocs=44421)
                0.015379518 = queryNorm
              0.4792118 = fieldWeight in 1091, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.614442 = idf(docFreq=3251, maxDocs=44421)
                0.09375 = fieldNorm(doc=1091)
          0.03417925 = weight(abstract_txt:documents in 1091) [ClassicSimilarity], result of:
            0.03417925 = score(doc=1091,freq=1.0), product of:
              0.08841868 = queryWeight, product of:
                1.394293 = boost
                4.123322 = idf(docFreq=1954, maxDocs=44421)
                0.015379518 = queryNorm
              0.38656145 = fieldWeight in 1091, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.123322 = idf(docFreq=1954, maxDocs=44421)
                0.09375 = fieldNorm(doc=1091)
          0.04825481 = weight(abstract_txt:text in 1091) [ClassicSimilarity], result of:
            0.04825481 = score(doc=1091,freq=1.0), product of:
              0.12737763 = queryWeight, product of:
                2.0496242 = boost
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.015379518 = queryNorm
              0.3788327 = fieldWeight in 1091, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.09375 = fieldNorm(doc=1091)
          0.1588586 = weight(abstract_txt:judgments in 1091) [ClassicSimilarity], result of:
            0.1588586 = score(doc=1091,freq=1.0), product of:
              0.24624994 = queryWeight, product of:
                2.3268592 = boost
                6.881186 = idf(docFreq=123, maxDocs=44421)
                0.015379518 = queryNorm
              0.6451112 = fieldWeight in 1091, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.881186 = idf(docFreq=123, maxDocs=44421)
                0.09375 = fieldNorm(doc=1091)
          0.17194833 = weight(abstract_txt:mining in 1091) [ClassicSimilarity], result of:
            0.17194833 = score(doc=1091,freq=1.0), product of:
              0.29716527 = queryWeight, product of:
                3.1305935 = boost
                6.1720386 = idf(docFreq=251, maxDocs=44421)
                0.015379518 = queryNorm
              0.5786286 = fieldWeight in 1091, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.1720386 = idf(docFreq=251, maxDocs=44421)
                0.09375 = fieldNorm(doc=1091)
          0.35497844 = weight(abstract_txt:legal in 1091) [ClassicSimilarity], result of:
            0.35497844 = score(doc=1091,freq=1.0), product of:
              0.6070308 = queryWeight, product of:
                6.327731 = boost
                6.2376356 = idf(docFreq=235, maxDocs=44421)
                0.015379518 = queryNorm
              0.5847783 = fieldWeight in 1091, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.2376356 = idf(docFreq=235, maxDocs=44421)
                0.09375 = fieldNorm(doc=1091)
        0.28 = coord(7/25)
    
  2. Turle, H.: Text retrieval in the legal world (1995) 0.20
    0.19774358 = sum of:
      0.19774358 = product of:
        0.9887179 = sum of:
          0.03588281 = weight(abstract_txt:research in 4552) [ClassicSimilarity], result of:
            0.03588281 = score(doc=4552,freq=4.0), product of:
              0.0519169 = queryWeight, product of:
                1.0684062 = boost
                3.159582 = idf(docFreq=5124, maxDocs=44421)
                0.015379518 = queryNorm
              0.69115853 = fieldWeight in 4552, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                3.159582 = idf(docFreq=5124, maxDocs=44421)
                0.109375 = fieldNorm(doc=4552)
          0.04139567 = weight(abstract_txt:retrieval in 4552) [ClassicSimilarity], result of:
            0.04139567 = score(doc=4552,freq=3.0), product of:
              0.062854156 = queryWeight, product of:
                1.1755714 = boost
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.015379518 = queryNorm
              0.6585988 = fieldWeight in 4552, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.109375 = fieldNorm(doc=4552)
          0.026859095 = weight(abstract_txt:been in 4552) [ClassicSimilarity], result of:
            0.026859095 = score(doc=4552,freq=1.0), product of:
              0.06794102 = queryWeight, product of:
                1.2222162 = boost
                3.614442 = idf(docFreq=3251, maxDocs=44421)
                0.015379518 = queryNorm
              0.3953296 = fieldWeight in 4552, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.614442 = idf(docFreq=3251, maxDocs=44421)
                0.109375 = fieldNorm(doc=4552)
          0.05629728 = weight(abstract_txt:text in 4552) [ClassicSimilarity], result of:
            0.05629728 = score(doc=4552,freq=1.0), product of:
              0.12737763 = queryWeight, product of:
                2.0496242 = boost
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.015379518 = queryNorm
              0.44197148 = fieldWeight in 4552, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.109375 = fieldNorm(doc=4552)
          0.8282831 = weight(abstract_txt:legal in 4552) [ClassicSimilarity], result of:
            0.8282831 = score(doc=4552,freq=4.0), product of:
              0.6070308 = queryWeight, product of:
                6.327731 = boost
                6.2376356 = idf(docFreq=235, maxDocs=44421)
                0.015379518 = queryNorm
              1.3644828 = fieldWeight in 4552, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                6.2376356 = idf(docFreq=235, maxDocs=44421)
                0.109375 = fieldNorm(doc=4552)
        0.2 = coord(5/25)
    
  3. Cumyn, M.; Reiner, G.; Mas, S.; Lesieur, D.: Legal knowledge representation using a faceted scheme (2019) 0.16
    0.15931699 = sum of:
      0.15931699 = product of:
        0.99573123 = sum of:
          0.020504463 = weight(abstract_txt:research in 788) [ClassicSimilarity], result of:
            0.020504463 = score(doc=788,freq=1.0), product of:
              0.0519169 = queryWeight, product of:
                1.0684062 = boost
                3.159582 = idf(docFreq=5124, maxDocs=44421)
                0.015379518 = queryNorm
              0.39494774 = fieldWeight in 788, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.159582 = idf(docFreq=5124, maxDocs=44421)
                0.125 = fieldNorm(doc=788)
          0.06444901 = weight(abstract_txt:documents in 788) [ClassicSimilarity], result of:
            0.06444901 = score(doc=788,freq=2.0), product of:
              0.08841868 = queryWeight, product of:
                1.394293 = boost
                4.123322 = idf(docFreq=1954, maxDocs=44421)
                0.015379518 = queryNorm
              0.7289072 = fieldWeight in 788, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.123322 = idf(docFreq=1954, maxDocs=44421)
                0.125 = fieldNorm(doc=788)
          0.09099014 = weight(abstract_txt:text in 788) [ClassicSimilarity], result of:
            0.09099014 = score(doc=788,freq=2.0), product of:
              0.12737763 = queryWeight, product of:
                2.0496242 = boost
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.015379518 = queryNorm
              0.7143338 = fieldWeight in 788, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.125 = fieldNorm(doc=788)
          0.8197876 = weight(abstract_txt:legal in 788) [ClassicSimilarity], result of:
            0.8197876 = score(doc=788,freq=3.0), product of:
              0.6070308 = queryWeight, product of:
                6.327731 = boost
                6.2376356 = idf(docFreq=235, maxDocs=44421)
                0.015379518 = queryNorm
              1.3504877 = fieldWeight in 788, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.2376356 = idf(docFreq=235, maxDocs=44421)
                0.125 = fieldNorm(doc=788)
        0.16 = coord(4/25)
    
  4. Russell-Rose, T.; Chamberlain, J.; Azzopardi, L.: Information retrieval in the workplace : a comparison of professional search practices (2018) 0.14
    0.14300495 = sum of:
      0.14300495 = product of:
        0.510732 = sum of:
          0.008970702 = weight(abstract_txt:research in 48) [ClassicSimilarity], result of:
            0.008970702 = score(doc=48,freq=1.0), product of:
              0.0519169 = queryWeight, product of:
                1.0684062 = boost
                3.159582 = idf(docFreq=5124, maxDocs=44421)
                0.015379518 = queryNorm
              0.17278963 = fieldWeight in 48, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.159582 = idf(docFreq=5124, maxDocs=44421)
                0.0546875 = fieldNorm(doc=48)
          0.0119499015 = weight(abstract_txt:retrieval in 48) [ClassicSimilarity], result of:
            0.0119499015 = score(doc=48,freq=1.0), product of:
              0.062854156 = queryWeight, product of:
                1.1755714 = boost
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.015379518 = queryNorm
              0.1901211 = fieldWeight in 48, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.0546875 = fieldNorm(doc=48)
          0.013429548 = weight(abstract_txt:been in 48) [ClassicSimilarity], result of:
            0.013429548 = score(doc=48,freq=1.0), product of:
              0.06794102 = queryWeight, product of:
                1.2222162 = boost
                3.614442 = idf(docFreq=3251, maxDocs=44421)
                0.015379518 = queryNorm
              0.1976648 = fieldWeight in 48, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.614442 = idf(docFreq=3251, maxDocs=44421)
                0.0546875 = fieldNorm(doc=48)
          0.019937897 = weight(abstract_txt:documents in 48) [ClassicSimilarity], result of:
            0.019937897 = score(doc=48,freq=1.0), product of:
              0.08841868 = queryWeight, product of:
                1.394293 = boost
                4.123322 = idf(docFreq=1954, maxDocs=44421)
                0.015379518 = queryNorm
              0.22549418 = fieldWeight in 48, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.123322 = idf(docFreq=1954, maxDocs=44421)
                0.0546875 = fieldNorm(doc=48)
          0.040379245 = weight(abstract_txt:previous in 48) [ClassicSimilarity], result of:
            0.040379245 = score(doc=48,freq=1.0), product of:
              0.1415348 = queryWeight, product of:
                1.7640612 = boost
                5.216832 = idf(docFreq=654, maxDocs=44421)
                0.015379518 = queryNorm
              0.28529552 = fieldWeight in 48, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.216832 = idf(docFreq=654, maxDocs=44421)
                0.0546875 = fieldNorm(doc=48)
          0.05740762 = weight(abstract_txt:professional in 48) [ClassicSimilarity], result of:
            0.05740762 = score(doc=48,freq=2.0), product of:
              0.1420346 = queryWeight, product of:
                1.7671732 = boost
                5.226035 = idf(docFreq=648, maxDocs=44421)
                0.015379518 = queryNorm
              0.40418053 = fieldWeight in 48, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.226035 = idf(docFreq=648, maxDocs=44421)
                0.0546875 = fieldNorm(doc=48)
          0.3586571 = weight(abstract_txt:legal in 48) [ClassicSimilarity], result of:
            0.3586571 = score(doc=48,freq=3.0), product of:
              0.6070308 = queryWeight, product of:
                6.327731 = boost
                6.2376356 = idf(docFreq=235, maxDocs=44421)
                0.015379518 = queryNorm
              0.5908384 = fieldWeight in 48, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.2376356 = idf(docFreq=235, maxDocs=44421)
                0.0546875 = fieldNorm(doc=48)
        0.28 = coord(7/25)
    
  5. Moens, M.-F.; Uyttendaele, C.: Automatic text structuring and categorization as a first step in summarizing legal cases (1997) 0.14
    0.13898453 = sum of:
      0.13898453 = product of:
        0.86865336 = sum of:
          0.11373768 = weight(abstract_txt:text in 3256) [ClassicSimilarity], result of:
            0.11373768 = score(doc=3256,freq=8.0), product of:
              0.12737763 = queryWeight, product of:
                2.0496242 = boost
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.015379518 = queryNorm
              0.8929172 = fieldWeight in 3256, product of:
                2.828427 = tf(freq=8.0), with freq of:
                  8.0 = termFreq=8.0
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.078125 = fieldNorm(doc=3256)
          0.066053346 = weight(abstract_txt:general in 3256) [ClassicSimilarity], result of:
            0.066053346 = score(doc=3256,freq=1.0), product of:
              0.19517747 = queryWeight, product of:
                2.9296238 = boost
                4.3318667 = idf(docFreq=1586, maxDocs=44421)
                0.015379518 = queryNorm
              0.3384271 = fieldWeight in 3256, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.3318667 = idf(docFreq=1586, maxDocs=44421)
                0.078125 = fieldNorm(doc=3256)
          0.3930469 = weight(abstract_txt:criminal in 3256) [ClassicSimilarity], result of:
            0.3930469 = score(doc=3256,freq=2.0), product of:
              0.40374708 = queryWeight, product of:
                2.9794543 = boost
                8.811096 = idf(docFreq=17, maxDocs=44421)
                0.015379518 = queryNorm
              0.97349775 = fieldWeight in 3256, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.811096 = idf(docFreq=17, maxDocs=44421)
                0.078125 = fieldNorm(doc=3256)
          0.2958154 = weight(abstract_txt:legal in 3256) [ClassicSimilarity], result of:
            0.2958154 = score(doc=3256,freq=1.0), product of:
              0.6070308 = queryWeight, product of:
                6.327731 = boost
                6.2376356 = idf(docFreq=235, maxDocs=44421)
                0.015379518 = queryNorm
              0.4873153 = fieldWeight in 3256, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.2376356 = idf(docFreq=235, maxDocs=44421)
                0.078125 = fieldNorm(doc=3256)
        0.16 = coord(4/25)