Document (#41094)

Author
Mai, F.
Galke, L.
Scherp, A.
Title
Using deep learning for title-based semantic subject indexing to reach competitive performance to full-text
Source
https://arxiv.org/abs/1801.06717
Year
2018
Abstract
For (semi-)automated subject indexing systems in digital libraries, it is often more practical to use metadata such as the title of a publication instead of the full-text or the abstract. Therefore, it is desirable to have good text mining and text classification algorithms that operate well already on the title of a publication. So far, the classification performance on titles is not competitive with the performance on the full-texts if the same number of training samples is used for training. However, it is much easier to obtain title data in large quantities and to use it for training than full-text data. In this paper, we investigate the question how models obtained from training on increasing amounts of title training data compare to models from training on a constant number of full-texts. We evaluate this question on a large-scale dataset from the medical domain (PubMed) and from economics (EconBiz). In these datasets, the titles and annotations of millions of publications are available, and they outnumber the available full-texts by a factor of 20 and 15, respectively. To exploit these large amounts of data to their full potential, we develop three strong deep learning classifiers and evaluate their performance on the two datasets. The results are promising. On the EconBiz dataset, all three classifiers outperform their full-text counterparts by a large margin. The best title-based classifier outperforms the best full-text method by 9.9%. On the PubMed dataset, the best title-based method almost reaches the performance of the best full-text classifier, with a difference of only 2.9%.

Similar documents (content)

  1. Mens, G. Le; Kovács; B.; Hannan, M.T.; Pros, G.: Uncovering the semantics of concepts using GPT-4 (2023) 0.28
    0.27510345 = sum of:
      0.27510345 = product of:
        0.6877586 = sum of:
          0.015745047 = weight(abstract_txt:based in 2305) [ClassicSimilarity], result of:
            0.015745047 = score(doc=2305,freq=3.0), product of:
              0.052221388 = queryWeight, product of:
                3.1830752 = idf(docFreq=5005, maxDocs=44421)
                0.016405955 = queryNorm
              0.30150574 = fieldWeight in 2305, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.1830752 = idf(docFreq=5005, maxDocs=44421)
                0.0546875 = fieldNorm(doc=2305)
          0.028234057 = weight(abstract_txt:evaluate in 2305) [ClassicSimilarity], result of:
            0.028234057 = score(doc=2305,freq=1.0), product of:
              0.097113125 = queryWeight, product of:
                1.1134459 = boost
                5.316273 = idf(docFreq=592, maxDocs=44421)
                0.016405955 = queryNorm
              0.2907337 = fieldWeight in 2305, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.316273 = idf(docFreq=592, maxDocs=44421)
                0.0546875 = fieldNorm(doc=2305)
          0.05276092 = weight(abstract_txt:datasets in 2305) [ClassicSimilarity], result of:
            0.05276092 = score(doc=2305,freq=1.0), product of:
              0.14733434 = queryWeight, product of:
                1.3714569 = boost
                6.548176 = idf(docFreq=172, maxDocs=44421)
                0.016405955 = queryNorm
              0.35810336 = fieldWeight in 2305, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.548176 = idf(docFreq=172, maxDocs=44421)
                0.0546875 = fieldNorm(doc=2305)
          0.024041578 = weight(abstract_txt:data in 2305) [ClassicSimilarity], result of:
            0.024041578 = score(doc=2305,freq=3.0), product of:
              0.07621503 = queryWeight, product of:
                1.394972 = boost
                3.3302255 = idf(docFreq=4320, maxDocs=44421)
                0.016405955 = queryNorm
              0.31544405 = fieldWeight in 2305, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.3302255 = idf(docFreq=4320, maxDocs=44421)
                0.0546875 = fieldNorm(doc=2305)
          0.050506175 = weight(abstract_txt:texts in 2305) [ClassicSimilarity], result of:
            0.050506175 = score(doc=2305,freq=1.0), product of:
              0.16381581 = queryWeight, product of:
                1.7711433 = boost
                5.6376824 = idf(docFreq=429, maxDocs=44421)
                0.016405955 = queryNorm
              0.30831075 = fieldWeight in 2305, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.6376824 = idf(docFreq=429, maxDocs=44421)
                0.0546875 = fieldNorm(doc=2305)
          0.057066433 = weight(abstract_txt:large in 2305) [ClassicSimilarity], result of:
            0.057066433 = score(doc=2305,freq=3.0), product of:
              0.13561857 = queryWeight, product of:
                1.8608216 = boost
                4.4423513 = idf(docFreq=1420, maxDocs=44421)
                0.016405955 = queryNorm
              0.4207863 = fieldWeight in 2305, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.4423513 = idf(docFreq=1420, maxDocs=44421)
                0.0546875 = fieldNorm(doc=2305)
          0.11834128 = weight(abstract_txt:dataset in 2305) [ClassicSimilarity], result of:
            0.11834128 = score(doc=2305,freq=2.0), product of:
              0.22937196 = queryWeight, product of:
                2.0957813 = boost
                6.6710296 = idf(docFreq=152, maxDocs=44421)
                0.016405955 = queryNorm
              0.51593614 = fieldWeight in 2305, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.6710296 = idf(docFreq=152, maxDocs=44421)
                0.0546875 = fieldNorm(doc=2305)
          0.08022506 = weight(abstract_txt:performance in 2305) [ClassicSimilarity], result of:
            0.08022506 = score(doc=2305,freq=3.0), product of:
              0.18333358 = queryWeight, product of:
                2.4189181 = boost
                4.619759 = idf(docFreq=1189, maxDocs=44421)
                0.016405955 = queryNorm
              0.43759066 = fieldWeight in 2305, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.619759 = idf(docFreq=1189, maxDocs=44421)
                0.0546875 = fieldNorm(doc=2305)
          0.14993979 = weight(abstract_txt:training in 2305) [ClassicSimilarity], result of:
            0.14993979 = score(doc=2305,freq=4.0), product of:
              0.268572 = queryWeight, product of:
                3.2071638 = boost
                5.104322 = idf(docFreq=732, maxDocs=44421)
                0.016405955 = queryNorm
              0.55828524 = fieldWeight in 2305, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.104322 = idf(docFreq=732, maxDocs=44421)
                0.0546875 = fieldNorm(doc=2305)
          0.110898316 = weight(abstract_txt:text in 2305) [ClassicSimilarity], result of:
            0.110898316 = score(doc=2305,freq=5.0), product of:
              0.2244273 = queryWeight, product of:
                3.3853066 = boost
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.016405955 = queryNorm
              0.49413916 = fieldWeight in 2305, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.0546875 = fieldNorm(doc=2305)
        0.4 = coord(10/25)
    
  2. Gauch, S.; Chandramouli, A.; Ranganathan, S.: Training a hierarchical classifier using inter document relationships (2009) 0.26
    0.2585845 = sum of:
      0.2585845 = product of:
        0.80807656 = sum of:
          0.040334366 = weight(abstract_txt:evaluate in 3697) [ClassicSimilarity], result of:
            0.040334366 = score(doc=3697,freq=1.0), product of:
              0.097113125 = queryWeight, product of:
                1.1134459 = boost
                5.316273 = idf(docFreq=592, maxDocs=44421)
                0.016405955 = queryNorm
              0.41533384 = fieldWeight in 3697, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.316273 = idf(docFreq=592, maxDocs=44421)
                0.078125 = fieldNorm(doc=3697)
          0.01595298 = weight(abstract_txt:from in 3697) [ClassicSimilarity], result of:
            0.01595298 = score(doc=3697,freq=2.0), product of:
              0.05232658 = queryWeight, product of:
                1.1558629 = boost
                2.759399 = idf(docFreq=7646, maxDocs=44421)
                0.016405955 = queryNorm
              0.30487338 = fieldWeight in 3697, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.759399 = idf(docFreq=7646, maxDocs=44421)
                0.078125 = fieldNorm(doc=3697)
          0.028042665 = weight(abstract_txt:data in 3697) [ClassicSimilarity], result of:
            0.028042665 = score(doc=3697,freq=2.0), product of:
              0.07621503 = queryWeight, product of:
                1.394972 = boost
                3.3302255 = idf(docFreq=4320, maxDocs=44421)
                0.016405955 = queryNorm
              0.3679414 = fieldWeight in 3697, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.3302255 = idf(docFreq=4320, maxDocs=44421)
                0.078125 = fieldNorm(doc=3697)
          0.14449897 = weight(abstract_txt:classifier in 3697) [ClassicSimilarity], result of:
            0.14449897 = score(doc=3697,freq=2.0), product of:
              0.18046553 = queryWeight, product of:
                1.5178446 = boost
                7.2471204 = idf(docFreq=85, maxDocs=44421)
                0.016405955 = queryNorm
              0.80070126 = fieldWeight in 3697, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.2471204 = idf(docFreq=85, maxDocs=44421)
                0.078125 = fieldNorm(doc=3697)
          0.19828676 = weight(abstract_txt:classifiers in 3697) [ClassicSimilarity], result of:
            0.19828676 = score(doc=3697,freq=3.0), product of:
              0.19467781 = queryWeight, product of:
                1.5764798 = boost
                7.5270805 = idf(docFreq=64, maxDocs=44421)
                0.016405955 = queryNorm
              1.018538 = fieldWeight in 3697, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                7.5270805 = idf(docFreq=64, maxDocs=44421)
                0.078125 = fieldNorm(doc=3697)
          0.06656364 = weight(abstract_txt:large in 3697) [ClassicSimilarity], result of:
            0.06656364 = score(doc=3697,freq=2.0), product of:
              0.13561857 = queryWeight, product of:
                1.8608216 = boost
                4.4423513 = idf(docFreq=1420, maxDocs=44421)
                0.016405955 = queryNorm
              0.4908151 = fieldWeight in 3697, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.4423513 = idf(docFreq=1420, maxDocs=44421)
                0.078125 = fieldNorm(doc=3697)
          0.21419969 = weight(abstract_txt:training in 3697) [ClassicSimilarity], result of:
            0.21419969 = score(doc=3697,freq=4.0), product of:
              0.268572 = queryWeight, product of:
                3.2071638 = boost
                5.104322 = idf(docFreq=732, maxDocs=44421)
                0.016405955 = queryNorm
              0.7975503 = fieldWeight in 3697, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.104322 = idf(docFreq=732, maxDocs=44421)
                0.078125 = fieldNorm(doc=3697)
          0.1001975 = weight(abstract_txt:text in 3697) [ClassicSimilarity], result of:
            0.1001975 = score(doc=3697,freq=2.0), product of:
              0.2244273 = queryWeight, product of:
                3.3853066 = boost
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.016405955 = queryNorm
              0.4464586 = fieldWeight in 3697, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.078125 = fieldNorm(doc=3697)
        0.32 = coord(8/25)
    
  3. Qu, B.; Cong, G.; Li, C.; Sun, A.; Chen, H.: ¬An evaluation of classification models for question topic categorization (2012) 0.25
    0.25071403 = sum of:
      0.25071403 = product of:
        0.6267851 = sum of:
          0.060451124 = weight(abstract_txt:question in 1237) [ClassicSimilarity], result of:
            0.060451124 = score(doc=1237,freq=4.0), product of:
              0.09297168 = queryWeight, product of:
                1.0894455 = boost
                5.2016807 = idf(docFreq=664, maxDocs=44421)
                0.016405955 = queryNorm
              0.6502101 = fieldWeight in 1237, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.2016807 = idf(docFreq=664, maxDocs=44421)
                0.0625 = fieldNorm(doc=1237)
          0.032267492 = weight(abstract_txt:evaluate in 1237) [ClassicSimilarity], result of:
            0.032267492 = score(doc=1237,freq=1.0), product of:
              0.097113125 = queryWeight, product of:
                1.1134459 = boost
                5.316273 = idf(docFreq=592, maxDocs=44421)
                0.016405955 = queryNorm
              0.33226708 = fieldWeight in 1237, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.316273 = idf(docFreq=592, maxDocs=44421)
                0.0625 = fieldNorm(doc=1237)
          0.0127623845 = weight(abstract_txt:from in 1237) [ClassicSimilarity], result of:
            0.0127623845 = score(doc=1237,freq=2.0), product of:
              0.05232658 = queryWeight, product of:
                1.1558629 = boost
                2.759399 = idf(docFreq=7646, maxDocs=44421)
                0.016405955 = queryNorm
              0.2438987 = fieldWeight in 1237, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.759399 = idf(docFreq=7646, maxDocs=44421)
                0.0625 = fieldNorm(doc=1237)
          0.022434132 = weight(abstract_txt:data in 1237) [ClassicSimilarity], result of:
            0.022434132 = score(doc=1237,freq=2.0), product of:
              0.07621503 = queryWeight, product of:
                1.394972 = boost
                3.3302255 = idf(docFreq=4320, maxDocs=44421)
                0.016405955 = queryNorm
              0.29435313 = fieldWeight in 1237, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.3302255 = idf(docFreq=4320, maxDocs=44421)
                0.0625 = fieldNorm(doc=1237)
          0.057721347 = weight(abstract_txt:texts in 1237) [ClassicSimilarity], result of:
            0.057721347 = score(doc=1237,freq=1.0), product of:
              0.16381581 = queryWeight, product of:
                1.7711433 = boost
                5.6376824 = idf(docFreq=429, maxDocs=44421)
                0.016405955 = queryNorm
              0.35235515 = fieldWeight in 1237, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.6376824 = idf(docFreq=429, maxDocs=44421)
                0.0625 = fieldNorm(doc=1237)
          0.037654083 = weight(abstract_txt:large in 1237) [ClassicSimilarity], result of:
            0.037654083 = score(doc=1237,freq=1.0), product of:
              0.13561857 = queryWeight, product of:
                1.8608216 = boost
                4.4423513 = idf(docFreq=1420, maxDocs=44421)
                0.016405955 = queryNorm
              0.27764696 = fieldWeight in 1237, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.4423513 = idf(docFreq=1420, maxDocs=44421)
                0.0625 = fieldNorm(doc=1237)
          0.13524717 = weight(abstract_txt:dataset in 1237) [ClassicSimilarity], result of:
            0.13524717 = score(doc=1237,freq=2.0), product of:
              0.22937196 = queryWeight, product of:
                2.0957813 = boost
                6.6710296 = idf(docFreq=152, maxDocs=44421)
                0.016405955 = queryNorm
              0.5896413 = fieldWeight in 1237, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.6710296 = idf(docFreq=152, maxDocs=44421)
                0.0625 = fieldNorm(doc=1237)
          0.07669785 = weight(abstract_txt:best in 1237) [ClassicSimilarity], result of:
            0.07669785 = score(doc=1237,freq=2.0), product of:
              0.17296433 = queryWeight, product of:
                2.1014712 = boost
                5.0168557 = idf(docFreq=799, maxDocs=44421)
                0.016405955 = queryNorm
              0.4434316 = fieldWeight in 1237, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.0168557 = idf(docFreq=799, maxDocs=44421)
                0.0625 = fieldNorm(doc=1237)
          0.10586962 = weight(abstract_txt:performance in 1237) [ClassicSimilarity], result of:
            0.10586962 = score(doc=1237,freq=4.0), product of:
              0.18333358 = queryWeight, product of:
                2.4189181 = boost
                4.619759 = idf(docFreq=1189, maxDocs=44421)
                0.016405955 = queryNorm
              0.5774699 = fieldWeight in 1237, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.619759 = idf(docFreq=1189, maxDocs=44421)
                0.0625 = fieldNorm(doc=1237)
          0.085679874 = weight(abstract_txt:training in 1237) [ClassicSimilarity], result of:
            0.085679874 = score(doc=1237,freq=1.0), product of:
              0.268572 = queryWeight, product of:
                3.2071638 = boost
                5.104322 = idf(docFreq=732, maxDocs=44421)
                0.016405955 = queryNorm
              0.31902012 = fieldWeight in 1237, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.104322 = idf(docFreq=732, maxDocs=44421)
                0.0625 = fieldNorm(doc=1237)
        0.4 = coord(10/25)
    
  4. Hung, C.-M.; Chien, L.-F.: Web-based text classification in the absence of manually labeled training documents (2007) 0.20
    0.2037459 = sum of:
      0.2037459 = product of:
        0.63670594 = sum of:
          0.040334366 = weight(abstract_txt:evaluate in 1087) [ClassicSimilarity], result of:
            0.040334366 = score(doc=1087,freq=1.0), product of:
              0.097113125 = queryWeight, product of:
                1.1134459 = boost
                5.316273 = idf(docFreq=592, maxDocs=44421)
                0.016405955 = queryNorm
              0.41533384 = fieldWeight in 1087, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.316273 = idf(docFreq=592, maxDocs=44421)
                0.078125 = fieldNorm(doc=1087)
          0.019538332 = weight(abstract_txt:from in 1087) [ClassicSimilarity], result of:
            0.019538332 = score(doc=1087,freq=3.0), product of:
              0.05232658 = queryWeight, product of:
                1.1558629 = boost
                2.759399 = idf(docFreq=7646, maxDocs=44421)
                0.016405955 = queryNorm
              0.3733921 = fieldWeight in 1087, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                2.759399 = idf(docFreq=7646, maxDocs=44421)
                0.078125 = fieldNorm(doc=1087)
          0.019829158 = weight(abstract_txt:data in 1087) [ClassicSimilarity], result of:
            0.019829158 = score(doc=1087,freq=1.0), product of:
              0.07621503 = queryWeight, product of:
                1.394972 = boost
                3.3302255 = idf(docFreq=4320, maxDocs=44421)
                0.016405955 = queryNorm
              0.26017386 = fieldWeight in 1087, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.3302255 = idf(docFreq=4320, maxDocs=44421)
                0.078125 = fieldNorm(doc=1087)
          0.10217621 = weight(abstract_txt:classifier in 1087) [ClassicSimilarity], result of:
            0.10217621 = score(doc=1087,freq=1.0), product of:
              0.18046553 = queryWeight, product of:
                1.5178446 = boost
                7.2471204 = idf(docFreq=85, maxDocs=44421)
                0.016405955 = queryNorm
              0.5661813 = fieldWeight in 1087, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.2471204 = idf(docFreq=85, maxDocs=44421)
                0.078125 = fieldNorm(doc=1087)
          0.114480905 = weight(abstract_txt:classifiers in 1087) [ClassicSimilarity], result of:
            0.114480905 = score(doc=1087,freq=1.0), product of:
              0.19467781 = queryWeight, product of:
                1.5764798 = boost
                7.5270805 = idf(docFreq=64, maxDocs=44421)
                0.016405955 = queryNorm
              0.58805317 = fieldWeight in 1087, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.5270805 = idf(docFreq=64, maxDocs=44421)
                0.078125 = fieldNorm(doc=1087)
          0.06616851 = weight(abstract_txt:performance in 1087) [ClassicSimilarity], result of:
            0.06616851 = score(doc=1087,freq=1.0), product of:
              0.18333358 = queryWeight, product of:
                2.4189181 = boost
                4.619759 = idf(docFreq=1189, maxDocs=44421)
                0.016405955 = queryNorm
              0.36091867 = fieldWeight in 1087, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.619759 = idf(docFreq=1189, maxDocs=44421)
                0.078125 = fieldNorm(doc=1087)
          0.15146205 = weight(abstract_txt:training in 1087) [ClassicSimilarity], result of:
            0.15146205 = score(doc=1087,freq=2.0), product of:
              0.268572 = queryWeight, product of:
                3.2071638 = boost
                5.104322 = idf(docFreq=732, maxDocs=44421)
                0.016405955 = queryNorm
              0.5639532 = fieldWeight in 1087, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.104322 = idf(docFreq=732, maxDocs=44421)
                0.078125 = fieldNorm(doc=1087)
          0.12271637 = weight(abstract_txt:text in 1087) [ClassicSimilarity], result of:
            0.12271637 = score(doc=1087,freq=3.0), product of:
              0.2244273 = queryWeight, product of:
                3.3853066 = boost
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.016405955 = queryNorm
              0.5467979 = fieldWeight in 1087, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.078125 = fieldNorm(doc=1087)
        0.32 = coord(8/25)
    
  5. Levin, M.; Krawczyk, S.; Bethard, S.; Jurafsky, D.: Citation-based bootstrapping for large-scale author disambiguation (2012) 0.19
    0.19067447 = sum of:
      0.19067447 = product of:
        0.59585774 = sum of:
          0.010389037 = weight(abstract_txt:based in 1246) [ClassicSimilarity], result of:
            0.010389037 = score(doc=1246,freq=1.0), product of:
              0.052221388 = queryWeight, product of:
                3.1830752 = idf(docFreq=5005, maxDocs=44421)
                0.016405955 = queryNorm
              0.1989422 = fieldWeight in 1246, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.1830752 = idf(docFreq=5005, maxDocs=44421)
                0.0625 = fieldNorm(doc=1246)
          0.015630664 = weight(abstract_txt:from in 1246) [ClassicSimilarity], result of:
            0.015630664 = score(doc=1246,freq=3.0), product of:
              0.05232658 = queryWeight, product of:
                1.1558629 = boost
                2.759399 = idf(docFreq=7646, maxDocs=44421)
                0.016405955 = queryNorm
              0.29871368 = fieldWeight in 1246, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                2.759399 = idf(docFreq=7646, maxDocs=44421)
                0.0625 = fieldNorm(doc=1246)
          0.08174097 = weight(abstract_txt:classifier in 1246) [ClassicSimilarity], result of:
            0.08174097 = score(doc=1246,freq=1.0), product of:
              0.18046553 = queryWeight, product of:
                1.5178446 = boost
                7.2471204 = idf(docFreq=85, maxDocs=44421)
                0.016405955 = queryNorm
              0.45294502 = fieldWeight in 1246, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.2471204 = idf(docFreq=85, maxDocs=44421)
                0.0625 = fieldNorm(doc=1246)
          0.037654083 = weight(abstract_txt:large in 1246) [ClassicSimilarity], result of:
            0.037654083 = score(doc=1246,freq=1.0), product of:
              0.13561857 = queryWeight, product of:
                1.8608216 = boost
                4.4423513 = idf(docFreq=1420, maxDocs=44421)
                0.016405955 = queryNorm
              0.27764696 = fieldWeight in 1246, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.4423513 = idf(docFreq=1420, maxDocs=44421)
                0.0625 = fieldNorm(doc=1246)
          0.0956342 = weight(abstract_txt:dataset in 1246) [ClassicSimilarity], result of:
            0.0956342 = score(doc=1246,freq=1.0), product of:
              0.22937196 = queryWeight, product of:
                2.0957813 = boost
                6.6710296 = idf(docFreq=152, maxDocs=44421)
                0.016405955 = queryNorm
              0.41693935 = fieldWeight in 1246, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.6710296 = idf(docFreq=152, maxDocs=44421)
                0.0625 = fieldNorm(doc=1246)
          0.085679874 = weight(abstract_txt:training in 1246) [ClassicSimilarity], result of:
            0.085679874 = score(doc=1246,freq=1.0), product of:
              0.268572 = queryWeight, product of:
                3.2071638 = boost
                5.104322 = idf(docFreq=732, maxDocs=44421)
                0.016405955 = queryNorm
              0.31902012 = fieldWeight in 1246, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.104322 = idf(docFreq=732, maxDocs=44421)
                0.0625 = fieldNorm(doc=1246)
          0.14085993 = weight(abstract_txt:title in 1246) [ClassicSimilarity], result of:
            0.14085993 = score(doc=1246,freq=1.0), product of:
              0.39383602 = queryWeight, product of:
                4.194903 = boost
                5.722582 = idf(docFreq=394, maxDocs=44421)
                0.016405955 = queryNorm
              0.35766137 = fieldWeight in 1246, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.722582 = idf(docFreq=394, maxDocs=44421)
                0.0625 = fieldNorm(doc=1246)
          0.12826899 = weight(abstract_txt:full in 1246) [ClassicSimilarity], result of:
            0.12826899 = score(doc=1246,freq=1.0), product of:
              0.41671476 = queryWeight, product of:
                5.157445 = boost
                4.9249606 = idf(docFreq=876, maxDocs=44421)
                0.016405955 = queryNorm
              0.30781004 = fieldWeight in 1246, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.9249606 = idf(docFreq=876, maxDocs=44421)
                0.0625 = fieldNorm(doc=1246)
        0.32 = coord(8/25)