Document (#28238)

Author
Wu, T.
Pottenger, W.M.
Title
¬A semi-supervised active learning algorithm for information extraction from textual data
Source
Journal of the American Society for Information Science and Technology. 56(2005) no.3, S.258-271
Year
2005
Abstract
In this article we present a semi-supervised active learning algorithm for pattern discovery in information extraction from textual data. The patterns are reduced regular expressions composed of various characteristics of features useful in information extraction. Our major contribution is a semi-supervised learning algorithm that extracts information from a set of examples labeled as relevant or irrelevant to a given attribute. The approach is semi-supervised because it does not require precise labeling of the exact location of features in the training data. This significantly reduces the effort needed to develop a training set. An active learning algorithm is used to assist the semi-supervised learning algorithm to further reduce the training set development effort. The active learning algorithm is seeded with a Single positive example of a given attribute. The context of the seed is used to automatically identify candidates for additional positive examples of the given attribute. Candidate examples are manually pruned during the active learning phase, and our semi-supervised learning algorithm automatically discovers reduced regular expressions for each attribute. We have successfully applied this learning technique in the extraction of textual features from police incident reports, university crime reports, and patents. The performance of our algorithm compares favorably with competitive extraction systems being used in criminal justice information systems.
Footnote
Beitrag in einem Themenheft zu: 'Intelligence and security informatics'
Theme
Data Mining

Similar documents (content)

  1. Levin, M.; Krawczyk, S.; Bethard, S.; Jurafsky, D.: Citation-based bootstrapping for large-scale author disambiguation (2012) 0.35
    0.35004652 = sum of:
      0.35004652 = product of:
        0.9723514 = sum of:
          0.014204536 = weight(abstract_txt:used in 1246) [ClassicSimilarity], result of:
            0.014204536 = score(doc=1246,freq=2.0), product of:
              0.04786905 = queryWeight, product of:
                1.1023455 = boost
                3.3572001 = idf(docFreq=4205, maxDocs=44421)
                0.012934804 = queryNorm
              0.29673737 = fieldWeight in 1246, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.3572001 = idf(docFreq=4205, maxDocs=44421)
                0.0625 = fieldNorm(doc=1246)
          0.012880218 = weight(abstract_txt:from in 1246) [ClassicSimilarity], result of:
            0.012880218 = score(doc=1246,freq=3.0), product of:
              0.043118943 = queryWeight, product of:
                1.2080746 = boost
                2.759399 = idf(docFreq=7646, maxDocs=44421)
                0.012934804 = queryNorm
              0.29871368 = fieldWeight in 1246, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                2.759399 = idf(docFreq=7646, maxDocs=44421)
                0.0625 = fieldNorm(doc=1246)
          0.036221832 = weight(abstract_txt:positive in 1246) [ClassicSimilarity], result of:
            0.036221832 = score(doc=1246,freq=1.0), product of:
              0.098339945 = queryWeight, product of:
                1.2900593 = boost
                5.8933253 = idf(docFreq=332, maxDocs=44421)
                0.012934804 = queryNorm
              0.36833283 = fieldWeight in 1246, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.8933253 = idf(docFreq=332, maxDocs=44421)
                0.0625 = fieldNorm(doc=1246)
          0.055566285 = weight(abstract_txt:features in 1246) [ClassicSimilarity], result of:
            0.055566285 = score(doc=1246,freq=5.0), product of:
              0.087565094 = queryWeight, product of:
                1.4909252 = boost
                4.5406218 = idf(docFreq=1287, maxDocs=44421)
                0.012934804 = queryNorm
              0.6345712 = fieldWeight in 1246, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                4.5406218 = idf(docFreq=1287, maxDocs=44421)
                0.0625 = fieldNorm(doc=1246)
          0.032952875 = weight(abstract_txt:examples in 1246) [ClassicSimilarity], result of:
            0.032952875 = score(doc=1246,freq=1.0), product of:
              0.10569205 = queryWeight, product of:
                1.637991 = boost
                4.9885116 = idf(docFreq=822, maxDocs=44421)
                0.012934804 = queryNorm
              0.31178197 = fieldWeight in 1246, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.9885116 = idf(docFreq=822, maxDocs=44421)
                0.0625 = fieldNorm(doc=1246)
          0.03530161 = weight(abstract_txt:training in 1246) [ClassicSimilarity], result of:
            0.03530161 = score(doc=1246,freq=1.0), product of:
              0.11065637 = queryWeight, product of:
                1.6760175 = boost
                5.104322 = idf(docFreq=732, maxDocs=44421)
                0.012934804 = queryNorm
              0.31902012 = fieldWeight in 1246, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.104322 = idf(docFreq=732, maxDocs=44421)
                0.0625 = fieldNorm(doc=1246)
          0.10503611 = weight(abstract_txt:extraction in 1246) [ClassicSimilarity], result of:
            0.10503611 = score(doc=1246,freq=1.0), product of:
              0.27140766 = queryWeight, product of:
                3.3886425 = boost
                6.192079 = idf(docFreq=246, maxDocs=44421)
                0.012934804 = queryNorm
              0.38700494 = fieldWeight in 1246, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.192079 = idf(docFreq=246, maxDocs=44421)
                0.0625 = fieldNorm(doc=1246)
          0.49430707 = weight(abstract_txt:supervised in 1246) [ClassicSimilarity], result of:
            0.49430707 = score(doc=1246,freq=5.0), product of:
              0.47365776 = queryWeight, product of:
                4.9038553 = boost
                7.467361 = idf(docFreq=68, maxDocs=44421)
                0.012934804 = queryNorm
              1.0435954 = fieldWeight in 1246, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                7.467361 = idf(docFreq=68, maxDocs=44421)
                0.0625 = fieldNorm(doc=1246)
          0.18588084 = weight(abstract_txt:algorithm in 1246) [ClassicSimilarity], result of:
            0.18588084 = score(doc=1246,freq=2.0), product of:
              0.3686233 = queryWeight, product of:
                4.995352 = boost
                5.7050157 = idf(docFreq=401, maxDocs=44421)
                0.012934804 = queryNorm
              0.5042569 = fieldWeight in 1246, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.7050157 = idf(docFreq=401, maxDocs=44421)
                0.0625 = fieldNorm(doc=1246)
        0.36 = coord(9/25)
    
  2. Billal, B.; Fonseca, A.; Sadat, F.; Lounis, H.: Semi-supervised learning and social media text analysis towards multi-labeling categorization (2017) 0.33
    0.32530674 = sum of:
      0.32530674 = product of:
        1.0165836 = sum of:
          0.021012852 = weight(abstract_txt:data in 95) [ClassicSimilarity], result of:
            0.021012852 = score(doc=95,freq=6.0), product of:
              0.0471029 = queryWeight, product of:
                1.0934883 = boost
                3.3302255 = idf(docFreq=4320, maxDocs=44421)
                0.012934804 = queryNorm
              0.44610527 = fieldWeight in 95, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                3.3302255 = idf(docFreq=4320, maxDocs=44421)
                0.0546875 = fieldNorm(doc=95)
          0.008788608 = weight(abstract_txt:used in 95) [ClassicSimilarity], result of:
            0.008788608 = score(doc=95,freq=1.0), product of:
              0.04786905 = queryWeight, product of:
                1.1023455 = boost
                3.3572001 = idf(docFreq=4205, maxDocs=44421)
                0.012934804 = queryNorm
              0.18359688 = fieldWeight in 95, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.3572001 = idf(docFreq=4205, maxDocs=44421)
                0.0546875 = fieldNorm(doc=95)
          0.0065068477 = weight(abstract_txt:from in 95) [ClassicSimilarity], result of:
            0.0065068477 = score(doc=95,freq=1.0), product of:
              0.043118943 = queryWeight, product of:
                1.2080746 = boost
                2.759399 = idf(docFreq=7646, maxDocs=44421)
                0.012934804 = queryNorm
              0.15090463 = fieldWeight in 95, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.759399 = idf(docFreq=7646, maxDocs=44421)
                0.0546875 = fieldNorm(doc=95)
          0.03088891 = weight(abstract_txt:training in 95) [ClassicSimilarity], result of:
            0.03088891 = score(doc=95,freq=1.0), product of:
              0.11065637 = queryWeight, product of:
                1.6760175 = boost
                5.104322 = idf(docFreq=732, maxDocs=44421)
                0.012934804 = queryNorm
              0.27914262 = fieldWeight in 95, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.104322 = idf(docFreq=732, maxDocs=44421)
                0.0546875 = fieldNorm(doc=95)
          0.0919066 = weight(abstract_txt:extraction in 95) [ClassicSimilarity], result of:
            0.0919066 = score(doc=95,freq=1.0), product of:
              0.27140766 = queryWeight, product of:
                3.3886425 = boost
                6.192079 = idf(docFreq=246, maxDocs=44421)
                0.012934804 = queryNorm
              0.33862934 = fieldWeight in 95, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.192079 = idf(docFreq=246, maxDocs=44421)
                0.0546875 = fieldNorm(doc=95)
          0.25881186 = weight(abstract_txt:semi in 95) [ClassicSimilarity], result of:
            0.25881186 = score(doc=95,freq=4.0), product of:
              0.362316 = queryWeight, product of:
                4.2889314 = boost
                6.5309834 = idf(docFreq=175, maxDocs=44421)
                0.012934804 = queryNorm
              0.7143263 = fieldWeight in 95, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                6.5309834 = idf(docFreq=175, maxDocs=44421)
                0.0546875 = fieldNorm(doc=95)
          0.16614924 = weight(abstract_txt:learning in 95) [ClassicSimilarity], result of:
            0.16614924 = score(doc=95,freq=5.0), product of:
              0.28652164 = queryWeight, product of:
                4.671213 = boost
                4.7420692 = idf(docFreq=1052, maxDocs=44421)
                0.012934804 = queryNorm
              0.57988375 = fieldWeight in 95, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                4.7420692 = idf(docFreq=1052, maxDocs=44421)
                0.0546875 = fieldNorm(doc=95)
          0.4325187 = weight(abstract_txt:supervised in 95) [ClassicSimilarity], result of:
            0.4325187 = score(doc=95,freq=5.0), product of:
              0.47365776 = queryWeight, product of:
                4.9038553 = boost
                7.467361 = idf(docFreq=68, maxDocs=44421)
                0.012934804 = queryNorm
              0.913146 = fieldWeight in 95, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                7.467361 = idf(docFreq=68, maxDocs=44421)
                0.0546875 = fieldNorm(doc=95)
        0.32 = coord(8/25)
    
  3. Kholghi, M.; Vine, L.D.; Sitbon, L.; Zuccon, G.; Nguyen, A.: Clinical information extraction using small data : an active learning approach based on sequence representations and word embeddings (2017) 0.32
    0.3234847 = sum of:
      0.3234847 = product of:
        0.8985685 = sum of:
          0.007436398 = weight(abstract_txt:from in 4920) [ClassicSimilarity], result of:
            0.007436398 = score(doc=4920,freq=1.0), product of:
              0.043118943 = queryWeight, product of:
                1.2080746 = boost
                2.759399 = idf(docFreq=7646, maxDocs=44421)
                0.012934804 = queryNorm
              0.17246243 = fieldWeight in 4920, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.759399 = idf(docFreq=7646, maxDocs=44421)
                0.0625 = fieldNorm(doc=4920)
          0.0332522 = weight(abstract_txt:effort in 4920) [ClassicSimilarity], result of:
            0.0332522 = score(doc=4920,freq=1.0), product of:
              0.09288878 = queryWeight, product of:
                1.2537944 = boost
                5.727658 = idf(docFreq=392, maxDocs=44421)
                0.012934804 = queryNorm
              0.3579786 = fieldWeight in 4920, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.727658 = idf(docFreq=392, maxDocs=44421)
                0.0625 = fieldNorm(doc=4920)
          0.0088551855 = weight(abstract_txt:information in 4920) [ClassicSimilarity], result of:
            0.0088551855 = score(doc=4920,freq=2.0), product of:
              0.041417588 = queryWeight, product of:
                1.3237535 = boost
                2.4188995 = idf(docFreq=10748, maxDocs=44421)
                0.012934804 = queryNorm
              0.21380253 = fieldWeight in 4920, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.4188995 = idf(docFreq=10748, maxDocs=44421)
                0.0625 = fieldNorm(doc=4920)
          0.059800223 = weight(abstract_txt:reduced in 4920) [ClassicSimilarity], result of:
            0.059800223 = score(doc=4920,freq=1.0), product of:
              0.13736778 = queryWeight, product of:
                1.5247097 = boost
                6.965269 = idf(docFreq=113, maxDocs=44421)
                0.012934804 = queryNorm
              0.43532932 = fieldWeight in 4920, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.965269 = idf(docFreq=113, maxDocs=44421)
                0.0625 = fieldNorm(doc=4920)
          0.03530161 = weight(abstract_txt:training in 4920) [ClassicSimilarity], result of:
            0.03530161 = score(doc=4920,freq=1.0), product of:
              0.11065637 = queryWeight, product of:
                1.6760175 = boost
                5.104322 = idf(docFreq=732, maxDocs=44421)
                0.012934804 = queryNorm
              0.31902012 = fieldWeight in 4920, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.104322 = idf(docFreq=732, maxDocs=44421)
                0.0625 = fieldNorm(doc=4920)
          0.14854348 = weight(abstract_txt:extraction in 4920) [ClassicSimilarity], result of:
            0.14854348 = score(doc=4920,freq=2.0), product of:
              0.27140766 = queryWeight, product of:
                3.3886425 = boost
                6.192079 = idf(docFreq=246, maxDocs=44421)
                0.012934804 = queryNorm
              0.5473076 = fieldWeight in 4920, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.192079 = idf(docFreq=246, maxDocs=44421)
                0.0625 = fieldNorm(doc=4920)
          0.19443375 = weight(abstract_txt:active in 4920) [ClassicSimilarity], result of:
            0.19443375 = score(doc=4920,freq=3.0), product of:
              0.2837072 = queryWeight, product of:
                3.4645743 = boost
                6.3308296 = idf(docFreq=214, maxDocs=44421)
                0.012934804 = queryNorm
              0.6853324 = fieldWeight in 4920, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.3308296 = idf(docFreq=214, maxDocs=44421)
                0.0625 = fieldNorm(doc=4920)
          0.18988486 = weight(abstract_txt:learning in 4920) [ClassicSimilarity], result of:
            0.18988486 = score(doc=4920,freq=5.0), product of:
              0.28652164 = queryWeight, product of:
                4.671213 = boost
                4.7420692 = idf(docFreq=1052, maxDocs=44421)
                0.012934804 = queryNorm
              0.6627243 = fieldWeight in 4920, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                4.7420692 = idf(docFreq=1052, maxDocs=44421)
                0.0625 = fieldNorm(doc=4920)
          0.22106084 = weight(abstract_txt:supervised in 4920) [ClassicSimilarity], result of:
            0.22106084 = score(doc=4920,freq=1.0), product of:
              0.47365776 = queryWeight, product of:
                4.9038553 = boost
                7.467361 = idf(docFreq=68, maxDocs=44421)
                0.012934804 = queryNorm
              0.46671006 = fieldWeight in 4920, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.467361 = idf(docFreq=68, maxDocs=44421)
                0.0625 = fieldNorm(doc=4920)
        0.36 = coord(9/25)
    
  4. Suakkaphong, N.; Zhang, Z.; Chen, H.: Disease named entity recognition using semisupervised learning and conditional random fields (2011) 0.26
    0.2615762 = sum of:
      0.2615762 = product of:
        0.7266005 = sum of:
          0.019607909 = weight(abstract_txt:data in 367) [ClassicSimilarity], result of:
            0.019607909 = score(doc=367,freq=4.0), product of:
              0.0471029 = queryWeight, product of:
                1.0934883 = boost
                3.3302255 = idf(docFreq=4320, maxDocs=44421)
                0.012934804 = queryNorm
              0.41627818 = fieldWeight in 367, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                3.3302255 = idf(docFreq=4320, maxDocs=44421)
                0.0625 = fieldNorm(doc=367)
          0.010516654 = weight(abstract_txt:from in 367) [ClassicSimilarity], result of:
            0.010516654 = score(doc=367,freq=2.0), product of:
              0.043118943 = queryWeight, product of:
                1.2080746 = boost
                2.759399 = idf(docFreq=7646, maxDocs=44421)
                0.012934804 = queryNorm
              0.2438987 = fieldWeight in 367, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.759399 = idf(docFreq=7646, maxDocs=44421)
                0.0625 = fieldNorm(doc=367)
          0.012523123 = weight(abstract_txt:information in 367) [ClassicSimilarity], result of:
            0.012523123 = score(doc=367,freq=4.0), product of:
              0.041417588 = queryWeight, product of:
                1.3237535 = boost
                2.4188995 = idf(docFreq=10748, maxDocs=44421)
                0.012934804 = queryNorm
              0.30236244 = fieldWeight in 367, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                2.4188995 = idf(docFreq=10748, maxDocs=44421)
                0.0625 = fieldNorm(doc=367)
          0.027515598 = weight(abstract_txt:given in 367) [ClassicSimilarity], result of:
            0.027515598 = score(doc=367,freq=1.0), product of:
              0.0937201 = queryWeight, product of:
                1.5424345 = boost
                4.6974936 = idf(docFreq=1100, maxDocs=44421)
                0.012934804 = queryNorm
              0.29359335 = fieldWeight in 367, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.6974936 = idf(docFreq=1100, maxDocs=44421)
                0.0625 = fieldNorm(doc=367)
          0.03530161 = weight(abstract_txt:training in 367) [ClassicSimilarity], result of:
            0.03530161 = score(doc=367,freq=1.0), product of:
              0.11065637 = queryWeight, product of:
                1.6760175 = boost
                5.104322 = idf(docFreq=732, maxDocs=44421)
                0.012934804 = queryNorm
              0.31902012 = fieldWeight in 367, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.104322 = idf(docFreq=732, maxDocs=44421)
                0.0625 = fieldNorm(doc=367)
          0.14854348 = weight(abstract_txt:extraction in 367) [ClassicSimilarity], result of:
            0.14854348 = score(doc=367,freq=2.0), product of:
              0.27140766 = queryWeight, product of:
                3.3886425 = boost
                6.192079 = idf(docFreq=246, maxDocs=44421)
                0.012934804 = queryNorm
              0.5473076 = fieldWeight in 367, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.192079 = idf(docFreq=246, maxDocs=44421)
                0.0625 = fieldNorm(doc=367)
          0.120093726 = weight(abstract_txt:learning in 367) [ClassicSimilarity], result of:
            0.120093726 = score(doc=367,freq=2.0), product of:
              0.28652164 = queryWeight, product of:
                4.671213 = boost
                4.7420692 = idf(docFreq=1052, maxDocs=44421)
                0.012934804 = queryNorm
              0.41914365 = fieldWeight in 367, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.7420692 = idf(docFreq=1052, maxDocs=44421)
                0.0625 = fieldNorm(doc=367)
          0.22106084 = weight(abstract_txt:supervised in 367) [ClassicSimilarity], result of:
            0.22106084 = score(doc=367,freq=1.0), product of:
              0.47365776 = queryWeight, product of:
                4.9038553 = boost
                7.467361 = idf(docFreq=68, maxDocs=44421)
                0.012934804 = queryNorm
              0.46671006 = fieldWeight in 367, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.467361 = idf(docFreq=68, maxDocs=44421)
                0.0625 = fieldNorm(doc=367)
          0.1314376 = weight(abstract_txt:algorithm in 367) [ClassicSimilarity], result of:
            0.1314376 = score(doc=367,freq=1.0), product of:
              0.3686233 = queryWeight, product of:
                4.995352 = boost
                5.7050157 = idf(docFreq=401, maxDocs=44421)
                0.012934804 = queryNorm
              0.35656348 = fieldWeight in 367, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.7050157 = idf(docFreq=401, maxDocs=44421)
                0.0625 = fieldNorm(doc=367)
        0.36 = coord(9/25)
    
  5. Thelwall, M.; Buckley, K.; Paltoglou, G.: Sentiment strength detection for the social web (2012) 0.22
    0.22382146 = sum of:
      0.22382146 = product of:
        0.6217263 = sum of:
          0.013864885 = weight(abstract_txt:data in 972) [ClassicSimilarity], result of:
            0.013864885 = score(doc=972,freq=2.0), product of:
              0.0471029 = queryWeight, product of:
                1.0934883 = boost
                3.3302255 = idf(docFreq=4320, maxDocs=44421)
                0.012934804 = queryNorm
              0.29435313 = fieldWeight in 972, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.3302255 = idf(docFreq=4320, maxDocs=44421)
                0.0625 = fieldNorm(doc=972)
          0.010044124 = weight(abstract_txt:used in 972) [ClassicSimilarity], result of:
            0.010044124 = score(doc=972,freq=1.0), product of:
              0.04786905 = queryWeight, product of:
                1.1023455 = boost
                3.3572001 = idf(docFreq=4205, maxDocs=44421)
                0.012934804 = queryNorm
              0.20982501 = fieldWeight in 972, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.3572001 = idf(docFreq=4205, maxDocs=44421)
                0.0625 = fieldNorm(doc=972)
          0.012880218 = weight(abstract_txt:from in 972) [ClassicSimilarity], result of:
            0.012880218 = score(doc=972,freq=3.0), product of:
              0.043118943 = queryWeight, product of:
                1.2080746 = boost
                2.759399 = idf(docFreq=7646, maxDocs=44421)
                0.012934804 = queryNorm
              0.29871368 = fieldWeight in 972, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                2.759399 = idf(docFreq=7646, maxDocs=44421)
                0.0625 = fieldNorm(doc=972)
          0.036221832 = weight(abstract_txt:positive in 972) [ClassicSimilarity], result of:
            0.036221832 = score(doc=972,freq=1.0), product of:
              0.098339945 = queryWeight, product of:
                1.2900593 = boost
                5.8933253 = idf(docFreq=332, maxDocs=44421)
                0.012934804 = queryNorm
              0.36833283 = fieldWeight in 972, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.8933253 = idf(docFreq=332, maxDocs=44421)
                0.0625 = fieldNorm(doc=972)
          0.0062615615 = weight(abstract_txt:information in 972) [ClassicSimilarity], result of:
            0.0062615615 = score(doc=972,freq=1.0), product of:
              0.041417588 = queryWeight, product of:
                1.3237535 = boost
                2.4188995 = idf(docFreq=10748, maxDocs=44421)
                0.012934804 = queryNorm
              0.15118122 = fieldWeight in 972, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.4188995 = idf(docFreq=10748, maxDocs=44421)
                0.0625 = fieldNorm(doc=972)
          0.10503611 = weight(abstract_txt:extraction in 972) [ClassicSimilarity], result of:
            0.10503611 = score(doc=972,freq=1.0), product of:
              0.27140766 = queryWeight, product of:
                3.3886425 = boost
                6.192079 = idf(docFreq=246, maxDocs=44421)
                0.012934804 = queryNorm
              0.38700494 = fieldWeight in 972, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.192079 = idf(docFreq=246, maxDocs=44421)
                0.0625 = fieldNorm(doc=972)
          0.084919095 = weight(abstract_txt:learning in 972) [ClassicSimilarity], result of:
            0.084919095 = score(doc=972,freq=1.0), product of:
              0.28652164 = queryWeight, product of:
                4.671213 = boost
                4.7420692 = idf(docFreq=1052, maxDocs=44421)
                0.012934804 = queryNorm
              0.29637933 = fieldWeight in 972, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.7420692 = idf(docFreq=1052, maxDocs=44421)
                0.0625 = fieldNorm(doc=972)
          0.22106084 = weight(abstract_txt:supervised in 972) [ClassicSimilarity], result of:
            0.22106084 = score(doc=972,freq=1.0), product of:
              0.47365776 = queryWeight, product of:
                4.9038553 = boost
                7.467361 = idf(docFreq=68, maxDocs=44421)
                0.012934804 = queryNorm
              0.46671006 = fieldWeight in 972, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.467361 = idf(docFreq=68, maxDocs=44421)
                0.0625 = fieldNorm(doc=972)
          0.1314376 = weight(abstract_txt:algorithm in 972) [ClassicSimilarity], result of:
            0.1314376 = score(doc=972,freq=1.0), product of:
              0.3686233 = queryWeight, product of:
                4.995352 = boost
                5.7050157 = idf(docFreq=401, maxDocs=44421)
                0.012934804 = queryNorm
              0.35656348 = fieldWeight in 972, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.7050157 = idf(docFreq=401, maxDocs=44421)
                0.0625 = fieldNorm(doc=972)
        0.36 = coord(9/25)