Document (#34078)

Zhou, G.D.
Zhang, M.
Ji, D.H.
Zhu, Q.M.
Hierarchical learning strategy in semantic relation extraction
Information processing and management. 44(2008) no.3, S.1008-1021
This paper proposes a novel hierarchical learning strategy to deal with the data sparseness problem in semantic relation extraction by modeling the commonality among related classes. For each class in the hierarchy either manually predefined or automatically clustered, a discriminative function is determined in a top-down way. As the upper-level class normally has much more positive training examples than the lower-level class, the corresponding discriminative function can be determined more reliably and guide the discriminative function learning in the lower-level one more effectively, which otherwise might suffer from limited training data. In this paper, two classifier learning approaches, i.e. the simple perceptron algorithm and the state-of-the-art Support Vector Machines, are applied using the hierarchical learning strategy. Moreover, several kinds of class hierarchies either manually predefined or automatically clustered are explored and compared. Evaluation on the ACE RDC 2003 and 2004 corpora shows that the hierarchical learning strategy much improves the performance on least- and medium-frequent relations.
Automatisches Klassifizieren

Similar documents (author)

  1. Zhou, L.; Zhang, D.: NLPIR: a theoretical framework for applying Natural Language Processing to information retrieval (2003) 5.04
    5.039768 = sum of:
      5.039768 = sum of:
        1.7948558 = weight(author_txt:zhang in 5148) [ClassicSimilarity], result of:
          1.7948558 = score(doc=5148,freq=1.0), product of:
            0.5588067 = queryWeight, product of:
              6.4238877 = idf(docFreq=194, maxDocs=44218)
              0.08698887 = queryNorm
            3.2119439 = fieldWeight in 5148, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              6.4238877 = idf(docFreq=194, maxDocs=44218)
              0.5 = fieldNorm(doc=5148)
        3.2449126 = weight(author_txt:zhou in 5148) [ClassicSimilarity], result of:
          3.2449126 = score(doc=5148,freq=1.0), product of:
            0.82929796 = queryWeight, product of:
              1.2182165 = boost
              7.825686 = idf(docFreq=47, maxDocs=44218)
              0.08698887 = queryNorm
            3.912843 = fieldWeight in 5148, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              7.825686 = idf(docFreq=47, maxDocs=44218)
              0.5 = fieldNorm(doc=5148)
  2. Zhou, G.D.; Zhang, M.: Extracting relation information from text documents by exploring various types of knowledge (2007) 5.04
    5.039768 = sum of:
      5.039768 = sum of:
        1.7948558 = weight(author_txt:zhang in 927) [ClassicSimilarity], result of:
          1.7948558 = score(doc=927,freq=1.0), product of:
            0.5588067 = queryWeight, product of:
              6.4238877 = idf(docFreq=194, maxDocs=44218)
              0.08698887 = queryNorm
            3.2119439 = fieldWeight in 927, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              6.4238877 = idf(docFreq=194, maxDocs=44218)
              0.5 = fieldNorm(doc=927)
        3.2449126 = weight(author_txt:zhou in 927) [ClassicSimilarity], result of:
          3.2449126 = score(doc=927,freq=1.0), product of:
            0.82929796 = queryWeight, product of:
              1.2182165 = boost
              7.825686 = idf(docFreq=47, maxDocs=44218)
              0.08698887 = queryNorm
            3.912843 = fieldWeight in 927, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              7.825686 = idf(docFreq=47, maxDocs=44218)
              0.5 = fieldNorm(doc=927)
  3. Zhang, M.; Zhou, G.D.; Aw, A.: Exploring syntactic structured features over parse trees for relation extraction using kernel methods (2008) 3.78
    3.7798266 = sum of:
      3.7798266 = sum of:
        1.3461419 = weight(author_txt:zhang in 2055) [ClassicSimilarity], result of:
          1.3461419 = score(doc=2055,freq=1.0), product of:
            0.5588067 = queryWeight, product of:
              6.4238877 = idf(docFreq=194, maxDocs=44218)
              0.08698887 = queryNorm
            2.408958 = fieldWeight in 2055, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              6.4238877 = idf(docFreq=194, maxDocs=44218)
              0.375 = fieldNorm(doc=2055)
        2.4336846 = weight(author_txt:zhou in 2055) [ClassicSimilarity], result of:
          2.4336846 = score(doc=2055,freq=1.0), product of:
            0.82929796 = queryWeight, product of:
              1.2182165 = boost
              7.825686 = idf(docFreq=47, maxDocs=44218)
              0.08698887 = queryNorm
            2.9346323 = fieldWeight in 2055, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              7.825686 = idf(docFreq=47, maxDocs=44218)
              0.375 = fieldNorm(doc=2055)
  4. Zhang, D.; Zambrowicz, C.; Zhou, H.; Roderer, N.K.: User information seeking behavior in a medical Web portal environment : a preliminary study (2004) 3.15
    3.1498551 = sum of:
      3.1498551 = sum of:
        1.1217848 = weight(author_txt:zhang in 2261) [ClassicSimilarity], result of:
          1.1217848 = score(doc=2261,freq=1.0), product of:
            0.5588067 = queryWeight, product of:
              6.4238877 = idf(docFreq=194, maxDocs=44218)
              0.08698887 = queryNorm
            2.007465 = fieldWeight in 2261, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              6.4238877 = idf(docFreq=194, maxDocs=44218)
              0.3125 = fieldNorm(doc=2261)
        2.0280704 = weight(author_txt:zhou in 2261) [ClassicSimilarity], result of:
          2.0280704 = score(doc=2261,freq=1.0), product of:
            0.82929796 = queryWeight, product of:
              1.2182165 = boost
              7.825686 = idf(docFreq=47, maxDocs=44218)
              0.08698887 = queryNorm
            2.4455268 = fieldWeight in 2261, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              7.825686 = idf(docFreq=47, maxDocs=44218)
              0.3125 = fieldNorm(doc=2261)
  5. Chang, K.-C.; Zhou, W.; Zhang, S.; Yuan, C,-C.: Threshold effects of the patent H-index in the relationship between patent citations and market value (2015) 3.15
    3.1498551 = sum of:
      3.1498551 = sum of:
        1.1217848 = weight(author_txt:zhang in 2344) [ClassicSimilarity], result of:
          1.1217848 = score(doc=2344,freq=1.0), product of:
            0.5588067 = queryWeight, product of:
              6.4238877 = idf(docFreq=194, maxDocs=44218)
              0.08698887 = queryNorm
            2.007465 = fieldWeight in 2344, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              6.4238877 = idf(docFreq=194, maxDocs=44218)
              0.3125 = fieldNorm(doc=2344)
        2.0280704 = weight(author_txt:zhou in 2344) [ClassicSimilarity], result of:
          2.0280704 = score(doc=2344,freq=1.0), product of:
            0.82929796 = queryWeight, product of:
              1.2182165 = boost
              7.825686 = idf(docFreq=47, maxDocs=44218)
              0.08698887 = queryNorm
            2.4455268 = fieldWeight in 2344, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              7.825686 = idf(docFreq=47, maxDocs=44218)
              0.3125 = fieldNorm(doc=2344)

Similar documents (content)

  1. Ru, C.; Tang, J.; Li, S.; Xie, S.; Wang, T.: Using semantic similarity to reduce wrong labels in distant supervision for relation extraction (2018) 0.13
    0.133515 = sum of:
      0.133515 = product of:
        0.4768393 = sum of:
          0.06784942 = weight(abstract_txt:suffer in 5055) [ClassicSimilarity], result of:
            0.06784942 = score(doc=5055,freq=1.0), product of:
              0.13286668 = queryWeight, product of:
                8.1705265 = idf(docFreq=33, maxDocs=44218)
                0.016261704 = queryNorm
              0.5106579 = fieldWeight in 5055, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.1705265 = idf(docFreq=33, maxDocs=44218)
                0.0625 = fieldNorm(doc=5055)
          0.044569477 = weight(abstract_txt:semantic in 5055) [ClassicSimilarity], result of:
            0.044569477 = score(doc=5055,freq=4.0), product of:
              0.07968936 = queryWeight, product of:
                1.0952345 = boost
                4.4743214 = idf(docFreq=1369, maxDocs=44218)
                0.016261704 = queryNorm
              0.5592902 = fieldWeight in 5055, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.4743214 = idf(docFreq=1369, maxDocs=44218)
                0.0625 = fieldNorm(doc=5055)
          0.030642634 = weight(abstract_txt:much in 5055) [ClassicSimilarity], result of:
            0.030642634 = score(doc=5055,freq=1.0), product of:
              0.09854004 = queryWeight, product of:
                1.2179047 = boost
                4.9754615 = idf(docFreq=829, maxDocs=44218)
                0.016261704 = queryNorm
              0.31096634 = fieldWeight in 5055, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.9754615 = idf(docFreq=829, maxDocs=44218)
                0.0625 = fieldNorm(doc=5055)
          0.03323709 = weight(abstract_txt:training in 5055) [ClassicSimilarity], result of:
            0.03323709 = score(doc=5055,freq=1.0), product of:
              0.1040265 = queryWeight, product of:
                1.2513504 = boost
                5.112096 = idf(docFreq=723, maxDocs=44218)
                0.016261704 = queryNorm
              0.319506 = fieldWeight in 5055, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.112096 = idf(docFreq=723, maxDocs=44218)
                0.0625 = fieldNorm(doc=5055)
          0.123362064 = weight(abstract_txt:relation in 5055) [ClassicSimilarity], result of:
            0.123362064 = score(doc=5055,freq=10.0), product of:
              0.11574878 = queryWeight, product of:
                1.3199733 = boost
                5.3924384 = idf(docFreq=546, maxDocs=44218)
                0.016261704 = queryNorm
              1.0657742 = fieldWeight in 5055, product of:
                3.1622777 = tf(freq=10.0), with freq of:
                  10.0 = termFreq=10.0
                5.3924384 = idf(docFreq=546, maxDocs=44218)
                0.0625 = fieldNorm(doc=5055)
          0.05907713 = weight(abstract_txt:automatically in 5055) [ClassicSimilarity], result of:
            0.05907713 = score(doc=5055,freq=2.0), product of:
              0.121152274 = queryWeight, product of:
                1.350432 = boost
                5.5168705 = idf(docFreq=482, maxDocs=44218)
                0.016261704 = queryNorm
              0.48762706 = fieldWeight in 5055, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.5168705 = idf(docFreq=482, maxDocs=44218)
                0.0625 = fieldNorm(doc=5055)
          0.11810148 = weight(abstract_txt:extraction in 5055) [ClassicSimilarity], result of:
            0.11810148 = score(doc=5055,freq=4.0), product of:
              0.15259685 = queryWeight, product of:
                1.515583 = boost
                6.1915555 = idf(docFreq=245, maxDocs=44218)
                0.016261704 = queryNorm
              0.77394444 = fieldWeight in 5055, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                6.1915555 = idf(docFreq=245, maxDocs=44218)
                0.0625 = fieldNorm(doc=5055)
        0.28 = coord(7/25)
  2. Jiang, X.; Tan, A.-H.: CRCTOL: a semantic-based domain ontology learning system (2009) 0.13
    0.12607986 = sum of:
      0.12607986 = product of:
        0.39399958 = sum of:
          0.033773508 = weight(abstract_txt:semantic in 3320) [ClassicSimilarity], result of:
            0.033773508 = score(doc=3320,freq=3.0), product of:
              0.07968936 = queryWeight, product of:
                1.0952345 = boost
                4.4743214 = idf(docFreq=1369, maxDocs=44218)
                0.016261704 = queryNorm
              0.42381454 = fieldWeight in 3320, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.4743214 = idf(docFreq=1369, maxDocs=44218)
                0.0546875 = fieldNorm(doc=3320)
          0.01285766 = weight(abstract_txt:more in 3320) [ClassicSimilarity], result of:
            0.01285766 = score(doc=3320,freq=1.0), product of:
              0.069108 = queryWeight, product of:
                1.2491561 = boost
                3.402088 = idf(docFreq=4002, maxDocs=44218)
                0.016261704 = queryNorm
              0.18605168 = fieldWeight in 3320, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.402088 = idf(docFreq=4002, maxDocs=44218)
                0.0546875 = fieldNorm(doc=3320)
          0.04827304 = weight(abstract_txt:relation in 3320) [ClassicSimilarity], result of:
            0.04827304 = score(doc=3320,freq=2.0), product of:
              0.11574878 = queryWeight, product of:
                1.3199733 = boost
                5.3924384 = idf(docFreq=546, maxDocs=44218)
                0.016261704 = queryNorm
              0.41705012 = fieldWeight in 3320, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.3924384 = idf(docFreq=546, maxDocs=44218)
                0.0546875 = fieldNorm(doc=3320)
          0.03655211 = weight(abstract_txt:automatically in 3320) [ClassicSimilarity], result of:
            0.03655211 = score(doc=3320,freq=1.0), product of:
              0.121152274 = queryWeight, product of:
                1.350432 = boost
                5.5168705 = idf(docFreq=482, maxDocs=44218)
                0.016261704 = queryNorm
              0.30170387 = fieldWeight in 3320, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.5168705 = idf(docFreq=482, maxDocs=44218)
                0.0546875 = fieldNorm(doc=3320)
          0.038113423 = weight(abstract_txt:either in 3320) [ClassicSimilarity], result of:
            0.038113423 = score(doc=3320,freq=1.0), product of:
              0.12457817 = queryWeight, product of:
                1.3693924 = boost
                5.5943284 = idf(docFreq=446, maxDocs=44218)
                0.016261704 = queryNorm
              0.30593982 = fieldWeight in 3320, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.5943284 = idf(docFreq=446, maxDocs=44218)
                0.0546875 = fieldNorm(doc=3320)
          0.051669396 = weight(abstract_txt:extraction in 3320) [ClassicSimilarity], result of:
            0.051669396 = score(doc=3320,freq=1.0), product of:
              0.15259685 = queryWeight, product of:
                1.515583 = boost
                6.1915555 = idf(docFreq=245, maxDocs=44218)
                0.016261704 = queryNorm
              0.3386007 = fieldWeight in 3320, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.1915555 = idf(docFreq=245, maxDocs=44218)
                0.0546875 = fieldNorm(doc=3320)
          0.051467318 = weight(abstract_txt:level in 3320) [ClassicSimilarity], result of:
            0.051467318 = score(doc=3320,freq=3.0), product of:
              0.1208002 = queryWeight, product of:
                1.6515298 = boost
                4.497956 = idf(docFreq=1337, maxDocs=44218)
                0.016261704 = queryNorm
              0.42605326 = fieldWeight in 3320, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.497956 = idf(docFreq=1337, maxDocs=44218)
                0.0546875 = fieldNorm(doc=3320)
          0.121293135 = weight(abstract_txt:learning in 3320) [ClassicSimilarity], result of:
            0.121293135 = score(doc=3320,freq=3.0), product of:
              0.26953435 = queryWeight, product of:
                3.4887884 = boost
                4.750873 = idf(docFreq=1038, maxDocs=44218)
                0.016261704 = queryNorm
              0.45000994 = fieldWeight in 3320, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.750873 = idf(docFreq=1038, maxDocs=44218)
                0.0546875 = fieldNorm(doc=3320)
        0.32 = coord(8/25)
  3. Yu, N.: Exploring co-training strategies for opinion detection (2014) 0.12
    0.12242734 = sum of:
      0.12242734 = product of:
        0.43724048 = sum of:
          0.030642634 = weight(abstract_txt:much in 1503) [ClassicSimilarity], result of:
            0.030642634 = score(doc=1503,freq=1.0), product of:
              0.09854004 = queryWeight, product of:
                1.2179047 = boost
                4.9754615 = idf(docFreq=829, maxDocs=44218)
                0.016261704 = queryNorm
              0.31096634 = fieldWeight in 1503, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.9754615 = idf(docFreq=829, maxDocs=44218)
                0.0625 = fieldNorm(doc=1503)
          0.014694469 = weight(abstract_txt:more in 1503) [ClassicSimilarity], result of:
            0.014694469 = score(doc=1503,freq=1.0), product of:
              0.069108 = queryWeight, product of:
                1.2491561 = boost
                3.402088 = idf(docFreq=4002, maxDocs=44218)
                0.016261704 = queryNorm
              0.2126305 = fieldWeight in 1503, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.402088 = idf(docFreq=4002, maxDocs=44218)
                0.0625 = fieldNorm(doc=1503)
          0.06647418 = weight(abstract_txt:training in 1503) [ClassicSimilarity], result of:
            0.06647418 = score(doc=1503,freq=4.0), product of:
              0.1040265 = queryWeight, product of:
                1.2513504 = boost
                5.112096 = idf(docFreq=723, maxDocs=44218)
                0.016261704 = queryNorm
              0.639012 = fieldWeight in 1503, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.112096 = idf(docFreq=723, maxDocs=44218)
                0.0625 = fieldNorm(doc=1503)
          0.05907713 = weight(abstract_txt:automatically in 1503) [ClassicSimilarity], result of:
            0.05907713 = score(doc=1503,freq=2.0), product of:
              0.121152274 = queryWeight, product of:
                1.350432 = boost
                5.5168705 = idf(docFreq=482, maxDocs=44218)
                0.016261704 = queryNorm
              0.48762706 = fieldWeight in 1503, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.5168705 = idf(docFreq=482, maxDocs=44218)
                0.0625 = fieldNorm(doc=1503)
          0.033959623 = weight(abstract_txt:level in 1503) [ClassicSimilarity], result of:
            0.033959623 = score(doc=1503,freq=1.0), product of:
              0.1208002 = queryWeight, product of:
                1.6515298 = boost
                4.497956 = idf(docFreq=1337, maxDocs=44218)
                0.016261704 = queryNorm
              0.28112224 = fieldWeight in 1503, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.497956 = idf(docFreq=1337, maxDocs=44218)
                0.0625 = fieldNorm(doc=1503)
          0.09377171 = weight(abstract_txt:strategy in 1503) [ClassicSimilarity], result of:
            0.09377171 = score(doc=1503,freq=1.0), product of:
              0.26168966 = queryWeight, product of:
                2.8068242 = boost
                5.733308 = idf(docFreq=388, maxDocs=44218)
                0.016261704 = queryNorm
              0.35833174 = fieldWeight in 1503, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.733308 = idf(docFreq=388, maxDocs=44218)
                0.0625 = fieldNorm(doc=1503)
          0.13862072 = weight(abstract_txt:learning in 1503) [ClassicSimilarity], result of:
            0.13862072 = score(doc=1503,freq=3.0), product of:
              0.26953435 = queryWeight, product of:
                3.4887884 = boost
                4.750873 = idf(docFreq=1038, maxDocs=44218)
                0.016261704 = queryNorm
              0.51429707 = fieldWeight in 1503, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.750873 = idf(docFreq=1038, maxDocs=44218)
                0.0625 = fieldNorm(doc=1503)
        0.28 = coord(7/25)
  4. Li, J.; Zhang, Z.; Li, X.; Chen, H.: Kernel-based learning for biomedical relation extraction (2008) 0.11
    0.10870323 = sum of:
      0.10870323 = product of:
        0.54351616 = sum of:
          0.11944481 = weight(abstract_txt:relation in 1611) [ClassicSimilarity], result of:
            0.11944481 = score(doc=1611,freq=6.0), product of:
              0.11574878 = queryWeight, product of:
                1.3199733 = boost
                5.3924384 = idf(docFreq=546, maxDocs=44218)
                0.016261704 = queryNorm
              1.0319315 = fieldWeight in 1611, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                5.3924384 = idf(docFreq=546, maxDocs=44218)
                0.078125 = fieldNorm(doc=1611)
          0.073846415 = weight(abstract_txt:automatically in 1611) [ClassicSimilarity], result of:
            0.073846415 = score(doc=1611,freq=2.0), product of:
              0.121152274 = queryWeight, product of:
                1.350432 = boost
                5.5168705 = idf(docFreq=482, maxDocs=44218)
                0.016261704 = queryNorm
              0.60953385 = fieldWeight in 1611, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.5168705 = idf(docFreq=482, maxDocs=44218)
                0.078125 = fieldNorm(doc=1611)
          0.12784861 = weight(abstract_txt:extraction in 1611) [ClassicSimilarity], result of:
            0.12784861 = score(doc=1611,freq=3.0), product of:
              0.15259685 = queryWeight, product of:
                1.515583 = boost
                6.1915555 = idf(docFreq=245, maxDocs=44218)
                0.016261704 = queryNorm
              0.83781946 = fieldWeight in 1611, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.1915555 = idf(docFreq=245, maxDocs=44218)
                0.078125 = fieldNorm(doc=1611)
          0.08089718 = weight(abstract_txt:function in 1611) [ClassicSimilarity], result of:
            0.08089718 = score(doc=1611,freq=1.0), product of:
              0.18568408 = queryWeight, product of:
                2.0475755 = boost
                5.5765896 = idf(docFreq=454, maxDocs=44218)
                0.016261704 = queryNorm
              0.43567106 = fieldWeight in 1611, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.5765896 = idf(docFreq=454, maxDocs=44218)
                0.078125 = fieldNorm(doc=1611)
          0.1414792 = weight(abstract_txt:learning in 1611) [ClassicSimilarity], result of:
            0.1414792 = score(doc=1611,freq=2.0), product of:
              0.26953435 = queryWeight, product of:
                3.4887884 = boost
                4.750873 = idf(docFreq=1038, maxDocs=44218)
                0.016261704 = queryNorm
              0.5249023 = fieldWeight in 1611, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.750873 = idf(docFreq=1038, maxDocs=44218)
                0.078125 = fieldNorm(doc=1611)
        0.2 = coord(5/25)
  5. Wu, T.; Pottenger, W.M.: ¬A semi-supervised active learning algorithm for information extraction from textual data (2005) 0.11
    0.10683572 = sum of:
      0.10683572 = product of:
        0.5341786 = sum of:
          0.057568323 = weight(abstract_txt:training in 3237) [ClassicSimilarity], result of:
            0.057568323 = score(doc=3237,freq=3.0), product of:
              0.1040265 = queryWeight, product of:
                1.2513504 = boost
                5.112096 = idf(docFreq=723, maxDocs=44218)
                0.016261704 = queryNorm
              0.5534006 = fieldWeight in 3237, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.112096 = idf(docFreq=723, maxDocs=44218)
                0.0625 = fieldNorm(doc=3237)
          0.05907713 = weight(abstract_txt:automatically in 3237) [ClassicSimilarity], result of:
            0.05907713 = score(doc=3237,freq=2.0), product of:
              0.121152274 = queryWeight, product of:
                1.350432 = boost
                5.5168705 = idf(docFreq=482, maxDocs=44218)
                0.016261704 = queryNorm
              0.48762706 = fieldWeight in 3237, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.5168705 = idf(docFreq=482, maxDocs=44218)
                0.0625 = fieldNorm(doc=3237)
          0.11810148 = weight(abstract_txt:extraction in 3237) [ClassicSimilarity], result of:
            0.11810148 = score(doc=3237,freq=4.0), product of:
              0.15259685 = queryWeight, product of:
                1.515583 = boost
                6.1915555 = idf(docFreq=245, maxDocs=44218)
                0.016261704 = queryNorm
              0.77394444 = fieldWeight in 3237, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                6.1915555 = idf(docFreq=245, maxDocs=44218)
                0.0625 = fieldNorm(doc=3237)
          0.073064975 = weight(abstract_txt:manually in 3237) [ClassicSimilarity], result of:
            0.073064975 = score(doc=3237,freq=1.0), product of:
              0.17587394 = queryWeight, product of:
                1.6270754 = boost
                6.6470313 = idf(docFreq=155, maxDocs=44218)
                0.016261704 = queryNorm
              0.41543946 = fieldWeight in 3237, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.6470313 = idf(docFreq=155, maxDocs=44218)
                0.0625 = fieldNorm(doc=3237)
          0.22636671 = weight(abstract_txt:learning in 3237) [ClassicSimilarity], result of:
            0.22636671 = score(doc=3237,freq=8.0), product of:
              0.26953435 = queryWeight, product of:
                3.4887884 = boost
                4.750873 = idf(docFreq=1038, maxDocs=44218)
                0.016261704 = queryNorm
              0.83984363 = fieldWeight in 3237, product of:
                2.828427 = tf(freq=8.0), with freq of:
                  8.0 = termFreq=8.0
                4.750873 = idf(docFreq=1038, maxDocs=44218)
                0.0625 = fieldNorm(doc=3237)
        0.2 = coord(5/25)