Document (#34078)

Author
Zhou, G.D.
Zhang, M.
Ji, D.H.
Zhu, Q.M.
Title
Hierarchical learning strategy in semantic relation extraction
Source
Information processing and management. 44(2008) no.3, S.1008-1021
Year
2008
Abstract
This paper proposes a novel hierarchical learning strategy to deal with the data sparseness problem in semantic relation extraction by modeling the commonality among related classes. For each class in the hierarchy either manually predefined or automatically clustered, a discriminative function is determined in a top-down way. As the upper-level class normally has much more positive training examples than the lower-level class, the corresponding discriminative function can be determined more reliably and guide the discriminative function learning in the lower-level one more effectively, which otherwise might suffer from limited training data. In this paper, two classifier learning approaches, i.e. the simple perceptron algorithm and the state-of-the-art Support Vector Machines, are applied using the hierarchical learning strategy. Moreover, several kinds of class hierarchies either manually predefined or automatically clustered are explored and compared. Evaluation on the ACE RDC 2003 and 2004 corpora shows that the hierarchical learning strategy much improves the performance on least- and medium-frequent relations.
Theme
Automatisches Klassifizieren

Similar documents (author)

  1. Zhou, L.; Zhang, D.: NLPIR: a theoretical framework for applying Natural Language Processing to information retrieval (2003) 5.01
    5.0124793 = sum of:
      5.0124793 = sum of:
        1.7987939 = weight(author_txt:zhang in 148) [ClassicSimilarity], result of:
          1.7987939 = score(doc=148,freq=1.0), product of:
            0.56184655 = queryWeight, product of:
              6.40315 = idf(docFreq=199, maxDocs=44421)
              0.08774533 = queryNorm
            3.201575 = fieldWeight in 148, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              6.40315 = idf(docFreq=199, maxDocs=44421)
              0.5 = fieldNorm(doc=148)
        3.2136853 = weight(author_txt:zhou in 148) [ClassicSimilarity], result of:
          3.2136853 = score(doc=148,freq=1.0), product of:
            0.82724154 = queryWeight, product of:
              1.2134093 = boost
              7.769642 = idf(docFreq=50, maxDocs=44421)
              0.08774533 = queryNorm
            3.884821 = fieldWeight in 148, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              7.769642 = idf(docFreq=50, maxDocs=44421)
              0.5 = fieldNorm(doc=148)
    
  2. Zhou, G.D.; Zhang, M.: Extracting relation information from text documents by exploring various types of knowledge (2007) 5.01
    5.0124793 = sum of:
      5.0124793 = sum of:
        1.7987939 = weight(author_txt:zhang in 1927) [ClassicSimilarity], result of:
          1.7987939 = score(doc=1927,freq=1.0), product of:
            0.56184655 = queryWeight, product of:
              6.40315 = idf(docFreq=199, maxDocs=44421)
              0.08774533 = queryNorm
            3.201575 = fieldWeight in 1927, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              6.40315 = idf(docFreq=199, maxDocs=44421)
              0.5 = fieldNorm(doc=1927)
        3.2136853 = weight(author_txt:zhou in 1927) [ClassicSimilarity], result of:
          3.2136853 = score(doc=1927,freq=1.0), product of:
            0.82724154 = queryWeight, product of:
              1.2134093 = boost
              7.769642 = idf(docFreq=50, maxDocs=44421)
              0.08774533 = queryNorm
            3.884821 = fieldWeight in 1927, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              7.769642 = idf(docFreq=50, maxDocs=44421)
              0.5 = fieldNorm(doc=1927)
    
  3. Zhang, M.; Zhou, G.D.; Aw, A.: Exploring syntactic structured features over parse trees for relation extraction using kernel methods (2008) 3.76
    3.7593594 = sum of:
      3.7593594 = sum of:
        1.3490953 = weight(author_txt:zhang in 3055) [ClassicSimilarity], result of:
          1.3490953 = score(doc=3055,freq=1.0), product of:
            0.56184655 = queryWeight, product of:
              6.40315 = idf(docFreq=199, maxDocs=44421)
              0.08774533 = queryNorm
            2.4011812 = fieldWeight in 3055, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              6.40315 = idf(docFreq=199, maxDocs=44421)
              0.375 = fieldNorm(doc=3055)
        2.410264 = weight(author_txt:zhou in 3055) [ClassicSimilarity], result of:
          2.410264 = score(doc=3055,freq=1.0), product of:
            0.82724154 = queryWeight, product of:
              1.2134093 = boost
              7.769642 = idf(docFreq=50, maxDocs=44421)
              0.08774533 = queryNorm
            2.9136157 = fieldWeight in 3055, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              7.769642 = idf(docFreq=50, maxDocs=44421)
              0.375 = fieldNorm(doc=3055)
    
  4. Zhang, D.; Zambrowicz, C.; Zhou, H.; Roderer, N.K.: User information seeking behavior in a medical Web portal environment : a preliminary study (2004) 3.13
    3.1327996 = sum of:
      3.1327996 = sum of:
        1.1242462 = weight(author_txt:zhang in 3261) [ClassicSimilarity], result of:
          1.1242462 = score(doc=3261,freq=1.0), product of:
            0.56184655 = queryWeight, product of:
              6.40315 = idf(docFreq=199, maxDocs=44421)
              0.08774533 = queryNorm
            2.0009844 = fieldWeight in 3261, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              6.40315 = idf(docFreq=199, maxDocs=44421)
              0.3125 = fieldNorm(doc=3261)
        2.0085533 = weight(author_txt:zhou in 3261) [ClassicSimilarity], result of:
          2.0085533 = score(doc=3261,freq=1.0), product of:
            0.82724154 = queryWeight, product of:
              1.2134093 = boost
              7.769642 = idf(docFreq=50, maxDocs=44421)
              0.08774533 = queryNorm
            2.428013 = fieldWeight in 3261, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              7.769642 = idf(docFreq=50, maxDocs=44421)
              0.3125 = fieldNorm(doc=3261)
    
  5. Chang, K.-C.; Zhou, W.; Zhang, S.; Yuan, C,-C.: Threshold effects of the patent H-index in the relationship between patent citations and market value (2015) 3.13
    3.1327996 = sum of:
      3.1327996 = sum of:
        1.1242462 = weight(author_txt:zhang in 3344) [ClassicSimilarity], result of:
          1.1242462 = score(doc=3344,freq=1.0), product of:
            0.56184655 = queryWeight, product of:
              6.40315 = idf(docFreq=199, maxDocs=44421)
              0.08774533 = queryNorm
            2.0009844 = fieldWeight in 3344, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              6.40315 = idf(docFreq=199, maxDocs=44421)
              0.3125 = fieldNorm(doc=3344)
        2.0085533 = weight(author_txt:zhou in 3344) [ClassicSimilarity], result of:
          2.0085533 = score(doc=3344,freq=1.0), product of:
            0.82724154 = queryWeight, product of:
              1.2134093 = boost
              7.769642 = idf(docFreq=50, maxDocs=44421)
              0.08774533 = queryNorm
            2.428013 = fieldWeight in 3344, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              7.769642 = idf(docFreq=50, maxDocs=44421)
              0.3125 = fieldNorm(doc=3344)
    

Similar documents (content)

  1. Ru, C.; Tang, J.; Li, S.; Xie, S.; Wang, T.: Using semantic similarity to reduce wrong labels in distant supervision for relation extraction (2018) 0.13
    0.13338745 = sum of:
      0.13338745 = product of:
        0.47638375 = sum of:
          0.06795596 = weight(abstract_txt:suffer in 55) [ClassicSimilarity], result of:
            0.06795596 = score(doc=55,freq=1.0), product of:
              0.13300076 = queryWeight, product of:
                8.175107 = idf(docFreq=33, maxDocs=44421)
                0.016268993 = queryNorm
              0.5109442 = fieldWeight in 55, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.175107 = idf(docFreq=33, maxDocs=44421)
                0.0625 = fieldNorm(doc=55)
          0.044570755 = weight(abstract_txt:semantic in 55) [ClassicSimilarity], result of:
            0.044570755 = score(doc=55,freq=4.0), product of:
              0.0796879 = queryWeight, product of:
                1.0946723 = boost
                4.4745317 = idf(docFreq=1375, maxDocs=44421)
                0.016268993 = queryNorm
              0.55931646 = fieldWeight in 55, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.4745317 = idf(docFreq=1375, maxDocs=44421)
                0.0625 = fieldNorm(doc=55)
          0.030524712 = weight(abstract_txt:much in 55) [ClassicSimilarity], result of:
            0.030524712 = score(doc=55,freq=1.0), product of:
              0.09828339 = queryWeight, product of:
                1.2157044 = boost
                4.969257 = idf(docFreq=838, maxDocs=44421)
                0.016268993 = queryNorm
              0.31057855 = fieldWeight in 55, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.969257 = idf(docFreq=838, maxDocs=44421)
                0.0625 = fieldNorm(doc=55)
          0.033081975 = weight(abstract_txt:training in 55) [ClassicSimilarity], result of:
            0.033081975 = score(doc=55,freq=1.0), product of:
              0.10369871 = queryWeight, product of:
                1.2487475 = boost
                5.104322 = idf(docFreq=732, maxDocs=44421)
                0.016268993 = queryNorm
              0.31902012 = fieldWeight in 55, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.104322 = idf(docFreq=732, maxDocs=44421)
                0.0625 = fieldNorm(doc=55)
          0.12291443 = weight(abstract_txt:relation in 55) [ClassicSimilarity], result of:
            0.12291443 = score(doc=55,freq=10.0), product of:
              0.11546428 = queryWeight, product of:
                1.3176855 = boost
                5.38611 = idf(docFreq=552, maxDocs=44421)
                0.016268993 = queryNorm
              1.0645235 = fieldWeight in 55, product of:
                3.1622777 = tf(freq=10.0), with freq of:
                  10.0 = termFreq=10.0
                5.38611 = idf(docFreq=552, maxDocs=44421)
                0.0625 = fieldNorm(doc=55)
          0.05921775 = weight(abstract_txt:automatically in 55) [ClassicSimilarity], result of:
            0.05921775 = score(doc=55,freq=2.0), product of:
              0.12133991 = queryWeight, product of:
                1.350796 = boost
                5.521451 = idf(docFreq=482, maxDocs=44421)
                0.016268993 = queryNorm
              0.48803192 = fieldWeight in 55, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.521451 = idf(docFreq=482, maxDocs=44421)
                0.0625 = fieldNorm(doc=55)
          0.11811818 = weight(abstract_txt:extraction in 55) [ClassicSimilarity], result of:
            0.11811818 = score(doc=55,freq=4.0), product of:
              0.15260552 = queryWeight, product of:
                1.514862 = boost
                6.192079 = idf(docFreq=246, maxDocs=44421)
                0.016268993 = queryNorm
              0.7740099 = fieldWeight in 55, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                6.192079 = idf(docFreq=246, maxDocs=44421)
                0.0625 = fieldNorm(doc=55)
        0.28 = coord(7/25)
    
  2. Jiang, X.; Tan, A.-H.: CRCTOL: a semantic-based domain ontology learning system (2009) 0.13
    0.12574996 = sum of:
      0.12574996 = product of:
        0.39296865 = sum of:
          0.03377448 = weight(abstract_txt:semantic in 307) [ClassicSimilarity], result of:
            0.03377448 = score(doc=307,freq=3.0), product of:
              0.0796879 = queryWeight, product of:
                1.0946723 = boost
                4.4745317 = idf(docFreq=1375, maxDocs=44421)
                0.016268993 = queryNorm
              0.42383447 = fieldWeight in 307, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.4745317 = idf(docFreq=1375, maxDocs=44421)
                0.0546875 = fieldNorm(doc=307)
          0.012789927 = weight(abstract_txt:more in 307) [ClassicSimilarity], result of:
            0.012789927 = score(doc=307,freq=1.0), product of:
              0.068862505 = queryWeight, product of:
                1.2463069 = boost
                3.3962307 = idf(docFreq=4044, maxDocs=44421)
                0.016268993 = queryNorm
              0.18573137 = fieldWeight in 307, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.3962307 = idf(docFreq=4044, maxDocs=44421)
                0.0546875 = fieldNorm(doc=307)
          0.048097875 = weight(abstract_txt:relation in 307) [ClassicSimilarity], result of:
            0.048097875 = score(doc=307,freq=2.0), product of:
              0.11546428 = queryWeight, product of:
                1.3176855 = boost
                5.38611 = idf(docFreq=552, maxDocs=44421)
                0.016268993 = queryNorm
              0.41656065 = fieldWeight in 307, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.38611 = idf(docFreq=552, maxDocs=44421)
                0.0546875 = fieldNorm(doc=307)
          0.036639113 = weight(abstract_txt:automatically in 307) [ClassicSimilarity], result of:
            0.036639113 = score(doc=307,freq=1.0), product of:
              0.12133991 = queryWeight, product of:
                1.350796 = boost
                5.521451 = idf(docFreq=482, maxDocs=44421)
                0.016268993 = queryNorm
              0.30195436 = fieldWeight in 307, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.521451 = idf(docFreq=482, maxDocs=44421)
                0.0546875 = fieldNorm(doc=307)
          0.038020756 = weight(abstract_txt:either in 307) [ClassicSimilarity], result of:
            0.038020756 = score(doc=307,freq=1.0), product of:
              0.12437149 = queryWeight, product of:
                1.3675662 = boost
                5.59 = idf(docFreq=450, maxDocs=44421)
                0.016268993 = queryNorm
              0.30570313 = fieldWeight in 307, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.59 = idf(docFreq=450, maxDocs=44421)
                0.0546875 = fieldNorm(doc=307)
          0.051676705 = weight(abstract_txt:extraction in 307) [ClassicSimilarity], result of:
            0.051676705 = score(doc=307,freq=1.0), product of:
              0.15260552 = queryWeight, product of:
                1.514862 = boost
                6.192079 = idf(docFreq=246, maxDocs=44421)
                0.016268993 = queryNorm
              0.33862934 = fieldWeight in 307, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.192079 = idf(docFreq=246, maxDocs=44421)
                0.0546875 = fieldNorm(doc=307)
          0.051363252 = weight(abstract_txt:level in 307) [ClassicSimilarity], result of:
            0.051363252 = score(doc=307,freq=3.0), product of:
              0.12063279 = queryWeight, product of:
                1.649553 = boost
                4.4950905 = idf(docFreq=1347, maxDocs=44421)
                0.016268993 = queryNorm
              0.42578185 = fieldWeight in 307, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.4950905 = idf(docFreq=1347, maxDocs=44421)
                0.0546875 = fieldNorm(doc=307)
          0.12060653 = weight(abstract_txt:learning in 307) [ClassicSimilarity], result of:
            0.12060653 = score(doc=307,freq=3.0), product of:
              0.26850614 = queryWeight, product of:
                3.4803722 = boost
                4.7420692 = idf(docFreq=1052, maxDocs=44421)
                0.016268993 = queryNorm
              0.44917604 = fieldWeight in 307, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.7420692 = idf(docFreq=1052, maxDocs=44421)
                0.0546875 = fieldNorm(doc=307)
        0.32 = coord(8/25)
    
  3. Yu, N.: Exploring co-training strategies for opinion detection (2014) 0.12
    0.12204066 = sum of:
      0.12204066 = product of:
        0.4358595 = sum of:
          0.030524712 = weight(abstract_txt:much in 2503) [ClassicSimilarity], result of:
            0.030524712 = score(doc=2503,freq=1.0), product of:
              0.09828339 = queryWeight, product of:
                1.2157044 = boost
                4.969257 = idf(docFreq=838, maxDocs=44421)
                0.016268993 = queryNorm
              0.31057855 = fieldWeight in 2503, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.969257 = idf(docFreq=838, maxDocs=44421)
                0.0625 = fieldNorm(doc=2503)
          0.014617059 = weight(abstract_txt:more in 2503) [ClassicSimilarity], result of:
            0.014617059 = score(doc=2503,freq=1.0), product of:
              0.068862505 = queryWeight, product of:
                1.2463069 = boost
                3.3962307 = idf(docFreq=4044, maxDocs=44421)
                0.016268993 = queryNorm
              0.21226442 = fieldWeight in 2503, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.3962307 = idf(docFreq=4044, maxDocs=44421)
                0.0625 = fieldNorm(doc=2503)
          0.06616395 = weight(abstract_txt:training in 2503) [ClassicSimilarity], result of:
            0.06616395 = score(doc=2503,freq=4.0), product of:
              0.10369871 = queryWeight, product of:
                1.2487475 = boost
                5.104322 = idf(docFreq=732, maxDocs=44421)
                0.016268993 = queryNorm
              0.63804024 = fieldWeight in 2503, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.104322 = idf(docFreq=732, maxDocs=44421)
                0.0625 = fieldNorm(doc=2503)
          0.05921775 = weight(abstract_txt:automatically in 2503) [ClassicSimilarity], result of:
            0.05921775 = score(doc=2503,freq=2.0), product of:
              0.12133991 = queryWeight, product of:
                1.350796 = boost
                5.521451 = idf(docFreq=482, maxDocs=44421)
                0.016268993 = queryNorm
              0.48803192 = fieldWeight in 2503, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.521451 = idf(docFreq=482, maxDocs=44421)
                0.0625 = fieldNorm(doc=2503)
          0.033890955 = weight(abstract_txt:level in 2503) [ClassicSimilarity], result of:
            0.033890955 = score(doc=2503,freq=1.0), product of:
              0.12063279 = queryWeight, product of:
                1.649553 = boost
                4.4950905 = idf(docFreq=1347, maxDocs=44421)
                0.016268993 = queryNorm
              0.28094316 = fieldWeight in 2503, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.4950905 = idf(docFreq=1347, maxDocs=44421)
                0.0625 = fieldNorm(doc=2503)
          0.09360905 = weight(abstract_txt:strategy in 2503) [ClassicSimilarity], result of:
            0.09360905 = score(doc=2503,freq=1.0), product of:
              0.26137716 = queryWeight, product of:
                2.8037336 = boost
                5.7302055 = idf(docFreq=391, maxDocs=44421)
                0.016268993 = queryNorm
              0.35813785 = fieldWeight in 2503, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.7302055 = idf(docFreq=391, maxDocs=44421)
                0.0625 = fieldNorm(doc=2503)
          0.13783602 = weight(abstract_txt:learning in 2503) [ClassicSimilarity], result of:
            0.13783602 = score(doc=2503,freq=3.0), product of:
              0.26850614 = queryWeight, product of:
                3.4803722 = boost
                4.7420692 = idf(docFreq=1052, maxDocs=44421)
                0.016268993 = queryNorm
              0.51334405 = fieldWeight in 2503, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.7420692 = idf(docFreq=1052, maxDocs=44421)
                0.0625 = fieldNorm(doc=2503)
        0.28 = coord(7/25)
    
  4. Li, J.; Zhang, Z.; Li, X.; Chen, H.: Kernel-based learning for biomedical relation extraction (2008) 0.11
    0.108457044 = sum of:
      0.108457044 = product of:
        0.5422852 = sum of:
          0.11901138 = weight(abstract_txt:relation in 2611) [ClassicSimilarity], result of:
            0.11901138 = score(doc=2611,freq=6.0), product of:
              0.11546428 = queryWeight, product of:
                1.3176855 = boost
                5.38611 = idf(docFreq=552, maxDocs=44421)
                0.016268993 = queryNorm
              1.0307204 = fieldWeight in 2611, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                5.38611 = idf(docFreq=552, maxDocs=44421)
                0.078125 = fieldNorm(doc=2611)
          0.07402219 = weight(abstract_txt:automatically in 2611) [ClassicSimilarity], result of:
            0.07402219 = score(doc=2611,freq=2.0), product of:
              0.12133991 = queryWeight, product of:
                1.350796 = boost
                5.521451 = idf(docFreq=482, maxDocs=44421)
                0.016268993 = queryNorm
              0.6100399 = fieldWeight in 2611, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.521451 = idf(docFreq=482, maxDocs=44421)
                0.078125 = fieldNorm(doc=2611)
          0.12786669 = weight(abstract_txt:extraction in 2611) [ClassicSimilarity], result of:
            0.12786669 = score(doc=2611,freq=3.0), product of:
              0.15260552 = queryWeight, product of:
                1.514862 = boost
                6.192079 = idf(docFreq=246, maxDocs=44421)
                0.016268993 = queryNorm
              0.83789027 = fieldWeight in 2611, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.192079 = idf(docFreq=246, maxDocs=44421)
                0.078125 = fieldNorm(doc=2611)
          0.08070666 = weight(abstract_txt:function in 2611) [ClassicSimilarity], result of:
            0.08070666 = score(doc=2611,freq=1.0), product of:
              0.18538548 = queryWeight, product of:
                2.0448968 = boost
                5.5724173 = idf(docFreq=458, maxDocs=44421)
                0.016268993 = queryNorm
              0.4353451 = fieldWeight in 2611, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.5724173 = idf(docFreq=458, maxDocs=44421)
                0.078125 = fieldNorm(doc=2611)
          0.1406783 = weight(abstract_txt:learning in 2611) [ClassicSimilarity], result of:
            0.1406783 = score(doc=2611,freq=2.0), product of:
              0.26850614 = queryWeight, product of:
                3.4803722 = boost
                4.7420692 = idf(docFreq=1052, maxDocs=44421)
                0.016268993 = queryNorm
              0.52392954 = fieldWeight in 2611, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.7420692 = idf(docFreq=1052, maxDocs=44421)
                0.078125 = fieldNorm(doc=2611)
        0.2 = coord(5/25)
    
  5. Wu, T.; Pottenger, W.M.: ¬A semi-supervised active learning algorithm for information extraction from textual data (2005) 0.11
    0.10654359 = sum of:
      0.10654359 = product of:
        0.53271794 = sum of:
          0.057299662 = weight(abstract_txt:training in 4237) [ClassicSimilarity], result of:
            0.057299662 = score(doc=4237,freq=3.0), product of:
              0.10369871 = queryWeight, product of:
                1.2487475 = boost
                5.104322 = idf(docFreq=732, maxDocs=44421)
                0.016268993 = queryNorm
              0.5525591 = fieldWeight in 4237, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.104322 = idf(docFreq=732, maxDocs=44421)
                0.0625 = fieldNorm(doc=4237)
          0.05921775 = weight(abstract_txt:automatically in 4237) [ClassicSimilarity], result of:
            0.05921775 = score(doc=4237,freq=2.0), product of:
              0.12133991 = queryWeight, product of:
                1.350796 = boost
                5.521451 = idf(docFreq=482, maxDocs=44421)
                0.016268993 = queryNorm
              0.48803192 = fieldWeight in 4237, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.521451 = idf(docFreq=482, maxDocs=44421)
                0.0625 = fieldNorm(doc=4237)
          0.11811818 = weight(abstract_txt:extraction in 4237) [ClassicSimilarity], result of:
            0.11811818 = score(doc=4237,freq=4.0), product of:
              0.15260552 = queryWeight, product of:
                1.514862 = boost
                6.192079 = idf(docFreq=246, maxDocs=44421)
                0.016268993 = queryNorm
              0.7740099 = fieldWeight in 4237, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                6.192079 = idf(docFreq=246, maxDocs=44421)
                0.0625 = fieldNorm(doc=4237)
          0.07299711 = weight(abstract_txt:manually in 4237) [ClassicSimilarity], result of:
            0.07299711 = score(doc=4237,freq=1.0), product of:
              0.17575842 = queryWeight, product of:
                1.625721 = boost
                6.6452217 = idf(docFreq=156, maxDocs=44421)
                0.016268993 = queryNorm
              0.41532636 = fieldWeight in 4237, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.6452217 = idf(docFreq=156, maxDocs=44421)
                0.0625 = fieldNorm(doc=4237)
          0.22508529 = weight(abstract_txt:learning in 4237) [ClassicSimilarity], result of:
            0.22508529 = score(doc=4237,freq=8.0), product of:
              0.26850614 = queryWeight, product of:
                3.4803722 = boost
                4.7420692 = idf(docFreq=1052, maxDocs=44421)
                0.016268993 = queryNorm
              0.8382873 = fieldWeight in 4237, product of:
                2.828427 = tf(freq=8.0), with freq of:
                  8.0 = termFreq=8.0
                4.7420692 = idf(docFreq=1052, maxDocs=44421)
                0.0625 = fieldNorm(doc=4237)
        0.2 = coord(5/25)