Document (#38185)

Author
Silva, R.M.
Gonçalves, M.A.
Veloso, A.
Title
¬A Two-stage active learning method for learning to rank
Source
Journal of the Association for Information Science and Technology. 65(2014) no.1, S.109-128
Year
2014
Abstract
Learning to rank (L2R) algorithms use a labeled training set to generate a ranking model that can later be used to rank new query results. These training sets are costly and laborious to produce, requiring human annotators to assess the relevance or order of the documents in relation to a query. Active learning algorithms are able to reduce the labeling effort by selectively sampling an unlabeled set and choosing data instances that maximize a learning function's effectiveness. In this article, we propose a novel two-stage active learning method for L2R that combines and exploits interesting properties of its constituent parts, thus being effective and practical. In the first stage, an association rule active sampling algorithm is used to select a very small but effective initial training set. In the second stage, a query-by-committee strategy trained with the first-stage set is used to iteratively select more examples until a preset labeling budget is met or a target effectiveness is achieved. We test our method with various LETOR benchmarking data sets and compare it with several baselines to show that it achieves good results using only a small portion of the original training sets.
Content
Vgl.: http://onlinelibrary.wiley.com/doi/10.1002/asi.22958/abstract.
Theme
Retrievalalgorithmen

Similar documents (author)

  1. Cortez, E.; Silva, A.S. da; Gonçalves, M.A.; Mesquita, F.; Moura, E.S. de: ¬A flexible approach for extracting metadata from bibliographic citations (2009) 2.84
    2.8423824 = sum of:
      2.8423824 = sum of:
        1.1448084 = weight(author_txt:silva in 3848) [ClassicSimilarity], result of:
          1.1448084 = score(doc=3848,freq=1.0), product of:
            0.60960436 = queryWeight, product of:
              7.5118127 = idf(docFreq=65, maxDocs=44421)
              0.08115277 = queryNorm
            1.8779532 = fieldWeight in 3848, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              7.5118127 = idf(docFreq=65, maxDocs=44421)
              0.25 = fieldNorm(doc=3848)
        1.6975741 = weight(author_txt:gonçalves in 3848) [ClassicSimilarity], result of:
          1.6975741 = score(doc=3848,freq=1.0), product of:
            0.7927058 = queryWeight, product of:
              1.1403338 = boost
              8.565973 = idf(docFreq=22, maxDocs=44421)
              0.08115277 = queryNorm
            2.1414933 = fieldWeight in 3848, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              8.565973 = idf(docFreq=22, maxDocs=44421)
              0.25 = fieldNorm(doc=3848)
    
  2. Moura, E.S. de; Fernandes, D.; Ribeiro-Neto, B.; Silva, A.S. da; Gonçalves, M.A.: Using structural information to improve search in Web collections (2010) 2.84
    2.8423824 = sum of:
      2.8423824 = sum of:
        1.1448084 = weight(author_txt:silva in 119) [ClassicSimilarity], result of:
          1.1448084 = score(doc=119,freq=1.0), product of:
            0.60960436 = queryWeight, product of:
              7.5118127 = idf(docFreq=65, maxDocs=44421)
              0.08115277 = queryNorm
            1.8779532 = fieldWeight in 119, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              7.5118127 = idf(docFreq=65, maxDocs=44421)
              0.25 = fieldNorm(doc=119)
        1.6975741 = weight(author_txt:gonçalves in 119) [ClassicSimilarity], result of:
          1.6975741 = score(doc=119,freq=1.0), product of:
            0.7927058 = queryWeight, product of:
              1.1403338 = boost
              8.565973 = idf(docFreq=22, maxDocs=44421)
              0.08115277 = queryNorm
            2.1414933 = fieldWeight in 119, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              8.565973 = idf(docFreq=22, maxDocs=44421)
              0.25 = fieldNorm(doc=119)
    
  3. Silva, A.J.C.; Gonçalves, M.A.; Laender, A.H.F.; Modesto, M.A.B.; Cristo, M.; Ziviani, N.: Finding what is missing from a digital library : a case study in the computer science field (2009) 2.84
    2.8423824 = sum of:
      2.8423824 = sum of:
        1.1448084 = weight(author_txt:silva in 219) [ClassicSimilarity], result of:
          1.1448084 = score(doc=219,freq=1.0), product of:
            0.60960436 = queryWeight, product of:
              7.5118127 = idf(docFreq=65, maxDocs=44421)
              0.08115277 = queryNorm
            1.8779532 = fieldWeight in 219, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              7.5118127 = idf(docFreq=65, maxDocs=44421)
              0.25 = fieldNorm(doc=219)
        1.6975741 = weight(author_txt:gonçalves in 219) [ClassicSimilarity], result of:
          1.6975741 = score(doc=219,freq=1.0), product of:
            0.7927058 = queryWeight, product of:
              1.1403338 = boost
              8.565973 = idf(docFreq=22, maxDocs=44421)
              0.08115277 = queryNorm
            2.1414933 = fieldWeight in 219, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              8.565973 = idf(docFreq=22, maxDocs=44421)
              0.25 = fieldNorm(doc=219)
    
  4. Cavalcante Dourado, Í.; Galante, R.; Gonçalves, M.A.; Silva Torres, R. de: Bag of textual graphs (BoTG) : a general graph-based text representation model (2019) 2.84
    2.8423824 = sum of:
      2.8423824 = sum of:
        1.1448084 = weight(author_txt:silva in 291) [ClassicSimilarity], result of:
          1.1448084 = score(doc=291,freq=1.0), product of:
            0.60960436 = queryWeight, product of:
              7.5118127 = idf(docFreq=65, maxDocs=44421)
              0.08115277 = queryNorm
            1.8779532 = fieldWeight in 291, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              7.5118127 = idf(docFreq=65, maxDocs=44421)
              0.25 = fieldNorm(doc=291)
        1.6975741 = weight(author_txt:gonçalves in 291) [ClassicSimilarity], result of:
          1.6975741 = score(doc=291,freq=1.0), product of:
            0.7927058 = queryWeight, product of:
              1.1403338 = boost
              8.565973 = idf(docFreq=22, maxDocs=44421)
              0.08115277 = queryNorm
            2.1414933 = fieldWeight in 291, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              8.565973 = idf(docFreq=22, maxDocs=44421)
              0.25 = fieldNorm(doc=291)
    
  5. Sant'Ana, R.C. Gonçalves => Gonçalves Sant'Ana, R.C.: 1.80
    1.8005493 = sum of:
      1.8005493 = product of:
        3.6010985 = sum of:
          3.6010985 = weight(author_txt:gonçalves in 732) [ClassicSimilarity], result of:
            3.6010985 = score(doc=732,freq=2.0), product of:
              0.7927058 = queryWeight, product of:
                1.1403338 = boost
                8.565973 = idf(docFreq=22, maxDocs=44421)
                0.08115277 = queryNorm
              4.5427933 = fieldWeight in 732, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.565973 = idf(docFreq=22, maxDocs=44421)
                0.375 = fieldNorm(doc=732)
        0.5 = coord(1/2)
    

Similar documents (content)

  1. Ko, Y.; Seo, J.: Text classification from unlabeled documents with bootstrapping and feature projection techniques (2009) 0.24
    0.24286342 = sum of:
      0.24286342 = product of:
        0.7589482 = sum of:
          0.15785158 = weight(abstract_txt:unlabeled in 3452) [ClassicSimilarity], result of:
            0.15785158 = score(doc=3452,freq=2.0), product of:
              0.19001053 = queryWeight, product of:
                1.1455123 = boost
                9.398883 = idf(docFreq=9, maxDocs=44421)
                0.01764825 = queryNorm
              0.8307517 = fieldWeight in 3452, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                9.398883 = idf(docFreq=9, maxDocs=44421)
                0.0625 = fieldNorm(doc=3452)
          0.010058608 = weight(abstract_txt:that in 3452) [ClassicSimilarity], result of:
            0.010058608 = score(doc=3452,freq=2.0), product of:
              0.04811978 = queryWeight, product of:
                1.1529295 = boost
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.01764825 = queryNorm
              0.20903271 = fieldWeight in 3452, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.0625 = fieldNorm(doc=3452)
          0.015260126 = weight(abstract_txt:used in 3452) [ClassicSimilarity], result of:
            0.015260126 = score(doc=3452,freq=1.0), product of:
              0.072727874 = queryWeight, product of:
                1.2275014 = boost
                3.3572001 = idf(docFreq=4205, maxDocs=44421)
                0.01764825 = queryNorm
              0.20982501 = fieldWeight in 3452, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.3572001 = idf(docFreq=4205, maxDocs=44421)
                0.0625 = fieldNorm(doc=3452)
          0.0497934 = weight(abstract_txt:algorithms in 3452) [ClassicSimilarity], result of:
            0.0497934 = score(doc=3452,freq=1.0), product of:
              0.13976966 = queryWeight, product of:
                1.3894163 = boost
                5.7000527 = idf(docFreq=403, maxDocs=44421)
                0.01764825 = queryNorm
              0.3562533 = fieldWeight in 3452, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.7000527 = idf(docFreq=403, maxDocs=44421)
                0.0625 = fieldNorm(doc=3452)
          0.082111396 = weight(abstract_txt:method in 3452) [ClassicSimilarity], result of:
            0.082111396 = score(doc=3452,freq=5.0), product of:
              0.13059938 = queryWeight, product of:
                1.64491 = boost
                4.4988065 = idf(docFreq=1342, maxDocs=44421)
                0.01764825 = queryNorm
              0.6287273 = fieldWeight in 3452, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                4.4988065 = idf(docFreq=1342, maxDocs=44421)
                0.0625 = fieldNorm(doc=3452)
          0.12908174 = weight(abstract_txt:labeling in 3452) [ClassicSimilarity], result of:
            0.12908174 = score(doc=3452,freq=1.0), product of:
              0.26375958 = queryWeight, product of:
                1.9086666 = boost
                7.8302665 = idf(docFreq=47, maxDocs=44421)
                0.01764825 = queryNorm
              0.48939165 = fieldWeight in 3452, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.8302665 = idf(docFreq=47, maxDocs=44421)
                0.0625 = fieldNorm(doc=3452)
          0.07151208 = weight(abstract_txt:training in 3452) [ClassicSimilarity], result of:
            0.07151208 = score(doc=3452,freq=1.0), product of:
              0.22416165 = queryWeight, product of:
                2.488408 = boost
                5.104322 = idf(docFreq=732, maxDocs=44421)
                0.01764825 = queryNorm
              0.31902012 = fieldWeight in 3452, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.104322 = idf(docFreq=732, maxDocs=44421)
                0.0625 = fieldNorm(doc=3452)
          0.2432793 = weight(abstract_txt:learning in 3452) [ClassicSimilarity], result of:
            0.2432793 = score(doc=3452,freq=8.0), product of:
              0.29020992 = queryWeight, product of:
                3.4677093 = boost
                4.7420692 = idf(docFreq=1052, maxDocs=44421)
                0.01764825 = queryNorm
              0.8382873 = fieldWeight in 3452, product of:
                2.828427 = tf(freq=8.0), with freq of:
                  8.0 = termFreq=8.0
                4.7420692 = idf(docFreq=1052, maxDocs=44421)
                0.0625 = fieldNorm(doc=3452)
        0.32 = coord(8/25)
    
  2. Xu, B.; Lin, H.; Lin, Y.: Assessment of learning to rank methods for query expansion (2016) 0.23
    0.22826326 = sum of:
      0.22826326 = product of:
        0.7133227 = sum of:
          0.00711251 = weight(abstract_txt:that in 3929) [ClassicSimilarity], result of:
            0.00711251 = score(doc=3929,freq=1.0), product of:
              0.04811978 = queryWeight, product of:
                1.1529295 = boost
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.01764825 = queryNorm
              0.14780845 = fieldWeight in 3929, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.0625 = fieldNorm(doc=3929)
          0.043048434 = weight(abstract_txt:effective in 3929) [ClassicSimilarity], result of:
            0.043048434 = score(doc=3929,freq=2.0), product of:
              0.10067616 = queryWeight, product of:
                1.1792048 = boost
                4.837664 = idf(docFreq=956, maxDocs=44421)
                0.01764825 = queryNorm
              0.42759314 = fieldWeight in 3929, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.837664 = idf(docFreq=956, maxDocs=44421)
                0.0625 = fieldNorm(doc=3929)
          0.015260126 = weight(abstract_txt:used in 3929) [ClassicSimilarity], result of:
            0.015260126 = score(doc=3929,freq=1.0), product of:
              0.072727874 = queryWeight, product of:
                1.2275014 = boost
                3.3572001 = idf(docFreq=4205, maxDocs=44421)
                0.01764825 = queryNorm
              0.20982501 = fieldWeight in 3929, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.3572001 = idf(docFreq=4205, maxDocs=44421)
                0.0625 = fieldNorm(doc=3929)
          0.05193181 = weight(abstract_txt:method in 3929) [ClassicSimilarity], result of:
            0.05193181 = score(doc=3929,freq=2.0), product of:
              0.13059938 = queryWeight, product of:
                1.64491 = boost
                4.4988065 = idf(docFreq=1342, maxDocs=44421)
                0.01764825 = queryNorm
              0.39764208 = fieldWeight in 3929, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.4988065 = idf(docFreq=1342, maxDocs=44421)
                0.0625 = fieldNorm(doc=3929)
          0.06129905 = weight(abstract_txt:query in 3929) [ClassicSimilarity], result of:
            0.06129905 = score(doc=3929,freq=2.0), product of:
              0.14586619 = queryWeight, product of:
                1.7383968 = boost
                4.754492 = idf(docFreq=1039, maxDocs=44421)
                0.01764825 = queryNorm
              0.42024165 = fieldWeight in 3929, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.754492 = idf(docFreq=1039, maxDocs=44421)
                0.0625 = fieldNorm(doc=3929)
          0.12908174 = weight(abstract_txt:labeling in 3929) [ClassicSimilarity], result of:
            0.12908174 = score(doc=3929,freq=1.0), product of:
              0.26375958 = queryWeight, product of:
                1.9086666 = boost
                7.8302665 = idf(docFreq=47, maxDocs=44421)
                0.01764825 = queryNorm
              0.48939165 = fieldWeight in 3929, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.8302665 = idf(docFreq=47, maxDocs=44421)
                0.0625 = fieldNorm(doc=3929)
          0.21325989 = weight(abstract_txt:rank in 3929) [ClassicSimilarity], result of:
            0.21325989 = score(doc=3929,freq=4.0), product of:
              0.2658163 = queryWeight, product of:
                2.346726 = boost
                6.418264 = idf(docFreq=196, maxDocs=44421)
                0.01764825 = queryNorm
              0.802283 = fieldWeight in 3929, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                6.418264 = idf(docFreq=196, maxDocs=44421)
                0.0625 = fieldNorm(doc=3929)
          0.19232917 = weight(abstract_txt:learning in 3929) [ClassicSimilarity], result of:
            0.19232917 = score(doc=3929,freq=5.0), product of:
              0.29020992 = queryWeight, product of:
                3.4677093 = boost
                4.7420692 = idf(docFreq=1052, maxDocs=44421)
                0.01764825 = queryNorm
              0.6627243 = fieldWeight in 3929, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                4.7420692 = idf(docFreq=1052, maxDocs=44421)
                0.0625 = fieldNorm(doc=3929)
        0.32 = coord(8/25)
    
  3. Kuo, J.-S.; Li, H.; Yang, Y.-K.: Active learning for constructing transliteration lexicons from the Web (2008) 0.22
    0.21822666 = sum of:
      0.21822666 = product of:
        0.90927774 = sum of:
          0.13082492 = weight(abstract_txt:iteratively in 2345) [ClassicSimilarity], result of:
            0.13082492 = score(doc=2345,freq=1.0), product of:
              0.1611961 = queryWeight, product of:
                1.0550869 = boost
                8.656945 = idf(docFreq=20, maxDocs=44421)
                0.01764825 = queryNorm
              0.81158864 = fieldWeight in 2345, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.656945 = idf(docFreq=20, maxDocs=44421)
                0.09375 = fieldNorm(doc=2345)
          0.018478842 = weight(abstract_txt:that in 2345) [ClassicSimilarity], result of:
            0.018478842 = score(doc=2345,freq=3.0), product of:
              0.04811978 = queryWeight, product of:
                1.1529295 = boost
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.01764825 = queryNorm
              0.3840176 = fieldWeight in 2345, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.09375 = fieldNorm(doc=2345)
          0.045659762 = weight(abstract_txt:effective in 2345) [ClassicSimilarity], result of:
            0.045659762 = score(doc=2345,freq=1.0), product of:
              0.10067616 = queryWeight, product of:
                1.1792048 = boost
                4.837664 = idf(docFreq=956, maxDocs=44421)
                0.01764825 = queryNorm
              0.45353103 = fieldWeight in 2345, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.837664 = idf(docFreq=956, maxDocs=44421)
                0.09375 = fieldNorm(doc=2345)
          0.1936226 = weight(abstract_txt:labeling in 2345) [ClassicSimilarity], result of:
            0.1936226 = score(doc=2345,freq=1.0), product of:
              0.26375958 = queryWeight, product of:
                1.9086666 = boost
                7.8302665 = idf(docFreq=47, maxDocs=44421)
                0.01764825 = queryNorm
              0.73408747 = fieldWeight in 2345, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.8302665 = idf(docFreq=47, maxDocs=44421)
                0.09375 = fieldNorm(doc=2345)
          0.20466253 = weight(abstract_txt:active in 2345) [ClassicSimilarity], result of:
            0.20466253 = score(doc=2345,freq=1.0), product of:
              0.3448311 = queryWeight, product of:
                3.0863428 = boost
                6.3308296 = idf(docFreq=214, maxDocs=44421)
                0.01764825 = queryNorm
              0.5935153 = fieldWeight in 2345, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.3308296 = idf(docFreq=214, maxDocs=44421)
                0.09375 = fieldNorm(doc=2345)
          0.31602907 = weight(abstract_txt:learning in 2345) [ClassicSimilarity], result of:
            0.31602907 = score(doc=2345,freq=6.0), product of:
              0.29020992 = queryWeight, product of:
                3.4677093 = boost
                4.7420692 = idf(docFreq=1052, maxDocs=44421)
                0.01764825 = queryNorm
              1.0889672 = fieldWeight in 2345, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                4.7420692 = idf(docFreq=1052, maxDocs=44421)
                0.09375 = fieldNorm(doc=2345)
        0.24 = coord(6/25)
    
  4. Lin, Y.; Lin, H.; Xu, K.; Sun, X.: Learning to rank using smoothing methods for language modeling (2013) 0.21
    0.20549905 = sum of:
      0.20549905 = product of:
        0.64218456 = sum of:
          0.00711251 = weight(abstract_txt:that in 1687) [ClassicSimilarity], result of:
            0.00711251 = score(doc=1687,freq=1.0), product of:
              0.04811978 = queryWeight, product of:
                1.1529295 = boost
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.01764825 = queryNorm
              0.14780845 = fieldWeight in 1687, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.0625 = fieldNorm(doc=1687)
          0.043048434 = weight(abstract_txt:effective in 1687) [ClassicSimilarity], result of:
            0.043048434 = score(doc=1687,freq=2.0), product of:
              0.10067616 = queryWeight, product of:
                1.1792048 = boost
                4.837664 = idf(docFreq=956, maxDocs=44421)
                0.01764825 = queryNorm
              0.42759314 = fieldWeight in 1687, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.837664 = idf(docFreq=956, maxDocs=44421)
                0.0625 = fieldNorm(doc=1687)
          0.050324768 = weight(abstract_txt:effectiveness in 1687) [ClassicSimilarity], result of:
            0.050324768 = score(doc=1687,freq=2.0), product of:
              0.11172308 = queryWeight, product of:
                1.2422168 = boost
                5.0961695 = idf(docFreq=738, maxDocs=44421)
                0.01764825 = queryNorm
              0.450442 = fieldWeight in 1687, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.0961695 = idf(docFreq=738, maxDocs=44421)
                0.0625 = fieldNorm(doc=1687)
          0.0652522 = weight(abstract_txt:select in 1687) [ClassicSimilarity], result of:
            0.0652522 = score(doc=1687,freq=1.0), product of:
              0.16737676 = queryWeight, product of:
                1.5204549 = boost
                6.2376356 = idf(docFreq=235, maxDocs=44421)
                0.01764825 = queryNorm
              0.38985223 = fieldWeight in 1687, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.2376356 = idf(docFreq=235, maxDocs=44421)
                0.0625 = fieldNorm(doc=1687)
          0.063603215 = weight(abstract_txt:method in 1687) [ClassicSimilarity], result of:
            0.063603215 = score(doc=1687,freq=3.0), product of:
              0.13059938 = queryWeight, product of:
                1.64491 = boost
                4.4988065 = idf(docFreq=1342, maxDocs=44421)
                0.01764825 = queryNorm
              0.4870101 = fieldWeight in 1687, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.4988065 = idf(docFreq=1342, maxDocs=44421)
                0.0625 = fieldNorm(doc=1687)
          0.056130536 = weight(abstract_txt:sets in 1687) [ClassicSimilarity], result of:
            0.056130536 = score(doc=1687,freq=1.0), product of:
              0.17329855 = queryWeight, product of:
                1.8948247 = boost
                5.18232 = idf(docFreq=677, maxDocs=44421)
                0.01764825 = queryNorm
              0.323895 = fieldWeight in 1687, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.18232 = idf(docFreq=677, maxDocs=44421)
                0.0625 = fieldNorm(doc=1687)
          0.1846885 = weight(abstract_txt:rank in 1687) [ClassicSimilarity], result of:
            0.1846885 = score(doc=1687,freq=3.0), product of:
              0.2658163 = queryWeight, product of:
                2.346726 = boost
                6.418264 = idf(docFreq=196, maxDocs=44421)
                0.01764825 = queryNorm
              0.69479746 = fieldWeight in 1687, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.418264 = idf(docFreq=196, maxDocs=44421)
                0.0625 = fieldNorm(doc=1687)
          0.17202444 = weight(abstract_txt:learning in 1687) [ClassicSimilarity], result of:
            0.17202444 = score(doc=1687,freq=4.0), product of:
              0.29020992 = queryWeight, product of:
                3.4677093 = boost
                4.7420692 = idf(docFreq=1052, maxDocs=44421)
                0.01764825 = queryNorm
              0.59275866 = fieldWeight in 1687, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.7420692 = idf(docFreq=1052, maxDocs=44421)
                0.0625 = fieldNorm(doc=1687)
        0.32 = coord(8/25)
    
  5. Wu, T.; Pottenger, W.M.: ¬A semi-supervised active learning algorithm for information extraction from textual data (2005) 0.19
    0.19263618 = sum of:
      0.19263618 = product of:
        0.80265075 = sum of:
          0.00711251 = weight(abstract_txt:that in 4237) [ClassicSimilarity], result of:
            0.00711251 = score(doc=4237,freq=1.0), product of:
              0.04811978 = queryWeight, product of:
                1.1529295 = boost
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.01764825 = queryNorm
              0.14780845 = fieldWeight in 4237, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.0625 = fieldNorm(doc=4237)
          0.026431315 = weight(abstract_txt:used in 4237) [ClassicSimilarity], result of:
            0.026431315 = score(doc=4237,freq=3.0), product of:
              0.072727874 = queryWeight, product of:
                1.2275014 = boost
                3.3572001 = idf(docFreq=4205, maxDocs=44421)
                0.01764825 = queryNorm
              0.36342758 = fieldWeight in 4237, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.3572001 = idf(docFreq=4205, maxDocs=44421)
                0.0625 = fieldNorm(doc=4237)
          0.12908174 = weight(abstract_txt:labeling in 4237) [ClassicSimilarity], result of:
            0.12908174 = score(doc=4237,freq=1.0), product of:
              0.26375958 = queryWeight, product of:
                1.9086666 = boost
                7.8302665 = idf(docFreq=47, maxDocs=44421)
                0.01764825 = queryNorm
              0.48939165 = fieldWeight in 4237, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.8302665 = idf(docFreq=47, maxDocs=44421)
                0.0625 = fieldNorm(doc=4237)
          0.12386256 = weight(abstract_txt:training in 4237) [ClassicSimilarity], result of:
            0.12386256 = score(doc=4237,freq=3.0), product of:
              0.22416165 = queryWeight, product of:
                2.488408 = boost
                5.104322 = idf(docFreq=732, maxDocs=44421)
                0.01764825 = queryNorm
              0.5525591 = fieldWeight in 4237, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.104322 = idf(docFreq=732, maxDocs=44421)
                0.0625 = fieldNorm(doc=4237)
          0.2728834 = weight(abstract_txt:active in 4237) [ClassicSimilarity], result of:
            0.2728834 = score(doc=4237,freq=4.0), product of:
              0.3448311 = queryWeight, product of:
                3.0863428 = boost
                6.3308296 = idf(docFreq=214, maxDocs=44421)
                0.01764825 = queryNorm
              0.7913537 = fieldWeight in 4237, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                6.3308296 = idf(docFreq=214, maxDocs=44421)
                0.0625 = fieldNorm(doc=4237)
          0.2432793 = weight(abstract_txt:learning in 4237) [ClassicSimilarity], result of:
            0.2432793 = score(doc=4237,freq=8.0), product of:
              0.29020992 = queryWeight, product of:
                3.4677093 = boost
                4.7420692 = idf(docFreq=1052, maxDocs=44421)
                0.01764825 = queryNorm
              0.8382873 = fieldWeight in 4237, product of:
                2.828427 = tf(freq=8.0), with freq of:
                  8.0 = termFreq=8.0
                4.7420692 = idf(docFreq=1052, maxDocs=44421)
                0.0625 = fieldNorm(doc=4237)
        0.24 = coord(6/25)