Document (#36219)

Author
Li, M.
Li, H.
Zhou, Z.-H.
Title
Semi-supervised document retrieval
Source
Information processing and management. 45(2009) no.3, S.341-355
Year
2009
Abstract
This paper proposes a new machine learning method for constructing ranking models in document retrieval. The method, which is referred to as SSRank, aims to use the advantages of both the traditional Information Retrieval (IR) methods and the supervised learning methods for IR proposed recently. The advantages include the use of limited amount of labeled data and rich model representation. To do so, the method adopts a semi-supervised learning framework in ranking model construction. Specifically, given a small number of labeled documents with respect to some queries, the method effectively labels the unlabeled documents for the queries. It then uses all the labeled data to train a machine learning model (in our case, Neural Network). In the data labeling, the method also makes use of a traditional IR model (in our case, BM25). A stopping criterion based on machine learning theory is given for the data labeling process. Experimental results on three benchmark datasets and one web search dataset indicate that SSRank consistently and almost always significantly outperforms the baseline methods (unsupervised and supervised learning methods), given the same amount of labeled data. This is because SSRank can effectively leverage the use of unlabeled data in learning.
Theme
Retrievalalgorithmen

Similar documents (author)

  1. Zhou, L.: Characteristics of material organization and classification in the Kinsey Institute Library (2003) 4.86
    4.8560257 = sum of:
      4.8560257 = weight(author_txt:zhou in 639) [ClassicSimilarity], result of:
        4.8560257 = score(doc=639,freq=1.0), product of:
          0.99999994 = queryWeight, product of:
            7.769642 = idf(docFreq=50, maxDocs=44421)
            0.12870605 = queryNorm
          4.856026 = fieldWeight in 639, product of:
            1.0 = tf(freq=1.0), with freq of:
              1.0 = termFreq=1.0
            7.769642 = idf(docFreq=50, maxDocs=44421)
            0.625 = fieldNorm(doc=639)
    
  2. Zhou, J.-z.: ¬A new subclass for Library of Congress Classification, QF : Computer science (1998) 3.88
    3.8848207 = sum of:
      3.8848207 = weight(author_txt:zhou in 3846) [ClassicSimilarity], result of:
        3.8848207 = score(doc=3846,freq=1.0), product of:
          0.99999994 = queryWeight, product of:
            7.769642 = idf(docFreq=50, maxDocs=44421)
            0.12870605 = queryNorm
          3.884821 = fieldWeight in 3846, product of:
            1.0 = tf(freq=1.0), with freq of:
              1.0 = termFreq=1.0
            7.769642 = idf(docFreq=50, maxDocs=44421)
            0.5 = fieldNorm(doc=3846)
    
  3. Zhou, L.; Zhang, D.: NLPIR: a theoretical framework for applying Natural Language Processing to information retrieval (2003) 3.88
    3.8848207 = sum of:
      3.8848207 = weight(author_txt:zhou in 148) [ClassicSimilarity], result of:
        3.8848207 = score(doc=148,freq=1.0), product of:
          0.99999994 = queryWeight, product of:
            7.769642 = idf(docFreq=50, maxDocs=44421)
            0.12870605 = queryNorm
          3.884821 = fieldWeight in 148, product of:
            1.0 = tf(freq=1.0), with freq of:
              1.0 = termFreq=1.0
            7.769642 = idf(docFreq=50, maxDocs=44421)
            0.5 = fieldNorm(doc=148)
    
  4. Zhou, P.; Leydesdorff, L.: ¬A comparison between the China Scientific and Technical Papers and Citations Database and the Science Citation Index in terms of journal hierarchies and interjournal citation relations (2007) 3.88
    3.8848207 = sum of:
      3.8848207 = weight(author_txt:zhou in 1070) [ClassicSimilarity], result of:
        3.8848207 = score(doc=1070,freq=1.0), product of:
          0.99999994 = queryWeight, product of:
            7.769642 = idf(docFreq=50, maxDocs=44421)
            0.12870605 = queryNorm
          3.884821 = fieldWeight in 1070, product of:
            1.0 = tf(freq=1.0), with freq of:
              1.0 = termFreq=1.0
            7.769642 = idf(docFreq=50, maxDocs=44421)
            0.5 = fieldNorm(doc=1070)
    
  5. Zhou, G.D.; Zhang, M.: Extracting relation information from text documents by exploring various types of knowledge (2007) 3.88
    3.8848207 = sum of:
      3.8848207 = weight(author_txt:zhou in 1927) [ClassicSimilarity], result of:
        3.8848207 = score(doc=1927,freq=1.0), product of:
          0.99999994 = queryWeight, product of:
            7.769642 = idf(docFreq=50, maxDocs=44421)
            0.12870605 = queryNorm
          3.884821 = fieldWeight in 1927, product of:
            1.0 = tf(freq=1.0), with freq of:
              1.0 = termFreq=1.0
            7.769642 = idf(docFreq=50, maxDocs=44421)
            0.5 = fieldNorm(doc=1927)
    

Similar documents (content)

  1. Ko, Y.; Seo, J.: Text classification from unlabeled documents with bootstrapping and feature projection techniques (2009) 0.60
    0.60128975 = sum of:
      0.60128975 = product of:
        1.6702492 = sum of:
          0.03425737 = weight(abstract_txt:documents in 3452) [ClassicSimilarity], result of:
            0.03425737 = score(doc=3452,freq=4.0), product of:
              0.06646557 = queryWeight, product of:
                1.041601 = boost
                4.123322 = idf(docFreq=1954, maxDocs=44421)
                0.015475623 = queryNorm
              0.51541525 = fieldWeight in 3452, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.123322 = idf(docFreq=1954, maxDocs=44421)
                0.0625 = fieldNorm(doc=3452)
          0.068064086 = weight(abstract_txt:semi in 3452) [ClassicSimilarity], result of:
            0.068064086 = score(doc=3452,freq=1.0), product of:
              0.16674754 = queryWeight, product of:
                1.6498053 = boost
                6.5309834 = idf(docFreq=175, maxDocs=44421)
                0.015475623 = queryNorm
              0.40818647 = fieldWeight in 3452, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.5309834 = idf(docFreq=175, maxDocs=44421)
                0.0625 = fieldNorm(doc=3452)
          0.117303744 = weight(abstract_txt:labeling in 3452) [ClassicSimilarity], result of:
            0.117303744 = score(doc=3452,freq=1.0), product of:
              0.23969299 = queryWeight, product of:
                1.9780198 = boost
                7.8302665 = idf(docFreq=47, maxDocs=44421)
                0.015475623 = queryNorm
              0.48939165 = fieldWeight in 3452, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.8302665 = idf(docFreq=47, maxDocs=44421)
                0.0625 = fieldNorm(doc=3452)
          0.076076545 = weight(abstract_txt:machine in 3452) [ClassicSimilarity], result of:
            0.076076545 = score(doc=3452,freq=2.0), product of:
              0.16316801 = queryWeight, product of:
                1.9987853 = boost
                5.274979 = idf(docFreq=617, maxDocs=44421)
                0.015475623 = queryNorm
              0.4662467 = fieldWeight in 3452, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.274979 = idf(docFreq=617, maxDocs=44421)
                0.0625 = fieldNorm(doc=3452)
          0.28689697 = weight(abstract_txt:unlabeled in 3452) [ClassicSimilarity], result of:
            0.28689697 = score(doc=3452,freq=2.0), product of:
              0.34534624 = queryWeight, product of:
                2.3742714 = boost
                9.398883 = idf(docFreq=9, maxDocs=44421)
                0.015475623 = queryNorm
              0.8307517 = fieldWeight in 3452, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                9.398883 = idf(docFreq=9, maxDocs=44421)
                0.0625 = fieldNorm(doc=3452)
          0.12436531 = weight(abstract_txt:method in 3452) [ClassicSimilarity], result of:
            0.12436531 = score(doc=3452,freq=5.0), product of:
              0.19780484 = queryWeight, product of:
                2.8411322 = boost
                4.4988065 = idf(docFreq=1342, maxDocs=44421)
                0.015475623 = queryNorm
              0.6287273 = fieldWeight in 3452, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                4.4988065 = idf(docFreq=1342, maxDocs=44421)
                0.0625 = fieldNorm(doc=3452)
          0.4069525 = weight(abstract_txt:supervised in 3452) [ClassicSimilarity], result of:
            0.4069525 = score(doc=3452,freq=4.0), product of:
              0.43598 = queryWeight, product of:
                3.7726912 = boost
                7.467361 = idf(docFreq=68, maxDocs=44421)
                0.015475623 = queryNorm
              0.9334201 = fieldWeight in 3452, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                7.467361 = idf(docFreq=68, maxDocs=44421)
                0.0625 = fieldNorm(doc=3452)
          0.29840446 = weight(abstract_txt:labeled in 3452) [ClassicSimilarity], result of:
            0.29840446 = score(doc=3452,freq=2.0), product of:
              0.44666743 = queryWeight, product of:
                3.8186524 = boost
                7.558333 = idf(docFreq=62, maxDocs=44421)
                0.015475623 = queryNorm
              0.6680685 = fieldWeight in 3452, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.558333 = idf(docFreq=62, maxDocs=44421)
                0.0625 = fieldNorm(doc=3452)
          0.25792825 = weight(abstract_txt:learning in 3452) [ClassicSimilarity], result of:
            0.25792825 = score(doc=3452,freq=8.0), product of:
              0.3076848 = queryWeight, product of:
                4.1926637 = boost
                4.7420692 = idf(docFreq=1052, maxDocs=44421)
                0.015475623 = queryNorm
              0.8382873 = fieldWeight in 3452, product of:
                2.828427 = tf(freq=8.0), with freq of:
                  8.0 = termFreq=8.0
                4.7420692 = idf(docFreq=1052, maxDocs=44421)
                0.0625 = fieldNorm(doc=3452)
        0.36 = coord(9/25)
    
  2. Billal, B.; Fonseca, A.; Sadat, F.; Lounis, H.: Semi-supervised learning and social media text analysis towards multi-labeling categorization (2017) 0.51
    0.5145408 = sum of:
      0.5145408 = product of:
        1.169411 = sum of:
          0.021623166 = weight(abstract_txt:traditional in 95) [ClassicSimilarity], result of:
            0.021623166 = score(doc=95,freq=1.0), product of:
              0.08486362 = queryWeight, product of:
                1.1769656 = boost
                4.6591816 = idf(docFreq=1143, maxDocs=44421)
                0.015475623 = queryNorm
              0.254799 = fieldWeight in 95, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.6591816 = idf(docFreq=1143, maxDocs=44421)
                0.0546875 = fieldNorm(doc=95)
          0.023565583 = weight(abstract_txt:case in 95) [ClassicSimilarity], result of:
            0.023565583 = score(doc=95,freq=1.0), product of:
              0.089872636 = queryWeight, product of:
                1.2112024 = boost
                4.794713 = idf(docFreq=998, maxDocs=44421)
                0.015475623 = queryNorm
              0.26221088 = fieldWeight in 95, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.794713 = idf(docFreq=998, maxDocs=44421)
                0.0546875 = fieldNorm(doc=95)
          0.11911216 = weight(abstract_txt:semi in 95) [ClassicSimilarity], result of:
            0.11911216 = score(doc=95,freq=4.0), product of:
              0.16674754 = queryWeight, product of:
                1.6498053 = boost
                6.5309834 = idf(docFreq=175, maxDocs=44421)
                0.015475623 = queryNorm
              0.7143263 = fieldWeight in 95, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                6.5309834 = idf(docFreq=175, maxDocs=44421)
                0.0546875 = fieldNorm(doc=95)
          0.047069963 = weight(abstract_txt:machine in 95) [ClassicSimilarity], result of:
            0.047069963 = score(doc=95,freq=1.0), product of:
              0.16316801 = queryWeight, product of:
                1.9987853 = boost
                5.274979 = idf(docFreq=617, maxDocs=44421)
                0.015475623 = queryNorm
              0.28847542 = fieldWeight in 95, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.274979 = idf(docFreq=617, maxDocs=44421)
                0.0546875 = fieldNorm(doc=95)
          0.027013443 = weight(abstract_txt:model in 95) [ClassicSimilarity], result of:
            0.027013443 = score(doc=95,freq=1.0), product of:
              0.124023885 = queryWeight, product of:
                2.0121977 = boost
                3.9827821 = idf(docFreq=2249, maxDocs=44421)
                0.015475623 = queryNorm
              0.2178084 = fieldWeight in 95, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.9827821 = idf(docFreq=2249, maxDocs=44421)
                0.0546875 = fieldNorm(doc=95)
          0.04301588 = weight(abstract_txt:methods in 95) [ClassicSimilarity], result of:
            0.04301588 = score(doc=95,freq=2.0), product of:
              0.13423361 = queryWeight, product of:
                2.0933826 = boost
                4.1434727 = idf(docFreq=1915, maxDocs=44421)
                0.015475623 = queryNorm
              0.32045534 = fieldWeight in 95, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.1434727 = idf(docFreq=1915, maxDocs=44421)
                0.0546875 = fieldNorm(doc=95)
          0.058024064 = weight(abstract_txt:data in 95) [ClassicSimilarity], result of:
            0.058024064 = score(doc=95,freq=6.0), product of:
              0.1300681 = queryWeight, product of:
                2.5237656 = boost
                3.3302255 = idf(docFreq=4320, maxDocs=44421)
                0.015475623 = queryNorm
              0.44610527 = fieldWeight in 95, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                3.3302255 = idf(docFreq=4320, maxDocs=44421)
                0.0546875 = fieldNorm(doc=95)
          0.06882358 = weight(abstract_txt:method in 95) [ClassicSimilarity], result of:
            0.06882358 = score(doc=95,freq=2.0), product of:
              0.19780484 = queryWeight, product of:
                2.8411322 = boost
                4.4988065 = idf(docFreq=1342, maxDocs=44421)
                0.015475623 = queryNorm
              0.3479368 = fieldWeight in 95, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.4988065 = idf(docFreq=1342, maxDocs=44421)
                0.0546875 = fieldNorm(doc=95)
          0.3981134 = weight(abstract_txt:supervised in 95) [ClassicSimilarity], result of:
            0.3981134 = score(doc=95,freq=5.0), product of:
              0.43598 = queryWeight, product of:
                3.7726912 = boost
                7.467361 = idf(docFreq=68, maxDocs=44421)
                0.015475623 = queryNorm
              0.913146 = fieldWeight in 95, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                7.467361 = idf(docFreq=68, maxDocs=44421)
                0.0546875 = fieldNorm(doc=95)
          0.18462834 = weight(abstract_txt:labeled in 95) [ClassicSimilarity], result of:
            0.18462834 = score(doc=95,freq=1.0), product of:
              0.44666743 = queryWeight, product of:
                3.8186524 = boost
                7.558333 = idf(docFreq=62, maxDocs=44421)
                0.015475623 = queryNorm
              0.41334632 = fieldWeight in 95, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.558333 = idf(docFreq=62, maxDocs=44421)
                0.0546875 = fieldNorm(doc=95)
          0.17842142 = weight(abstract_txt:learning in 95) [ClassicSimilarity], result of:
            0.17842142 = score(doc=95,freq=5.0), product of:
              0.3076848 = queryWeight, product of:
                4.1926637 = boost
                4.7420692 = idf(docFreq=1052, maxDocs=44421)
                0.015475623 = queryNorm
              0.57988375 = fieldWeight in 95, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                4.7420692 = idf(docFreq=1052, maxDocs=44421)
                0.0546875 = fieldNorm(doc=95)
        0.44 = coord(11/25)
    
  3. Wu, T.; Pottenger, W.M.: ¬A semi-supervised active learning algorithm for information extraction from textual data (2005) 0.36
    0.36330152 = sum of:
      0.36330152 = product of:
        1.2975054 = sum of:
          0.15219593 = weight(abstract_txt:semi in 4237) [ClassicSimilarity], result of:
            0.15219593 = score(doc=4237,freq=5.0), product of:
              0.16674754 = queryWeight, product of:
                1.6498053 = boost
                6.5309834 = idf(docFreq=175, maxDocs=44421)
                0.015475623 = queryNorm
              0.9127327 = fieldWeight in 4237, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                6.5309834 = idf(docFreq=175, maxDocs=44421)
                0.0625 = fieldNorm(doc=4237)
          0.06580103 = weight(abstract_txt:given in 4237) [ClassicSimilarity], result of:
            0.06580103 = score(doc=4237,freq=3.0), product of:
              0.1293975 = queryWeight, product of:
                1.7799654 = boost
                4.6974936 = idf(docFreq=1100, maxDocs=44421)
                0.015475623 = queryNorm
              0.5085186 = fieldWeight in 4237, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.6974936 = idf(docFreq=1100, maxDocs=44421)
                0.0625 = fieldNorm(doc=4237)
          0.117303744 = weight(abstract_txt:labeling in 4237) [ClassicSimilarity], result of:
            0.117303744 = score(doc=4237,freq=1.0), product of:
              0.23969299 = queryWeight, product of:
                1.9780198 = boost
                7.8302665 = idf(docFreq=47, maxDocs=44421)
                0.015475623 = queryNorm
              0.48939165 = fieldWeight in 4237, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.8302665 = idf(docFreq=47, maxDocs=44421)
                0.0625 = fieldNorm(doc=4237)
          0.03828595 = weight(abstract_txt:data in 4237) [ClassicSimilarity], result of:
            0.03828595 = score(doc=4237,freq=2.0), product of:
              0.1300681 = queryWeight, product of:
                2.5237656 = boost
                3.3302255 = idf(docFreq=4320, maxDocs=44421)
                0.015475623 = queryNorm
              0.29435313 = fieldWeight in 4237, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.3302255 = idf(docFreq=4320, maxDocs=44421)
                0.0625 = fieldNorm(doc=4237)
          0.45498672 = weight(abstract_txt:supervised in 4237) [ClassicSimilarity], result of:
            0.45498672 = score(doc=4237,freq=5.0), product of:
              0.43598 = queryWeight, product of:
                3.7726912 = boost
                7.467361 = idf(docFreq=68, maxDocs=44421)
                0.015475623 = queryNorm
              1.0435954 = fieldWeight in 4237, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                7.467361 = idf(docFreq=68, maxDocs=44421)
                0.0625 = fieldNorm(doc=4237)
          0.21100383 = weight(abstract_txt:labeled in 4237) [ClassicSimilarity], result of:
            0.21100383 = score(doc=4237,freq=1.0), product of:
              0.44666743 = queryWeight, product of:
                3.8186524 = boost
                7.558333 = idf(docFreq=62, maxDocs=44421)
                0.015475623 = queryNorm
              0.4723958 = fieldWeight in 4237, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.558333 = idf(docFreq=62, maxDocs=44421)
                0.0625 = fieldNorm(doc=4237)
          0.25792825 = weight(abstract_txt:learning in 4237) [ClassicSimilarity], result of:
            0.25792825 = score(doc=4237,freq=8.0), product of:
              0.3076848 = queryWeight, product of:
                4.1926637 = boost
                4.7420692 = idf(docFreq=1052, maxDocs=44421)
                0.015475623 = queryNorm
              0.8382873 = fieldWeight in 4237, product of:
                2.828427 = tf(freq=8.0), with freq of:
                  8.0 = termFreq=8.0
                4.7420692 = idf(docFreq=1052, maxDocs=44421)
                0.0625 = fieldNorm(doc=4237)
        0.28 = coord(7/25)
    
  4. Silva, R.M.; Gonçalves, M.A.; Veloso, A.: ¬A Two-stage active learning method for learning to rank (2014) 0.35
    0.34917158 = sum of:
      0.34917158 = product of:
        0.96992105 = sum of:
          0.017128685 = weight(abstract_txt:documents in 2184) [ClassicSimilarity], result of:
            0.017128685 = score(doc=2184,freq=1.0), product of:
              0.06646557 = queryWeight, product of:
                1.041601 = boost
                4.123322 = idf(docFreq=1954, maxDocs=44421)
                0.015475623 = queryNorm
              0.25770763 = fieldWeight in 2184, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.123322 = idf(docFreq=1954, maxDocs=44421)
                0.0625 = fieldNorm(doc=2184)
          0.0428324 = weight(abstract_txt:ranking in 2184) [ClassicSimilarity], result of:
            0.0428324 = score(doc=2184,freq=1.0), product of:
              0.12245101 = queryWeight, product of:
                1.4137876 = boost
                5.5966744 = idf(docFreq=447, maxDocs=44421)
                0.015475623 = queryNorm
              0.34979215 = fieldWeight in 2184, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.5966744 = idf(docFreq=447, maxDocs=44421)
                0.0625 = fieldNorm(doc=2184)
          0.16589254 = weight(abstract_txt:labeling in 2184) [ClassicSimilarity], result of:
            0.16589254 = score(doc=2184,freq=2.0), product of:
              0.23969299 = queryWeight, product of:
                1.9780198 = boost
                7.8302665 = idf(docFreq=47, maxDocs=44421)
                0.015475623 = queryNorm
              0.6921043 = fieldWeight in 2184, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.8302665 = idf(docFreq=47, maxDocs=44421)
                0.0625 = fieldNorm(doc=2184)
          0.030872507 = weight(abstract_txt:model in 2184) [ClassicSimilarity], result of:
            0.030872507 = score(doc=2184,freq=1.0), product of:
              0.124023885 = queryWeight, product of:
                2.0121977 = boost
                3.9827821 = idf(docFreq=2249, maxDocs=44421)
                0.015475623 = queryNorm
              0.24892388 = fieldWeight in 2184, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.9827821 = idf(docFreq=2249, maxDocs=44421)
                0.0625 = fieldNorm(doc=2184)
          0.20286681 = weight(abstract_txt:unlabeled in 2184) [ClassicSimilarity], result of:
            0.20286681 = score(doc=2184,freq=1.0), product of:
              0.34534624 = queryWeight, product of:
                2.3742714 = boost
                9.398883 = idf(docFreq=9, maxDocs=44421)
                0.015475623 = queryNorm
              0.5874302 = fieldWeight in 2184, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.398883 = idf(docFreq=9, maxDocs=44421)
                0.0625 = fieldNorm(doc=2184)
          0.03828595 = weight(abstract_txt:data in 2184) [ClassicSimilarity], result of:
            0.03828595 = score(doc=2184,freq=2.0), product of:
              0.1300681 = queryWeight, product of:
                2.5237656 = boost
                3.3302255 = idf(docFreq=4320, maxDocs=44421)
                0.015475623 = queryNorm
              0.29435313 = fieldWeight in 2184, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.3302255 = idf(docFreq=4320, maxDocs=44421)
                0.0625 = fieldNorm(doc=2184)
          0.078655526 = weight(abstract_txt:method in 2184) [ClassicSimilarity], result of:
            0.078655526 = score(doc=2184,freq=2.0), product of:
              0.19780484 = queryWeight, product of:
                2.8411322 = boost
                4.4988065 = idf(docFreq=1342, maxDocs=44421)
                0.015475623 = queryNorm
              0.39764208 = fieldWeight in 2184, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.4988065 = idf(docFreq=1342, maxDocs=44421)
                0.0625 = fieldNorm(doc=2184)
          0.21100383 = weight(abstract_txt:labeled in 2184) [ClassicSimilarity], result of:
            0.21100383 = score(doc=2184,freq=1.0), product of:
              0.44666743 = queryWeight, product of:
                3.8186524 = boost
                7.558333 = idf(docFreq=62, maxDocs=44421)
                0.015475623 = queryNorm
              0.4723958 = fieldWeight in 2184, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.558333 = idf(docFreq=62, maxDocs=44421)
                0.0625 = fieldNorm(doc=2184)
          0.18238284 = weight(abstract_txt:learning in 2184) [ClassicSimilarity], result of:
            0.18238284 = score(doc=2184,freq=4.0), product of:
              0.3076848 = queryWeight, product of:
                4.1926637 = boost
                4.7420692 = idf(docFreq=1052, maxDocs=44421)
                0.015475623 = queryNorm
              0.59275866 = fieldWeight in 2184, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.7420692 = idf(docFreq=1052, maxDocs=44421)
                0.0625 = fieldNorm(doc=2184)
        0.36 = coord(9/25)
    
  5. Yu, N.: Exploring co-training strategies for opinion detection (2014) 0.29
    0.29138428 = sum of:
      0.29138428 = product of:
        1.0406581 = sum of:
          0.076076545 = weight(abstract_txt:machine in 2503) [ClassicSimilarity], result of:
            0.076076545 = score(doc=2503,freq=2.0), product of:
              0.16316801 = queryWeight, product of:
                1.9987853 = boost
                5.274979 = idf(docFreq=617, maxDocs=44421)
                0.015475623 = queryNorm
              0.4662467 = fieldWeight in 2503, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.274979 = idf(docFreq=617, maxDocs=44421)
                0.0625 = fieldNorm(doc=2503)
          0.030872507 = weight(abstract_txt:model in 2503) [ClassicSimilarity], result of:
            0.030872507 = score(doc=2503,freq=1.0), product of:
              0.124023885 = queryWeight, product of:
                2.0121977 = boost
                3.9827821 = idf(docFreq=2249, maxDocs=44421)
                0.015475623 = queryNorm
              0.24892388 = fieldWeight in 2503, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.9827821 = idf(docFreq=2249, maxDocs=44421)
                0.0625 = fieldNorm(doc=2503)
          0.03476208 = weight(abstract_txt:methods in 2503) [ClassicSimilarity], result of:
            0.03476208 = score(doc=2503,freq=1.0), product of:
              0.13423361 = queryWeight, product of:
                2.0933826 = boost
                4.1434727 = idf(docFreq=1915, maxDocs=44421)
                0.015475623 = queryNorm
              0.25896704 = fieldWeight in 2503, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.1434727 = idf(docFreq=1915, maxDocs=44421)
                0.0625 = fieldNorm(doc=2503)
          0.20286681 = weight(abstract_txt:unlabeled in 2503) [ClassicSimilarity], result of:
            0.20286681 = score(doc=2503,freq=1.0), product of:
              0.34534624 = queryWeight, product of:
                2.3742714 = boost
                9.398883 = idf(docFreq=9, maxDocs=44421)
                0.015475623 = queryNorm
              0.5874302 = fieldWeight in 2503, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.398883 = idf(docFreq=9, maxDocs=44421)
                0.0625 = fieldNorm(doc=2503)
          0.066313215 = weight(abstract_txt:data in 2503) [ClassicSimilarity], result of:
            0.066313215 = score(doc=2503,freq=6.0), product of:
              0.1300681 = queryWeight, product of:
                2.5237656 = boost
                3.3302255 = idf(docFreq=4320, maxDocs=44421)
                0.015475623 = queryNorm
              0.5098346 = fieldWeight in 2503, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                3.3302255 = idf(docFreq=4320, maxDocs=44421)
                0.0625 = fieldNorm(doc=2503)
          0.47181886 = weight(abstract_txt:labeled in 2503) [ClassicSimilarity], result of:
            0.47181886 = score(doc=2503,freq=5.0), product of:
              0.44666743 = queryWeight, product of:
                3.8186524 = boost
                7.558333 = idf(docFreq=62, maxDocs=44421)
                0.015475623 = queryNorm
              1.0563091 = fieldWeight in 2503, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                7.558333 = idf(docFreq=62, maxDocs=44421)
                0.0625 = fieldNorm(doc=2503)
          0.15794817 = weight(abstract_txt:learning in 2503) [ClassicSimilarity], result of:
            0.15794817 = score(doc=2503,freq=3.0), product of:
              0.3076848 = queryWeight, product of:
                4.1926637 = boost
                4.7420692 = idf(docFreq=1052, maxDocs=44421)
                0.015475623 = queryNorm
              0.51334405 = fieldWeight in 2503, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.7420692 = idf(docFreq=1052, maxDocs=44421)
                0.0625 = fieldNorm(doc=2503)
        0.28 = coord(7/25)