Document (#41096)

Author
Billal, B.
Fonseca, A.
Sadat, F.
Lounis, H.
Title
Semi-supervised learning and social media text analysis towards multi-labeling categorization
Source
IEEE International Conference on Big Data (Big Data) (2017)
Year
2017
Pages
S.1907-1916
Abstract
In traditional text classification, classes are mutually exclusive, i.e. it is not possible to have one text or text fragment classified into more than one class. On the other hand, in multi-label classification an individual text may belong to several classes simultaneously. This type of classification is required by a large number of current applications such as big data classification, images and video annotation. Supervised learning is the most used type of machine learning in the classification task. It requires large quantities of labeled data and the intervention of a human tagger in the creation of the training sets. When the data sets become very large or heavily noisy, this operation can be tedious, prone to error and time consuming. In this case, semi-supervised learning, which requires only few labels, is a better choice. In this paper, we study and evaluate several methods to address the problem of multi-label classification using semi-supervised learning and data from social networks. First, we propose a linguistic pre-processing involving tokeni-sation, recognition of named entities and hashtag segmentation in order to decrease the noise in this type of massive and unstructured real data and then we perform a word sense disambiguation using WordNet. Second, several experiments related to multi-label classification and semi-supervised learning are carried out on these data sets and compared to each other. These evaluations compare the results of the approaches considered. This paper proposes a method for combining semi-supervised methods with a graph method for the extraction of subjects in social networks using a multi-label classification approach. Experiments show that the performance of the proposed model increases in 4 p.p. the precision of the classification when compared to a baseline.
Footnote
Vgl.: doi:10.1109/BigData.2017.8258136
Theme
Automatisches Klassifizieren

Similar documents (author)

  1. Fonseca, F.: ¬The double role of ontologies in information science research (2007) 5.66
    5.664006 = sum of:
      5.664006 = weight(author_txt:fonseca in 1277) [ClassicSimilarity], result of:
        5.664006 = fieldWeight in 1277, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.06241 = idf(docFreq=13, maxDocs=44421)
          0.625 = fieldNorm(doc=1277)
    
  2. Fonseca, F.: Whether or when : the question on the use of theories in data science (2021) 5.66
    5.664006 = sum of:
      5.664006 = weight(author_txt:fonseca in 1410) [ClassicSimilarity], result of:
        5.664006 = fieldWeight in 1410, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.06241 = idf(docFreq=13, maxDocs=44421)
          0.625 = fieldNorm(doc=1410)
    
  3. Scott, M.; Fonseca, F.: Methodology for functional appraisal of records and creation of a functional thesaurus (1992) 4.53
    4.531205 = sum of:
      4.531205 = weight(author_txt:fonseca in 2095) [ClassicSimilarity], result of:
        4.531205 = fieldWeight in 2095, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.06241 = idf(docFreq=13, maxDocs=44421)
          0.5 = fieldNorm(doc=2095)
    
  4. Fonseca, F.T.; Martin, J.E.: Toward an alternative notion of information systems ontologies : information engineering as a hermeneutic enterprise (2005) 4.53
    4.531205 = sum of:
      4.531205 = weight(author_txt:fonseca in 4266) [ClassicSimilarity], result of:
        4.531205 = fieldWeight in 4266, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.06241 = idf(docFreq=13, maxDocs=44421)
          0.5 = fieldNorm(doc=4266)
    
  5. Câmara, G.; Fonseca, F.: Information policies and open source software in developing countries (2007) 4.53
    4.531205 = sum of:
      4.531205 = weight(author_txt:fonseca in 1090) [ClassicSimilarity], result of:
        4.531205 = fieldWeight in 1090, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.06241 = idf(docFreq=13, maxDocs=44421)
          0.5 = fieldNorm(doc=1090)
    

Similar documents (content)

  1. Ko, Y.; Seo, J.: Text classification from unlabeled documents with bootstrapping and feature projection techniques (2009) 0.54
    0.54457647 = sum of:
      0.54457647 = product of:
        1.2376738 = sum of:
          0.042769678 = weight(abstract_txt:method in 3452) [ClassicSimilarity], result of:
            0.042769678 = score(doc=3452,freq=5.0), product of:
              0.0680258 = queryWeight, product of:
                1.033527 = boost
                4.4988065 = idf(docFreq=1342, maxDocs=44421)
                0.014630342 = queryNorm
              0.6287273 = fieldWeight in 3452, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                4.4988065 = idf(docFreq=1342, maxDocs=44421)
                0.0625 = fieldNorm(doc=3452)
          0.026406568 = weight(abstract_txt:compared in 3452) [ClassicSimilarity], result of:
            0.026406568 = score(doc=3452,freq=1.0), product of:
              0.084342726 = queryWeight, product of:
                1.1508238 = boost
                5.0093837 = idf(docFreq=805, maxDocs=44421)
                0.014630342 = queryNorm
              0.31308648 = fieldWeight in 3452, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.0093837 = idf(docFreq=805, maxDocs=44421)
                0.0625 = fieldNorm(doc=3452)
          0.013016664 = weight(abstract_txt:using in 3452) [ClassicSimilarity], result of:
            0.013016664 = score(doc=3452,freq=1.0), product of:
              0.060247157 = queryWeight, product of:
                1.1912391 = boost
                3.4568708 = idf(docFreq=3806, maxDocs=44421)
                0.014630342 = queryNorm
              0.21605442 = fieldWeight in 3452, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4568708 = idf(docFreq=3806, maxDocs=44421)
                0.0625 = fieldNorm(doc=3452)
          0.031714253 = weight(abstract_txt:experiments in 3452) [ClassicSimilarity], result of:
            0.031714253 = score(doc=3452,freq=1.0), product of:
              0.095296286 = queryWeight, product of:
                1.223272 = boost
                5.324741 = idf(docFreq=587, maxDocs=44421)
                0.014630342 = queryNorm
              0.3327963 = fieldWeight in 3452, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.324741 = idf(docFreq=587, maxDocs=44421)
                0.0625 = fieldNorm(doc=3452)
          0.027624156 = weight(abstract_txt:large in 3452) [ClassicSimilarity], result of:
            0.027624156 = score(doc=3452,freq=1.0), product of:
              0.09949382 = queryWeight, product of:
                1.5308361 = boost
                4.4423513 = idf(docFreq=1420, maxDocs=44421)
                0.014630342 = queryNorm
              0.27764696 = fieldWeight in 3452, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.4423513 = idf(docFreq=1420, maxDocs=44421)
                0.0625 = fieldNorm(doc=3452)
          0.008779994 = weight(abstract_txt:this in 3452) [ClassicSimilarity], result of:
            0.008779994 = score(doc=3452,freq=1.0), product of:
              0.058381632 = queryWeight, product of:
                1.658379 = boost
                2.4062347 = idf(docFreq=10885, maxDocs=44421)
                0.014630342 = queryNorm
              0.15038967 = fieldWeight in 3452, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.4062347 = idf(docFreq=10885, maxDocs=44421)
                0.0625 = fieldNorm(doc=3452)
          0.091680415 = weight(abstract_txt:text in 3452) [ClassicSimilarity], result of:
            0.091680415 = score(doc=3452,freq=7.0), product of:
              0.13720545 = queryWeight, product of:
                2.320816 = boost
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.014630342 = queryNorm
              0.66819805 = fieldWeight in 3452, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.0625 = fieldNorm(doc=3452)
          0.19007672 = weight(abstract_txt:learning in 3452) [ClassicSimilarity], result of:
            0.19007672 = score(doc=3452,freq=8.0), product of:
              0.22674413 = queryWeight, product of:
                3.268238 = boost
                4.7420692 = idf(docFreq=1052, maxDocs=44421)
                0.014630342 = queryNorm
              0.8382873 = fieldWeight in 3452, product of:
                2.828427 = tf(freq=8.0), with freq of:
                  8.0 = termFreq=8.0
                4.7420692 = idf(docFreq=1052, maxDocs=44421)
                0.0625 = fieldNorm(doc=3452)
          0.1462968 = weight(abstract_txt:semi in 3452) [ClassicSimilarity], result of:
            0.1462968 = score(doc=3452,freq=1.0), product of:
              0.35840678 = queryWeight, product of:
                3.7509663 = boost
                6.5309834 = idf(docFreq=175, maxDocs=44421)
                0.014630342 = queryNorm
              0.40818647 = fieldWeight in 3452, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.5309834 = idf(docFreq=175, maxDocs=44421)
                0.0625 = fieldNorm(doc=3452)
          0.13448696 = weight(abstract_txt:classification in 3452) [ClassicSimilarity], result of:
            0.13448696 = score(doc=3452,freq=5.0), product of:
              0.2410501 = queryWeight, product of:
                4.1270995 = boost
                3.9921594 = idf(docFreq=2228, maxDocs=44421)
                0.014630342 = queryNorm
              0.55792123 = fieldWeight in 3452, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                3.9921594 = idf(docFreq=2228, maxDocs=44421)
                0.0625 = fieldNorm(doc=3452)
          0.5248216 = weight(abstract_txt:supervised in 3452) [ClassicSimilarity], result of:
            0.5248216 = score(doc=3452,freq=4.0), product of:
              0.5622566 = queryWeight, product of:
                5.1465116 = boost
                7.467361 = idf(docFreq=68, maxDocs=44421)
                0.014630342 = queryNorm
              0.9334201 = fieldWeight in 3452, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                7.467361 = idf(docFreq=68, maxDocs=44421)
                0.0625 = fieldNorm(doc=3452)
        0.44 = coord(11/25)
    
  2. Stamatatos, E.: Author identification : using text sampling to handle the class imbalance problem (2008) 0.26
    0.25649 = sum of:
      0.25649 = product of:
        0.7124722 = sum of:
          0.031714253 = weight(abstract_txt:experiments in 3063) [ClassicSimilarity], result of:
            0.031714253 = score(doc=3063,freq=1.0), product of:
              0.095296286 = queryWeight, product of:
                1.223272 = boost
                5.324741 = idf(docFreq=587, maxDocs=44421)
                0.014630342 = queryNorm
              0.3327963 = fieldWeight in 3063, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.324741 = idf(docFreq=587, maxDocs=44421)
                0.0625 = fieldNorm(doc=3063)
          0.083103724 = weight(abstract_txt:classes in 3063) [ClassicSimilarity], result of:
            0.083103724 = score(doc=3063,freq=4.0), product of:
              0.114103764 = queryWeight, product of:
                1.3385513 = boost
                5.8265367 = idf(docFreq=355, maxDocs=44421)
                0.014630342 = queryNorm
              0.7283171 = fieldWeight in 3063, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.8265367 = idf(docFreq=355, maxDocs=44421)
                0.0625 = fieldNorm(doc=3063)
          0.012416787 = weight(abstract_txt:this in 3063) [ClassicSimilarity], result of:
            0.012416787 = score(doc=3063,freq=2.0), product of:
              0.058381632 = queryWeight, product of:
                1.658379 = boost
                2.4062347 = idf(docFreq=10885, maxDocs=44421)
                0.014630342 = queryNorm
              0.21268311 = fieldWeight in 3063, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.4062347 = idf(docFreq=10885, maxDocs=44421)
                0.0625 = fieldNorm(doc=3063)
          0.02327562 = weight(abstract_txt:data in 3063) [ClassicSimilarity], result of:
            0.02327562 = score(doc=3063,freq=1.0), product of:
              0.11182723 = queryWeight, product of:
                2.2951944 = boost
                3.3302255 = idf(docFreq=4320, maxDocs=44421)
                0.014630342 = queryNorm
              0.20813909 = fieldWeight in 3063, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.3302255 = idf(docFreq=4320, maxDocs=44421)
                0.0625 = fieldNorm(doc=3063)
          0.08487958 = weight(abstract_txt:text in 3063) [ClassicSimilarity], result of:
            0.08487958 = score(doc=3063,freq=6.0), product of:
              0.13720545 = queryWeight, product of:
                2.320816 = boost
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.014630342 = queryNorm
              0.61863124 = fieldWeight in 3063, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.0625 = fieldNorm(doc=3063)
          0.06720227 = weight(abstract_txt:learning in 3063) [ClassicSimilarity], result of:
            0.06720227 = score(doc=3063,freq=1.0), product of:
              0.22674413 = queryWeight, product of:
                3.268238 = boost
                4.7420692 = idf(docFreq=1052, maxDocs=44421)
                0.014630342 = queryNorm
              0.29637933 = fieldWeight in 3063, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.7420692 = idf(docFreq=1052, maxDocs=44421)
                0.0625 = fieldNorm(doc=3063)
          0.15914954 = weight(abstract_txt:label in 3063) [ClassicSimilarity], result of:
            0.15914954 = score(doc=3063,freq=1.0), product of:
              0.35192755 = queryWeight, product of:
                3.3245025 = boost
                7.2355595 = idf(docFreq=86, maxDocs=44421)
                0.014630342 = queryNorm
              0.45222247 = fieldWeight in 3063, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.2355595 = idf(docFreq=86, maxDocs=44421)
                0.0625 = fieldNorm(doc=3063)
          0.190586 = weight(abstract_txt:multi in 3063) [ClassicSimilarity], result of:
            0.190586 = score(doc=3063,freq=3.0), product of:
              0.29641938 = queryWeight, product of:
                3.411209 = boost
                5.9394164 = idf(docFreq=317, maxDocs=44421)
                0.014630342 = queryNorm
              0.64296067 = fieldWeight in 3063, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.9394164 = idf(docFreq=317, maxDocs=44421)
                0.0625 = fieldNorm(doc=3063)
          0.0601444 = weight(abstract_txt:classification in 3063) [ClassicSimilarity], result of:
            0.0601444 = score(doc=3063,freq=1.0), product of:
              0.2410501 = queryWeight, product of:
                4.1270995 = boost
                3.9921594 = idf(docFreq=2228, maxDocs=44421)
                0.014630342 = queryNorm
              0.24950996 = fieldWeight in 3063, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.9921594 = idf(docFreq=2228, maxDocs=44421)
                0.0625 = fieldNorm(doc=3063)
        0.36 = coord(9/25)
    
  3. Rodríguez-Vidal, J.; Gonzalo, J.; Plaza, L.; Anaya Sánchez, H.: Automatic detection of influencers in social networks : authority versus domain signals (2019) 0.26
    0.25579116 = sum of:
      0.25579116 = product of:
        0.710531 = sum of:
          0.013016664 = weight(abstract_txt:using in 301) [ClassicSimilarity], result of:
            0.013016664 = score(doc=301,freq=1.0), product of:
              0.060247157 = queryWeight, product of:
                1.1912391 = boost
                3.4568708 = idf(docFreq=3806, maxDocs=44421)
                0.014630342 = queryNorm
              0.21605442 = fieldWeight in 301, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4568708 = idf(docFreq=3806, maxDocs=44421)
                0.0625 = fieldNorm(doc=301)
          0.031714253 = weight(abstract_txt:experiments in 301) [ClassicSimilarity], result of:
            0.031714253 = score(doc=301,freq=1.0), product of:
              0.095296286 = queryWeight, product of:
                1.223272 = boost
                5.324741 = idf(docFreq=587, maxDocs=44421)
                0.014630342 = queryNorm
              0.3327963 = fieldWeight in 301, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.324741 = idf(docFreq=587, maxDocs=44421)
                0.0625 = fieldNorm(doc=301)
          0.023397828 = weight(abstract_txt:social in 301) [ClassicSimilarity], result of:
            0.023397828 = score(doc=301,freq=1.0), product of:
              0.08906774 = queryWeight, product of:
                1.4484079 = boost
                4.2031517 = idf(docFreq=1804, maxDocs=44421)
                0.014630342 = queryNorm
              0.26269698 = fieldWeight in 301, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.2031517 = idf(docFreq=1804, maxDocs=44421)
                0.0625 = fieldNorm(doc=301)
          0.027624156 = weight(abstract_txt:large in 301) [ClassicSimilarity], result of:
            0.027624156 = score(doc=301,freq=1.0), product of:
              0.09949382 = queryWeight, product of:
                1.5308361 = boost
                4.4423513 = idf(docFreq=1420, maxDocs=44421)
                0.014630342 = queryNorm
              0.27764696 = fieldWeight in 301, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.4423513 = idf(docFreq=1420, maxDocs=44421)
                0.0625 = fieldNorm(doc=301)
          0.0438554 = weight(abstract_txt:sets in 301) [ClassicSimilarity], result of:
            0.0438554 = score(doc=301,freq=1.0), product of:
              0.13540004 = queryWeight, product of:
                1.7858298 = boost
                5.18232 = idf(docFreq=677, maxDocs=44421)
                0.014630342 = queryNorm
              0.323895 = fieldWeight in 301, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.18232 = idf(docFreq=677, maxDocs=44421)
                0.0625 = fieldNorm(doc=301)
          0.02327562 = weight(abstract_txt:data in 301) [ClassicSimilarity], result of:
            0.02327562 = score(doc=301,freq=1.0), product of:
              0.11182723 = queryWeight, product of:
                2.2951944 = boost
                3.3302255 = idf(docFreq=4320, maxDocs=44421)
                0.014630342 = queryNorm
              0.20813909 = fieldWeight in 301, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.3302255 = idf(docFreq=4320, maxDocs=44421)
                0.0625 = fieldNorm(doc=301)
          0.11639775 = weight(abstract_txt:learning in 301) [ClassicSimilarity], result of:
            0.11639775 = score(doc=301,freq=3.0), product of:
              0.22674413 = queryWeight, product of:
                3.268238 = boost
                4.7420692 = idf(docFreq=1052, maxDocs=44421)
                0.014630342 = queryNorm
              0.51334405 = fieldWeight in 301, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.7420692 = idf(docFreq=1052, maxDocs=44421)
                0.0625 = fieldNorm(doc=301)
          0.0601444 = weight(abstract_txt:classification in 301) [ClassicSimilarity], result of:
            0.0601444 = score(doc=301,freq=1.0), product of:
              0.2410501 = queryWeight, product of:
                4.1270995 = boost
                3.9921594 = idf(docFreq=2228, maxDocs=44421)
                0.014630342 = queryNorm
              0.24950996 = fieldWeight in 301, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.9921594 = idf(docFreq=2228, maxDocs=44421)
                0.0625 = fieldNorm(doc=301)
          0.3711049 = weight(abstract_txt:supervised in 301) [ClassicSimilarity], result of:
            0.3711049 = score(doc=301,freq=2.0), product of:
              0.5622566 = queryWeight, product of:
                5.1465116 = boost
                7.467361 = idf(docFreq=68, maxDocs=44421)
                0.014630342 = queryNorm
              0.6600277 = fieldWeight in 301, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.467361 = idf(docFreq=68, maxDocs=44421)
                0.0625 = fieldNorm(doc=301)
        0.36 = coord(9/25)
    
  4. Xu, L.; Qiu, J.: Unsupervised multi-class sentiment classification approach (2019) 0.25
    0.24820115 = sum of:
      0.24820115 = product of:
        0.88643265 = sum of:
          0.03312925 = weight(abstract_txt:method in 3) [ClassicSimilarity], result of:
            0.03312925 = score(doc=3,freq=3.0), product of:
              0.0680258 = queryWeight, product of:
                1.033527 = boost
                4.4988065 = idf(docFreq=1342, maxDocs=44421)
                0.014630342 = queryNorm
              0.4870101 = fieldWeight in 3, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.4988065 = idf(docFreq=1342, maxDocs=44421)
                0.0625 = fieldNorm(doc=3)
          0.026406568 = weight(abstract_txt:compared in 3) [ClassicSimilarity], result of:
            0.026406568 = score(doc=3,freq=1.0), product of:
              0.084342726 = queryWeight, product of:
                1.1508238 = boost
                5.0093837 = idf(docFreq=805, maxDocs=44421)
                0.014630342 = queryNorm
              0.31308648 = fieldWeight in 3, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.0093837 = idf(docFreq=805, maxDocs=44421)
                0.0625 = fieldNorm(doc=3)
          0.013016664 = weight(abstract_txt:using in 3) [ClassicSimilarity], result of:
            0.013016664 = score(doc=3,freq=1.0), product of:
              0.060247157 = queryWeight, product of:
                1.1912391 = boost
                3.4568708 = idf(docFreq=3806, maxDocs=44421)
                0.014630342 = queryNorm
              0.21605442 = fieldWeight in 3, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4568708 = idf(docFreq=3806, maxDocs=44421)
                0.0625 = fieldNorm(doc=3)
          0.24604549 = weight(abstract_txt:multi in 3) [ClassicSimilarity], result of:
            0.24604549 = score(doc=3,freq=5.0), product of:
              0.29641938 = queryWeight, product of:
                3.411209 = boost
                5.9394164 = idf(docFreq=317, maxDocs=44421)
                0.014630342 = queryNorm
              0.8300587 = fieldWeight in 3, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                5.9394164 = idf(docFreq=317, maxDocs=44421)
                0.0625 = fieldNorm(doc=3)
          0.1462968 = weight(abstract_txt:semi in 3) [ClassicSimilarity], result of:
            0.1462968 = score(doc=3,freq=1.0), product of:
              0.35840678 = queryWeight, product of:
                3.7509663 = boost
                6.5309834 = idf(docFreq=175, maxDocs=44421)
                0.014630342 = queryNorm
              0.40818647 = fieldWeight in 3, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.5309834 = idf(docFreq=175, maxDocs=44421)
                0.0625 = fieldNorm(doc=3)
          0.15912712 = weight(abstract_txt:classification in 3) [ClassicSimilarity], result of:
            0.15912712 = score(doc=3,freq=7.0), product of:
              0.2410501 = queryWeight, product of:
                4.1270995 = boost
                3.9921594 = idf(docFreq=2228, maxDocs=44421)
                0.014630342 = queryNorm
              0.6601413 = fieldWeight in 3, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                3.9921594 = idf(docFreq=2228, maxDocs=44421)
                0.0625 = fieldNorm(doc=3)
          0.2624108 = weight(abstract_txt:supervised in 3) [ClassicSimilarity], result of:
            0.2624108 = score(doc=3,freq=1.0), product of:
              0.5622566 = queryWeight, product of:
                5.1465116 = boost
                7.467361 = idf(docFreq=68, maxDocs=44421)
                0.014630342 = queryNorm
              0.46671006 = fieldWeight in 3, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.467361 = idf(docFreq=68, maxDocs=44421)
                0.0625 = fieldNorm(doc=3)
        0.28 = coord(7/25)
    
  5. Wang, J.: ¬An extensive study on automated Dewey Decimal Classification (2009) 0.24
    0.24448161 = sum of:
      0.24448161 = product of:
        0.67911553 = sum of:
          0.044850726 = weight(abstract_txt:experiments in 159) [ClassicSimilarity], result of:
            0.044850726 = score(doc=159,freq=2.0), product of:
              0.095296286 = queryWeight, product of:
                1.223272 = boost
                5.324741 = idf(docFreq=587, maxDocs=44421)
                0.014630342 = queryNorm
              0.47064504 = fieldWeight in 159, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.324741 = idf(docFreq=587, maxDocs=44421)
                0.0625 = fieldNorm(doc=159)
          0.041551862 = weight(abstract_txt:classes in 159) [ClassicSimilarity], result of:
            0.041551862 = score(doc=159,freq=1.0), product of:
              0.114103764 = queryWeight, product of:
                1.3385513 = boost
                5.8265367 = idf(docFreq=355, maxDocs=44421)
                0.014630342 = queryNorm
              0.36415854 = fieldWeight in 159, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.8265367 = idf(docFreq=355, maxDocs=44421)
                0.0625 = fieldNorm(doc=159)
          0.027624156 = weight(abstract_txt:large in 159) [ClassicSimilarity], result of:
            0.027624156 = score(doc=159,freq=1.0), product of:
              0.09949382 = queryWeight, product of:
                1.5308361 = boost
                4.4423513 = idf(docFreq=1420, maxDocs=44421)
                0.014630342 = queryNorm
              0.27764696 = fieldWeight in 159, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.4423513 = idf(docFreq=1420, maxDocs=44421)
                0.0625 = fieldNorm(doc=159)
          0.008779994 = weight(abstract_txt:this in 159) [ClassicSimilarity], result of:
            0.008779994 = score(doc=159,freq=1.0), product of:
              0.058381632 = queryWeight, product of:
                1.658379 = boost
                2.4062347 = idf(docFreq=10885, maxDocs=44421)
                0.014630342 = queryNorm
              0.15038967 = fieldWeight in 159, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.4062347 = idf(docFreq=10885, maxDocs=44421)
                0.0625 = fieldNorm(doc=159)
          0.032916695 = weight(abstract_txt:data in 159) [ClassicSimilarity], result of:
            0.032916695 = score(doc=159,freq=2.0), product of:
              0.11182723 = queryWeight, product of:
                2.2951944 = boost
                3.3302255 = idf(docFreq=4320, maxDocs=44421)
                0.014630342 = queryNorm
              0.29435313 = fieldWeight in 159, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.3302255 = idf(docFreq=4320, maxDocs=44421)
                0.0625 = fieldNorm(doc=159)
          0.034651943 = weight(abstract_txt:text in 159) [ClassicSimilarity], result of:
            0.034651943 = score(doc=159,freq=1.0), product of:
              0.13720545 = queryWeight, product of:
                2.320816 = boost
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.014630342 = queryNorm
              0.25255513 = fieldWeight in 159, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.0625 = fieldNorm(doc=159)
          0.06720227 = weight(abstract_txt:learning in 159) [ClassicSimilarity], result of:
            0.06720227 = score(doc=159,freq=1.0), product of:
              0.22674413 = queryWeight, product of:
                3.268238 = boost
                4.7420692 = idf(docFreq=1052, maxDocs=44421)
                0.014630342 = queryNorm
              0.29637933 = fieldWeight in 159, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.7420692 = idf(docFreq=1052, maxDocs=44421)
                0.0625 = fieldNorm(doc=159)
          0.15912712 = weight(abstract_txt:classification in 159) [ClassicSimilarity], result of:
            0.15912712 = score(doc=159,freq=7.0), product of:
              0.2410501 = queryWeight, product of:
                4.1270995 = boost
                3.9921594 = idf(docFreq=2228, maxDocs=44421)
                0.014630342 = queryNorm
              0.6601413 = fieldWeight in 159, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                3.9921594 = idf(docFreq=2228, maxDocs=44421)
                0.0625 = fieldNorm(doc=159)
          0.2624108 = weight(abstract_txt:supervised in 159) [ClassicSimilarity], result of:
            0.2624108 = score(doc=159,freq=1.0), product of:
              0.5622566 = queryWeight, product of:
                5.1465116 = boost
                7.467361 = idf(docFreq=68, maxDocs=44421)
                0.014630342 = queryNorm
              0.46671006 = fieldWeight in 159, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.467361 = idf(docFreq=68, maxDocs=44421)
                0.0625 = fieldNorm(doc=159)
        0.36 = coord(9/25)