Document (#37691)

Author
Zhu, W.Z.
Allen, R.B.
Title
Document clustering using the LSI subspace signature model
Source
Journal of the American Society for Information Science and Technology. 64(2013) no.4, S.844-860
Year
2013
Abstract
We describe the latent semantic indexing subspace signature model (LSISSM) for semantic content representation of unstructured text. Grounded on singular value decomposition, the model represents terms and documents by the distribution signatures of their statistical contribution across the top-ranking latent concept dimensions. LSISSM matches term signatures with document signatures according to their mapping coherence between latent semantic indexing (LSI) term subspace and LSI document subspace. LSISSM does feature reduction and finds a low-rank approximation of scalable and sparse term-document matrices. Experiments demonstrate that this approach significantly improves the performance of major clustering algorithms such as standard K-means and self-organizing maps compared with the vector space model and the traditional LSI model. The unique contribution ranking mechanism in LSISSM also improves the initialization of standard K-means compared with random seeding procedure, which sometimes causes low efficiency and effectiveness of clustering. A two-stage initialization strategy based on LSISSM significantly reduces the running time of standard K-means procedures.
Theme
Automatisches Klassifizieren
Object
Latent semantic indexing

Similar documents (author)

  1. Allen, B.; Allen, G.: Cognitive abilities of academic librarians and their patrons (1993) 5.37
    5.36736 = sum of:
      5.36736 = weight(author_txt:allen in 6045) [ClassicSimilarity], result of:
        5.36736 = fieldWeight in 6045, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          7.590594 = idf(docFreq=60, maxDocs=44421)
          0.5 = fieldNorm(doc=6045)
    
  2. Allen, M.M.: Bluetooth bytes information retrieval (2001) 4.74
    4.744121 = sum of:
      4.744121 = weight(author_txt:allen in 746) [ClassicSimilarity], result of:
        4.744121 = fieldWeight in 746, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          7.590594 = idf(docFreq=60, maxDocs=44421)
          0.625 = fieldNorm(doc=746)
    
  3. Allen, B.: Topic knowledge and online catalog search formulation (1991) 4.74
    4.744121 = sum of:
      4.744121 = weight(author_txt:allen in 1070) [ClassicSimilarity], result of:
        4.744121 = fieldWeight in 1070, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          7.590594 = idf(docFreq=60, maxDocs=44421)
          0.625 = fieldNorm(doc=1070)
    
  4. Allen, L.: Alphabetical subject access, LCSH and a non-traditional approach (1981) 4.74
    4.744121 = sum of:
      4.744121 = weight(author_txt:allen in 1570) [ClassicSimilarity], result of:
        4.744121 = fieldWeight in 1570, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          7.590594 = idf(docFreq=60, maxDocs=44421)
          0.625 = fieldNorm(doc=1570)
    
  5. Allen, G.G.: Change in the catalogue in the context of library management (1976) 4.74
    4.744121 = sum of:
      4.744121 = weight(author_txt:allen in 1574) [ClassicSimilarity], result of:
        4.744121 = fieldWeight in 1574, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          7.590594 = idf(docFreq=60, maxDocs=44421)
          0.625 = fieldNorm(doc=1574)
    

Similar documents (content)

  1. Berry, M.W.; Dumais, S.T.; O'Brien, G.W.: Using linear algebra for intelligent information retrieval (1995) 0.45
    0.4458978 = sum of:
      0.4458978 = product of:
        1.1147444 = sum of:
          0.064083 = weight(abstract_txt:decomposition in 3206) [ClassicSimilarity], result of:
            0.064083 = score(doc=3206,freq=1.0), product of:
              0.10360411 = queryWeight, product of:
                1.0407716 = boost
                7.917278 = idf(docFreq=43, maxDocs=44421)
                0.012573195 = queryNorm
              0.6185373 = fieldWeight in 3206, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.917278 = idf(docFreq=43, maxDocs=44421)
                0.078125 = fieldNorm(doc=3206)
          0.067056976 = weight(abstract_txt:matrices in 3206) [ClassicSimilarity], result of:
            0.067056976 = score(doc=3206,freq=1.0), product of:
              0.106785186 = queryWeight, product of:
                1.0566288 = boost
                8.037906 = idf(docFreq=38, maxDocs=44421)
                0.012573195 = queryNorm
              0.6279614 = fieldWeight in 3206, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.037906 = idf(docFreq=38, maxDocs=44421)
                0.078125 = fieldNorm(doc=3206)
          0.10442559 = weight(abstract_txt:singular in 3206) [ClassicSimilarity], result of:
            0.10442559 = score(doc=3206,freq=2.0), product of:
              0.11387009 = queryWeight, product of:
                1.0911182 = boost
                8.30027 = idf(docFreq=29, maxDocs=44421)
                0.012573195 = queryNorm
              0.91705894 = fieldWeight in 3206, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.30027 = idf(docFreq=29, maxDocs=44421)
                0.078125 = fieldNorm(doc=3206)
          0.07881355 = weight(abstract_txt:sparse in 3206) [ClassicSimilarity], result of:
            0.07881355 = score(doc=3206,freq=1.0), product of:
              0.11892751 = queryWeight, product of:
                1.1150854 = boost
                8.482592 = idf(docFreq=24, maxDocs=44421)
                0.012573195 = queryNorm
              0.66270244 = fieldWeight in 3206, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.482592 = idf(docFreq=24, maxDocs=44421)
                0.078125 = fieldNorm(doc=3206)
          0.030069113 = weight(abstract_txt:indexing in 3206) [ClassicSimilarity], result of:
            0.030069113 = score(doc=3206,freq=2.0), product of:
              0.06255981 = queryWeight, product of:
                1.1437463 = boost
                4.3503094 = idf(docFreq=1557, maxDocs=44421)
                0.012573195 = queryNorm
              0.48064584 = fieldWeight in 3206, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.3503094 = idf(docFreq=1557, maxDocs=44421)
                0.078125 = fieldNorm(doc=3206)
          0.034703977 = weight(abstract_txt:semantic in 3206) [ClassicSimilarity], result of:
            0.034703977 = score(doc=3206,freq=1.0), product of:
              0.0992754 = queryWeight, product of:
                1.7646087 = boost
                4.4745317 = idf(docFreq=1375, maxDocs=44421)
                0.012573195 = queryNorm
              0.34957278 = fieldWeight in 3206, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.4745317 = idf(docFreq=1375, maxDocs=44421)
                0.078125 = fieldNorm(doc=3206)
          0.042699654 = weight(abstract_txt:term in 3206) [ClassicSimilarity], result of:
            0.042699654 = score(doc=3206,freq=1.0), product of:
              0.113991305 = queryWeight, product of:
                1.8908777 = boost
                4.794713 = idf(docFreq=998, maxDocs=44421)
                0.012573195 = queryNorm
              0.37458694 = fieldWeight in 3206, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.794713 = idf(docFreq=998, maxDocs=44421)
                0.078125 = fieldNorm(doc=3206)
          0.057839144 = weight(abstract_txt:document in 3206) [ClassicSimilarity], result of:
            0.057839144 = score(doc=3206,freq=2.0), product of:
              0.12191008 = queryWeight, product of:
                2.2579627 = boost
                4.29415 = idf(docFreq=1647, maxDocs=44421)
                0.012573195 = queryNorm
              0.47444102 = fieldWeight in 3206, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.29415 = idf(docFreq=1647, maxDocs=44421)
                0.078125 = fieldNorm(doc=3206)
          0.13241254 = weight(abstract_txt:latent in 3206) [ClassicSimilarity], result of:
            0.13241254 = score(doc=3206,freq=1.0), product of:
              0.242405 = queryWeight, product of:
                2.757391 = boost
                6.9919376 = idf(docFreq=110, maxDocs=44421)
                0.012573195 = queryNorm
              0.5462451 = fieldWeight in 3206, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.9919376 = idf(docFreq=110, maxDocs=44421)
                0.078125 = fieldNorm(doc=3206)
          0.50264084 = weight(abstract_txt:subspace in 3206) [ClassicSimilarity], result of:
            0.50264084 = score(doc=3206,freq=1.0), product of:
              0.64924246 = queryWeight, product of:
                5.210752 = boost
                9.909708 = idf(docFreq=5, maxDocs=44421)
                0.012573195 = queryNorm
              0.7741959 = fieldWeight in 3206, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.909708 = idf(docFreq=5, maxDocs=44421)
                0.078125 = fieldNorm(doc=3206)
        0.4 = coord(10/25)
    
  2. Li, D.; Kwong, C.-P.; Lee, D.L.: Unified linear subspace approach to semantic analysis (2009) 0.24
    0.23894075 = sum of:
      0.23894075 = product of:
        0.59735185 = sum of:
          0.051266406 = weight(abstract_txt:decomposition in 308) [ClassicSimilarity], result of:
            0.051266406 = score(doc=308,freq=1.0), product of:
              0.10360411 = queryWeight, product of:
                1.0407716 = boost
                7.917278 = idf(docFreq=43, maxDocs=44421)
                0.012573195 = queryNorm
              0.49482986 = fieldWeight in 308, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.917278 = idf(docFreq=43, maxDocs=44421)
                0.0625 = fieldNorm(doc=308)
          0.059072033 = weight(abstract_txt:singular in 308) [ClassicSimilarity], result of:
            0.059072033 = score(doc=308,freq=1.0), product of:
              0.11387009 = queryWeight, product of:
                1.0911182 = boost
                8.30027 = idf(docFreq=29, maxDocs=44421)
                0.012573195 = queryNorm
              0.5187669 = fieldWeight in 308, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.30027 = idf(docFreq=29, maxDocs=44421)
                0.0625 = fieldNorm(doc=308)
          0.017009659 = weight(abstract_txt:indexing in 308) [ClassicSimilarity], result of:
            0.017009659 = score(doc=308,freq=1.0), product of:
              0.06255981 = queryWeight, product of:
                1.1437463 = boost
                4.3503094 = idf(docFreq=1557, maxDocs=44421)
                0.012573195 = queryNorm
              0.27189434 = fieldWeight in 308, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.3503094 = idf(docFreq=1557, maxDocs=44421)
                0.0625 = fieldNorm(doc=308)
          0.034165047 = weight(abstract_txt:significantly in 308) [ClassicSimilarity], result of:
            0.034165047 = score(doc=308,freq=1.0), product of:
              0.09959092 = queryWeight, product of:
                1.4430847 = boost
                5.4888616 = idf(docFreq=498, maxDocs=44421)
                0.012573195 = queryNorm
              0.34305385 = fieldWeight in 308, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.4888616 = idf(docFreq=498, maxDocs=44421)
                0.0625 = fieldNorm(doc=308)
          0.06800564 = weight(abstract_txt:semantic in 308) [ClassicSimilarity], result of:
            0.06800564 = score(doc=308,freq=6.0), product of:
              0.0992754 = queryWeight, product of:
                1.7646087 = boost
                4.4745317 = idf(docFreq=1375, maxDocs=44421)
                0.012573195 = queryNorm
              0.68501997 = fieldWeight in 308, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                4.4745317 = idf(docFreq=1375, maxDocs=44421)
                0.0625 = fieldNorm(doc=308)
          0.032771494 = weight(abstract_txt:standard in 308) [ClassicSimilarity], result of:
            0.032771494 = score(doc=308,freq=1.0), product of:
              0.11088164 = queryWeight, product of:
                1.864908 = boost
                4.7288613 = idf(docFreq=1066, maxDocs=44421)
                0.012573195 = queryNorm
              0.29555383 = fieldWeight in 308, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.7288613 = idf(docFreq=1066, maxDocs=44421)
                0.0625 = fieldNorm(doc=308)
          0.05916638 = weight(abstract_txt:term in 308) [ClassicSimilarity], result of:
            0.05916638 = score(doc=308,freq=3.0), product of:
              0.113991305 = queryWeight, product of:
                1.8908777 = boost
                4.794713 = idf(docFreq=998, maxDocs=44421)
                0.012573195 = queryNorm
              0.5190429 = fieldWeight in 308, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.794713 = idf(docFreq=998, maxDocs=44421)
                0.0625 = fieldNorm(doc=308)
          0.046271313 = weight(abstract_txt:document in 308) [ClassicSimilarity], result of:
            0.046271313 = score(doc=308,freq=2.0), product of:
              0.12191008 = queryWeight, product of:
                2.2579627 = boost
                4.29415 = idf(docFreq=1647, maxDocs=44421)
                0.012573195 = queryNorm
              0.3795528 = fieldWeight in 308, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.29415 = idf(docFreq=1647, maxDocs=44421)
                0.0625 = fieldNorm(doc=308)
          0.04614768 = weight(abstract_txt:model in 308) [ClassicSimilarity], result of:
            0.04614768 = score(doc=308,freq=2.0), product of:
              0.13108963 = queryWeight, product of:
                2.6177979 = boost
                3.9827821 = idf(docFreq=2249, maxDocs=44421)
                0.012573195 = queryNorm
              0.35203153 = fieldWeight in 308, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.9827821 = idf(docFreq=2249, maxDocs=44421)
                0.0625 = fieldNorm(doc=308)
          0.1834762 = weight(abstract_txt:latent in 308) [ClassicSimilarity], result of:
            0.1834762 = score(doc=308,freq=3.0), product of:
              0.242405 = queryWeight, product of:
                2.757391 = boost
                6.9919376 = idf(docFreq=110, maxDocs=44421)
                0.012573195 = queryNorm
              0.7568994 = fieldWeight in 308, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.9919376 = idf(docFreq=110, maxDocs=44421)
                0.0625 = fieldNorm(doc=308)
        0.4 = coord(10/25)
    
  3. Ding, C.H.Q.: ¬A probabilistic model for Latent Semantic Indexing (2005) 0.18
    0.18273649 = sum of:
      0.18273649 = product of:
        0.57105154 = sum of:
          0.021262074 = weight(abstract_txt:indexing in 4459) [ClassicSimilarity], result of:
            0.021262074 = score(doc=4459,freq=1.0), product of:
              0.06255981 = queryWeight, product of:
                1.1437463 = boost
                4.3503094 = idf(docFreq=1557, maxDocs=44421)
                0.012573195 = queryNorm
              0.33986792 = fieldWeight in 4459, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.3503094 = idf(docFreq=1557, maxDocs=44421)
                0.078125 = fieldNorm(doc=4459)
          0.05014549 = weight(abstract_txt:contribution in 4459) [ClassicSimilarity], result of:
            0.05014549 = score(doc=4459,freq=1.0), product of:
              0.110844195 = queryWeight, product of:
                1.5224339 = boost
                5.790671 = idf(docFreq=368, maxDocs=44421)
                0.012573195 = queryNorm
              0.45239615 = fieldWeight in 4459, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.790671 = idf(docFreq=368, maxDocs=44421)
                0.078125 = fieldNorm(doc=4459)
          0.07760046 = weight(abstract_txt:semantic in 4459) [ClassicSimilarity], result of:
            0.07760046 = score(doc=4459,freq=5.0), product of:
              0.0992754 = queryWeight, product of:
                1.7646087 = boost
                4.4745317 = idf(docFreq=1375, maxDocs=44421)
                0.012573195 = queryNorm
              0.7816685 = fieldWeight in 4459, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                4.4745317 = idf(docFreq=1375, maxDocs=44421)
                0.078125 = fieldNorm(doc=4459)
          0.078295775 = weight(abstract_txt:improves in 4459) [ClassicSimilarity], result of:
            0.078295775 = score(doc=4459,freq=1.0), product of:
              0.14918229 = queryWeight, product of:
                1.7662029 = boost
                6.717861 = idf(docFreq=145, maxDocs=44421)
                0.012573195 = queryNorm
              0.5248329 = fieldWeight in 4459, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.717861 = idf(docFreq=145, maxDocs=44421)
                0.078125 = fieldNorm(doc=4459)
          0.04096437 = weight(abstract_txt:standard in 4459) [ClassicSimilarity], result of:
            0.04096437 = score(doc=4459,freq=1.0), product of:
              0.11088164 = queryWeight, product of:
                1.864908 = boost
                4.7288613 = idf(docFreq=1066, maxDocs=44421)
                0.012573195 = queryNorm
              0.36944228 = fieldWeight in 4459, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.7288613 = idf(docFreq=1066, maxDocs=44421)
                0.078125 = fieldNorm(doc=4459)
          0.057839144 = weight(abstract_txt:document in 4459) [ClassicSimilarity], result of:
            0.057839144 = score(doc=4459,freq=2.0), product of:
              0.12191008 = queryWeight, product of:
                2.2579627 = boost
                4.29415 = idf(docFreq=1647, maxDocs=44421)
                0.012573195 = queryNorm
              0.47444102 = fieldWeight in 4459, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.29415 = idf(docFreq=1647, maxDocs=44421)
                0.078125 = fieldNorm(doc=4459)
          0.0576846 = weight(abstract_txt:model in 4459) [ClassicSimilarity], result of:
            0.0576846 = score(doc=4459,freq=2.0), product of:
              0.13108963 = queryWeight, product of:
                2.6177979 = boost
                3.9827821 = idf(docFreq=2249, maxDocs=44421)
                0.012573195 = queryNorm
              0.4400394 = fieldWeight in 4459, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.9827821 = idf(docFreq=2249, maxDocs=44421)
                0.078125 = fieldNorm(doc=4459)
          0.18725961 = weight(abstract_txt:latent in 4459) [ClassicSimilarity], result of:
            0.18725961 = score(doc=4459,freq=2.0), product of:
              0.242405 = queryWeight, product of:
                2.757391 = boost
                6.9919376 = idf(docFreq=110, maxDocs=44421)
                0.012573195 = queryNorm
              0.77250725 = fieldWeight in 4459, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.9919376 = idf(docFreq=110, maxDocs=44421)
                0.078125 = fieldNorm(doc=4459)
        0.32 = coord(8/25)
    
  4. Dunlavy, D.M.; O'Leary, D.P.; Conroy, J.M.; Schlesinger, J.D.: QCS: A system for querying, clustering and summarizing documents (2007) 0.18
    0.18231152 = sum of:
      0.18231152 = product of:
        0.4557788 = sum of:
          0.044858105 = weight(abstract_txt:decomposition in 1947) [ClassicSimilarity], result of:
            0.044858105 = score(doc=1947,freq=1.0), product of:
              0.10360411 = queryWeight, product of:
                1.0407716 = boost
                7.917278 = idf(docFreq=43, maxDocs=44421)
                0.012573195 = queryNorm
              0.43297613 = fieldWeight in 1947, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.917278 = idf(docFreq=43, maxDocs=44421)
                0.0546875 = fieldNorm(doc=1947)
          0.014883451 = weight(abstract_txt:indexing in 1947) [ClassicSimilarity], result of:
            0.014883451 = score(doc=1947,freq=1.0), product of:
              0.06255981 = queryWeight, product of:
                1.1437463 = boost
                4.3503094 = idf(docFreq=1557, maxDocs=44421)
                0.012573195 = queryNorm
              0.23790754 = fieldWeight in 1947, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.3503094 = idf(docFreq=1557, maxDocs=44421)
                0.0546875 = fieldNorm(doc=1947)
          0.024292786 = weight(abstract_txt:semantic in 1947) [ClassicSimilarity], result of:
            0.024292786 = score(doc=1947,freq=1.0), product of:
              0.0992754 = queryWeight, product of:
                1.7646087 = boost
                4.4745317 = idf(docFreq=1375, maxDocs=44421)
                0.012573195 = queryNorm
              0.24470095 = fieldWeight in 1947, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.4745317 = idf(docFreq=1375, maxDocs=44421)
                0.0546875 = fieldNorm(doc=1947)
          0.05480704 = weight(abstract_txt:improves in 1947) [ClassicSimilarity], result of:
            0.05480704 = score(doc=1947,freq=1.0), product of:
              0.14918229 = queryWeight, product of:
                1.7662029 = boost
                6.717861 = idf(docFreq=145, maxDocs=44421)
                0.012573195 = queryNorm
              0.36738303 = fieldWeight in 1947, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.717861 = idf(docFreq=145, maxDocs=44421)
                0.0546875 = fieldNorm(doc=1947)
          0.028675057 = weight(abstract_txt:standard in 1947) [ClassicSimilarity], result of:
            0.028675057 = score(doc=1947,freq=1.0), product of:
              0.11088164 = queryWeight, product of:
                1.864908 = boost
                4.7288613 = idf(docFreq=1066, maxDocs=44421)
                0.012573195 = queryNorm
              0.2586096 = fieldWeight in 1947, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.7288613 = idf(docFreq=1066, maxDocs=44421)
                0.0546875 = fieldNorm(doc=1947)
          0.03421409 = weight(abstract_txt:means in 1947) [ClassicSimilarity], result of:
            0.03421409 = score(doc=1947,freq=1.0), product of:
              0.124736466 = queryWeight, product of:
                1.977991 = boost
                5.015607 = idf(docFreq=800, maxDocs=44421)
                0.012573195 = queryNorm
              0.274291 = fieldWeight in 1947, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.015607 = idf(docFreq=800, maxDocs=44421)
                0.0546875 = fieldNorm(doc=1947)
          0.040487397 = weight(abstract_txt:document in 1947) [ClassicSimilarity], result of:
            0.040487397 = score(doc=1947,freq=2.0), product of:
              0.12191008 = queryWeight, product of:
                2.2579627 = boost
                4.29415 = idf(docFreq=1647, maxDocs=44421)
                0.012573195 = queryNorm
              0.3321087 = fieldWeight in 1947, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.29415 = idf(docFreq=1647, maxDocs=44421)
                0.0546875 = fieldNorm(doc=1947)
          0.09231966 = weight(abstract_txt:clustering in 1947) [ClassicSimilarity], result of:
            0.09231966 = score(doc=1947,freq=2.0), product of:
              0.19188584 = queryWeight, product of:
                2.453291 = boost
                6.2208285 = idf(docFreq=239, maxDocs=44421)
                0.012573195 = queryNorm
              0.48111764 = fieldWeight in 1947, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.2208285 = idf(docFreq=239, maxDocs=44421)
                0.0546875 = fieldNorm(doc=1947)
          0.028552422 = weight(abstract_txt:model in 1947) [ClassicSimilarity], result of:
            0.028552422 = score(doc=1947,freq=1.0), product of:
              0.13108963 = queryWeight, product of:
                2.6177979 = boost
                3.9827821 = idf(docFreq=2249, maxDocs=44421)
                0.012573195 = queryNorm
              0.2178084 = fieldWeight in 1947, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.9827821 = idf(docFreq=2249, maxDocs=44421)
                0.0546875 = fieldNorm(doc=1947)
          0.09268879 = weight(abstract_txt:latent in 1947) [ClassicSimilarity], result of:
            0.09268879 = score(doc=1947,freq=1.0), product of:
              0.242405 = queryWeight, product of:
                2.757391 = boost
                6.9919376 = idf(docFreq=110, maxDocs=44421)
                0.012573195 = queryNorm
              0.3823716 = fieldWeight in 1947, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.9919376 = idf(docFreq=110, maxDocs=44421)
                0.0546875 = fieldNorm(doc=1947)
        0.4 = coord(10/25)
    
  5. Chen, L.; Zeng, J.; Tokuda, N.: ¬A "stereo" document representation for textual information retrieval (2006) 0.18
    0.18063119 = sum of:
      0.18063119 = product of:
        0.5644725 = sum of:
          0.021262074 = weight(abstract_txt:indexing in 292) [ClassicSimilarity], result of:
            0.021262074 = score(doc=292,freq=1.0), product of:
              0.06255981 = queryWeight, product of:
                1.1437463 = boost
                4.3503094 = idf(docFreq=1557, maxDocs=44421)
                0.012573195 = queryNorm
              0.33986792 = fieldWeight in 292, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.3503094 = idf(docFreq=1557, maxDocs=44421)
                0.078125 = fieldNorm(doc=292)
          0.034703977 = weight(abstract_txt:semantic in 292) [ClassicSimilarity], result of:
            0.034703977 = score(doc=292,freq=1.0), product of:
              0.0992754 = queryWeight, product of:
                1.7646087 = boost
                4.4745317 = idf(docFreq=1375, maxDocs=44421)
                0.012573195 = queryNorm
              0.34957278 = fieldWeight in 292, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.4745317 = idf(docFreq=1375, maxDocs=44421)
                0.078125 = fieldNorm(doc=292)
          0.078295775 = weight(abstract_txt:improves in 292) [ClassicSimilarity], result of:
            0.078295775 = score(doc=292,freq=1.0), product of:
              0.14918229 = queryWeight, product of:
                1.7662029 = boost
                6.717861 = idf(docFreq=145, maxDocs=44421)
                0.012573195 = queryNorm
              0.5248329 = fieldWeight in 292, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.717861 = idf(docFreq=145, maxDocs=44421)
                0.078125 = fieldNorm(doc=292)
          0.057932366 = weight(abstract_txt:standard in 292) [ClassicSimilarity], result of:
            0.057932366 = score(doc=292,freq=2.0), product of:
              0.11088164 = queryWeight, product of:
                1.864908 = boost
                4.7288613 = idf(docFreq=1066, maxDocs=44421)
                0.012573195 = queryNorm
              0.5224703 = fieldWeight in 292, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.7288613 = idf(docFreq=1066, maxDocs=44421)
                0.078125 = fieldNorm(doc=292)
          0.042699654 = weight(abstract_txt:term in 292) [ClassicSimilarity], result of:
            0.042699654 = score(doc=292,freq=1.0), product of:
              0.113991305 = queryWeight, product of:
                1.8908777 = boost
                4.794713 = idf(docFreq=998, maxDocs=44421)
                0.012573195 = queryNorm
              0.37458694 = fieldWeight in 292, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.794713 = idf(docFreq=998, maxDocs=44421)
                0.078125 = fieldNorm(doc=292)
          0.0817969 = weight(abstract_txt:document in 292) [ClassicSimilarity], result of:
            0.0817969 = score(doc=292,freq=4.0), product of:
              0.12191008 = queryWeight, product of:
                2.2579627 = boost
                4.29415 = idf(docFreq=1647, maxDocs=44421)
                0.012573195 = queryNorm
              0.6709609 = fieldWeight in 292, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.29415 = idf(docFreq=1647, maxDocs=44421)
                0.078125 = fieldNorm(doc=292)
          0.1153692 = weight(abstract_txt:model in 292) [ClassicSimilarity], result of:
            0.1153692 = score(doc=292,freq=8.0), product of:
              0.13108963 = queryWeight, product of:
                2.6177979 = boost
                3.9827821 = idf(docFreq=2249, maxDocs=44421)
                0.012573195 = queryNorm
              0.8800788 = fieldWeight in 292, product of:
                2.828427 = tf(freq=8.0), with freq of:
                  8.0 = termFreq=8.0
                3.9827821 = idf(docFreq=2249, maxDocs=44421)
                0.078125 = fieldNorm(doc=292)
          0.13241254 = weight(abstract_txt:latent in 292) [ClassicSimilarity], result of:
            0.13241254 = score(doc=292,freq=1.0), product of:
              0.242405 = queryWeight, product of:
                2.757391 = boost
                6.9919376 = idf(docFreq=110, maxDocs=44421)
                0.012573195 = queryNorm
              0.5462451 = fieldWeight in 292, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.9919376 = idf(docFreq=110, maxDocs=44421)
                0.078125 = fieldNorm(doc=292)
        0.32 = coord(8/25)