Document (#33368)

Author
Dang, E.K.F.
Luk, R.W.P.
Ho, K.S.
Chan, S.C.F.
Lee, D.L.
Title
¬A new measure of clustering effectiveness : algorithms and experimental studies
Source
Journal of the American Society for Information Science and Technology. 59(2008) no.3, S.390-406
Year
2008
Abstract
We propose a new optimal clustering effectiveness measure, called CS1, based on a combination of clusters rather than selecting a single optimal cluster as in the traditional MK1 measure. For hierarchical clustering, we present an algorithm to compute CS1, defined by seeking the optimal combinations of disjoint clusters obtained by cutting the hierarchical structure at a certain similarity level. By reformulating the optimization to a 0-1 linear fractional programming problem, we demonstrate that an exact solution can be obtained by a linear time algorithm. We further discuss how our approach can be generalized to more general problems involving overlapping clusters, and we show how optimal estimates can be obtained by greedy algorithms.
Theme
Automatisches Klassifizieren

Similar documents (author)

  1. Dang, E.K.F.; Luk, R.W.P.; Allan, J.: Beyond bag-of-words : bigram-enhanced context-dependent term weights (2014) 4.44
    4.439562 = sum of:
      4.439562 = product of:
        5.9194155 = sum of:
          1.84631 = weight(author_txt:r.w.p in 2283) [ClassicSimilarity], result of:
            1.84631 = score(doc=2283,freq=1.0), product of:
              0.52383816 = queryWeight, product of:
                1.2884451 = boost
                9.398883 = idf(docFreq=9, maxDocs=44421)
                0.04325686 = queryNorm
              3.524581 = fieldWeight in 2283, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.398883 = idf(docFreq=9, maxDocs=44421)
                0.375 = fieldNorm(doc=2283)
          1.9090993 = weight(author_txt:dang in 2283) [ClassicSimilarity], result of:
            1.9090993 = score(doc=2283,freq=1.0), product of:
              0.5356483 = queryWeight, product of:
                1.3028884 = boost
                9.504243 = idf(docFreq=8, maxDocs=44421)
                0.04325686 = queryNorm
              3.5640912 = fieldWeight in 2283, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.504243 = idf(docFreq=8, maxDocs=44421)
                0.375 = fieldNorm(doc=2283)
          2.1640062 = weight(author_txt:e.k.f in 2283) [ClassicSimilarity], result of:
            2.1640062 = score(doc=2283,freq=1.0), product of:
              0.5823263 = queryWeight, product of:
                1.3584715 = boost
                9.909708 = idf(docFreq=5, maxDocs=44421)
                0.04325686 = queryNorm
              3.7161405 = fieldWeight in 2283, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.909708 = idf(docFreq=5, maxDocs=44421)
                0.375 = fieldNorm(doc=2283)
        0.75 = coord(3/4)
    
  2. Dang, E.K.F.; Luk, R.W.P.; Allan, J.: ¬A context-dependent relevance model (2016) 4.44
    4.439562 = sum of:
      4.439562 = product of:
        5.9194155 = sum of:
          1.84631 = weight(author_txt:r.w.p in 3778) [ClassicSimilarity], result of:
            1.84631 = score(doc=3778,freq=1.0), product of:
              0.52383816 = queryWeight, product of:
                1.2884451 = boost
                9.398883 = idf(docFreq=9, maxDocs=44421)
                0.04325686 = queryNorm
              3.524581 = fieldWeight in 3778, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.398883 = idf(docFreq=9, maxDocs=44421)
                0.375 = fieldNorm(doc=3778)
          1.9090993 = weight(author_txt:dang in 3778) [ClassicSimilarity], result of:
            1.9090993 = score(doc=3778,freq=1.0), product of:
              0.5356483 = queryWeight, product of:
                1.3028884 = boost
                9.504243 = idf(docFreq=8, maxDocs=44421)
                0.04325686 = queryNorm
              3.5640912 = fieldWeight in 3778, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.504243 = idf(docFreq=8, maxDocs=44421)
                0.375 = fieldNorm(doc=3778)
          2.1640062 = weight(author_txt:e.k.f in 3778) [ClassicSimilarity], result of:
            2.1640062 = score(doc=3778,freq=1.0), product of:
              0.5823263 = queryWeight, product of:
                1.3584715 = boost
                9.909708 = idf(docFreq=5, maxDocs=44421)
                0.04325686 = queryNorm
              3.7161405 = fieldWeight in 3778, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.909708 = idf(docFreq=5, maxDocs=44421)
                0.375 = fieldNorm(doc=3778)
        0.75 = coord(3/4)
    
  3. Dang, E.K.F.; Luk, R.W.P.; Allan, J.: ¬A retrieval model family based on the probability ranking principle for ad hoc retrieval (2022) 4.44
    4.439562 = sum of:
      4.439562 = product of:
        5.9194155 = sum of:
          1.84631 = weight(author_txt:r.w.p in 1639) [ClassicSimilarity], result of:
            1.84631 = score(doc=1639,freq=1.0), product of:
              0.52383816 = queryWeight, product of:
                1.2884451 = boost
                9.398883 = idf(docFreq=9, maxDocs=44421)
                0.04325686 = queryNorm
              3.524581 = fieldWeight in 1639, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.398883 = idf(docFreq=9, maxDocs=44421)
                0.375 = fieldNorm(doc=1639)
          1.9090993 = weight(author_txt:dang in 1639) [ClassicSimilarity], result of:
            1.9090993 = score(doc=1639,freq=1.0), product of:
              0.5356483 = queryWeight, product of:
                1.3028884 = boost
                9.504243 = idf(docFreq=8, maxDocs=44421)
                0.04325686 = queryNorm
              3.5640912 = fieldWeight in 1639, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.504243 = idf(docFreq=8, maxDocs=44421)
                0.375 = fieldNorm(doc=1639)
          2.1640062 = weight(author_txt:e.k.f in 1639) [ClassicSimilarity], result of:
            2.1640062 = score(doc=1639,freq=1.0), product of:
              0.5823263 = queryWeight, product of:
                1.3584715 = boost
                9.909708 = idf(docFreq=5, maxDocs=44421)
                0.04325686 = queryNorm
              3.7161405 = fieldWeight in 1639, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.909708 = idf(docFreq=5, maxDocs=44421)
                0.375 = fieldNorm(doc=1639)
        0.75 = coord(3/4)
    
  4. Dang, E.K.F.; Luk, R.W.P.; Allan, J.; Ho, K.S.; Chung, K.F.L.; Lee, D.L.: ¬A new context-dependent term weight computed by boost and discount using relevance information (2010) 2.96
    2.9597077 = sum of:
      2.9597077 = product of:
        3.9462771 = sum of:
          1.2308733 = weight(author_txt:r.w.p in 120) [ClassicSimilarity], result of:
            1.2308733 = score(doc=120,freq=1.0), product of:
              0.52383816 = queryWeight, product of:
                1.2884451 = boost
                9.398883 = idf(docFreq=9, maxDocs=44421)
                0.04325686 = queryNorm
              2.3497207 = fieldWeight in 120, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.398883 = idf(docFreq=9, maxDocs=44421)
                0.25 = fieldNorm(doc=120)
          1.2727329 = weight(author_txt:dang in 120) [ClassicSimilarity], result of:
            1.2727329 = score(doc=120,freq=1.0), product of:
              0.5356483 = queryWeight, product of:
                1.3028884 = boost
                9.504243 = idf(docFreq=8, maxDocs=44421)
                0.04325686 = queryNorm
              2.3760607 = fieldWeight in 120, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.504243 = idf(docFreq=8, maxDocs=44421)
                0.25 = fieldNorm(doc=120)
          1.442671 = weight(author_txt:e.k.f in 120) [ClassicSimilarity], result of:
            1.442671 = score(doc=120,freq=1.0), product of:
              0.5823263 = queryWeight, product of:
                1.3584715 = boost
                9.909708 = idf(docFreq=5, maxDocs=44421)
                0.04325686 = queryNorm
              2.477427 = fieldWeight in 120, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.909708 = idf(docFreq=5, maxDocs=44421)
                0.25 = fieldNorm(doc=120)
        0.75 = coord(3/4)
    
  5. Luk, R.W.P.; Leong, H.V.; Dillon, T.S.; Chan, A.T.S.; Croft, W.B.; Allen, J.: ¬A survey in indexing and searching XML documents (2002) 0.90
    0.903167 = sum of:
      0.903167 = product of:
        1.806334 = sum of:
          0.5754607 = weight(author_txt:chan in 1460) [ClassicSimilarity], result of:
            0.5754607 = score(doc=1460,freq=1.0), product of:
              0.3155479 = queryWeight, product of:
                7.2947483 = idf(docFreq=81, maxDocs=44421)
                0.04325686 = queryNorm
              1.8236871 = fieldWeight in 1460, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.2947483 = idf(docFreq=81, maxDocs=44421)
                0.25 = fieldNorm(doc=1460)
          1.2308733 = weight(author_txt:r.w.p in 1460) [ClassicSimilarity], result of:
            1.2308733 = score(doc=1460,freq=1.0), product of:
              0.52383816 = queryWeight, product of:
                1.2884451 = boost
                9.398883 = idf(docFreq=9, maxDocs=44421)
                0.04325686 = queryNorm
              2.3497207 = fieldWeight in 1460, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.398883 = idf(docFreq=9, maxDocs=44421)
                0.25 = fieldNorm(doc=1460)
        0.5 = coord(2/4)
    

Similar documents (content)

  1. Mather, L.A.: ¬A linear algebra measure of cluster quality (2000) 0.20
    0.20155923 = sum of:
      0.20155923 = product of:
        0.83983016 = sum of:
          0.074993946 = weight(abstract_txt:cluster in 5767) [ClassicSimilarity], result of:
            0.074993946 = score(doc=5767,freq=3.0), product of:
              0.10579502 = queryWeight, product of:
                1.0060943 = boost
                6.548176 = idf(docFreq=172, maxDocs=44421)
                0.016058544 = queryNorm
              0.7088608 = fieldWeight in 5767, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.548176 = idf(docFreq=172, maxDocs=44421)
                0.0625 = fieldNorm(doc=5767)
          0.10977296 = weight(abstract_txt:disjoint in 5767) [ClassicSimilarity], result of:
            0.10977296 = score(doc=5767,freq=1.0), product of:
              0.19670637 = queryWeight, product of:
                1.3718774 = boost
                8.928879 = idf(docFreq=15, maxDocs=44421)
                0.016058544 = queryNorm
              0.5580549 = fieldWeight in 5767, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.928879 = idf(docFreq=15, maxDocs=44421)
                0.0625 = fieldNorm(doc=5767)
          0.08077671 = weight(abstract_txt:algorithms in 5767) [ClassicSimilarity], result of:
            0.08077671 = score(doc=5767,freq=2.0), product of:
              0.16032907 = queryWeight, product of:
                1.751569 = boost
                5.7000527 = idf(docFreq=403, maxDocs=44421)
                0.016058544 = queryNorm
              0.5038182 = fieldWeight in 5767, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.7000527 = idf(docFreq=403, maxDocs=44421)
                0.0625 = fieldNorm(doc=5767)
          0.12948759 = weight(abstract_txt:linear in 5767) [ClassicSimilarity], result of:
            0.12948759 = score(doc=5767,freq=2.0), product of:
              0.219604 = queryWeight, product of:
                2.0499403 = boost
                6.6710296 = idf(docFreq=152, maxDocs=44421)
                0.016058544 = queryNorm
              0.5896413 = fieldWeight in 5767, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.6710296 = idf(docFreq=152, maxDocs=44421)
                0.0625 = fieldNorm(doc=5767)
          0.22274119 = weight(abstract_txt:clustering in 5767) [ClassicSimilarity], result of:
            0.22274119 = score(doc=5767,freq=4.0), product of:
              0.28644568 = queryWeight, product of:
                2.8673973 = boost
                6.2208285 = idf(docFreq=239, maxDocs=44421)
                0.016058544 = queryNorm
              0.77760357 = fieldWeight in 5767, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                6.2208285 = idf(docFreq=239, maxDocs=44421)
                0.0625 = fieldNorm(doc=5767)
          0.22205779 = weight(abstract_txt:clusters in 5767) [ClassicSimilarity], result of:
            0.22205779 = score(doc=5767,freq=3.0), product of:
              0.31462908 = queryWeight, product of:
                3.00515 = boost
                6.519684 = idf(docFreq=177, maxDocs=44421)
                0.016058544 = queryNorm
              0.70577645 = fieldWeight in 5767, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.519684 = idf(docFreq=177, maxDocs=44421)
                0.0625 = fieldNorm(doc=5767)
        0.24 = coord(6/25)
    
  2. Bose, I.; Chen, X.: ¬A method for extension of generative topographic mapping for fuzzy clustering (2009) 0.19
    0.19342618 = sum of:
      0.19342618 = product of:
        0.8059424 = sum of:
          0.054122217 = weight(abstract_txt:cluster in 3711) [ClassicSimilarity], result of:
            0.054122217 = score(doc=3711,freq=1.0), product of:
              0.10579502 = queryWeight, product of:
                1.0060943 = boost
                6.548176 = idf(docFreq=172, maxDocs=44421)
                0.016058544 = queryNorm
              0.51157624 = fieldWeight in 3711, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.548176 = idf(docFreq=172, maxDocs=44421)
                0.078125 = fieldNorm(doc=3711)
          0.10097089 = weight(abstract_txt:algorithms in 3711) [ClassicSimilarity], result of:
            0.10097089 = score(doc=3711,freq=2.0), product of:
              0.16032907 = queryWeight, product of:
                1.751569 = boost
                5.7000527 = idf(docFreq=403, maxDocs=44421)
                0.016058544 = queryNorm
              0.6297728 = fieldWeight in 3711, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.7000527 = idf(docFreq=403, maxDocs=44421)
                0.078125 = fieldNorm(doc=3711)
          0.07158386 = weight(abstract_txt:algorithm in 3711) [ClassicSimilarity], result of:
            0.07158386 = score(doc=3711,freq=1.0), product of:
              0.1606084 = queryWeight, product of:
                1.7530941 = boost
                5.7050157 = idf(docFreq=401, maxDocs=44421)
                0.016058544 = queryNorm
              0.44570434 = fieldWeight in 3711, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.7050157 = idf(docFreq=401, maxDocs=44421)
                0.078125 = fieldNorm(doc=3711)
          0.15575139 = weight(abstract_txt:obtained in 3711) [ClassicSimilarity], result of:
            0.15575139 = score(doc=3711,freq=2.0), product of:
              0.24501906 = queryWeight, product of:
                2.6519582 = boost
                5.7534328 = idf(docFreq=382, maxDocs=44421)
                0.016058544 = queryNorm
              0.6356705 = fieldWeight in 3711, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.7534328 = idf(docFreq=382, maxDocs=44421)
                0.078125 = fieldNorm(doc=3711)
          0.19687724 = weight(abstract_txt:clustering in 3711) [ClassicSimilarity], result of:
            0.19687724 = score(doc=3711,freq=2.0), product of:
              0.28644568 = queryWeight, product of:
                2.8673973 = boost
                6.2208285 = idf(docFreq=239, maxDocs=44421)
                0.016058544 = queryNorm
              0.68731093 = fieldWeight in 3711, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.2208285 = idf(docFreq=239, maxDocs=44421)
                0.078125 = fieldNorm(doc=3711)
          0.2266368 = weight(abstract_txt:clusters in 3711) [ClassicSimilarity], result of:
            0.2266368 = score(doc=3711,freq=2.0), product of:
              0.31462908 = queryWeight, product of:
                3.00515 = boost
                6.519684 = idf(docFreq=177, maxDocs=44421)
                0.016058544 = queryNorm
              0.7203301 = fieldWeight in 3711, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.519684 = idf(docFreq=177, maxDocs=44421)
                0.078125 = fieldNorm(doc=3711)
        0.24 = coord(6/25)
    
  3. Cathey, R.J.; Jensen, E.C.; Beitzel, S.M.; Frieder, O.; Grossman, D.: Exploiting parallelism to support scalable hierarchical clustering (2007) 0.18
    0.18460605 = sum of:
      0.18460605 = product of:
        0.76919186 = sum of:
          0.043297775 = weight(abstract_txt:cluster in 1448) [ClassicSimilarity], result of:
            0.043297775 = score(doc=1448,freq=1.0), product of:
              0.10579502 = queryWeight, product of:
                1.0060943 = boost
                6.548176 = idf(docFreq=172, maxDocs=44421)
                0.016058544 = queryNorm
              0.409261 = fieldWeight in 1448, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.548176 = idf(docFreq=172, maxDocs=44421)
                0.0625 = fieldNorm(doc=1448)
          0.08077671 = weight(abstract_txt:algorithms in 1448) [ClassicSimilarity], result of:
            0.08077671 = score(doc=1448,freq=2.0), product of:
              0.16032907 = queryWeight, product of:
                1.751569 = boost
                5.7000527 = idf(docFreq=403, maxDocs=44421)
                0.016058544 = queryNorm
              0.5038182 = fieldWeight in 1448, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.7000527 = idf(docFreq=403, maxDocs=44421)
                0.0625 = fieldNorm(doc=1448)
          0.1280531 = weight(abstract_txt:algorithm in 1448) [ClassicSimilarity], result of:
            0.1280531 = score(doc=1448,freq=5.0), product of:
              0.1606084 = queryWeight, product of:
                1.7530941 = boost
                5.7050157 = idf(docFreq=401, maxDocs=44421)
                0.016058544 = queryNorm
              0.79730016 = fieldWeight in 1448, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                5.7050157 = idf(docFreq=401, maxDocs=44421)
                0.0625 = fieldNorm(doc=1448)
          0.11605802 = weight(abstract_txt:hierarchical in 1448) [ClassicSimilarity], result of:
            0.11605802 = score(doc=1448,freq=4.0), product of:
              0.16202982 = queryWeight, product of:
                1.7608347 = boost
                5.7302055 = idf(docFreq=391, maxDocs=44421)
                0.016058544 = queryNorm
              0.7162757 = fieldWeight in 1448, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.7302055 = idf(docFreq=391, maxDocs=44421)
                0.0625 = fieldNorm(doc=1448)
          0.27280113 = weight(abstract_txt:clustering in 1448) [ClassicSimilarity], result of:
            0.27280113 = score(doc=1448,freq=6.0), product of:
              0.28644568 = queryWeight, product of:
                2.8673973 = boost
                6.2208285 = idf(docFreq=239, maxDocs=44421)
                0.016058544 = queryNorm
              0.952366 = fieldWeight in 1448, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                6.2208285 = idf(docFreq=239, maxDocs=44421)
                0.0625 = fieldNorm(doc=1448)
          0.12820514 = weight(abstract_txt:clusters in 1448) [ClassicSimilarity], result of:
            0.12820514 = score(doc=1448,freq=1.0), product of:
              0.31462908 = queryWeight, product of:
                3.00515 = boost
                6.519684 = idf(docFreq=177, maxDocs=44421)
                0.016058544 = queryNorm
              0.40748024 = fieldWeight in 1448, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.519684 = idf(docFreq=177, maxDocs=44421)
                0.0625 = fieldNorm(doc=1448)
        0.24 = coord(6/25)
    
  4. Burgin, R.: ¬The retrieval effectiveness of 5 clustering algorithms as a function of indexing exhaustivity (1995) 0.18
    0.1844953 = sum of:
      0.1844953 = product of:
        0.76873046 = sum of:
          0.043297775 = weight(abstract_txt:cluster in 3433) [ClassicSimilarity], result of:
            0.043297775 = score(doc=3433,freq=1.0), product of:
              0.10579502 = queryWeight, product of:
                1.0060943 = boost
                6.548176 = idf(docFreq=172, maxDocs=44421)
                0.016058544 = queryNorm
              0.409261 = fieldWeight in 3433, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.548176 = idf(docFreq=172, maxDocs=44421)
                0.0625 = fieldNorm(doc=3433)
          0.08163871 = weight(abstract_txt:effectiveness in 3433) [ClassicSimilarity], result of:
            0.08163871 = score(doc=3433,freq=4.0), product of:
              0.12815697 = queryWeight, product of:
                1.5660018 = boost
                5.0961695 = idf(docFreq=738, maxDocs=44421)
                0.016058544 = queryNorm
              0.6370212 = fieldWeight in 3433, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.0961695 = idf(docFreq=738, maxDocs=44421)
                0.0625 = fieldNorm(doc=3433)
          0.05802901 = weight(abstract_txt:hierarchical in 3433) [ClassicSimilarity], result of:
            0.05802901 = score(doc=3433,freq=1.0), product of:
              0.16202982 = queryWeight, product of:
                1.7608347 = boost
                5.7302055 = idf(docFreq=391, maxDocs=44421)
                0.016058544 = queryNorm
              0.35813785 = fieldWeight in 3433, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.7302055 = idf(docFreq=391, maxDocs=44421)
                0.0625 = fieldNorm(doc=3433)
          0.27280113 = weight(abstract_txt:clustering in 3433) [ClassicSimilarity], result of:
            0.27280113 = score(doc=3433,freq=6.0), product of:
              0.28644568 = queryWeight, product of:
                2.8673973 = boost
                6.2208285 = idf(docFreq=239, maxDocs=44421)
                0.016058544 = queryNorm
              0.952366 = fieldWeight in 3433, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                6.2208285 = idf(docFreq=239, maxDocs=44421)
                0.0625 = fieldNorm(doc=3433)
          0.12820514 = weight(abstract_txt:clusters in 3433) [ClassicSimilarity], result of:
            0.12820514 = score(doc=3433,freq=1.0), product of:
              0.31462908 = queryWeight, product of:
                3.00515 = boost
                6.519684 = idf(docFreq=177, maxDocs=44421)
                0.016058544 = queryNorm
              0.40748024 = fieldWeight in 3433, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.519684 = idf(docFreq=177, maxDocs=44421)
                0.0625 = fieldNorm(doc=3433)
          0.18475871 = weight(abstract_txt:optimal in 3433) [ClassicSimilarity], result of:
            0.18475871 = score(doc=3433,freq=1.0), product of:
              0.4418194 = queryWeight, product of:
                4.112051 = boost
                6.690832 = idf(docFreq=149, maxDocs=44421)
                0.016058544 = queryNorm
              0.418177 = fieldWeight in 3433, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.690832 = idf(docFreq=149, maxDocs=44421)
                0.0625 = fieldNorm(doc=3433)
        0.24 = coord(6/25)
    
  5. Kishida, K.: High-speed rough clustering for very large document collections (2010) 0.16
    0.15918921 = sum of:
      0.15918921 = product of:
        0.6632884 = sum of:
          0.043297775 = weight(abstract_txt:cluster in 450) [ClassicSimilarity], result of:
            0.043297775 = score(doc=450,freq=1.0), product of:
              0.10579502 = queryWeight, product of:
                1.0060943 = boost
                6.548176 = idf(docFreq=172, maxDocs=44421)
                0.016058544 = queryNorm
              0.409261 = fieldWeight in 450, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.548176 = idf(docFreq=172, maxDocs=44421)
                0.0625 = fieldNorm(doc=450)
          0.040819354 = weight(abstract_txt:effectiveness in 450) [ClassicSimilarity], result of:
            0.040819354 = score(doc=450,freq=1.0), product of:
              0.12815697 = queryWeight, product of:
                1.5660018 = boost
                5.0961695 = idf(docFreq=738, maxDocs=44421)
                0.016058544 = queryNorm
              0.3185106 = fieldWeight in 450, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.0961695 = idf(docFreq=738, maxDocs=44421)
                0.0625 = fieldNorm(doc=450)
          0.05711776 = weight(abstract_txt:algorithms in 450) [ClassicSimilarity], result of:
            0.05711776 = score(doc=450,freq=1.0), product of:
              0.16032907 = queryWeight, product of:
                1.751569 = boost
                5.7000527 = idf(docFreq=403, maxDocs=44421)
                0.016058544 = queryNorm
              0.3562533 = fieldWeight in 450, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.7000527 = idf(docFreq=403, maxDocs=44421)
                0.0625 = fieldNorm(doc=450)
          0.09918951 = weight(abstract_txt:algorithm in 450) [ClassicSimilarity], result of:
            0.09918951 = score(doc=450,freq=3.0), product of:
              0.1606084 = queryWeight, product of:
                1.7530941 = boost
                5.7050157 = idf(docFreq=401, maxDocs=44421)
                0.016058544 = queryNorm
              0.6175861 = fieldWeight in 450, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.7050157 = idf(docFreq=401, maxDocs=44421)
                0.0625 = fieldNorm(doc=450)
          0.29465887 = weight(abstract_txt:clustering in 450) [ClassicSimilarity], result of:
            0.29465887 = score(doc=450,freq=7.0), product of:
              0.28644568 = queryWeight, product of:
                2.8673973 = boost
                6.2208285 = idf(docFreq=239, maxDocs=44421)
                0.016058544 = queryNorm
              1.0286728 = fieldWeight in 450, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                6.2208285 = idf(docFreq=239, maxDocs=44421)
                0.0625 = fieldNorm(doc=450)
          0.12820514 = weight(abstract_txt:clusters in 450) [ClassicSimilarity], result of:
            0.12820514 = score(doc=450,freq=1.0), product of:
              0.31462908 = queryWeight, product of:
                3.00515 = boost
                6.519684 = idf(docFreq=177, maxDocs=44421)
                0.016058544 = queryNorm
              0.40748024 = fieldWeight in 450, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.519684 = idf(docFreq=177, maxDocs=44421)
                0.0625 = fieldNorm(doc=450)
        0.24 = coord(6/25)