Document (#2414)

Author
Griffiths, A.
Robinson, L.A.
Willett, P.
Title
Hierarchic agglomerative clustering methods for automatic document classification
Source
Journal of documentation. 40(1984) no.3, S.175-205
Year
1984
Theme
Automatisches Indexieren

Similar documents (author)

  1. Griffiths, A.; Luckhurst, H.C.; Willett, P.: Using interdocument similarity information in document retrieval systems (1986) 2.68
    2.684671 = sum of:
      2.684671 = product of:
        4.027006 = sum of:
          1.6312693 = weight(author_txt:willett in 2414) [ClassicSimilarity], result of:
            1.6312693 = score(doc=2414,freq=1.0), product of:
              0.5411922 = queryWeight, product of:
                1.0764052 = boost
                8.037906 = idf(docFreq=38, maxDocs=44421)
                0.06255079 = queryNorm
              3.0142145 = fieldWeight in 2414, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.037906 = idf(docFreq=38, maxDocs=44421)
                0.375 = fieldNorm(doc=2414)
          2.395737 = weight(author_txt:griffiths in 2414) [ClassicSimilarity], result of:
            2.395737 = score(doc=2414,freq=1.0), product of:
              0.6992414 = queryWeight, product of:
                1.2235271 = boost
                9.1365185 = idf(docFreq=12, maxDocs=44421)
                0.06255079 = queryNorm
              3.4261944 = fieldWeight in 2414, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.1365185 = idf(docFreq=12, maxDocs=44421)
                0.375 = fieldNorm(doc=2414)
        0.6666667 = coord(2/3)
    
  2. Griffiths, R.: Health information (1993) 1.33
    1.330965 = sum of:
      1.330965 = product of:
        3.9928951 = sum of:
          3.9928951 = weight(author_txt:griffiths in 119) [ClassicSimilarity], result of:
            3.9928951 = score(doc=119,freq=1.0), product of:
              0.6992414 = queryWeight, product of:
                1.2235271 = boost
                9.1365185 = idf(docFreq=12, maxDocs=44421)
                0.06255079 = queryNorm
              5.7103243 = fieldWeight in 119, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.1365185 = idf(docFreq=12, maxDocs=44421)
                0.625 = fieldNorm(doc=119)
        0.33333334 = coord(1/3)
    
  3. Griffiths, J.: ¬The value of information and related systems, products and services (1982) 1.33
    1.330965 = sum of:
      1.330965 = product of:
        3.9928951 = sum of:
          3.9928951 = weight(author_txt:griffiths in 5903) [ClassicSimilarity], result of:
            3.9928951 = score(doc=5903,freq=1.0), product of:
              0.6992414 = queryWeight, product of:
                1.2235271 = boost
                9.1365185 = idf(docFreq=12, maxDocs=44421)
                0.06255079 = queryNorm
              5.7103243 = fieldWeight in 5903, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.1365185 = idf(docFreq=12, maxDocs=44421)
                0.625 = fieldNorm(doc=5903)
        0.33333334 = coord(1/3)
    
  4. Griffiths, P.: Personal searching gets the right results (1997) 1.33
    1.330965 = sum of:
      1.330965 = product of:
        3.9928951 = sum of:
          3.9928951 = weight(author_txt:griffiths in 1784) [ClassicSimilarity], result of:
            3.9928951 = score(doc=1784,freq=1.0), product of:
              0.6992414 = queryWeight, product of:
                1.2235271 = boost
                9.1365185 = idf(docFreq=12, maxDocs=44421)
                0.06255079 = queryNorm
              5.7103243 = fieldWeight in 1784, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.1365185 = idf(docFreq=12, maxDocs=44421)
                0.625 = fieldNorm(doc=1784)
        0.33333334 = coord(1/3)
    
  5. Griffiths, A.: Setting up a subject directory of Web sites : a case study of management links (1999) 1.33
    1.330965 = sum of:
      1.330965 = product of:
        3.9928951 = sum of:
          3.9928951 = weight(author_txt:griffiths in 5559) [ClassicSimilarity], result of:
            3.9928951 = score(doc=5559,freq=1.0), product of:
              0.6992414 = queryWeight, product of:
                1.2235271 = boost
                9.1365185 = idf(docFreq=12, maxDocs=44421)
                0.06255079 = queryNorm
              5.7103243 = fieldWeight in 5559, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.1365185 = idf(docFreq=12, maxDocs=44421)
                0.625 = fieldNorm(doc=5559)
        0.33333334 = coord(1/3)
    

Similar documents (content)

  1. Tombros, A.; Villa, R.; Rijsbergen, C.J. Van: ¬The effectiveness of query-specific hierarchic clustering in information retrieval (2002) 1.40
    1.4016899 = sum of:
      1.4016899 = product of:
        2.4529572 = sum of:
          0.03980808 = weight(abstract_txt:methods in 3586) [ClassicSimilarity], result of:
            0.03980808 = score(doc=3586,freq=1.0), product of:
              0.12297497 = queryWeight, product of:
                1.0379026 = boost
                4.1434727 = idf(docFreq=1915, maxDocs=44421)
                0.028595366 = queryNorm
              0.3237088 = fieldWeight in 3586, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.1434727 = idf(docFreq=1915, maxDocs=44421)
                0.078125 = fieldNorm(doc=3586)
          0.0626649 = weight(abstract_txt:document in 3586) [ClassicSimilarity], result of:
            0.0626649 = score(doc=3586,freq=2.0), product of:
              0.13208155 = queryWeight, product of:
                1.0756459 = boost
                4.29415 = idf(docFreq=1647, maxDocs=44421)
                0.028595366 = queryNorm
              0.47444102 = fieldWeight in 3586, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.29415 = idf(docFreq=1647, maxDocs=44421)
                0.078125 = fieldNorm(doc=3586)
          0.35642773 = weight(abstract_txt:clustering in 3586) [ClassicSimilarity], result of:
            0.35642773 = score(doc=3586,freq=7.0), product of:
              0.27719426 = queryWeight, product of:
                1.5582615 = boost
                6.2208285 = idf(docFreq=239, maxDocs=44421)
                0.028595366 = queryNorm
              1.285841 = fieldWeight in 3586, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                6.2208285 = idf(docFreq=239, maxDocs=44421)
                0.078125 = fieldNorm(doc=3586)
          1.9940563 = weight(title_txt:hierarchic in 3586) [ClassicSimilarity], result of:
            1.9940563 = score(doc=3586,freq=1.0), product of:
              0.6631639 = queryWeight, product of:
                2.410231 = boost
                9.622026 = idf(docFreq=7, maxDocs=44421)
                0.028595366 = queryNorm
              3.0068831 = fieldWeight in 3586, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.622026 = idf(docFreq=7, maxDocs=44421)
                0.3125 = fieldNorm(doc=3586)
        0.5714286 = coord(4/7)
    
  2. Kirriemuir, J.W.; Willet, P.: Identification of duplicate and near-duplicate full-text records in database search-outputs using hierarchic cluster analysis (1995) 1.01
    1.0093362 = sum of:
      1.0093362 = product of:
        1.7663383 = sum of:
          0.04272506 = weight(abstract_txt:classification in 2497) [ClassicSimilarity], result of:
            0.04272506 = score(doc=2497,freq=1.0), product of:
              0.11415726 = queryWeight, product of:
                3.9921594 = idf(docFreq=2228, maxDocs=44421)
                0.028595366 = queryNorm
              0.37426496 = fieldWeight in 2497, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.9921594 = idf(docFreq=2228, maxDocs=44421)
                0.09375 = fieldNorm(doc=2497)
          0.047769696 = weight(abstract_txt:methods in 2497) [ClassicSimilarity], result of:
            0.047769696 = score(doc=2497,freq=1.0), product of:
              0.12297497 = queryWeight, product of:
                1.0379026 = boost
                4.1434727 = idf(docFreq=1915, maxDocs=44421)
                0.028595366 = queryNorm
              0.38845056 = fieldWeight in 2497, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.1434727 = idf(docFreq=1915, maxDocs=44421)
                0.09375 = fieldNorm(doc=2497)
          0.28000408 = weight(abstract_txt:clustering in 2497) [ClassicSimilarity], result of:
            0.28000408 = score(doc=2497,freq=3.0), product of:
              0.27719426 = queryWeight, product of:
                1.5582615 = boost
                6.2208285 = idf(docFreq=239, maxDocs=44421)
                0.028595366 = queryNorm
              1.0101366 = fieldWeight in 2497, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.2208285 = idf(docFreq=239, maxDocs=44421)
                0.09375 = fieldNorm(doc=2497)
          1.3958396 = weight(title_txt:hierarchic in 2497) [ClassicSimilarity], result of:
            1.3958396 = score(doc=2497,freq=1.0), product of:
              0.6631639 = queryWeight, product of:
                2.410231 = boost
                9.622026 = idf(docFreq=7, maxDocs=44421)
                0.028595366 = queryNorm
              2.1048183 = fieldWeight in 2497, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.622026 = idf(docFreq=7, maxDocs=44421)
                0.21875 = fieldNorm(doc=2497)
        0.5714286 = coord(4/7)
    
  3. Miyamoto, S.: Information clustering based an fuzzy multisets (2003) 0.49
    0.48570547 = sum of:
      0.48570547 = product of:
        0.8499845 = sum of:
          0.03980808 = weight(abstract_txt:methods in 2071) [ClassicSimilarity], result of:
            0.03980808 = score(doc=2071,freq=1.0), product of:
              0.12297497 = queryWeight, product of:
                1.0379026 = boost
                4.1434727 = idf(docFreq=1915, maxDocs=44421)
                0.028595366 = queryNorm
              0.3237088 = fieldWeight in 2071, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.1434727 = idf(docFreq=1915, maxDocs=44421)
                0.078125 = fieldNorm(doc=2071)
          0.04431078 = weight(abstract_txt:document in 2071) [ClassicSimilarity], result of:
            0.04431078 = score(doc=2071,freq=1.0), product of:
              0.13208155 = queryWeight, product of:
                1.0756459 = boost
                4.29415 = idf(docFreq=1647, maxDocs=44421)
                0.028595366 = queryNorm
              0.33548045 = fieldWeight in 2071, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.29415 = idf(docFreq=1647, maxDocs=44421)
                0.078125 = fieldNorm(doc=2071)
          0.30123645 = weight(abstract_txt:clustering in 2071) [ClassicSimilarity], result of:
            0.30123645 = score(doc=2071,freq=5.0), product of:
              0.27719426 = queryWeight, product of:
                1.5582615 = boost
                6.2208285 = idf(docFreq=239, maxDocs=44421)
                0.028595366 = queryNorm
              1.086734 = fieldWeight in 2071, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                6.2208285 = idf(docFreq=239, maxDocs=44421)
                0.078125 = fieldNorm(doc=2071)
          0.4646292 = weight(abstract_txt:agglomerative in 2071) [ClassicSimilarity], result of:
            0.4646292 = score(doc=2071,freq=1.0), product of:
              0.6327618 = queryWeight, product of:
                2.3543355 = boost
                9.398883 = idf(docFreq=9, maxDocs=44421)
                0.028595366 = queryNorm
              0.73428774 = fieldWeight in 2071, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.398883 = idf(docFreq=9, maxDocs=44421)
                0.078125 = fieldNorm(doc=2071)
        0.5714286 = coord(4/7)
    
  4. Rijsbergen, C.J. van: ¬A fast hierarchic clustering algorithm (1970) 0.40
    0.3988113 = sum of:
      0.3988113 = product of:
        2.7916791 = sum of:
          2.7916791 = weight(title_txt:hierarchic in 3299) [ClassicSimilarity], result of:
            2.7916791 = score(doc=3299,freq=1.0), product of:
              0.6631639 = queryWeight, product of:
                2.410231 = boost
                9.622026 = idf(docFreq=7, maxDocs=44421)
                0.028595366 = queryNorm
              4.2096367 = fieldWeight in 3299, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.622026 = idf(docFreq=7, maxDocs=44421)
                0.4375 = fieldNorm(doc=3299)
        0.14285715 = coord(1/7)
    
  5. Cathey, R.J.; Jensen, E.C.; Beitzel, S.M.; Frieder, O.; Grossman, D.: Exploiting parallelism to support scalable hierarchical clustering (2007) 0.36
    0.3599101 = sum of:
      0.3599101 = product of:
        0.8397902 = sum of:
          0.050131924 = weight(abstract_txt:document in 1448) [ClassicSimilarity], result of:
            0.050131924 = score(doc=1448,freq=2.0), product of:
              0.13208155 = queryWeight, product of:
                1.0756459 = boost
                4.29415 = idf(docFreq=1647, maxDocs=44421)
                0.028595366 = queryNorm
              0.3795528 = fieldWeight in 1448, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.29415 = idf(docFreq=1647, maxDocs=44421)
                0.0625 = fieldNorm(doc=1448)
          0.2639904 = weight(abstract_txt:clustering in 1448) [ClassicSimilarity], result of:
            0.2639904 = score(doc=1448,freq=6.0), product of:
              0.27719426 = queryWeight, product of:
                1.5582615 = boost
                6.2208285 = idf(docFreq=239, maxDocs=44421)
                0.028595366 = queryNorm
              0.952366 = fieldWeight in 1448, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                6.2208285 = idf(docFreq=239, maxDocs=44421)
                0.0625 = fieldNorm(doc=1448)
          0.5256679 = weight(abstract_txt:agglomerative in 1448) [ClassicSimilarity], result of:
            0.5256679 = score(doc=1448,freq=2.0), product of:
              0.6327618 = queryWeight, product of:
                2.3543355 = boost
                9.398883 = idf(docFreq=9, maxDocs=44421)
                0.028595366 = queryNorm
              0.8307517 = fieldWeight in 1448, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                9.398883 = idf(docFreq=9, maxDocs=44421)
                0.0625 = fieldNorm(doc=1448)
        0.42857143 = coord(3/7)