Document (#26104)

Author
Subramanian, S.
Shafer, K.E.
Title
Clustering
Source
http://www.oclc.org/research/publications/arr/1997/
Year
1998
Abstract
This article presents our exploration of computer science clustering algorithms as they relate to the Scorpion system. Scorpion is a research project at OCLC that explores the indexing and cataloging of electronic resources. For a more complete description of the Scorpion, please visit the Scorpion Web site at <http://purl.oclc.org/scorpion>
Theme
Automatisches Klassifizieren
Internet
Object
Scorpion
DDC

Similar documents (author)

  1. Shafer, K.: Scorpion Project explores using Dewey to organize the Web (1996) 5.47
    5.4731426 = sum of:
      5.4731426 = weight(author_txt:shafer in 6818) [ClassicSimilarity], result of:
        5.4731426 = fieldWeight in 6818, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.757029 = idf(docFreq=18, maxDocs=44421)
          0.625 = fieldNorm(doc=6818)
    
  2. Shafer, K.: Scorpion helps catalog the Web (1997) 5.47
    5.4731426 = sum of:
      5.4731426 = weight(author_txt:shafer in 3532) [ClassicSimilarity], result of:
        5.4731426 = fieldWeight in 3532, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.757029 = idf(docFreq=18, maxDocs=44421)
          0.625 = fieldNorm(doc=3532)
    
  3. Shafer, K.E.: Manipulating Tagged text (2001) 5.47
    5.4731426 = sum of:
      5.4731426 = weight(author_txt:shafer in 5011) [ClassicSimilarity], result of:
        5.4731426 = fieldWeight in 5011, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.757029 = idf(docFreq=18, maxDocs=44421)
          0.625 = fieldNorm(doc=5011)
    
  4. Shafer, K.E.: Mantis Project : A Toolkit for Cataloging (2001) 5.47
    5.4731426 = sum of:
      5.4731426 = weight(author_txt:shafer in 2028) [ClassicSimilarity], result of:
        5.4731426 = fieldWeight in 2028, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.757029 = idf(docFreq=18, maxDocs=44421)
          0.625 = fieldNorm(doc=2028)
    
  5. Shafer, K.E.: Translating Mathematical Markup for Electronic Journals (2001) 5.47
    5.4731426 = sum of:
      5.4731426 = weight(author_txt:shafer in 2030) [ClassicSimilarity], result of:
        5.4731426 = fieldWeight in 2030, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.757029 = idf(docFreq=18, maxDocs=44421)
          0.625 = fieldNorm(doc=2030)
    

Similar documents (content)

  1. Shafer, K.: Scorpion helps catalog the Web (1997) 1.00
    1.0032932 = sum of:
      1.0032932 = product of:
        4.1803885 = sum of:
          0.02226481 = weight(abstract_txt:resources in 3532) [ClassicSimilarity], result of:
            0.02226481 = score(doc=3532,freq=2.0), product of:
              0.034078155 = queryWeight, product of:
                1.2523407 = boost
                4.2238636 = idf(docFreq=1767, maxDocs=44421)
                0.006442341 = queryNorm
              0.6533455 = fieldWeight in 3532, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.2238636 = idf(docFreq=1767, maxDocs=44421)
                0.109375 = fieldNorm(doc=3532)
          0.01645204 = weight(abstract_txt:electronic in 3532) [ClassicSimilarity], result of:
            0.01645204 = score(doc=3532,freq=1.0), product of:
              0.035092954 = queryWeight, product of:
                1.2708504 = boost
                4.2862926 = idf(docFreq=1660, maxDocs=44421)
                0.006442341 = queryNorm
              0.46881324 = fieldWeight in 3532, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.2862926 = idf(docFreq=1660, maxDocs=44421)
                0.109375 = fieldNorm(doc=3532)
          0.017200252 = weight(abstract_txt:indexing in 3532) [ClassicSimilarity], result of:
            0.017200252 = score(doc=3532,freq=1.0), product of:
              0.036149025 = queryWeight, product of:
                1.2898308 = boost
                4.3503094 = idf(docFreq=1557, maxDocs=44421)
                0.006442341 = queryNorm
              0.4758151 = fieldWeight in 3532, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.3503094 = idf(docFreq=1557, maxDocs=44421)
                0.109375 = fieldNorm(doc=3532)
          0.017550237 = weight(abstract_txt:project in 3532) [ClassicSimilarity], result of:
            0.017550237 = score(doc=3532,freq=1.0), product of:
              0.036637746 = queryWeight, product of:
                1.2985206 = boost
                4.3796177 = idf(docFreq=1512, maxDocs=44421)
                0.006442341 = queryNorm
              0.47902068 = fieldWeight in 3532, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.3796177 = idf(docFreq=1512, maxDocs=44421)
                0.109375 = fieldNorm(doc=3532)
          0.0407377 = weight(abstract_txt:oclc in 3532) [ClassicSimilarity], result of:
            0.0407377 = score(doc=3532,freq=1.0), product of:
              0.06422997 = queryWeight, product of:
                1.7193066 = boost
                5.7988343 = idf(docFreq=365, maxDocs=44421)
                0.006442341 = queryNorm
              0.6342475 = fieldWeight in 3532, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.7988343 = idf(docFreq=365, maxDocs=44421)
                0.109375 = fieldNorm(doc=3532)
          4.0661836 = weight(title_txt:scorpion in 3532) [ClassicSimilarity], result of:
            4.0661836 = score(doc=3532,freq=1.0), product of:
              0.93788165 = queryWeight, product of:
                14.690733 = boost
                9.909708 = idf(docFreq=5, maxDocs=44421)
                0.006442341 = queryNorm
              4.3354974 = fieldWeight in 3532, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.909708 = idf(docFreq=5, maxDocs=44421)
                0.4375 = fieldNorm(doc=3532)
        0.24 = coord(6/25)
    
  2. Shafer, K.E.: Evaluating Scorpion results (1998) 0.95
    0.94922304 = sum of:
      0.94922304 = product of:
        4.746115 = sum of:
          0.013631588 = weight(abstract_txt:science in 2569) [ClassicSimilarity], result of:
            0.013631588 = score(doc=2569,freq=1.0), product of:
              0.028321074 = queryWeight, product of:
                1.1416667 = boost
                3.850585 = idf(docFreq=2567, maxDocs=44421)
                0.006442341 = queryNorm
              0.48132312 = fieldWeight in 2569, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.850585 = idf(docFreq=2567, maxDocs=44421)
                0.125 = fieldNorm(doc=2569)
          0.018802334 = weight(abstract_txt:electronic in 2569) [ClassicSimilarity], result of:
            0.018802334 = score(doc=2569,freq=1.0), product of:
              0.035092954 = queryWeight, product of:
                1.2708504 = boost
                4.2862926 = idf(docFreq=1660, maxDocs=44421)
                0.006442341 = queryNorm
              0.53578657 = fieldWeight in 2569, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.2862926 = idf(docFreq=1660, maxDocs=44421)
                0.125 = fieldNorm(doc=2569)
          0.020057416 = weight(abstract_txt:project in 2569) [ClassicSimilarity], result of:
            0.020057416 = score(doc=2569,freq=1.0), product of:
              0.036637746 = queryWeight, product of:
                1.2985206 = boost
                4.3796177 = idf(docFreq=1512, maxDocs=44421)
                0.006442341 = queryNorm
              0.5474522 = fieldWeight in 2569, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.3796177 = idf(docFreq=1512, maxDocs=44421)
                0.125 = fieldNorm(doc=2569)
          0.04655737 = weight(abstract_txt:oclc in 2569) [ClassicSimilarity], result of:
            0.04655737 = score(doc=2569,freq=1.0), product of:
              0.06422997 = queryWeight, product of:
                1.7193066 = boost
                5.7988343 = idf(docFreq=365, maxDocs=44421)
                0.006442341 = queryNorm
              0.7248543 = fieldWeight in 2569, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.7988343 = idf(docFreq=365, maxDocs=44421)
                0.125 = fieldNorm(doc=2569)
          4.6470666 = weight(title_txt:scorpion in 2569) [ClassicSimilarity], result of:
            4.6470666 = score(doc=2569,freq=1.0), product of:
              0.93788165 = queryWeight, product of:
                14.690733 = boost
                9.909708 = idf(docFreq=5, maxDocs=44421)
                0.006442341 = queryNorm
              4.954854 = fieldWeight in 2569, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.909708 = idf(docFreq=5, maxDocs=44421)
                0.5 = fieldNorm(doc=2569)
        0.2 = coord(5/25)
    
  3. Shafer, K.: Scorpion Project explores using Dewey to organize the Web (1996) 0.72
    0.71842706 = sum of:
      0.71842706 = product of:
        2.993446 = sum of:
          0.010223691 = weight(abstract_txt:science in 6818) [ClassicSimilarity], result of:
            0.010223691 = score(doc=6818,freq=1.0), product of:
              0.028321074 = queryWeight, product of:
                1.1416667 = boost
                3.850585 = idf(docFreq=2567, maxDocs=44421)
                0.006442341 = queryNorm
              0.36099234 = fieldWeight in 6818, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.850585 = idf(docFreq=2567, maxDocs=44421)
                0.09375 = fieldNorm(doc=6818)
          0.014101749 = weight(abstract_txt:electronic in 6818) [ClassicSimilarity], result of:
            0.014101749 = score(doc=6818,freq=1.0), product of:
              0.035092954 = queryWeight, product of:
                1.2708504 = boost
                4.2862926 = idf(docFreq=1660, maxDocs=44421)
                0.006442341 = queryNorm
              0.4018399 = fieldWeight in 6818, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.2862926 = idf(docFreq=1660, maxDocs=44421)
                0.09375 = fieldNorm(doc=6818)
          0.014743073 = weight(abstract_txt:indexing in 6818) [ClassicSimilarity], result of:
            0.014743073 = score(doc=6818,freq=1.0), product of:
              0.036149025 = queryWeight, product of:
                1.2898308 = boost
                4.3503094 = idf(docFreq=1557, maxDocs=44421)
                0.006442341 = queryNorm
              0.4078415 = fieldWeight in 6818, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.3503094 = idf(docFreq=1557, maxDocs=44421)
                0.09375 = fieldNorm(doc=6818)
          0.015043061 = weight(abstract_txt:project in 6818) [ClassicSimilarity], result of:
            0.015043061 = score(doc=6818,freq=1.0), product of:
              0.036637746 = queryWeight, product of:
                1.2985206 = boost
                4.3796177 = idf(docFreq=1512, maxDocs=44421)
                0.006442341 = queryNorm
              0.41058916 = fieldWeight in 6818, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.3796177 = idf(docFreq=1512, maxDocs=44421)
                0.09375 = fieldNorm(doc=6818)
          0.03491803 = weight(abstract_txt:oclc in 6818) [ClassicSimilarity], result of:
            0.03491803 = score(doc=6818,freq=1.0), product of:
              0.06422997 = queryWeight, product of:
                1.7193066 = boost
                5.7988343 = idf(docFreq=365, maxDocs=44421)
                0.006442341 = queryNorm
              0.54364073 = fieldWeight in 6818, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.7988343 = idf(docFreq=365, maxDocs=44421)
                0.09375 = fieldNorm(doc=6818)
          2.9044166 = weight(title_txt:scorpion in 6818) [ClassicSimilarity], result of:
            2.9044166 = score(doc=6818,freq=1.0), product of:
              0.93788165 = queryWeight, product of:
                14.690733 = boost
                9.909708 = idf(docFreq=5, maxDocs=44421)
                0.006442341 = queryNorm
              3.0967836 = fieldWeight in 6818, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.909708 = idf(docFreq=5, maxDocs=44421)
                0.3125 = fieldNorm(doc=6818)
        0.24 = coord(6/25)
    
  4. Shafer, K.E.: Evaluating Scorpion Results (2001) 0.57
    0.568943 = sum of:
      0.568943 = product of:
        4.741192 = sum of:
          0.04498171 = weight(abstract_txt:resources in 5085) [ClassicSimilarity], result of:
            0.04498171 = score(doc=5085,freq=1.0), product of:
              0.034078155 = queryWeight, product of:
                1.2523407 = boost
                4.2238636 = idf(docFreq=1767, maxDocs=44421)
                0.006442341 = queryNorm
              1.3199574 = fieldWeight in 5085, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.2238636 = idf(docFreq=1767, maxDocs=44421)
                0.3125 = fieldNorm(doc=5085)
          0.049143575 = weight(abstract_txt:indexing in 5085) [ClassicSimilarity], result of:
            0.049143575 = score(doc=5085,freq=1.0), product of:
              0.036149025 = queryWeight, product of:
                1.2898308 = boost
                4.3503094 = idf(docFreq=1557, maxDocs=44421)
                0.006442341 = queryNorm
              1.3594717 = fieldWeight in 5085, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.3503094 = idf(docFreq=1557, maxDocs=44421)
                0.3125 = fieldNorm(doc=5085)
          4.6470666 = weight(title_txt:scorpion in 5085) [ClassicSimilarity], result of:
            4.6470666 = score(doc=5085,freq=1.0), product of:
              0.93788165 = queryWeight, product of:
                14.690733 = boost
                9.909708 = idf(docFreq=5, maxDocs=44421)
                0.006442341 = queryNorm
              4.954854 = fieldWeight in 5085, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.909708 = idf(docFreq=5, maxDocs=44421)
                0.5 = fieldNorm(doc=5085)
        0.12 = coord(3/25)
    
  5. Shafer, K.E.: Automatic Subject Assignment via the Scorpion System (2001) 0.14
    0.139412 = sum of:
      0.139412 = product of:
        3.4853 = sum of:
          3.4853 = weight(title_txt:scorpion in 2043) [ClassicSimilarity], result of:
            3.4853 = score(doc=2043,freq=1.0), product of:
              0.93788165 = queryWeight, product of:
                14.690733 = boost
                9.909708 = idf(docFreq=5, maxDocs=44421)
                0.006442341 = queryNorm
              3.7161405 = fieldWeight in 2043, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.909708 = idf(docFreq=5, maxDocs=44421)
                0.375 = fieldNorm(doc=2043)
        0.04 = coord(1/25)