Document (#13106)

Author
O'Kane, K.C.
Title
Generating hierarchical document indices from common denominators in large document collections
Source
Information processing and management. 32(1996) no.2, S.105-115
Year
1996
Abstract
Describes an effective, simple and efficient algorithm for computer generation of hierarchical indices from Document Term matrices by means of calculating common denominator vectors from the document vector set. This procedure produces an intuitive, user friendly hierarchical index of a document collection not unlike that which would be expected had a manual indexer set about to create an index or outline of a collection. The resulting index, when presented with a graphical user interface, provides the user with a natural easily comprehended view of the document collection, permits general browsing and informal search activities with an access method that requires no keyboard entry or prior knowledge of the vocabulary
Theme
Automatisches Indexieren
Register

Similar documents (content)

  1. Hartman, J.H.; Proebsting, T.A.; Sundaram, R.: Index-based hyperlinks (1997) 0.22
    0.21831007 = sum of:
      0.21831007 = product of:
        1.0915504 = sum of:
          0.050029505 = weight(abstract_txt:user in 3723) [ClassicSimilarity], result of:
            0.050029505 = score(doc=3723,freq=1.0), product of:
              0.124267586 = queryWeight, product of:
                1.6757932 = boost
                3.6808684 = idf(docFreq=3042, maxDocs=44421)
                0.020145923 = queryNorm
              0.40259498 = fieldWeight in 3723, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6808684 = idf(docFreq=3042, maxDocs=44421)
                0.109375 = fieldNorm(doc=3723)
          0.10749113 = weight(abstract_txt:index in 3723) [ClassicSimilarity], result of:
            0.10749113 = score(doc=3723,freq=1.0), product of:
              0.20691349 = queryWeight, product of:
                2.1623993 = boost
                4.7496953 = idf(docFreq=1044, maxDocs=44421)
                0.020145923 = queryNorm
              0.51949793 = fieldWeight in 3723, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.7496953 = idf(docFreq=1044, maxDocs=44421)
                0.109375 = fieldNorm(doc=3723)
          0.58641195 = weight(abstract_txt:indices in 3723) [ClassicSimilarity], result of:
            0.58641195 = score(doc=3723,freq=5.0), product of:
              0.32758334 = queryWeight, product of:
                2.2215533 = boost
                7.319441 = idf(docFreq=79, maxDocs=44421)
                0.020145923 = queryNorm
              1.7901152 = fieldWeight in 3723, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                7.319441 = idf(docFreq=79, maxDocs=44421)
                0.109375 = fieldNorm(doc=3723)
          0.18874954 = weight(abstract_txt:hierarchical in 3723) [ClassicSimilarity], result of:
            0.18874954 = score(doc=3723,freq=1.0), product of:
              0.30116025 = queryWeight, product of:
                2.608797 = boost
                5.7302055 = idf(docFreq=391, maxDocs=44421)
                0.020145923 = queryNorm
              0.62674123 = fieldWeight in 3723, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.7302055 = idf(docFreq=391, maxDocs=44421)
                0.109375 = fieldNorm(doc=3723)
          0.1588682 = weight(abstract_txt:document in 3723) [ClassicSimilarity], result of:
            0.1588682 = score(doc=3723,freq=1.0), product of:
              0.33825302 = queryWeight, product of:
                3.910005 = boost
                4.29415 = idf(docFreq=1647, maxDocs=44421)
                0.020145923 = queryNorm
              0.46967265 = fieldWeight in 3723, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.29415 = idf(docFreq=1647, maxDocs=44421)
                0.109375 = fieldNorm(doc=3723)
        0.2 = coord(5/25)
    
  2. Kim, P.J.; Lee, J.Y.; Park, J.-H.: Developing a new collection-evaluation method : mapping and the user-side h-index (2009) 0.16
    0.16228035 = sum of:
      0.16228035 = product of:
        0.67616814 = sum of:
          0.054672565 = weight(abstract_txt:procedure in 158) [ClassicSimilarity], result of:
            0.054672565 = score(doc=158,freq=1.0), product of:
              0.13275115 = queryWeight, product of:
                6.58948 = idf(docFreq=165, maxDocs=44421)
                0.020145923 = queryNorm
              0.4118425 = fieldWeight in 158, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.58948 = idf(docFreq=165, maxDocs=44421)
                0.0625 = fieldNorm(doc=158)
          0.008915464 = weight(abstract_txt:with in 158) [ClassicSimilarity], result of:
            0.008915464 = score(doc=158,freq=1.0), product of:
              0.057147212 = queryWeight, product of:
                1.1364204 = boost
                2.4961398 = idf(docFreq=9949, maxDocs=44421)
                0.020145923 = queryNorm
              0.15600874 = fieldWeight in 158, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.4961398 = idf(docFreq=9949, maxDocs=44421)
                0.0625 = fieldNorm(doc=158)
          0.063925356 = weight(abstract_txt:user in 158) [ClassicSimilarity], result of:
            0.063925356 = score(doc=158,freq=5.0), product of:
              0.124267586 = queryWeight, product of:
                1.6757932 = boost
                3.6808684 = idf(docFreq=3042, maxDocs=44421)
                0.020145923 = queryNorm
              0.514417 = fieldWeight in 158, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                3.6808684 = idf(docFreq=3042, maxDocs=44421)
                0.0625 = fieldNorm(doc=158)
          0.15245314 = weight(abstract_txt:collection in 158) [ClassicSimilarity], result of:
            0.15245314 = score(doc=158,freq=7.0), product of:
              0.19828536 = queryWeight, product of:
                2.116834 = boost
                4.649612 = idf(docFreq=1154, maxDocs=44421)
                0.020145923 = queryNorm
              0.7688573 = fieldWeight in 158, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                4.649612 = idf(docFreq=1154, maxDocs=44421)
                0.0625 = fieldNorm(doc=158)
          0.1842705 = weight(abstract_txt:index in 158) [ClassicSimilarity], result of:
            0.1842705 = score(doc=158,freq=9.0), product of:
              0.20691349 = queryWeight, product of:
                2.1623993 = boost
                4.7496953 = idf(docFreq=1044, maxDocs=44421)
                0.020145923 = queryNorm
              0.8905679 = fieldWeight in 158, product of:
                3.0 = tf(freq=9.0), with freq of:
                  9.0 = termFreq=9.0
                4.7496953 = idf(docFreq=1044, maxDocs=44421)
                0.0625 = fieldNorm(doc=158)
          0.21193112 = weight(abstract_txt:indices in 158) [ClassicSimilarity], result of:
            0.21193112 = score(doc=158,freq=2.0), product of:
              0.32758334 = queryWeight, product of:
                2.2215533 = boost
                7.319441 = idf(docFreq=79, maxDocs=44421)
                0.020145923 = queryNorm
              0.6469533 = fieldWeight in 158, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.319441 = idf(docFreq=79, maxDocs=44421)
                0.0625 = fieldNorm(doc=158)
        0.24 = coord(6/25)
    
  3. Safder, I.; Ali, M.; Aljohani, N.R.; Nawaz, R.; Hassan, S.-U.: Neural machine translation for in-text citation classification (2023) 0.15
    0.14716269 = sum of:
      0.14716269 = product of:
        0.52558106 = sum of:
          0.06771278 = weight(abstract_txt:unlike in 2055) [ClassicSimilarity], result of:
            0.06771278 = score(doc=2055,freq=1.0), product of:
              0.15309902 = queryWeight, product of:
                1.073908 = boost
                7.0764947 = idf(docFreq=101, maxDocs=44421)
                0.020145923 = queryNorm
              0.44228092 = fieldWeight in 2055, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.0764947 = idf(docFreq=101, maxDocs=44421)
                0.0625 = fieldNorm(doc=2055)
          0.01260837 = weight(abstract_txt:with in 2055) [ClassicSimilarity], result of:
            0.01260837 = score(doc=2055,freq=2.0), product of:
              0.057147212 = queryWeight, product of:
                1.1364204 = boost
                2.4961398 = idf(docFreq=9949, maxDocs=44421)
                0.020145923 = queryNorm
              0.22062966 = fieldWeight in 2055, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.4961398 = idf(docFreq=9949, maxDocs=44421)
                0.0625 = fieldNorm(doc=2055)
          0.09030992 = weight(abstract_txt:vectors in 2055) [ClassicSimilarity], result of:
            0.09030992 = score(doc=2055,freq=1.0), product of:
              0.18550216 = queryWeight, product of:
                1.182103 = boost
                7.7894444 = idf(docFreq=49, maxDocs=44421)
                0.020145923 = queryNorm
              0.48684028 = fieldWeight in 2055, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.7894444 = idf(docFreq=49, maxDocs=44421)
                0.0625 = fieldNorm(doc=2055)
          0.10119291 = weight(abstract_txt:calculating in 2055) [ClassicSimilarity], result of:
            0.10119291 = score(doc=2055,freq=1.0), product of:
              0.20012072 = queryWeight, product of:
                1.2277979 = boost
                8.090549 = idf(docFreq=36, maxDocs=44421)
                0.020145923 = queryNorm
              0.50565934 = fieldWeight in 2055, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.090549 = idf(docFreq=36, maxDocs=44421)
                0.0625 = fieldNorm(doc=2055)
          0.01703318 = weight(abstract_txt:from in 2055) [ClassicSimilarity], result of:
            0.01703318 = score(doc=2055,freq=2.0), product of:
              0.06983711 = queryWeight, product of:
                1.2562746 = boost
                2.759399 = idf(docFreq=7646, maxDocs=44421)
                0.020145923 = queryNorm
              0.2438987 = fieldWeight in 2055, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.759399 = idf(docFreq=7646, maxDocs=44421)
                0.0625 = fieldNorm(doc=2055)
          0.08686595 = weight(abstract_txt:index in 2055) [ClassicSimilarity], result of:
            0.08686595 = score(doc=2055,freq=2.0), product of:
              0.20691349 = queryWeight, product of:
                2.1623993 = boost
                4.7496953 = idf(docFreq=1044, maxDocs=44421)
                0.020145923 = queryNorm
              0.41981772 = fieldWeight in 2055, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.7496953 = idf(docFreq=1044, maxDocs=44421)
                0.0625 = fieldNorm(doc=2055)
          0.14985794 = weight(abstract_txt:indices in 2055) [ClassicSimilarity], result of:
            0.14985794 = score(doc=2055,freq=1.0), product of:
              0.32758334 = queryWeight, product of:
                2.2215533 = boost
                7.319441 = idf(docFreq=79, maxDocs=44421)
                0.020145923 = queryNorm
              0.45746505 = fieldWeight in 2055, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.319441 = idf(docFreq=79, maxDocs=44421)
                0.0625 = fieldNorm(doc=2055)
        0.28 = coord(7/25)
    
  4. Kim, Y.W.; Kim, J.H.: ¬A model of knowledge based information retrieval with hierarchical concept graph (1990) 0.14
    0.13937253 = sum of:
      0.13937253 = product of:
        0.5807189 = sum of:
          0.12616488 = weight(abstract_txt:intuitive in 3908) [ClassicSimilarity], result of:
            0.12616488 = score(doc=3908,freq=2.0), product of:
              0.15856269 = queryWeight, product of:
                1.0929023 = boost
                7.201658 = idf(docFreq=89, maxDocs=44421)
                0.020145923 = queryNorm
              0.79567826 = fieldWeight in 3908, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.201658 = idf(docFreq=89, maxDocs=44421)
                0.078125 = fieldNorm(doc=3908)
          0.019302547 = weight(abstract_txt:with in 3908) [ClassicSimilarity], result of:
            0.019302547 = score(doc=3908,freq=3.0), product of:
              0.057147212 = queryWeight, product of:
                1.1364204 = boost
                2.4961398 = idf(docFreq=9949, maxDocs=44421)
                0.020145923 = queryNorm
              0.33776882 = fieldWeight in 3908, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                2.4961398 = idf(docFreq=9949, maxDocs=44421)
                0.078125 = fieldNorm(doc=3908)
          0.021291476 = weight(abstract_txt:from in 3908) [ClassicSimilarity], result of:
            0.021291476 = score(doc=3908,freq=2.0), product of:
              0.06983711 = queryWeight, product of:
                1.2562746 = boost
                2.759399 = idf(docFreq=7646, maxDocs=44421)
                0.020145923 = queryNorm
              0.30487338 = fieldWeight in 3908, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.759399 = idf(docFreq=7646, maxDocs=44421)
                0.078125 = fieldNorm(doc=3908)
          0.03573536 = weight(abstract_txt:user in 3908) [ClassicSimilarity], result of:
            0.03573536 = score(doc=3908,freq=1.0), product of:
              0.124267586 = queryWeight, product of:
                1.6757932 = boost
                3.6808684 = idf(docFreq=3042, maxDocs=44421)
                0.020145923 = queryNorm
              0.28756785 = fieldWeight in 3908, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6808684 = idf(docFreq=3042, maxDocs=44421)
                0.078125 = fieldNorm(doc=3908)
          0.10858244 = weight(abstract_txt:index in 3908) [ClassicSimilarity], result of:
            0.10858244 = score(doc=3908,freq=2.0), product of:
              0.20691349 = queryWeight, product of:
                2.1623993 = boost
                4.7496953 = idf(docFreq=1044, maxDocs=44421)
                0.020145923 = queryNorm
              0.52477217 = fieldWeight in 3908, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.7496953 = idf(docFreq=1044, maxDocs=44421)
                0.078125 = fieldNorm(doc=3908)
          0.2696422 = weight(abstract_txt:hierarchical in 3908) [ClassicSimilarity], result of:
            0.2696422 = score(doc=3908,freq=4.0), product of:
              0.30116025 = queryWeight, product of:
                2.608797 = boost
                5.7302055 = idf(docFreq=391, maxDocs=44421)
                0.020145923 = queryNorm
              0.8953446 = fieldWeight in 3908, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.7302055 = idf(docFreq=391, maxDocs=44421)
                0.078125 = fieldNorm(doc=3908)
        0.24 = coord(6/25)
    
  5. Crestani, F.; Vegas, J.; Fuente, P. de la: ¬A graphical user interface for the retrieval of hierarchically structured documents (2004) 0.14
    0.13724072 = sum of:
      0.13724072 = product of:
        0.57183635 = sum of:
          0.09855651 = weight(abstract_txt:graphical in 3555) [ClassicSimilarity], result of:
            0.09855651 = score(doc=3555,freq=2.0), product of:
              0.13449275 = queryWeight, product of:
                1.0065383 = boost
                6.6325636 = idf(docFreq=158, maxDocs=44421)
                0.020145923 = queryNorm
              0.7328016 = fieldWeight in 3555, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.6325636 = idf(docFreq=158, maxDocs=44421)
                0.078125 = fieldNorm(doc=3555)
          0.08921205 = weight(abstract_txt:intuitive in 3555) [ClassicSimilarity], result of:
            0.08921205 = score(doc=3555,freq=1.0), product of:
              0.15856269 = queryWeight, product of:
                1.0929023 = boost
                7.201658 = idf(docFreq=89, maxDocs=44421)
                0.020145923 = queryNorm
              0.5626295 = fieldWeight in 3555, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.201658 = idf(docFreq=89, maxDocs=44421)
                0.078125 = fieldNorm(doc=3555)
          0.01114433 = weight(abstract_txt:with in 3555) [ClassicSimilarity], result of:
            0.01114433 = score(doc=3555,freq=1.0), product of:
              0.057147212 = queryWeight, product of:
                1.1364204 = boost
                2.4961398 = idf(docFreq=9949, maxDocs=44421)
                0.020145923 = queryNorm
              0.19501092 = fieldWeight in 3555, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.4961398 = idf(docFreq=9949, maxDocs=44421)
                0.078125 = fieldNorm(doc=3555)
          0.015055347 = weight(abstract_txt:from in 3555) [ClassicSimilarity], result of:
            0.015055347 = score(doc=3555,freq=1.0), product of:
              0.06983711 = queryWeight, product of:
                1.2562746 = boost
                2.759399 = idf(docFreq=7646, maxDocs=44421)
                0.020145923 = queryNorm
              0.21557805 = fieldWeight in 3555, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.759399 = idf(docFreq=7646, maxDocs=44421)
                0.078125 = fieldNorm(doc=3555)
          0.079906695 = weight(abstract_txt:user in 3555) [ClassicSimilarity], result of:
            0.079906695 = score(doc=3555,freq=5.0), product of:
              0.124267586 = queryWeight, product of:
                1.6757932 = boost
                3.6808684 = idf(docFreq=3042, maxDocs=44421)
                0.020145923 = queryNorm
              0.6430212 = fieldWeight in 3555, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                3.6808684 = idf(docFreq=3042, maxDocs=44421)
                0.078125 = fieldNorm(doc=3555)
          0.27796146 = weight(abstract_txt:document in 3555) [ClassicSimilarity], result of:
            0.27796146 = score(doc=3555,freq=6.0), product of:
              0.33825302 = queryWeight, product of:
                3.910005 = boost
                4.29415 = idf(docFreq=1647, maxDocs=44421)
                0.020145923 = queryNorm
              0.821756 = fieldWeight in 3555, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                4.29415 = idf(docFreq=1647, maxDocs=44421)
                0.078125 = fieldNorm(doc=3555)
        0.24 = coord(6/25)