Document (#33025)

Author
Mai, J.-E.
Title
Analysis in indexing : document and domain centered approaches
Source
Information processing and management. 41(2005) no.3, S.599-611
Year
2005
Abstract
The paper discusses the notion of steps in indexing and reveals that the document-centered approach to indexing is prevalent and argues that the document-centered approach is problematic because it blocks out context-dependent factors in the indexing process. A domain-centered approach to indexing is presented as an alternative and the paper discusses how this approach includes a broader range of analyses and how it requires a new set of actions from using this approach; analysis of the domain, users and indexers. The paper concludes that the two-step procedure to indexing is insufficient to explain the indexing process and suggests that the domain-centered approach offers a guide for indexers that can help them manage the complexity of indexing.
Theme
Inhaltsanalyse

Similar documents (content)

  1. Jens-Erik Mai, J.-E.: ¬The role of documents, domains and decisions in indexing (2004) 0.48
    0.48248184 = sum of:
      0.48248184 = product of:
        1.5077558 = sum of:
          0.04911545 = weight(abstract_txt:analysis in 3653) [ClassicSimilarity], result of:
            0.04911545 = score(doc=3653,freq=4.0), product of:
              0.071852006 = queryWeight, product of:
                1.2095513 = boost
                3.6456752 = idf(docFreq=3151, maxDocs=44421)
                0.016294334 = queryNorm
              0.68356407 = fieldWeight in 3653, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                3.6456752 = idf(docFreq=3151, maxDocs=44421)
                0.09375 = fieldNorm(doc=3653)
          0.033641282 = weight(abstract_txt:process in 3653) [ClassicSimilarity], result of:
            0.033641282 = score(doc=3653,freq=1.0), product of:
              0.08862614 = queryWeight, product of:
                1.3433394 = boost
                4.048922 = idf(docFreq=2105, maxDocs=44421)
                0.016294334 = queryNorm
              0.37958646 = fieldWeight in 3653, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.048922 = idf(docFreq=2105, maxDocs=44421)
                0.09375 = fieldNorm(doc=3653)
          0.0546186 = weight(abstract_txt:paper in 3653) [ClassicSimilarity], result of:
            0.0546186 = score(doc=3653,freq=3.0), product of:
              0.09716963 = queryWeight, product of:
                1.7227242 = boost
                3.4616103 = idf(docFreq=3788, maxDocs=44421)
                0.016294334 = queryNorm
              0.5620954 = fieldWeight in 3653, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.4616103 = idf(docFreq=3788, maxDocs=44421)
                0.09375 = fieldNorm(doc=3653)
          0.029027695 = weight(abstract_txt:that in 3653) [ClassicSimilarity], result of:
            0.029027695 = score(doc=3653,freq=3.0), product of:
              0.07558949 = queryWeight, product of:
                1.961578 = boost
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.016294334 = queryNorm
              0.3840176 = fieldWeight in 3653, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.09375 = fieldNorm(doc=3653)
          0.15158944 = weight(abstract_txt:domain in 3653) [ClassicSimilarity], result of:
            0.15158944 = score(doc=3653,freq=2.0), product of:
              0.2417832 = queryWeight, product of:
                3.1378553 = boost
                4.7288613 = idf(docFreq=1066, maxDocs=44421)
                0.016294334 = queryNorm
              0.62696433 = fieldWeight in 3653, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.7288613 = idf(docFreq=1066, maxDocs=44421)
                0.09375 = fieldNorm(doc=3653)
          0.13789527 = weight(abstract_txt:approach in 3653) [ClassicSimilarity], result of:
            0.13789527 = score(doc=3653,freq=3.0), product of:
              0.22699331 = queryWeight, product of:
                3.7236772 = boost
                3.741144 = idf(docFreq=2864, maxDocs=44421)
                0.016294334 = queryNorm
              0.60748607 = fieldWeight in 3653, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.741144 = idf(docFreq=2864, maxDocs=44421)
                0.09375 = fieldNorm(doc=3653)
          0.5797838 = weight(abstract_txt:centered in 3653) [ClassicSimilarity], result of:
            0.5797838 = score(doc=3653,freq=2.0), product of:
              0.63698256 = queryWeight, product of:
                5.694278 = boost
                6.8651857 = idf(docFreq=125, maxDocs=44421)
                0.016294334 = queryNorm
              0.9102036 = fieldWeight in 3653, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.8651857 = idf(docFreq=125, maxDocs=44421)
                0.09375 = fieldNorm(doc=3653)
          0.47208434 = weight(abstract_txt:indexing in 3653) [ClassicSimilarity], result of:
            0.47208434 = score(doc=3653,freq=8.0), product of:
              0.40924484 = queryWeight, product of:
                5.773331 = boost
                4.3503094 = idf(docFreq=1557, maxDocs=44421)
                0.016294334 = queryNorm
              1.1535499 = fieldWeight in 3653, product of:
                2.828427 = tf(freq=8.0), with freq of:
                  8.0 = termFreq=8.0
                4.3503094 = idf(docFreq=1557, maxDocs=44421)
                0.09375 = fieldNorm(doc=3653)
        0.32 = coord(8/25)
    
  2. Fidel, R.: User-centered indexing (1994) 0.28
    0.28349394 = sum of:
      0.28349394 = product of:
        1.1812247 = sum of:
          0.0280344 = weight(abstract_txt:process in 8258) [ClassicSimilarity], result of:
            0.0280344 = score(doc=8258,freq=1.0), product of:
              0.08862614 = queryWeight, product of:
                1.3433394 = boost
                4.048922 = idf(docFreq=2105, maxDocs=44421)
                0.016294334 = queryNorm
              0.31632203 = fieldWeight in 8258, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.048922 = idf(docFreq=2105, maxDocs=44421)
                0.078125 = fieldNorm(doc=8258)
          0.019750847 = weight(abstract_txt:that in 8258) [ClassicSimilarity], result of:
            0.019750847 = score(doc=8258,freq=2.0), product of:
              0.07558949 = queryWeight, product of:
                1.961578 = boost
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.016294334 = queryNorm
              0.2612909 = fieldWeight in 8258, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.078125 = fieldNorm(doc=8258)
          0.10032886 = weight(abstract_txt:document in 8258) [ClassicSimilarity], result of:
            0.10032886 = score(doc=8258,freq=4.0), product of:
              0.14953011 = queryWeight, product of:
                2.1370506 = boost
                4.29415 = idf(docFreq=1647, maxDocs=44421)
                0.016294334 = queryNorm
              0.6709609 = fieldWeight in 8258, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.29415 = idf(docFreq=1647, maxDocs=44421)
                0.078125 = fieldNorm(doc=8258)
          0.13268979 = weight(abstract_txt:approach in 8258) [ClassicSimilarity], result of:
            0.13268979 = score(doc=8258,freq=4.0), product of:
              0.22699331 = queryWeight, product of:
                3.7236772 = boost
                3.741144 = idf(docFreq=2864, maxDocs=44421)
                0.016294334 = queryNorm
              0.5845537 = fieldWeight in 8258, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                3.741144 = idf(docFreq=2864, maxDocs=44421)
                0.078125 = fieldNorm(doc=8258)
          0.4831532 = weight(abstract_txt:centered in 8258) [ClassicSimilarity], result of:
            0.4831532 = score(doc=8258,freq=2.0), product of:
              0.63698256 = queryWeight, product of:
                5.694278 = boost
                6.8651857 = idf(docFreq=125, maxDocs=44421)
                0.016294334 = queryNorm
              0.758503 = fieldWeight in 8258, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.8651857 = idf(docFreq=125, maxDocs=44421)
                0.078125 = fieldNorm(doc=8258)
          0.41726756 = weight(abstract_txt:indexing in 8258) [ClassicSimilarity], result of:
            0.41726756 = score(doc=8258,freq=9.0), product of:
              0.40924484 = queryWeight, product of:
                5.773331 = boost
                4.3503094 = idf(docFreq=1557, maxDocs=44421)
                0.016294334 = queryNorm
              1.0196037 = fieldWeight in 8258, product of:
                3.0 = tf(freq=9.0), with freq of:
                  9.0 = termFreq=9.0
                4.3503094 = idf(docFreq=1557, maxDocs=44421)
                0.078125 = fieldNorm(doc=8258)
        0.24 = coord(6/25)
    
  3. Wu, Y.: Indexing historical, political cartoons for retrieval (2013) 0.20
    0.19789432 = sum of:
      0.19789432 = product of:
        0.8245597 = sum of:
          0.020464772 = weight(abstract_txt:analysis in 2070) [ClassicSimilarity], result of:
            0.020464772 = score(doc=2070,freq=1.0), product of:
              0.071852006 = queryWeight, product of:
                1.2095513 = boost
                3.6456752 = idf(docFreq=3151, maxDocs=44421)
                0.016294334 = queryNorm
              0.28481838 = fieldWeight in 2070, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6456752 = idf(docFreq=3151, maxDocs=44421)
                0.078125 = fieldNorm(doc=2070)
          0.026278391 = weight(abstract_txt:paper in 2070) [ClassicSimilarity], result of:
            0.026278391 = score(doc=2070,freq=1.0), product of:
              0.09716963 = queryWeight, product of:
                1.7227242 = boost
                3.4616103 = idf(docFreq=3788, maxDocs=44421)
                0.016294334 = queryNorm
              0.2704383 = fieldWeight in 2070, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4616103 = idf(docFreq=3788, maxDocs=44421)
                0.078125 = fieldNorm(doc=2070)
          0.027931914 = weight(abstract_txt:that in 2070) [ClassicSimilarity], result of:
            0.027931914 = score(doc=2070,freq=4.0), product of:
              0.07558949 = queryWeight, product of:
                1.961578 = boost
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.016294334 = queryNorm
              0.3695211 = fieldWeight in 2070, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.078125 = fieldNorm(doc=2070)
          0.24370106 = weight(abstract_txt:indexers in 2070) [ClassicSimilarity], result of:
            0.24370106 = score(doc=2070,freq=4.0), product of:
              0.23604016 = queryWeight, product of:
                2.192289 = boost
                6.6077175 = idf(docFreq=162, maxDocs=44421)
                0.016294334 = queryNorm
              1.0324559 = fieldWeight in 2070, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                6.6077175 = idf(docFreq=162, maxDocs=44421)
                0.078125 = fieldNorm(doc=2070)
          0.066344894 = weight(abstract_txt:approach in 2070) [ClassicSimilarity], result of:
            0.066344894 = score(doc=2070,freq=1.0), product of:
              0.22699331 = queryWeight, product of:
                3.7236772 = boost
                3.741144 = idf(docFreq=2864, maxDocs=44421)
                0.016294334 = queryNorm
              0.29227686 = fieldWeight in 2070, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.741144 = idf(docFreq=2864, maxDocs=44421)
                0.078125 = fieldNorm(doc=2070)
          0.43983865 = weight(abstract_txt:indexing in 2070) [ClassicSimilarity], result of:
            0.43983865 = score(doc=2070,freq=10.0), product of:
              0.40924484 = queryWeight, product of:
                5.773331 = boost
                4.3503094 = idf(docFreq=1557, maxDocs=44421)
                0.016294334 = queryNorm
              1.0747567 = fieldWeight in 2070, product of:
                3.1622777 = tf(freq=10.0), with freq of:
                  10.0 = termFreq=10.0
                4.3503094 = idf(docFreq=1557, maxDocs=44421)
                0.078125 = fieldNorm(doc=2070)
        0.24 = coord(6/25)
    
  4. Sigel, A.: How can user-oriented depth analysis be constructively guided? (2000) 0.19
    0.19347148 = sum of:
      0.19347148 = product of:
        0.53742075 = sum of:
          0.023701578 = weight(abstract_txt:step in 1133) [ClassicSimilarity], result of:
            0.023701578 = score(doc=1133,freq=1.0), product of:
              0.09983799 = queryWeight, product of:
                1.0081793 = boost
                6.0774503 = idf(docFreq=276, maxDocs=44421)
                0.016294334 = queryNorm
              0.2374004 = fieldWeight in 1133, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.0774503 = idf(docFreq=276, maxDocs=44421)
                0.0390625 = fieldNorm(doc=1133)
          0.020464772 = weight(abstract_txt:analysis in 1133) [ClassicSimilarity], result of:
            0.020464772 = score(doc=1133,freq=4.0), product of:
              0.071852006 = queryWeight, product of:
                1.2095513 = boost
                3.6456752 = idf(docFreq=3151, maxDocs=44421)
                0.016294334 = queryNorm
              0.28481838 = fieldWeight in 1133, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                3.6456752 = idf(docFreq=3151, maxDocs=44421)
                0.0390625 = fieldNorm(doc=1133)
          0.019823315 = weight(abstract_txt:process in 1133) [ClassicSimilarity], result of:
            0.019823315 = score(doc=1133,freq=2.0), product of:
              0.08862614 = queryWeight, product of:
                1.3433394 = boost
                4.048922 = idf(docFreq=2105, maxDocs=44421)
                0.016294334 = queryNorm
              0.22367345 = fieldWeight in 1133, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.048922 = idf(docFreq=2105, maxDocs=44421)
                0.0390625 = fieldNorm(doc=1133)
          0.013139196 = weight(abstract_txt:paper in 1133) [ClassicSimilarity], result of:
            0.013139196 = score(doc=1133,freq=1.0), product of:
              0.09716963 = queryWeight, product of:
                1.7227242 = boost
                3.4616103 = idf(docFreq=3788, maxDocs=44421)
                0.016294334 = queryNorm
              0.13521916 = fieldWeight in 1133, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4616103 = idf(docFreq=3788, maxDocs=44421)
                0.0390625 = fieldNorm(doc=1133)
          0.0156144155 = weight(abstract_txt:that in 1133) [ClassicSimilarity], result of:
            0.0156144155 = score(doc=1133,freq=5.0), product of:
              0.07558949 = queryWeight, product of:
                1.961578 = boost
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.016294334 = queryNorm
              0.2065686 = fieldWeight in 1133, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.0390625 = fieldNorm(doc=1133)
          0.025082216 = weight(abstract_txt:document in 1133) [ClassicSimilarity], result of:
            0.025082216 = score(doc=1133,freq=1.0), product of:
              0.14953011 = queryWeight, product of:
                2.1370506 = boost
                4.29415 = idf(docFreq=1647, maxDocs=44421)
                0.016294334 = queryNorm
              0.16774023 = fieldWeight in 1133, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.29415 = idf(docFreq=1647, maxDocs=44421)
                0.0390625 = fieldNorm(doc=1133)
          0.060925264 = weight(abstract_txt:indexers in 1133) [ClassicSimilarity], result of:
            0.060925264 = score(doc=1133,freq=1.0), product of:
              0.23604016 = queryWeight, product of:
                2.192289 = boost
                6.6077175 = idf(docFreq=162, maxDocs=44421)
                0.016294334 = queryNorm
              0.25811398 = fieldWeight in 1133, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.6077175 = idf(docFreq=162, maxDocs=44421)
                0.0390625 = fieldNorm(doc=1133)
          0.089324936 = weight(abstract_txt:domain in 1133) [ClassicSimilarity], result of:
            0.089324936 = score(doc=1133,freq=4.0), product of:
              0.2417832 = queryWeight, product of:
                3.1378553 = boost
                4.7288613 = idf(docFreq=1066, maxDocs=44421)
                0.016294334 = queryNorm
              0.36944228 = fieldWeight in 1133, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.7288613 = idf(docFreq=1066, maxDocs=44421)
                0.0390625 = fieldNorm(doc=1133)
          0.26934507 = weight(abstract_txt:indexing in 1133) [ClassicSimilarity], result of:
            0.26934507 = score(doc=1133,freq=15.0), product of:
              0.40924484 = queryWeight, product of:
                5.773331 = boost
                4.3503094 = idf(docFreq=1557, maxDocs=44421)
                0.016294334 = queryNorm
              0.65815145 = fieldWeight in 1133, product of:
                3.8729835 = tf(freq=15.0), with freq of:
                  15.0 = termFreq=15.0
                4.3503094 = idf(docFreq=1557, maxDocs=44421)
                0.0390625 = fieldNorm(doc=1133)
        0.36 = coord(9/25)
    
  5. Cooper, W.S.: Indexing documents by Gedanken experimentation (1978) 0.17
    0.17235608 = sum of:
      0.17235608 = product of:
        0.7181504 = sum of:
          0.048557006 = weight(abstract_txt:process in 411) [ClassicSimilarity], result of:
            0.048557006 = score(doc=411,freq=3.0), product of:
              0.08862614 = queryWeight, product of:
                1.3433394 = boost
                4.048922 = idf(docFreq=2105, maxDocs=44421)
                0.016294334 = queryNorm
              0.54788584 = fieldWeight in 411, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.048922 = idf(docFreq=2105, maxDocs=44421)
                0.078125 = fieldNorm(doc=411)
          0.013965957 = weight(abstract_txt:that in 411) [ClassicSimilarity], result of:
            0.013965957 = score(doc=411,freq=1.0), product of:
              0.07558949 = queryWeight, product of:
                1.961578 = boost
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.016294334 = queryNorm
              0.18476056 = fieldWeight in 411, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.078125 = fieldNorm(doc=411)
          0.05016443 = weight(abstract_txt:document in 411) [ClassicSimilarity], result of:
            0.05016443 = score(doc=411,freq=1.0), product of:
              0.14953011 = queryWeight, product of:
                2.1370506 = boost
                4.29415 = idf(docFreq=1647, maxDocs=44421)
                0.016294334 = queryNorm
              0.33548045 = fieldWeight in 411, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.29415 = idf(docFreq=1647, maxDocs=44421)
                0.078125 = fieldNorm(doc=411)
          0.12185053 = weight(abstract_txt:indexers in 411) [ClassicSimilarity], result of:
            0.12185053 = score(doc=411,freq=1.0), product of:
              0.23604016 = queryWeight, product of:
                2.192289 = boost
                6.6077175 = idf(docFreq=162, maxDocs=44421)
                0.016294334 = queryNorm
              0.51622796 = fieldWeight in 411, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.6077175 = idf(docFreq=162, maxDocs=44421)
                0.078125 = fieldNorm(doc=411)
          0.066344894 = weight(abstract_txt:approach in 411) [ClassicSimilarity], result of:
            0.066344894 = score(doc=411,freq=1.0), product of:
              0.22699331 = queryWeight, product of:
                3.7236772 = boost
                3.741144 = idf(docFreq=2864, maxDocs=44421)
                0.016294334 = queryNorm
              0.29227686 = fieldWeight in 411, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.741144 = idf(docFreq=2864, maxDocs=44421)
                0.078125 = fieldNorm(doc=411)
          0.41726756 = weight(abstract_txt:indexing in 411) [ClassicSimilarity], result of:
            0.41726756 = score(doc=411,freq=9.0), product of:
              0.40924484 = queryWeight, product of:
                5.773331 = boost
                4.3503094 = idf(docFreq=1557, maxDocs=44421)
                0.016294334 = queryNorm
              1.0196037 = fieldWeight in 411, product of:
                3.0 = tf(freq=9.0), with freq of:
                  9.0 = termFreq=9.0
                4.3503094 = idf(docFreq=1557, maxDocs=44421)
                0.078125 = fieldNorm(doc=411)
        0.24 = coord(6/25)