Document (#30440)

Author
Bloomfield, M.
Title
Indexing : neglected and poorly understood
Source
Cataloging and classification quarterly. 33(2001) no.1, S.63-75
Year
2001
Abstract
The growth of the Internet has highlighted the use of machine indexing. The difficulties in using the Internet as a searching device can be frustrating. The use of the term "Python" is given as an example. Machine indexing is noted as "rotten" and human indexing as "capricious." The problem seems to be a lack of a theoretical foundation for the art of indexing. What librarians have learned over the last hundred years has yet to yield a consistent approach to what really works best in preparing index terms and in the ability of our customers to search the various indexes. An attempt is made to consider the elements of indexing, their pros and cons. The argument is made that machine indexing is far too prolific in its production of index terms. Neither librarians nor computer programmers have made much progress to improve Internet indexing. Human indexing has had the same problems for over fifty years.
Footnote
Vgl. auch: http://catalogingandclassificationquarterly.com/
Theme
Automatisches Indexieren
Internet

Similar documents (content)

  1. Lancaster, F.W.: Trends in subject indexing from 1957 to 2000 (1980) 0.17
    0.16972578 = sum of:
      0.16972578 = product of:
        0.8486289 = sum of:
          0.052341916 = weight(abstract_txt:terms in 208) [ClassicSimilarity], result of:
            0.052341916 = score(doc=208,freq=2.0), product of:
              0.097629964 = queryWeight, product of:
                1.1123384 = boost
                4.043712 = idf(docFreq=2116, maxDocs=44421)
                0.021705309 = queryNorm
              0.53612554 = fieldWeight in 208, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.043712 = idf(docFreq=2116, maxDocs=44421)
                0.09375 = fieldNorm(doc=208)
          0.08482155 = weight(abstract_txt:index in 208) [ClassicSimilarity], result of:
            0.08482155 = score(doc=208,freq=2.0), product of:
              0.13469584 = queryWeight, product of:
                1.3065392 = boost
                4.7496953 = idf(docFreq=1044, maxDocs=44421)
                0.021705309 = queryNorm
              0.6297266 = fieldWeight in 208, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.7496953 = idf(docFreq=1044, maxDocs=44421)
                0.09375 = fieldNorm(doc=208)
          0.08025136 = weight(abstract_txt:made in 208) [ClassicSimilarity], result of:
            0.08025136 = score(doc=208,freq=1.0), product of:
              0.1872228 = queryWeight, product of:
                1.8865589 = boost
                4.5721703 = idf(docFreq=1247, maxDocs=44421)
                0.021705309 = queryNorm
              0.42864096 = fieldWeight in 208, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.5721703 = idf(docFreq=1247, maxDocs=44421)
                0.09375 = fieldNorm(doc=208)
          0.123238795 = weight(abstract_txt:machine in 208) [ClassicSimilarity], result of:
            0.123238795 = score(doc=208,freq=1.0), product of:
              0.24920423 = queryWeight, product of:
                2.1765504 = boost
                5.274979 = idf(docFreq=617, maxDocs=44421)
                0.021705309 = queryNorm
              0.4945293 = fieldWeight in 208, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.274979 = idf(docFreq=617, maxDocs=44421)
                0.09375 = fieldNorm(doc=208)
          0.5079753 = weight(abstract_txt:indexing in 208) [ClassicSimilarity], result of:
            0.5079753 = score(doc=208,freq=6.0), product of:
              0.5084819 = queryWeight, product of:
                5.3850455 = boost
                4.3503094 = idf(docFreq=1557, maxDocs=44421)
                0.021705309 = queryNorm
              0.9990036 = fieldWeight in 208, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                4.3503094 = idf(docFreq=1557, maxDocs=44421)
                0.09375 = fieldNorm(doc=208)
        0.2 = coord(5/25)
    
  2. Lauser, B.; Johannsen, G.; Caracciolo, C.; Hage, W.R. van; Keizer, J.; Mayr, P.: Comparing human and automatic thesaurus mapping approaches in the agricultural domain (2008) 0.13
    0.13075931 = sum of:
      0.13075931 = product of:
        0.5448305 = sum of:
          0.12742746 = weight(abstract_txt:pros in 3627) [ClassicSimilarity], result of:
            0.12742746 = score(doc=3627,freq=1.0), product of:
              0.19951685 = queryWeight, product of:
                1.1243982 = boost
                8.175107 = idf(docFreq=33, maxDocs=44421)
                0.021705309 = queryNorm
              0.6386802 = fieldWeight in 3627, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.175107 = idf(docFreq=33, maxDocs=44421)
                0.078125 = fieldNorm(doc=3627)
          0.12742746 = weight(abstract_txt:cons in 3627) [ClassicSimilarity], result of:
            0.12742746 = score(doc=3627,freq=1.0), product of:
              0.19951685 = queryWeight, product of:
                1.1243982 = boost
                8.175107 = idf(docFreq=33, maxDocs=44421)
                0.021705309 = queryNorm
              0.6386802 = fieldWeight in 3627, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.175107 = idf(docFreq=33, maxDocs=44421)
                0.078125 = fieldNorm(doc=3627)
          0.0527267 = weight(abstract_txt:what in 3627) [ClassicSimilarity], result of:
            0.0527267 = score(doc=3627,freq=2.0), product of:
              0.11078764 = queryWeight, product of:
                1.1849254 = boost
                4.3075895 = idf(docFreq=1625, maxDocs=44421)
                0.021705309 = queryNorm
              0.47592586 = fieldWeight in 3627, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.3075895 = idf(docFreq=1625, maxDocs=44421)
                0.078125 = fieldNorm(doc=3627)
          0.06767382 = weight(abstract_txt:human in 3627) [ClassicSimilarity], result of:
            0.06767382 = score(doc=3627,freq=2.0), product of:
              0.13084325 = queryWeight, product of:
                1.2877188 = boost
                4.681277 = idf(docFreq=1118, maxDocs=44421)
                0.021705309 = queryNorm
              0.51721287 = fieldWeight in 3627, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.681277 = idf(docFreq=1118, maxDocs=44421)
                0.078125 = fieldNorm(doc=3627)
          0.06687613 = weight(abstract_txt:made in 3627) [ClassicSimilarity], result of:
            0.06687613 = score(doc=3627,freq=1.0), product of:
              0.1872228 = queryWeight, product of:
                1.8865589 = boost
                4.5721703 = idf(docFreq=1247, maxDocs=44421)
                0.021705309 = queryNorm
              0.3572008 = fieldWeight in 3627, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.5721703 = idf(docFreq=1247, maxDocs=44421)
                0.078125 = fieldNorm(doc=3627)
          0.10269899 = weight(abstract_txt:machine in 3627) [ClassicSimilarity], result of:
            0.10269899 = score(doc=3627,freq=1.0), product of:
              0.24920423 = queryWeight, product of:
                2.1765504 = boost
                5.274979 = idf(docFreq=617, maxDocs=44421)
                0.021705309 = queryNorm
              0.41210774 = fieldWeight in 3627, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.274979 = idf(docFreq=617, maxDocs=44421)
                0.078125 = fieldNorm(doc=3627)
        0.24 = coord(6/25)
    
  3. Mooers, C.N.: ¬The indexing language of an information retrieval system (1985) 0.13
    0.12662481 = sum of:
      0.12662481 = product of:
        0.39570254 = sum of:
          0.058268707 = weight(abstract_txt:hundred in 4644) [ClassicSimilarity], result of:
            0.058268707 = score(doc=4644,freq=1.0), product of:
              0.16646655 = queryWeight, product of:
                1.0270554 = boost
                7.467361 = idf(docFreq=68, maxDocs=44421)
                0.021705309 = queryNorm
              0.35003254 = fieldWeight in 4644, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.467361 = idf(docFreq=68, maxDocs=44421)
                0.046875 = fieldNorm(doc=4644)
          0.026170958 = weight(abstract_txt:terms in 4644) [ClassicSimilarity], result of:
            0.026170958 = score(doc=4644,freq=2.0), product of:
              0.097629964 = queryWeight, product of:
                1.1123384 = boost
                4.043712 = idf(docFreq=2116, maxDocs=44421)
                0.021705309 = queryNorm
              0.26806277 = fieldWeight in 4644, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.043712 = idf(docFreq=2116, maxDocs=44421)
                0.046875 = fieldNorm(doc=4644)
          0.021374326 = weight(abstract_txt:over in 4644) [ClassicSimilarity], result of:
            0.021374326 = score(doc=4644,freq=1.0), product of:
              0.10747521 = queryWeight, product of:
                1.1670771 = boost
                4.242705 = idf(docFreq=1734, maxDocs=44421)
                0.021705309 = queryNorm
              0.1988768 = fieldWeight in 4644, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.242705 = idf(docFreq=1734, maxDocs=44421)
                0.046875 = fieldNorm(doc=4644)
          0.022370046 = weight(abstract_txt:what in 4644) [ClassicSimilarity], result of:
            0.022370046 = score(doc=4644,freq=1.0), product of:
              0.11078764 = queryWeight, product of:
                1.1849254 = boost
                4.3075895 = idf(docFreq=1625, maxDocs=44421)
                0.021705309 = queryNorm
              0.20191826 = fieldWeight in 4644, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.3075895 = idf(docFreq=1625, maxDocs=44421)
                0.046875 = fieldNorm(doc=4644)
          0.028810605 = weight(abstract_txt:years in 4644) [ClassicSimilarity], result of:
            0.028810605 = score(doc=4644,freq=1.0), product of:
              0.13114396 = queryWeight, product of:
                1.2891977 = boost
                4.686653 = idf(docFreq=1112, maxDocs=44421)
                0.021705309 = queryNorm
              0.21968687 = fieldWeight in 4644, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.686653 = idf(docFreq=1112, maxDocs=44421)
                0.046875 = fieldNorm(doc=4644)
          0.05194238 = weight(abstract_txt:index in 4644) [ClassicSimilarity], result of:
            0.05194238 = score(doc=4644,freq=3.0), product of:
              0.13469584 = queryWeight, product of:
                1.3065392 = boost
                4.7496953 = idf(docFreq=1044, maxDocs=44421)
                0.021705309 = queryNorm
              0.38562718 = fieldWeight in 4644, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.7496953 = idf(docFreq=1044, maxDocs=44421)
                0.046875 = fieldNorm(doc=4644)
          0.04012568 = weight(abstract_txt:made in 4644) [ClassicSimilarity], result of:
            0.04012568 = score(doc=4644,freq=1.0), product of:
              0.1872228 = queryWeight, product of:
                1.8865589 = boost
                4.5721703 = idf(docFreq=1247, maxDocs=44421)
                0.021705309 = queryNorm
              0.21432048 = fieldWeight in 4644, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.5721703 = idf(docFreq=1247, maxDocs=44421)
                0.046875 = fieldNorm(doc=4644)
          0.14663982 = weight(abstract_txt:indexing in 4644) [ClassicSimilarity], result of:
            0.14663982 = score(doc=4644,freq=2.0), product of:
              0.5084819 = queryWeight, product of:
                5.3850455 = boost
                4.3503094 = idf(docFreq=1557, maxDocs=44421)
                0.021705309 = queryNorm
              0.28838748 = fieldWeight in 4644, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.3503094 = idf(docFreq=1557, maxDocs=44421)
                0.046875 = fieldNorm(doc=4644)
        0.32 = coord(8/25)
    
  4. Carroll, D.J.; Lele, P.: Human intervention in the networked environment : metadata alternatives (1998) 0.13
    0.1261214 = sum of:
      0.1261214 = product of:
        0.630607 = sum of:
          0.15291296 = weight(abstract_txt:pros in 3221) [ClassicSimilarity], result of:
            0.15291296 = score(doc=3221,freq=1.0), product of:
              0.19951685 = queryWeight, product of:
                1.1243982 = boost
                8.175107 = idf(docFreq=33, maxDocs=44421)
                0.021705309 = queryNorm
              0.7664163 = fieldWeight in 3221, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.175107 = idf(docFreq=33, maxDocs=44421)
                0.09375 = fieldNorm(doc=3221)
          0.15291296 = weight(abstract_txt:cons in 3221) [ClassicSimilarity], result of:
            0.15291296 = score(doc=3221,freq=1.0), product of:
              0.19951685 = queryWeight, product of:
                1.1243982 = boost
                8.175107 = idf(docFreq=33, maxDocs=44421)
                0.021705309 = queryNorm
              0.7664163 = fieldWeight in 3221, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.175107 = idf(docFreq=33, maxDocs=44421)
                0.09375 = fieldNorm(doc=3221)
          0.05742314 = weight(abstract_txt:human in 3221) [ClassicSimilarity], result of:
            0.05742314 = score(doc=3221,freq=1.0), product of:
              0.13084325 = queryWeight, product of:
                1.2877188 = boost
                4.681277 = idf(docFreq=1118, maxDocs=44421)
                0.021705309 = queryNorm
              0.4388697 = fieldWeight in 3221, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.681277 = idf(docFreq=1118, maxDocs=44421)
                0.09375 = fieldNorm(doc=3221)
          0.059977897 = weight(abstract_txt:index in 3221) [ClassicSimilarity], result of:
            0.059977897 = score(doc=3221,freq=1.0), product of:
              0.13469584 = queryWeight, product of:
                1.3065392 = boost
                4.7496953 = idf(docFreq=1044, maxDocs=44421)
                0.021705309 = queryNorm
              0.44528395 = fieldWeight in 3221, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.7496953 = idf(docFreq=1044, maxDocs=44421)
                0.09375 = fieldNorm(doc=3221)
          0.20738003 = weight(abstract_txt:indexing in 3221) [ClassicSimilarity], result of:
            0.20738003 = score(doc=3221,freq=1.0), product of:
              0.5084819 = queryWeight, product of:
                5.3850455 = boost
                4.3503094 = idf(docFreq=1557, maxDocs=44421)
                0.021705309 = queryNorm
              0.4078415 = fieldWeight in 3221, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.3503094 = idf(docFreq=1557, maxDocs=44421)
                0.09375 = fieldNorm(doc=3221)
        0.2 = coord(5/25)
    
  5. Cleverdon, C.W.; Mills, J.: ¬The testing of index language devices (1985) 0.12
    0.11572106 = sum of:
      0.11572106 = product of:
        0.4821711 = sum of:
          0.058268707 = weight(abstract_txt:hundred in 4643) [ClassicSimilarity], result of:
            0.058268707 = score(doc=4643,freq=1.0), product of:
              0.16646655 = queryWeight, product of:
                1.0270554 = boost
                7.467361 = idf(docFreq=68, maxDocs=44421)
                0.021705309 = queryNorm
              0.35003254 = fieldWeight in 4643, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.467361 = idf(docFreq=68, maxDocs=44421)
                0.046875 = fieldNorm(doc=4643)
          0.018505663 = weight(abstract_txt:terms in 4643) [ClassicSimilarity], result of:
            0.018505663 = score(doc=4643,freq=1.0), product of:
              0.097629964 = queryWeight, product of:
                1.1123384 = boost
                4.043712 = idf(docFreq=2116, maxDocs=44421)
                0.021705309 = queryNorm
              0.189549 = fieldWeight in 4643, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.043712 = idf(docFreq=2116, maxDocs=44421)
                0.046875 = fieldNorm(doc=4643)
          0.022370046 = weight(abstract_txt:what in 4643) [ClassicSimilarity], result of:
            0.022370046 = score(doc=4643,freq=1.0), product of:
              0.11078764 = queryWeight, product of:
                1.1849254 = boost
                4.3075895 = idf(docFreq=1625, maxDocs=44421)
                0.021705309 = queryNorm
              0.20191826 = fieldWeight in 4643, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.3075895 = idf(docFreq=1625, maxDocs=44421)
                0.046875 = fieldNorm(doc=4643)
          0.05194238 = weight(abstract_txt:index in 4643) [ClassicSimilarity], result of:
            0.05194238 = score(doc=4643,freq=3.0), product of:
              0.13469584 = queryWeight, product of:
                1.3065392 = boost
                4.7496953 = idf(docFreq=1044, maxDocs=44421)
                0.021705309 = queryNorm
              0.38562718 = fieldWeight in 4643, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.7496953 = idf(docFreq=1044, maxDocs=44421)
                0.046875 = fieldNorm(doc=4643)
          0.056746278 = weight(abstract_txt:made in 4643) [ClassicSimilarity], result of:
            0.056746278 = score(doc=4643,freq=2.0), product of:
              0.1872228 = queryWeight, product of:
                1.8865589 = boost
                4.5721703 = idf(docFreq=1247, maxDocs=44421)
                0.021705309 = queryNorm
              0.30309492 = fieldWeight in 4643, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.5721703 = idf(docFreq=1247, maxDocs=44421)
                0.046875 = fieldNorm(doc=4643)
          0.274338 = weight(abstract_txt:indexing in 4643) [ClassicSimilarity], result of:
            0.274338 = score(doc=4643,freq=7.0), product of:
              0.5084819 = queryWeight, product of:
                5.3850455 = boost
                4.3503094 = idf(docFreq=1557, maxDocs=44421)
                0.021705309 = queryNorm
              0.5395236 = fieldWeight in 4643, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                4.3503094 = idf(docFreq=1557, maxDocs=44421)
                0.046875 = fieldNorm(doc=4643)
        0.24 = coord(6/25)