Document (#3092)

Editor
Knowledge-based systems development
Author
Milstead, J.L.
Title
Methodologies for subject analysis in bibliographic databases
Source
IEEE proceedings. Pt.A. 28(1992) no.3, S.407-432
Year
1992
Abstract
Report on a subject analysis review undertaken to aid managers of databases in determining if new and little-known capabilities would improve the cost-effectiveness of subject analysis operations. Operational machine-aided and automatic indexing systems were found to form a continuum. Commercial automatic indexing packages were also reviewed. The primary obstacle to development of automatic indexing is the lack of machine understanding of natural language. Recommendations for action include: increasing the power of the indexer interface, studying indexing policies, enrichment of thesauri, and considering the development of machine-aided indexing

Similar documents (author)

  1. Milstead, J.L.: Database design : Indexing applications (1989) 5.41
    5.4105906 = sum of:
      5.4105906 = weight(author_txt:milstead in 866) [ClassicSimilarity], result of:
        5.4105906 = fieldWeight in 866, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.656945 = idf(docFreq=20, maxDocs=44421)
          0.625 = fieldNorm(doc=866)
    
  2. Milstead, J.L.: Specifications for thesaurus software (1991) 5.41
    5.4105906 = sum of:
      5.4105906 = weight(author_txt:milstead in 2290) [ClassicSimilarity], result of:
        5.4105906 = fieldWeight in 2290, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.656945 = idf(docFreq=20, maxDocs=44421)
          0.625 = fieldNorm(doc=2290)
    
  3. Milstead, J.L.: Methodologies for subject analysis in bibliographic databases (1992) 5.41
    5.4105906 = sum of:
      5.4105906 = weight(author_txt:milstead in 2310) [ClassicSimilarity], result of:
        5.4105906 = fieldWeight in 2310, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.656945 = idf(docFreq=20, maxDocs=44421)
          0.625 = fieldNorm(doc=2310)
    
  4. Milstead, J.L.: Natural versus inverted word order in subject headings (1980) 5.41
    5.4105906 = sum of:
      5.4105906 = weight(author_txt:milstead in 2866) [ClassicSimilarity], result of:
        5.4105906 = fieldWeight in 2866, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.656945 = idf(docFreq=20, maxDocs=44421)
          0.625 = fieldNorm(doc=2866)
    
  5. Milstead, J.L.: Thesaurus software packages (1990) 5.41
    5.4105906 = sum of:
      5.4105906 = weight(author_txt:milstead in 4867) [ClassicSimilarity], result of:
        5.4105906 = fieldWeight in 4867, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.656945 = idf(docFreq=20, maxDocs=44421)
          0.625 = fieldNorm(doc=4867)
    

Similar documents (content)

  1. Milstead, J.L.: Methodologies for subject analysis in bibliographic databases (1992) 0.43
    0.42542458 = sum of:
      0.42542458 = product of:
        1.0635614 = sum of:
          0.07522969 = weight(abstract_txt:reviewed in 2310) [ClassicSimilarity], result of:
            0.07522969 = score(doc=2310,freq=1.0), product of:
              0.13156323 = queryWeight, product of:
                6.099349 = idf(docFreq=270, maxDocs=44421)
                0.021570046 = queryNorm
              0.57181394 = fieldWeight in 2310, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.099349 = idf(docFreq=270, maxDocs=44421)
                0.09375 = fieldNorm(doc=2310)
          0.07981482 = weight(abstract_txt:policies in 2310) [ClassicSimilarity], result of:
            0.07981482 = score(doc=2310,freq=1.0), product of:
              0.13685606 = queryWeight, product of:
                1.0199168 = boost
                6.2208285 = idf(docFreq=239, maxDocs=44421)
                0.021570046 = queryNorm
              0.58320266 = fieldWeight in 2310, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.2208285 = idf(docFreq=239, maxDocs=44421)
                0.09375 = fieldNorm(doc=2310)
          0.11421291 = weight(abstract_txt:packages in 2310) [ClassicSimilarity], result of:
            0.11421291 = score(doc=2310,freq=1.0), product of:
              0.17378747 = queryWeight, product of:
                1.1493226 = boost
                7.01012 = idf(docFreq=108, maxDocs=44421)
                0.021570046 = queryNorm
              0.6571987 = fieldWeight in 2310, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.01012 = idf(docFreq=108, maxDocs=44421)
                0.09375 = fieldNorm(doc=2310)
          0.03270949 = weight(abstract_txt:were in 2310) [ClassicSimilarity], result of:
            0.03270949 = score(doc=2310,freq=1.0), product of:
              0.095133655 = queryWeight, product of:
                1.2025824 = boost
                3.6674848 = idf(docFreq=3083, maxDocs=44421)
                0.021570046 = queryNorm
              0.3438267 = fieldWeight in 2310, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6674848 = idf(docFreq=3083, maxDocs=44421)
                0.09375 = fieldNorm(doc=2310)
          0.037708506 = weight(abstract_txt:development in 2310) [ClassicSimilarity], result of:
            0.037708506 = score(doc=2310,freq=1.0), product of:
              0.10459507 = queryWeight, product of:
                1.260966 = boost
                3.8455355 = idf(docFreq=2580, maxDocs=44421)
                0.021570046 = queryNorm
              0.36051896 = fieldWeight in 2310, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.8455355 = idf(docFreq=2580, maxDocs=44421)
                0.09375 = fieldNorm(doc=2310)
          0.04819411 = weight(abstract_txt:analysis in 2310) [ClassicSimilarity], result of:
            0.04819411 = score(doc=2310,freq=1.0), product of:
              0.14100832 = queryWeight, product of:
                1.7931463 = boost
                3.6456752 = idf(docFreq=3151, maxDocs=44421)
                0.021570046 = queryNorm
              0.34178203 = fieldWeight in 2310, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6456752 = idf(docFreq=3151, maxDocs=44421)
                0.09375 = fieldNorm(doc=2310)
          0.05945276 = weight(abstract_txt:subject in 2310) [ClassicSimilarity], result of:
            0.05945276 = score(doc=2310,freq=1.0), product of:
              0.16219226 = queryWeight, product of:
                1.923129 = boost
                3.9099448 = idf(docFreq=2419, maxDocs=44421)
                0.021570046 = queryNorm
              0.36655733 = fieldWeight in 2310, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.9099448 = idf(docFreq=2419, maxDocs=44421)
                0.09375 = fieldNorm(doc=2310)
          0.19728915 = weight(abstract_txt:automatic in 2310) [ClassicSimilarity], result of:
            0.19728915 = score(doc=2310,freq=2.0), product of:
              0.2864008 = queryWeight, product of:
                2.555527 = boost
                5.1956835 = idf(docFreq=668, maxDocs=44421)
                0.021570046 = queryNorm
              0.68885684 = fieldWeight in 2310, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.1956835 = idf(docFreq=668, maxDocs=44421)
                0.09375 = fieldNorm(doc=2310)
          0.14598975 = weight(abstract_txt:machine in 2310) [ClassicSimilarity], result of:
            0.14598975 = score(doc=2310,freq=1.0), product of:
              0.2952095 = queryWeight, product of:
                2.594529 = boost
                5.274979 = idf(docFreq=617, maxDocs=44421)
                0.021570046 = queryNorm
              0.4945293 = fieldWeight in 2310, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.274979 = idf(docFreq=617, maxDocs=44421)
                0.09375 = fieldNorm(doc=2310)
          0.27296016 = weight(abstract_txt:indexing in 2310) [ClassicSimilarity], result of:
            0.27296016 = score(doc=2310,freq=4.0), product of:
              0.33464 = queryWeight, product of:
                3.566208 = boost
                4.3503094 = idf(docFreq=1557, maxDocs=44421)
                0.021570046 = queryNorm
              0.815683 = fieldWeight in 2310, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.3503094 = idf(docFreq=1557, maxDocs=44421)
                0.09375 = fieldNorm(doc=2310)
        0.4 = coord(10/25)
    
  2. White, H.; Willis, C.; Greenberg, J.: HIVEing : the effect of a semantic web technology on inter-indexer consistency (2014) 0.24
    0.23607185 = sum of:
      0.23607185 = product of:
        0.8431138 = sum of:
          0.05928761 = weight(abstract_txt:studying in 2781) [ClassicSimilarity], result of:
            0.05928761 = score(doc=2781,freq=1.0), product of:
              0.14708842 = queryWeight, product of:
                1.0573578 = boost
                6.449194 = idf(docFreq=190, maxDocs=44421)
                0.021570046 = queryNorm
              0.40307462 = fieldWeight in 2781, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.449194 = idf(docFreq=190, maxDocs=44421)
                0.0625 = fieldNorm(doc=2781)
          0.13240255 = weight(abstract_txt:indexer in 2781) [ClassicSimilarity], result of:
            0.13240255 = score(doc=2781,freq=3.0), product of:
              0.17424475 = queryWeight, product of:
                1.1508337 = boost
                7.019336 = idf(docFreq=107, maxDocs=44421)
                0.021570046 = queryNorm
              0.7598654 = fieldWeight in 2781, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                7.019336 = idf(docFreq=107, maxDocs=44421)
                0.0625 = fieldNorm(doc=2781)
          0.0308388 = weight(abstract_txt:were in 2781) [ClassicSimilarity], result of:
            0.0308388 = score(doc=2781,freq=2.0), product of:
              0.095133655 = queryWeight, product of:
                1.2025824 = boost
                3.6674848 = idf(docFreq=3083, maxDocs=44421)
                0.021570046 = queryNorm
              0.3241629 = fieldWeight in 2781, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.6674848 = idf(docFreq=3083, maxDocs=44421)
                0.0625 = fieldNorm(doc=2781)
          0.032129407 = weight(abstract_txt:analysis in 2781) [ClassicSimilarity], result of:
            0.032129407 = score(doc=2781,freq=1.0), product of:
              0.14100832 = queryWeight, product of:
                1.7931463 = boost
                3.6456752 = idf(docFreq=3151, maxDocs=44421)
                0.021570046 = queryNorm
              0.2278547 = fieldWeight in 2781, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6456752 = idf(docFreq=3151, maxDocs=44421)
                0.0625 = fieldNorm(doc=2781)
          0.29322135 = weight(abstract_txt:aided in 2781) [ClassicSimilarity], result of:
            0.29322135 = score(doc=2781,freq=2.0), product of:
              0.42697218 = queryWeight, product of:
                2.5476954 = boost
                7.769642 = idf(docFreq=50, maxDocs=44421)
                0.021570046 = queryNorm
              0.6867458 = fieldWeight in 2781, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.769642 = idf(docFreq=50, maxDocs=44421)
                0.0625 = fieldNorm(doc=2781)
          0.13764045 = weight(abstract_txt:machine in 2781) [ClassicSimilarity], result of:
            0.13764045 = score(doc=2781,freq=2.0), product of:
              0.2952095 = queryWeight, product of:
                2.594529 = boost
                5.274979 = idf(docFreq=617, maxDocs=44421)
                0.021570046 = queryNorm
              0.4662467 = fieldWeight in 2781, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.274979 = idf(docFreq=617, maxDocs=44421)
                0.0625 = fieldNorm(doc=2781)
          0.15759362 = weight(abstract_txt:indexing in 2781) [ClassicSimilarity], result of:
            0.15759362 = score(doc=2781,freq=3.0), product of:
              0.33464 = queryWeight, product of:
                3.566208 = boost
                4.3503094 = idf(docFreq=1557, maxDocs=44421)
                0.021570046 = queryNorm
              0.4709348 = fieldWeight in 2781, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.3503094 = idf(docFreq=1557, maxDocs=44421)
                0.0625 = fieldNorm(doc=2781)
        0.28 = coord(7/25)
    
  3. Greenrich, E.: CD-ROM data preparation enhancements (1993) 0.23
    0.22574292 = sum of:
      0.22574292 = product of:
        1.1287146 = sum of:
          0.07602862 = weight(abstract_txt:databases in 7842) [ClassicSimilarity], result of:
            0.07602862 = score(doc=7842,freq=1.0), product of:
              0.13779832 = queryWeight, product of:
                1.4473372 = boost
                4.413907 = idf(docFreq=1461, maxDocs=44421)
                0.021570046 = queryNorm
              0.5517384 = fieldWeight in 7842, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.413907 = idf(docFreq=1461, maxDocs=44421)
                0.125 = fieldNorm(doc=7842)
          0.41467762 = weight(abstract_txt:aided in 7842) [ClassicSimilarity], result of:
            0.41467762 = score(doc=7842,freq=1.0), product of:
              0.42697218 = queryWeight, product of:
                2.5476954 = boost
                7.769642 = idf(docFreq=50, maxDocs=44421)
                0.021570046 = queryNorm
              0.97120523 = fieldWeight in 7842, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.769642 = idf(docFreq=50, maxDocs=44421)
                0.125 = fieldNorm(doc=7842)
          0.18600598 = weight(abstract_txt:automatic in 7842) [ClassicSimilarity], result of:
            0.18600598 = score(doc=7842,freq=1.0), product of:
              0.2864008 = queryWeight, product of:
                2.555527 = boost
                5.1956835 = idf(docFreq=668, maxDocs=44421)
                0.021570046 = queryNorm
              0.64946043 = fieldWeight in 7842, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.1956835 = idf(docFreq=668, maxDocs=44421)
                0.125 = fieldNorm(doc=7842)
          0.19465299 = weight(abstract_txt:machine in 7842) [ClassicSimilarity], result of:
            0.19465299 = score(doc=7842,freq=1.0), product of:
              0.2952095 = queryWeight, product of:
                2.594529 = boost
                5.274979 = idf(docFreq=617, maxDocs=44421)
                0.021570046 = queryNorm
              0.6593724 = fieldWeight in 7842, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.274979 = idf(docFreq=617, maxDocs=44421)
                0.125 = fieldNorm(doc=7842)
          0.2573493 = weight(abstract_txt:indexing in 7842) [ClassicSimilarity], result of:
            0.2573493 = score(doc=7842,freq=2.0), product of:
              0.33464 = queryWeight, product of:
                3.566208 = boost
                4.3503094 = idf(docFreq=1557, maxDocs=44421)
                0.021570046 = queryNorm
              0.7690333 = fieldWeight in 7842, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.3503094 = idf(docFreq=1557, maxDocs=44421)
                0.125 = fieldNorm(doc=7842)
        0.2 = coord(5/25)
    
  4. Lancaster, F.W.: Trends in subject indexing from 1957 to 2000 (1980) 0.15
    0.14892945 = sum of:
      0.14892945 = product of:
        0.930809 = sum of:
          0.31100821 = weight(abstract_txt:aided in 208) [ClassicSimilarity], result of:
            0.31100821 = score(doc=208,freq=1.0), product of:
              0.42697218 = queryWeight, product of:
                2.5476954 = boost
                7.769642 = idf(docFreq=50, maxDocs=44421)
                0.021570046 = queryNorm
              0.7284039 = fieldWeight in 208, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.769642 = idf(docFreq=50, maxDocs=44421)
                0.09375 = fieldNorm(doc=208)
          0.13950449 = weight(abstract_txt:automatic in 208) [ClassicSimilarity], result of:
            0.13950449 = score(doc=208,freq=1.0), product of:
              0.2864008 = queryWeight, product of:
                2.555527 = boost
                5.1956835 = idf(docFreq=668, maxDocs=44421)
                0.021570046 = queryNorm
              0.48709533 = fieldWeight in 208, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.1956835 = idf(docFreq=668, maxDocs=44421)
                0.09375 = fieldNorm(doc=208)
          0.14598975 = weight(abstract_txt:machine in 208) [ClassicSimilarity], result of:
            0.14598975 = score(doc=208,freq=1.0), product of:
              0.2952095 = queryWeight, product of:
                2.594529 = boost
                5.274979 = idf(docFreq=617, maxDocs=44421)
                0.021570046 = queryNorm
              0.4945293 = fieldWeight in 208, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.274979 = idf(docFreq=617, maxDocs=44421)
                0.09375 = fieldNorm(doc=208)
          0.33430657 = weight(abstract_txt:indexing in 208) [ClassicSimilarity], result of:
            0.33430657 = score(doc=208,freq=6.0), product of:
              0.33464 = queryWeight, product of:
                3.566208 = boost
                4.3503094 = idf(docFreq=1557, maxDocs=44421)
                0.021570046 = queryNorm
              0.9990036 = fieldWeight in 208, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                4.3503094 = idf(docFreq=1557, maxDocs=44421)
                0.09375 = fieldNorm(doc=208)
        0.16 = coord(4/25)
    
  5. Milstead, J.L.: Thesauri in a full-text world (1998) 0.15
    0.14701001 = sum of:
      0.14701001 = product of:
        0.7350501 = sum of:
          0.04751789 = weight(abstract_txt:databases in 3337) [ClassicSimilarity], result of:
            0.04751789 = score(doc=3337,freq=1.0), product of:
              0.13779832 = queryWeight, product of:
                1.4473372 = boost
                4.413907 = idf(docFreq=1461, maxDocs=44421)
                0.021570046 = queryNorm
              0.34483647 = fieldWeight in 3337, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.413907 = idf(docFreq=1461, maxDocs=44421)
                0.078125 = fieldNorm(doc=3337)
          0.056797303 = weight(abstract_txt:analysis in 3337) [ClassicSimilarity], result of:
            0.056797303 = score(doc=3337,freq=2.0), product of:
              0.14100832 = queryWeight, product of:
                1.7931463 = boost
                3.6456752 = idf(docFreq=3151, maxDocs=44421)
                0.021570046 = queryNorm
              0.402794 = fieldWeight in 3337, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.6456752 = idf(docFreq=3151, maxDocs=44421)
                0.078125 = fieldNorm(doc=3337)
          0.2591735 = weight(abstract_txt:aided in 3337) [ClassicSimilarity], result of:
            0.2591735 = score(doc=3337,freq=1.0), product of:
              0.42697218 = queryWeight, product of:
                2.5476954 = boost
                7.769642 = idf(docFreq=50, maxDocs=44421)
                0.021570046 = queryNorm
              0.6070033 = fieldWeight in 3337, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.769642 = idf(docFreq=50, maxDocs=44421)
                0.078125 = fieldNorm(doc=3337)
          0.21071805 = weight(abstract_txt:machine in 3337) [ClassicSimilarity], result of:
            0.21071805 = score(doc=3337,freq=3.0), product of:
              0.2952095 = queryWeight, product of:
                2.594529 = boost
                5.274979 = idf(docFreq=617, maxDocs=44421)
                0.021570046 = queryNorm
              0.71379155 = fieldWeight in 3337, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.274979 = idf(docFreq=617, maxDocs=44421)
                0.078125 = fieldNorm(doc=3337)
          0.16084333 = weight(abstract_txt:indexing in 3337) [ClassicSimilarity], result of:
            0.16084333 = score(doc=3337,freq=2.0), product of:
              0.33464 = queryWeight, product of:
                3.566208 = boost
                4.3503094 = idf(docFreq=1557, maxDocs=44421)
                0.021570046 = queryNorm
              0.48064584 = fieldWeight in 3337, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.3503094 = idf(docFreq=1557, maxDocs=44421)
                0.078125 = fieldNorm(doc=3337)
        0.2 = coord(5/25)