Document (#7843)

Author
Greenrich, E.
Title
CD-ROM data preparation enhancements
Source
Proceedings of the 14th National Online Meeting 1993, New York, 4-6 May 1993. Ed.: M.E. Williams
Imprint
Medford, NJ : Learned Information
Year
1993
Pages
S.159-163
Abstract
Describes a number of improvements to the process of data preparation for the production of CD-ROM databases: imaging, optical character recognition (OCR) for data vcapture and input; automatic indexing (machine aided indexing); field tagging; and search performance enhancing features (data compression and encoding)

Similar documents (content)

  1. Broadhurst, R.: ¬The digitisation of library material (1993) 0.21
    0.20581643 = sum of:
      0.20581643 = product of:
        0.8575685 = sum of:
          0.04584905 = weight(abstract_txt:describes in 6255) [ClassicSimilarity], result of:
            0.04584905 = score(doc=6255,freq=2.0), product of:
              0.067756064 = queryWeight, product of:
                3.82787 = idf(docFreq=2626, maxDocs=44421)
                0.017700722 = queryNorm
              0.6766782 = fieldWeight in 6255, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.82787 = idf(docFreq=2626, maxDocs=44421)
                0.125 = fieldNorm(doc=6255)
          0.038367376 = weight(abstract_txt:process in 6255) [ClassicSimilarity], result of:
            0.038367376 = score(doc=6255,freq=1.0), product of:
              0.07580759 = queryWeight, product of:
                1.0577481 = boost
                4.048922 = idf(docFreq=2105, maxDocs=44421)
                0.017700722 = queryNorm
              0.50611526 = fieldWeight in 6255, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.048922 = idf(docFreq=2105, maxDocs=44421)
                0.125 = fieldNorm(doc=6255)
          0.13211948 = weight(abstract_txt:recognition in 6255) [ClassicSimilarity], result of:
            0.13211948 = score(doc=6255,freq=1.0), product of:
              0.17286849 = queryWeight, product of:
                1.5972903 = boost
                6.114219 = idf(docFreq=266, maxDocs=44421)
                0.017700722 = queryNorm
              0.7642774 = fieldWeight in 6255, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.114219 = idf(docFreq=266, maxDocs=44421)
                0.125 = fieldNorm(doc=6255)
          0.16018559 = weight(abstract_txt:character in 6255) [ClassicSimilarity], result of:
            0.16018559 = score(doc=6255,freq=1.0), product of:
              0.19655627 = queryWeight, product of:
                1.7032146 = boost
                6.519684 = idf(docFreq=177, maxDocs=44421)
                0.017700722 = queryNorm
              0.8149605 = fieldWeight in 6255, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.519684 = idf(docFreq=177, maxDocs=44421)
                0.125 = fieldNorm(doc=6255)
          0.23146008 = weight(abstract_txt:optical in 6255) [ClassicSimilarity], result of:
            0.23146008 = score(doc=6255,freq=1.0), product of:
              0.25122064 = queryWeight, product of:
                1.9255446 = boost
                7.370734 = idf(docFreq=75, maxDocs=44421)
                0.017700722 = queryNorm
              0.9213418 = fieldWeight in 6255, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.370734 = idf(docFreq=75, maxDocs=44421)
                0.125 = fieldNorm(doc=6255)
          0.24958694 = weight(abstract_txt:imaging in 6255) [ClassicSimilarity], result of:
            0.24958694 = score(doc=6255,freq=1.0), product of:
              0.26417142 = queryWeight, product of:
                1.9745532 = boost
                7.558333 = idf(docFreq=62, maxDocs=44421)
                0.017700722 = queryNorm
              0.9447916 = fieldWeight in 6255, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.558333 = idf(docFreq=62, maxDocs=44421)
                0.125 = fieldNorm(doc=6255)
        0.24 = coord(6/25)
    
  2. Guenette, D.R.: Document imaging, CD-ROM, and CD-R : a starting point (1996) 0.18
    0.18002148 = sum of:
      0.18002148 = product of:
        0.7500895 = sum of:
          0.024315132 = weight(abstract_txt:describes in 5054) [ClassicSimilarity], result of:
            0.024315132 = score(doc=5054,freq=1.0), product of:
              0.067756064 = queryWeight, product of:
                3.82787 = idf(docFreq=2626, maxDocs=44421)
                0.017700722 = queryNorm
              0.35886282 = fieldWeight in 5054, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.82787 = idf(docFreq=2626, maxDocs=44421)
                0.09375 = fieldNorm(doc=5054)
          0.028775534 = weight(abstract_txt:process in 5054) [ClassicSimilarity], result of:
            0.028775534 = score(doc=5054,freq=1.0), product of:
              0.07580759 = queryWeight, product of:
                1.0577481 = boost
                4.048922 = idf(docFreq=2105, maxDocs=44421)
                0.017700722 = queryNorm
              0.37958646 = fieldWeight in 5054, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.048922 = idf(docFreq=2105, maxDocs=44421)
                0.09375 = fieldNorm(doc=5054)
          0.3743804 = weight(abstract_txt:imaging in 5054) [ClassicSimilarity], result of:
            0.3743804 = score(doc=5054,freq=4.0), product of:
              0.26417142 = queryWeight, product of:
                1.9745532 = boost
                7.558333 = idf(docFreq=62, maxDocs=44421)
                0.017700722 = queryNorm
              1.4171875 = fieldWeight in 5054, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                7.558333 = idf(docFreq=62, maxDocs=44421)
                0.09375 = fieldNorm(doc=5054)
          0.1871902 = weight(abstract_txt:compression in 5054) [ClassicSimilarity], result of:
            0.1871902 = score(doc=5054,freq=1.0), product of:
              0.26417142 = queryWeight, product of:
                1.9745532 = boost
                7.558333 = idf(docFreq=62, maxDocs=44421)
                0.017700722 = queryNorm
              0.7085937 = fieldWeight in 5054, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.558333 = idf(docFreq=62, maxDocs=44421)
                0.09375 = fieldNorm(doc=5054)
          0.07138312 = weight(abstract_txt:indexing in 5054) [ClassicSimilarity], result of:
            0.07138312 = score(doc=5054,freq=1.0), product of:
              0.17502663 = queryWeight, product of:
                2.2729661 = boost
                4.3503094 = idf(docFreq=1557, maxDocs=44421)
                0.017700722 = queryNorm
              0.4078415 = fieldWeight in 5054, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.3503094 = idf(docFreq=1557, maxDocs=44421)
                0.09375 = fieldNorm(doc=5054)
          0.06404512 = weight(abstract_txt:data in 5054) [ClassicSimilarity], result of:
            0.06404512 = score(doc=5054,freq=1.0), product of:
              0.20513563 = queryWeight, product of:
                3.4799776 = boost
                3.3302255 = idf(docFreq=4320, maxDocs=44421)
                0.017700722 = queryNorm
              0.31220865 = fieldWeight in 5054, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.3302255 = idf(docFreq=4320, maxDocs=44421)
                0.09375 = fieldNorm(doc=5054)
        0.24 = coord(6/25)
    
  3. Silvester, J.P.; Genuardi, M.T.; Klingbiel, P.H.: Machine-aided indexing at NASA (1994) 0.17
    0.17389986 = sum of:
      0.17389986 = product of:
        0.7245828 = sum of:
          0.024315132 = weight(abstract_txt:describes in 117) [ClassicSimilarity], result of:
            0.024315132 = score(doc=117,freq=1.0), product of:
              0.067756064 = queryWeight, product of:
                3.82787 = idf(docFreq=2626, maxDocs=44421)
                0.017700722 = queryNorm
              0.35886282 = fieldWeight in 117, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.82787 = idf(docFreq=2626, maxDocs=44421)
                0.09375 = fieldNorm(doc=117)
          0.08998753 = weight(abstract_txt:machine in 117) [ClassicSimilarity], result of:
            0.08998753 = score(doc=117,freq=2.0), product of:
              0.12866941 = queryWeight, product of:
                1.3780456 = boost
                5.274979 = idf(docFreq=617, maxDocs=44421)
                0.017700722 = queryNorm
              0.69937 = fieldWeight in 117, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.274979 = idf(docFreq=617, maxDocs=44421)
                0.09375 = fieldNorm(doc=117)
          0.081895754 = weight(abstract_txt:production in 117) [ClassicSimilarity], result of:
            0.081895754 = score(doc=117,freq=1.0), product of:
              0.15224324 = queryWeight, product of:
                1.4989768 = boost
                5.7378883 = idf(docFreq=388, maxDocs=44421)
                0.017700722 = queryNorm
              0.53792703 = fieldWeight in 117, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.7378883 = idf(docFreq=388, maxDocs=44421)
                0.09375 = fieldNorm(doc=117)
          0.13987698 = weight(abstract_txt:input in 117) [ClassicSimilarity], result of:
            0.13987698 = score(doc=117,freq=2.0), product of:
              0.17265716 = queryWeight, product of:
                1.5963136 = boost
                6.110481 = idf(docFreq=267, maxDocs=44421)
                0.017700722 = queryNorm
              0.8101429 = fieldWeight in 117, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.110481 = idf(docFreq=267, maxDocs=44421)
                0.09375 = fieldNorm(doc=117)
          0.28755644 = weight(abstract_txt:aided in 117) [ClassicSimilarity], result of:
            0.28755644 = score(doc=117,freq=2.0), product of:
              0.27914885 = queryWeight, product of:
                2.029756 = boost
                7.769642 = idf(docFreq=50, maxDocs=44421)
                0.017700722 = queryNorm
              1.0301187 = fieldWeight in 117, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.769642 = idf(docFreq=50, maxDocs=44421)
                0.09375 = fieldNorm(doc=117)
          0.10095097 = weight(abstract_txt:indexing in 117) [ClassicSimilarity], result of:
            0.10095097 = score(doc=117,freq=2.0), product of:
              0.17502663 = queryWeight, product of:
                2.2729661 = boost
                4.3503094 = idf(docFreq=1557, maxDocs=44421)
                0.017700722 = queryNorm
              0.57677495 = fieldWeight in 117, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.3503094 = idf(docFreq=1557, maxDocs=44421)
                0.09375 = fieldNorm(doc=117)
        0.24 = coord(6/25)
    
  4. Lancaster, F.W.: Trends in subject indexing from 1957 to 2000 (1980) 0.16
    0.16402176 = sum of:
      0.16402176 = product of:
        0.683424 = sum of:
          0.06080414 = weight(abstract_txt:automatic in 208) [ClassicSimilarity], result of:
            0.06080414 = score(doc=208,freq=1.0), product of:
              0.12483006 = queryWeight, product of:
                1.3573302 = boost
                5.1956835 = idf(docFreq=668, maxDocs=44421)
                0.017700722 = queryNorm
              0.48709533 = fieldWeight in 208, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.1956835 = idf(docFreq=668, maxDocs=44421)
                0.09375 = fieldNorm(doc=208)
          0.0636308 = weight(abstract_txt:machine in 208) [ClassicSimilarity], result of:
            0.0636308 = score(doc=208,freq=1.0), product of:
              0.12866941 = queryWeight, product of:
                1.3780456 = boost
                5.274979 = idf(docFreq=617, maxDocs=44421)
                0.017700722 = queryNorm
              0.4945293 = fieldWeight in 208, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.274979 = idf(docFreq=617, maxDocs=44421)
                0.09375 = fieldNorm(doc=208)
          0.081895754 = weight(abstract_txt:production in 208) [ClassicSimilarity], result of:
            0.081895754 = score(doc=208,freq=1.0), product of:
              0.15224324 = queryWeight, product of:
                1.4989768 = boost
                5.7378883 = idf(docFreq=388, maxDocs=44421)
                0.017700722 = queryNorm
              0.53792703 = fieldWeight in 208, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.7378883 = idf(docFreq=388, maxDocs=44421)
                0.09375 = fieldNorm(doc=208)
          0.09890796 = weight(abstract_txt:input in 208) [ClassicSimilarity], result of:
            0.09890796 = score(doc=208,freq=1.0), product of:
              0.17265716 = queryWeight, product of:
                1.5963136 = boost
                6.110481 = idf(docFreq=267, maxDocs=44421)
                0.017700722 = queryNorm
              0.57285756 = fieldWeight in 208, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.110481 = idf(docFreq=267, maxDocs=44421)
                0.09375 = fieldNorm(doc=208)
          0.20333311 = weight(abstract_txt:aided in 208) [ClassicSimilarity], result of:
            0.20333311 = score(doc=208,freq=1.0), product of:
              0.27914885 = queryWeight, product of:
                2.029756 = boost
                7.769642 = idf(docFreq=50, maxDocs=44421)
                0.017700722 = queryNorm
              0.7284039 = fieldWeight in 208, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.769642 = idf(docFreq=50, maxDocs=44421)
                0.09375 = fieldNorm(doc=208)
          0.17485222 = weight(abstract_txt:indexing in 208) [ClassicSimilarity], result of:
            0.17485222 = score(doc=208,freq=6.0), product of:
              0.17502663 = queryWeight, product of:
                2.2729661 = boost
                4.3503094 = idf(docFreq=1557, maxDocs=44421)
                0.017700722 = queryNorm
              0.9990036 = fieldWeight in 208, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                4.3503094 = idf(docFreq=1557, maxDocs=44421)
                0.09375 = fieldNorm(doc=208)
        0.24 = coord(6/25)
    
  5. Milstead, J.L.: Methodologies for subject analysis in bibliographic databases (1992) 0.14
    0.13999629 = sum of:
      0.13999629 = product of:
        0.69998145 = sum of:
          0.037279893 = weight(abstract_txt:databases in 3091) [ClassicSimilarity], result of:
            0.037279893 = score(doc=3091,freq=1.0), product of:
              0.09009075 = queryWeight, product of:
                1.1530975 = boost
                4.413907 = idf(docFreq=1461, maxDocs=44421)
                0.017700722 = queryNorm
              0.4138038 = fieldWeight in 3091, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.413907 = idf(docFreq=1461, maxDocs=44421)
                0.09375 = fieldNorm(doc=3091)
          0.10531586 = weight(abstract_txt:automatic in 3091) [ClassicSimilarity], result of:
            0.10531586 = score(doc=3091,freq=3.0), product of:
              0.12483006 = queryWeight, product of:
                1.3573302 = boost
                5.1956835 = idf(docFreq=668, maxDocs=44421)
                0.017700722 = queryNorm
              0.8436738 = fieldWeight in 3091, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.1956835 = idf(docFreq=668, maxDocs=44421)
                0.09375 = fieldNorm(doc=3091)
          0.11021177 = weight(abstract_txt:machine in 3091) [ClassicSimilarity], result of:
            0.11021177 = score(doc=3091,freq=3.0), product of:
              0.12866941 = queryWeight, product of:
                1.3780456 = boost
                5.274979 = idf(docFreq=617, maxDocs=44421)
                0.017700722 = queryNorm
              0.85654986 = fieldWeight in 3091, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.274979 = idf(docFreq=617, maxDocs=44421)
                0.09375 = fieldNorm(doc=3091)
          0.28755644 = weight(abstract_txt:aided in 3091) [ClassicSimilarity], result of:
            0.28755644 = score(doc=3091,freq=2.0), product of:
              0.27914885 = queryWeight, product of:
                2.029756 = boost
                7.769642 = idf(docFreq=50, maxDocs=44421)
                0.017700722 = queryNorm
              1.0301187 = fieldWeight in 3091, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.769642 = idf(docFreq=50, maxDocs=44421)
                0.09375 = fieldNorm(doc=3091)
          0.15961751 = weight(abstract_txt:indexing in 3091) [ClassicSimilarity], result of:
            0.15961751 = score(doc=3091,freq=5.0), product of:
              0.17502663 = queryWeight, product of:
                2.2729661 = boost
                4.3503094 = idf(docFreq=1557, maxDocs=44421)
                0.017700722 = queryNorm
              0.9119613 = fieldWeight in 3091, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                4.3503094 = idf(docFreq=1557, maxDocs=44421)
                0.09375 = fieldNorm(doc=3091)
        0.2 = coord(5/25)