Document (#11878)

Author
Boeri, R.J.
Hensel, M.
Title
Set up a winning text retrieval system : carefully
Source
CD-ROM professional. 8(1995) no.8, S.67-68
Year
1995
Abstract
Considers some of the practical issues involved when a company plans to develop an in house computerized document management system: conversion of paper to electronic form via optical character recognition (OCR) or rekeying; coding of document elements using SGML; indexing for information searching and retrieval (including proximity searching); and hybrid CD-ROM and online information retrieval systems
Theme
Dokumentenmanagement
Aid
SGML

Similar documents (content)

  1. Thiel, T.J.: Automated indexing of document image management systems (1992) 0.39
    0.39040855 = sum of:
      0.39040855 = product of:
        1.0844681 = sum of:
          0.07476662 = weight(abstract_txt:indexing in 3048) [ClassicSimilarity], result of:
            0.07476662 = score(doc=3048,freq=4.0), product of:
              0.09166137 = queryWeight, product of:
                4.3503094 = idf(docFreq=1557, maxDocs=44421)
                0.021070082 = queryNorm
              0.815683 = fieldWeight in 3048, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.3503094 = idf(docFreq=1557, maxDocs=44421)
                0.09375 = fieldNorm(doc=3048)
          0.012852891 = weight(abstract_txt:information in 3048) [ClassicSimilarity], result of:
            0.012852891 = score(doc=3048,freq=1.0), product of:
              0.056677636 = queryWeight, product of:
                1.1120586 = boost
                2.4188995 = idf(docFreq=10748, maxDocs=44421)
                0.021070082 = queryNorm
              0.22677183 = fieldWeight in 3048, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.4188995 = idf(docFreq=10748, maxDocs=44421)
                0.09375 = fieldNorm(doc=3048)
          0.1467761 = weight(abstract_txt:recognition in 3048) [ClassicSimilarity], result of:
            0.1467761 = score(doc=3048,freq=2.0), product of:
              0.18106231 = queryWeight, product of:
                1.4054676 = boost
                6.114219 = idf(docFreq=266, maxDocs=44421)
                0.021070082 = queryNorm
              0.81063855 = fieldWeight in 3048, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.114219 = idf(docFreq=266, maxDocs=44421)
                0.09375 = fieldNorm(doc=3048)
          0.17795572 = weight(abstract_txt:character in 3048) [ClassicSimilarity], result of:
            0.17795572 = score(doc=3048,freq=2.0), product of:
              0.20587288 = queryWeight, product of:
                1.4986713 = boost
                6.519684 = idf(docFreq=177, maxDocs=44421)
                0.021070082 = queryNorm
              0.8643961 = fieldWeight in 3048, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.519684 = idf(docFreq=177, maxDocs=44421)
                0.09375 = fieldNorm(doc=3048)
          0.034842514 = weight(abstract_txt:system in 3048) [ClassicSimilarity], result of:
            0.034842514 = score(doc=3048,freq=1.0), product of:
              0.11019219 = queryWeight, product of:
                1.5505909 = boost
                3.372775 = idf(docFreq=4140, maxDocs=44421)
                0.021070082 = queryNorm
              0.31619766 = fieldWeight in 3048, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.372775 = idf(docFreq=4140, maxDocs=44421)
                0.09375 = fieldNorm(doc=3048)
          0.19835298 = weight(abstract_txt:coding in 3048) [ClassicSimilarity], result of:
            0.19835298 = score(doc=3048,freq=2.0), product of:
              0.22131814 = queryWeight, product of:
                1.5538723 = boost
                6.759825 = idf(docFreq=139, maxDocs=44421)
                0.021070082 = queryNorm
              0.89623463 = fieldWeight in 3048, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.759825 = idf(docFreq=139, maxDocs=44421)
                0.09375 = fieldNorm(doc=3048)
          0.257137 = weight(abstract_txt:optical in 3048) [ClassicSimilarity], result of:
            0.257137 = score(doc=3048,freq=2.0), product of:
              0.2631283 = queryWeight, product of:
                1.6943011 = boost
                7.370734 = idf(docFreq=75, maxDocs=44421)
                0.021070082 = queryNorm
              0.9772305 = fieldWeight in 3048, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.370734 = idf(docFreq=75, maxDocs=44421)
                0.09375 = fieldNorm(doc=3048)
          0.124548815 = weight(abstract_txt:document in 3048) [ClassicSimilarity], result of:
            0.124548815 = score(doc=3048,freq=3.0), product of:
              0.17862017 = queryWeight, product of:
                1.9741814 = boost
                4.29415 = idf(docFreq=1647, maxDocs=44421)
                0.021070082 = queryNorm
              0.697283 = fieldWeight in 3048, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.29415 = idf(docFreq=1647, maxDocs=44421)
                0.09375 = fieldNorm(doc=3048)
          0.057235487 = weight(abstract_txt:retrieval in 3048) [ClassicSimilarity], result of:
            0.057235487 = score(doc=3048,freq=1.0), product of:
              0.17561105 = queryWeight, product of:
                2.3974159 = boost
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.021070082 = queryNorm
              0.3259219 = fieldWeight in 3048, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.09375 = fieldNorm(doc=3048)
        0.36 = coord(9/25)
    
  2. Ramsden, A.: ELINOR electronic library system (1998) 0.22
    0.2195183 = sum of:
      0.2195183 = product of:
        0.9146596 = sum of:
          0.13838184 = weight(abstract_txt:recognition in 2403) [ClassicSimilarity], result of:
            0.13838184 = score(doc=2403,freq=1.0), product of:
              0.18106231 = queryWeight, product of:
                1.4054676 = boost
                6.114219 = idf(docFreq=266, maxDocs=44421)
                0.021070082 = queryNorm
              0.7642774 = fieldWeight in 2403, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.114219 = idf(docFreq=266, maxDocs=44421)
                0.125 = fieldNorm(doc=2403)
          0.16279177 = weight(abstract_txt:computerized in 2403) [ClassicSimilarity], result of:
            0.16279177 = score(doc=2403,freq=1.0), product of:
              0.20177327 = queryWeight, product of:
                1.4836745 = boost
                6.4544435 = idf(docFreq=189, maxDocs=44421)
                0.021070082 = queryNorm
              0.80680543 = fieldWeight in 2403, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.4544435 = idf(docFreq=189, maxDocs=44421)
                0.125 = fieldNorm(doc=2403)
          0.16777825 = weight(abstract_txt:character in 2403) [ClassicSimilarity], result of:
            0.16777825 = score(doc=2403,freq=1.0), product of:
              0.20587288 = queryWeight, product of:
                1.4986713 = boost
                6.519684 = idf(docFreq=177, maxDocs=44421)
                0.021070082 = queryNorm
              0.8149605 = fieldWeight in 2403, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.519684 = idf(docFreq=177, maxDocs=44421)
                0.125 = fieldNorm(doc=2403)
          0.2424311 = weight(abstract_txt:optical in 2403) [ClassicSimilarity], result of:
            0.2424311 = score(doc=2403,freq=1.0), product of:
              0.2631283 = queryWeight, product of:
                1.6943011 = boost
                7.370734 = idf(docFreq=75, maxDocs=44421)
                0.021070082 = queryNorm
              0.9213418 = fieldWeight in 2403, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.370734 = idf(docFreq=75, maxDocs=44421)
                0.125 = fieldNorm(doc=2403)
          0.09535237 = weight(abstract_txt:searching in 2403) [ClassicSimilarity], result of:
            0.09535237 = score(doc=2403,freq=1.0), product of:
              0.17796707 = queryWeight, product of:
                1.970569 = boost
                4.2862926 = idf(docFreq=1660, maxDocs=44421)
                0.021070082 = queryNorm
              0.53578657 = fieldWeight in 2403, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.2862926 = idf(docFreq=1660, maxDocs=44421)
                0.125 = fieldNorm(doc=2403)
          0.10792426 = weight(abstract_txt:retrieval in 2403) [ClassicSimilarity], result of:
            0.10792426 = score(doc=2403,freq=2.0), product of:
              0.17561105 = queryWeight, product of:
                2.3974159 = boost
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.021070082 = queryNorm
              0.6145642 = fieldWeight in 2403, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.125 = fieldNorm(doc=2403)
        0.24 = coord(6/25)
    
  3. Broadhurst, R.: ¬The digitisation of library material (1993) 0.21
    0.20962395 = sum of:
      0.20962395 = product of:
        0.8734331 = sum of:
          0.08051379 = weight(abstract_txt:considers in 6255) [ClassicSimilarity], result of:
            0.08051379 = score(doc=6255,freq=1.0), product of:
              0.1261892 = queryWeight, product of:
                1.1733239 = boost
                5.104322 = idf(docFreq=732, maxDocs=44421)
                0.021070082 = queryNorm
              0.63804024 = fieldWeight in 6255, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.104322 = idf(docFreq=732, maxDocs=44421)
                0.125 = fieldNorm(doc=6255)
          0.13838184 = weight(abstract_txt:recognition in 6255) [ClassicSimilarity], result of:
            0.13838184 = score(doc=6255,freq=1.0), product of:
              0.18106231 = queryWeight, product of:
                1.4054676 = boost
                6.114219 = idf(docFreq=266, maxDocs=44421)
                0.021070082 = queryNorm
              0.7642774 = fieldWeight in 6255, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.114219 = idf(docFreq=266, maxDocs=44421)
                0.125 = fieldNorm(doc=6255)
          0.14845039 = weight(abstract_txt:conversion in 6255) [ClassicSimilarity], result of:
            0.14845039 = score(doc=6255,freq=1.0), product of:
              0.18974176 = queryWeight, product of:
                1.4387597 = boost
                6.25905 = idf(docFreq=230, maxDocs=44421)
                0.021070082 = queryNorm
              0.78238124 = fieldWeight in 6255, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.25905 = idf(docFreq=230, maxDocs=44421)
                0.125 = fieldNorm(doc=6255)
          0.16777825 = weight(abstract_txt:character in 6255) [ClassicSimilarity], result of:
            0.16777825 = score(doc=6255,freq=1.0), product of:
              0.20587288 = queryWeight, product of:
                1.4986713 = boost
                6.519684 = idf(docFreq=177, maxDocs=44421)
                0.021070082 = queryNorm
              0.8149605 = fieldWeight in 6255, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.519684 = idf(docFreq=177, maxDocs=44421)
                0.125 = fieldNorm(doc=6255)
          0.2424311 = weight(abstract_txt:optical in 6255) [ClassicSimilarity], result of:
            0.2424311 = score(doc=6255,freq=1.0), product of:
              0.2631283 = queryWeight, product of:
                1.6943011 = boost
                7.370734 = idf(docFreq=75, maxDocs=44421)
                0.021070082 = queryNorm
              0.9213418 = fieldWeight in 6255, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.370734 = idf(docFreq=75, maxDocs=44421)
                0.125 = fieldNorm(doc=6255)
          0.09587772 = weight(abstract_txt:document in 6255) [ClassicSimilarity], result of:
            0.09587772 = score(doc=6255,freq=1.0), product of:
              0.17862017 = queryWeight, product of:
                1.9741814 = boost
                4.29415 = idf(docFreq=1647, maxDocs=44421)
                0.021070082 = queryNorm
              0.53676873 = fieldWeight in 6255, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.29415 = idf(docFreq=1647, maxDocs=44421)
                0.125 = fieldNorm(doc=6255)
        0.24 = coord(6/25)
    
  4. Initiatives for access (1994) 0.17
    0.16555734 = sum of:
      0.16555734 = product of:
        0.5912762 = sum of:
          0.010710742 = weight(abstract_txt:information in 3905) [ClassicSimilarity], result of:
            0.010710742 = score(doc=3905,freq=1.0), product of:
              0.056677636 = queryWeight, product of:
                1.1120586 = boost
                2.4188995 = idf(docFreq=10748, maxDocs=44421)
                0.021070082 = queryNorm
              0.18897653 = fieldWeight in 3905, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.4188995 = idf(docFreq=10748, maxDocs=44421)
                0.078125 = fieldNorm(doc=3905)
          0.053117696 = weight(abstract_txt:involved in 3905) [ClassicSimilarity], result of:
            0.053117696 = score(doc=3905,freq=1.0), product of:
              0.13082221 = queryWeight, product of:
                1.1946689 = boost
                5.1971793 = idf(docFreq=667, maxDocs=44421)
                0.021070082 = queryNorm
              0.40602964 = fieldWeight in 3905, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.1971793 = idf(docFreq=667, maxDocs=44421)
                0.078125 = fieldNorm(doc=3905)
          0.08648865 = weight(abstract_txt:recognition in 3905) [ClassicSimilarity], result of:
            0.08648865 = score(doc=3905,freq=1.0), product of:
              0.18106231 = queryWeight, product of:
                1.4054676 = boost
                6.114219 = idf(docFreq=266, maxDocs=44421)
                0.021070082 = queryNorm
              0.47767338 = fieldWeight in 3905, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.114219 = idf(docFreq=266, maxDocs=44421)
                0.078125 = fieldNorm(doc=3905)
          0.0927815 = weight(abstract_txt:conversion in 3905) [ClassicSimilarity], result of:
            0.0927815 = score(doc=3905,freq=1.0), product of:
              0.18974176 = queryWeight, product of:
                1.4387597 = boost
                6.25905 = idf(docFreq=230, maxDocs=44421)
                0.021070082 = queryNorm
              0.48898828 = fieldWeight in 3905, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.25905 = idf(docFreq=230, maxDocs=44421)
                0.078125 = fieldNorm(doc=3905)
          0.104861416 = weight(abstract_txt:character in 3905) [ClassicSimilarity], result of:
            0.104861416 = score(doc=3905,freq=1.0), product of:
              0.20587288 = queryWeight, product of:
                1.4986713 = boost
                6.519684 = idf(docFreq=177, maxDocs=44421)
                0.021070082 = queryNorm
              0.5093503 = fieldWeight in 3905, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.519684 = idf(docFreq=177, maxDocs=44421)
                0.078125 = fieldNorm(doc=3905)
          0.029035429 = weight(abstract_txt:system in 3905) [ClassicSimilarity], result of:
            0.029035429 = score(doc=3905,freq=1.0), product of:
              0.11019219 = queryWeight, product of:
                1.5505909 = boost
                3.372775 = idf(docFreq=4140, maxDocs=44421)
                0.021070082 = queryNorm
              0.26349807 = fieldWeight in 3905, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.372775 = idf(docFreq=4140, maxDocs=44421)
                0.078125 = fieldNorm(doc=3905)
          0.21428083 = weight(abstract_txt:optical in 3905) [ClassicSimilarity], result of:
            0.21428083 = score(doc=3905,freq=2.0), product of:
              0.2631283 = queryWeight, product of:
                1.6943011 = boost
                7.370734 = idf(docFreq=75, maxDocs=44421)
                0.021070082 = queryNorm
              0.8143587 = fieldWeight in 3905, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.370734 = idf(docFreq=75, maxDocs=44421)
                0.078125 = fieldNorm(doc=3905)
        0.28 = coord(7/25)
    
  5. Taylor, S.L.: Integrating natural language understanding with document structure analysis (1994) 0.16
    0.15943517 = sum of:
      0.15943517 = product of:
        0.6643132 = sum of:
          0.10378637 = weight(abstract_txt:recognition in 1862) [ClassicSimilarity], result of:
            0.10378637 = score(doc=1862,freq=1.0), product of:
              0.18106231 = queryWeight, product of:
                1.4054676 = boost
                6.114219 = idf(docFreq=266, maxDocs=44421)
                0.021070082 = queryNorm
              0.57320803 = fieldWeight in 1862, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.114219 = idf(docFreq=266, maxDocs=44421)
                0.09375 = fieldNorm(doc=1862)
          0.12583369 = weight(abstract_txt:character in 1862) [ClassicSimilarity], result of:
            0.12583369 = score(doc=1862,freq=1.0), product of:
              0.20587288 = queryWeight, product of:
                1.4986713 = boost
                6.519684 = idf(docFreq=177, maxDocs=44421)
                0.021070082 = queryNorm
              0.61122036 = fieldWeight in 1862, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.519684 = idf(docFreq=177, maxDocs=44421)
                0.09375 = fieldNorm(doc=1862)
          0.034842514 = weight(abstract_txt:system in 1862) [ClassicSimilarity], result of:
            0.034842514 = score(doc=1862,freq=1.0), product of:
              0.11019219 = queryWeight, product of:
                1.5505909 = boost
                3.372775 = idf(docFreq=4140, maxDocs=44421)
                0.021070082 = queryNorm
              0.31619766 = fieldWeight in 1862, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.372775 = idf(docFreq=4140, maxDocs=44421)
                0.09375 = fieldNorm(doc=1862)
          0.18182333 = weight(abstract_txt:optical in 1862) [ClassicSimilarity], result of:
            0.18182333 = score(doc=1862,freq=1.0), product of:
              0.2631283 = queryWeight, product of:
                1.6943011 = boost
                7.370734 = idf(docFreq=75, maxDocs=44421)
                0.021070082 = queryNorm
              0.6910063 = fieldWeight in 1862, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.370734 = idf(docFreq=75, maxDocs=44421)
                0.09375 = fieldNorm(doc=1862)
          0.16079183 = weight(abstract_txt:document in 1862) [ClassicSimilarity], result of:
            0.16079183 = score(doc=1862,freq=5.0), product of:
              0.17862017 = queryWeight, product of:
                1.9741814 = boost
                4.29415 = idf(docFreq=1647, maxDocs=44421)
                0.021070082 = queryNorm
              0.9001885 = fieldWeight in 1862, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                4.29415 = idf(docFreq=1647, maxDocs=44421)
                0.09375 = fieldNorm(doc=1862)
          0.057235487 = weight(abstract_txt:retrieval in 1862) [ClassicSimilarity], result of:
            0.057235487 = score(doc=1862,freq=1.0), product of:
              0.17561105 = queryWeight, product of:
                2.3974159 = boost
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.021070082 = queryNorm
              0.3259219 = fieldWeight in 1862, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.09375 = fieldNorm(doc=1862)
        0.24 = coord(6/25)