Document (#16031)

Author
Alexander, M.
Title
Retrieving digital data with fuzzy matching
Source
New library world. 97(1996) no.1131, S.28-31
Year
1996
Abstract
Briefly describes the Excalibur EFS system which makes use of adaptive pattern recognition technology as an aid to automatic indexing and how it is being tested at the British Library for the indexing and retrieval of scanned images from the library's holdings. Notes how Excalibur EFS can support a wide degree of fuzzy searching, compensate for the errors produced by OCR conversion of scanned images, reduce the costs of indexing, and require far less storage space than more traditional indexes
Theme
Automatisches Indexieren
Object
Excalibur EFS

Similar documents (author)

  1. Alexander, M.: Automatic indexing of document images using Excalibur EFS (1995) 5.58
    5.5805492 = sum of:
      5.5805492 = weight(author_txt:alexander in 1979) [ClassicSimilarity], result of:
        5.5805492 = fieldWeight in 1979, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.928879 = idf(docFreq=15, maxDocs=44421)
          0.625 = fieldNorm(doc=1979)
    
  2. Alexander, M.: Retrieving digital data with fuzzy matching (1997) 5.58
    5.5805492 = sum of:
      5.5805492 = weight(author_txt:alexander in 151) [ClassicSimilarity], result of:
        5.5805492 = fieldWeight in 151, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.928879 = idf(docFreq=15, maxDocs=44421)
          0.625 = fieldNorm(doc=151)
    
  3. Alexander, J.: Customs and excise process 2.5 million documents (1997) 5.58
    5.5805492 = sum of:
      5.5805492 = weight(author_txt:alexander in 3427) [ClassicSimilarity], result of:
        5.5805492 = fieldWeight in 3427, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.928879 = idf(docFreq=15, maxDocs=44421)
          0.625 = fieldNorm(doc=3427)
    
  4. Alexander, M.: Digitising books, manuscripts and scholarly materials : preparation, handling, scanning, recognition, compression, storage formats (1998) 5.58
    5.5805492 = sum of:
      5.5805492 = weight(author_txt:alexander in 4686) [ClassicSimilarity], result of:
        5.5805492 = fieldWeight in 4686, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.928879 = idf(docFreq=15, maxDocs=44421)
          0.625 = fieldNorm(doc=4686)
    
  5. Alexander, K.: Kompendium der visuellen Information und Kommunikation (2007) 5.58
    5.5805492 = sum of:
      5.5805492 = weight(author_txt:alexander in 1647) [ClassicSimilarity], result of:
        5.5805492 = fieldWeight in 1647, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.928879 = idf(docFreq=15, maxDocs=44421)
          0.625 = fieldNorm(doc=1647)
    

Similar documents (content)

  1. Alexander, M.: Automatic indexing of document images using Excalibur EFS (1995) 0.76
    0.76484233 = sum of:
      0.76484233 = product of:
        1.9121058 = sum of:
          0.096900694 = weight(abstract_txt:indexes in 1979) [ClassicSimilarity], result of:
            0.096900694 = score(doc=1979,freq=2.0), product of:
              0.10922854 = queryWeight, product of:
                1.0135162 = boost
                5.735321 = idf(docFreq=389, maxDocs=44421)
                0.018790904 = queryNorm
              0.8871372 = fieldWeight in 1979, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.735321 = idf(docFreq=389, maxDocs=44421)
                0.109375 = fieldNorm(doc=1979)
          0.07609744 = weight(abstract_txt:british in 1979) [ClassicSimilarity], result of:
            0.07609744 = score(doc=1979,freq=1.0), product of:
              0.11714081 = queryWeight, product of:
                1.049583 = boost
                5.9394164 = idf(docFreq=317, maxDocs=44421)
                0.018790904 = queryNorm
              0.6496237 = fieldWeight in 1979, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.9394164 = idf(docFreq=317, maxDocs=44421)
                0.109375 = fieldNorm(doc=1979)
          0.11740233 = weight(abstract_txt:recognition in 1979) [ClassicSimilarity], result of:
            0.11740233 = score(doc=1979,freq=2.0), product of:
              0.12413741 = queryWeight, product of:
                1.0804732 = boost
                6.114219 = idf(docFreq=266, maxDocs=44421)
                0.018790904 = queryNorm
              0.945745 = fieldWeight in 1979, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.114219 = idf(docFreq=266, maxDocs=44421)
                0.109375 = fieldNorm(doc=1979)
          0.124656156 = weight(abstract_txt:pattern in 1979) [ClassicSimilarity], result of:
            0.124656156 = score(doc=1979,freq=2.0), product of:
              0.12919945 = queryWeight, product of:
                1.1022826 = boost
                6.2376356 = idf(docFreq=235, maxDocs=44421)
                0.018790904 = queryNorm
              0.96483505 = fieldWeight in 1979, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.2376356 = idf(docFreq=235, maxDocs=44421)
                0.109375 = fieldNorm(doc=1979)
          0.17100178 = weight(abstract_txt:adaptive in 1979) [ClassicSimilarity], result of:
            0.17100178 = score(doc=1979,freq=2.0), product of:
              0.15950902 = queryWeight, product of:
                1.2247721 = boost
                6.930783 = idf(docFreq=117, maxDocs=44421)
                0.018790904 = queryNorm
              1.0720508 = fieldWeight in 1979, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.930783 = idf(docFreq=117, maxDocs=44421)
                0.109375 = fieldNorm(doc=1979)
          0.1162064 = weight(abstract_txt:images in 1979) [ClassicSimilarity], result of:
            0.1162064 = score(doc=1979,freq=1.0), product of:
              0.19571534 = queryWeight, product of:
                1.9186249 = boost
                5.428591 = idf(docFreq=529, maxDocs=44421)
                0.018790904 = queryNorm
              0.59375215 = fieldWeight in 1979, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.428591 = idf(docFreq=529, maxDocs=44421)
                0.109375 = fieldNorm(doc=1979)
          0.08970578 = weight(abstract_txt:indexing in 1979) [ClassicSimilarity], result of:
            0.08970578 = score(doc=1979,freq=1.0), product of:
              0.18853076 = queryWeight, product of:
                2.3062925 = boost
                4.3503094 = idf(docFreq=1557, maxDocs=44421)
                0.018790904 = queryNorm
              0.4758151 = fieldWeight in 1979, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.3503094 = idf(docFreq=1557, maxDocs=44421)
                0.109375 = fieldNorm(doc=1979)
          0.23262282 = weight(abstract_txt:fuzzy in 1979) [ClassicSimilarity], result of:
            0.23262282 = score(doc=1979,freq=1.0), product of:
              0.31086588 = queryWeight, product of:
                2.4180439 = boost
                6.8416553 = idf(docFreq=128, maxDocs=44421)
                0.018790904 = queryNorm
              0.74830604 = fieldWeight in 1979, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.8416553 = idf(docFreq=128, maxDocs=44421)
                0.109375 = fieldNorm(doc=1979)
          0.3808914 = weight(abstract_txt:scanned in 1979) [ClassicSimilarity], result of:
            0.3808914 = score(doc=1979,freq=1.0), product of:
              0.431856 = queryWeight, product of:
                2.850015 = boost
                8.063882 = idf(docFreq=37, maxDocs=44421)
                0.018790904 = queryNorm
              0.8819871 = fieldWeight in 1979, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.063882 = idf(docFreq=37, maxDocs=44421)
                0.109375 = fieldNorm(doc=1979)
          0.506621 = weight(abstract_txt:excalibur in 1979) [ClassicSimilarity], result of:
            0.506621 = score(doc=1979,freq=1.0), product of:
              0.52230835 = queryWeight, product of:
                3.134304 = boost
                8.868255 = idf(docFreq=16, maxDocs=44421)
                0.018790904 = queryNorm
              0.96996534 = fieldWeight in 1979, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.868255 = idf(docFreq=16, maxDocs=44421)
                0.109375 = fieldNorm(doc=1979)
        0.4 = coord(10/25)
    
  2. Alexander, M.: Digitising books, manuscripts and scholarly materials : preparation, handling, scanning, recognition, compression, storage formats (1998) 0.23
    0.23255782 = sum of:
      0.23255782 = product of:
        0.9689909 = sum of:
          0.07236512 = weight(abstract_txt:storage in 4686) [ClassicSimilarity], result of:
            0.07236512 = score(doc=4686,freq=1.0), product of:
              0.11327856 = queryWeight, product of:
                1.032135 = boost
                5.8406816 = idf(docFreq=350, maxDocs=44421)
                0.018790904 = queryNorm
              0.6388245 = fieldWeight in 4686, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.8406816 = idf(docFreq=350, maxDocs=44421)
                0.109375 = fieldNorm(doc=4686)
          0.07609744 = weight(abstract_txt:british in 4686) [ClassicSimilarity], result of:
            0.07609744 = score(doc=4686,freq=1.0), product of:
              0.11714081 = queryWeight, product of:
                1.049583 = boost
                5.9394164 = idf(docFreq=317, maxDocs=44421)
                0.018790904 = queryNorm
              0.6496237 = fieldWeight in 4686, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.9394164 = idf(docFreq=317, maxDocs=44421)
                0.109375 = fieldNorm(doc=4686)
          0.11468499 = weight(abstract_txt:library's in 4686) [ClassicSimilarity], result of:
            0.11468499 = score(doc=4686,freq=2.0), product of:
              0.12221445 = queryWeight, product of:
                1.0720719 = boost
                6.066678 = idf(docFreq=279, maxDocs=44421)
                0.018790904 = queryNorm
              0.9383914 = fieldWeight in 4686, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.066678 = idf(docFreq=279, maxDocs=44421)
                0.109375 = fieldNorm(doc=4686)
          0.083015986 = weight(abstract_txt:recognition in 4686) [ClassicSimilarity], result of:
            0.083015986 = score(doc=4686,freq=1.0), product of:
              0.12413741 = queryWeight, product of:
                1.0804732 = boost
                6.114219 = idf(docFreq=266, maxDocs=44421)
                0.018790904 = queryNorm
              0.6687427 = fieldWeight in 4686, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.114219 = idf(docFreq=266, maxDocs=44421)
                0.109375 = fieldNorm(doc=4686)
          0.1162064 = weight(abstract_txt:images in 4686) [ClassicSimilarity], result of:
            0.1162064 = score(doc=4686,freq=1.0), product of:
              0.19571534 = queryWeight, product of:
                1.9186249 = boost
                5.428591 = idf(docFreq=529, maxDocs=44421)
                0.018790904 = queryNorm
              0.59375215 = fieldWeight in 4686, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.428591 = idf(docFreq=529, maxDocs=44421)
                0.109375 = fieldNorm(doc=4686)
          0.506621 = weight(abstract_txt:excalibur in 4686) [ClassicSimilarity], result of:
            0.506621 = score(doc=4686,freq=1.0), product of:
              0.52230835 = queryWeight, product of:
                3.134304 = boost
                8.868255 = idf(docFreq=16, maxDocs=44421)
                0.018790904 = queryNorm
              0.96996534 = fieldWeight in 4686, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.868255 = idf(docFreq=16, maxDocs=44421)
                0.109375 = fieldNorm(doc=4686)
        0.24 = coord(6/25)
    
  3. Townsend, J.: Multimedia - myth or reality? (1994) 0.21
    0.20511144 = sum of:
      0.20511144 = product of:
        0.85463107 = sum of:
          0.061846398 = weight(abstract_txt:briefly in 728) [ClassicSimilarity], result of:
            0.061846398 = score(doc=728,freq=1.0), product of:
              0.11305826 = queryWeight, product of:
                1.0311309 = boost
                5.8349996 = idf(docFreq=352, maxDocs=44421)
                0.018790904 = queryNorm
              0.5470312 = fieldWeight in 728, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.8349996 = idf(docFreq=352, maxDocs=44421)
                0.09375 = fieldNorm(doc=728)
          0.07115656 = weight(abstract_txt:recognition in 728) [ClassicSimilarity], result of:
            0.07115656 = score(doc=728,freq=1.0), product of:
              0.12413741 = queryWeight, product of:
                1.0804732 = boost
                6.114219 = idf(docFreq=266, maxDocs=44421)
                0.018790904 = queryNorm
              0.57320803 = fieldWeight in 728, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.114219 = idf(docFreq=266, maxDocs=44421)
                0.09375 = fieldNorm(doc=728)
          0.10684813 = weight(abstract_txt:pattern in 728) [ClassicSimilarity], result of:
            0.10684813 = score(doc=728,freq=2.0), product of:
              0.12919945 = queryWeight, product of:
                1.1022826 = boost
                6.2376356 = idf(docFreq=235, maxDocs=44421)
                0.018790904 = queryNorm
              0.82700145 = fieldWeight in 728, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.2376356 = idf(docFreq=235, maxDocs=44421)
                0.09375 = fieldNorm(doc=728)
          0.103642724 = weight(abstract_txt:adaptive in 728) [ClassicSimilarity], result of:
            0.103642724 = score(doc=728,freq=1.0), product of:
              0.15950902 = queryWeight, product of:
                1.2247721 = boost
                6.930783 = idf(docFreq=117, maxDocs=44421)
                0.018790904 = queryNorm
              0.6497609 = fieldWeight in 728, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.930783 = idf(docFreq=117, maxDocs=44421)
                0.09375 = fieldNorm(doc=728)
          0.07689067 = weight(abstract_txt:indexing in 728) [ClassicSimilarity], result of:
            0.07689067 = score(doc=728,freq=1.0), product of:
              0.18853076 = queryWeight, product of:
                2.3062925 = boost
                4.3503094 = idf(docFreq=1557, maxDocs=44421)
                0.018790904 = queryNorm
              0.4078415 = fieldWeight in 728, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.3503094 = idf(docFreq=1557, maxDocs=44421)
                0.09375 = fieldNorm(doc=728)
          0.43424657 = weight(abstract_txt:excalibur in 728) [ClassicSimilarity], result of:
            0.43424657 = score(doc=728,freq=1.0), product of:
              0.52230835 = queryWeight, product of:
                3.134304 = boost
                8.868255 = idf(docFreq=16, maxDocs=44421)
                0.018790904 = queryNorm
              0.83139884 = fieldWeight in 728, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.868255 = idf(docFreq=16, maxDocs=44421)
                0.09375 = fieldNorm(doc=728)
        0.24 = coord(6/25)
    
  4. Picture content retrieval (1996) 0.20
    0.19905208 = sum of:
      0.19905208 = product of:
        0.99526036 = sum of:
          0.082461864 = weight(abstract_txt:briefly in 44) [ClassicSimilarity], result of:
            0.082461864 = score(doc=44,freq=1.0), product of:
              0.11305826 = queryWeight, product of:
                1.0311309 = boost
                5.8349996 = idf(docFreq=352, maxDocs=44421)
                0.018790904 = queryNorm
              0.72937495 = fieldWeight in 44, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.8349996 = idf(docFreq=352, maxDocs=44421)
                0.125 = fieldNorm(doc=44)
          0.09487542 = weight(abstract_txt:recognition in 44) [ClassicSimilarity], result of:
            0.09487542 = score(doc=44,freq=1.0), product of:
              0.12413741 = queryWeight, product of:
                1.0804732 = boost
                6.114219 = idf(docFreq=266, maxDocs=44421)
                0.018790904 = queryNorm
              0.7642774 = fieldWeight in 44, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.114219 = idf(docFreq=266, maxDocs=44421)
                0.125 = fieldNorm(doc=44)
          0.100737385 = weight(abstract_txt:pattern in 44) [ClassicSimilarity], result of:
            0.100737385 = score(doc=44,freq=1.0), product of:
              0.12919945 = queryWeight, product of:
                1.1022826 = boost
                6.2376356 = idf(docFreq=235, maxDocs=44421)
                0.018790904 = queryNorm
              0.77970445 = fieldWeight in 44, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.2376356 = idf(docFreq=235, maxDocs=44421)
                0.125 = fieldNorm(doc=44)
          0.1381903 = weight(abstract_txt:adaptive in 44) [ClassicSimilarity], result of:
            0.1381903 = score(doc=44,freq=1.0), product of:
              0.15950902 = queryWeight, product of:
                1.2247721 = boost
                6.930783 = idf(docFreq=117, maxDocs=44421)
                0.018790904 = queryNorm
              0.86634785 = fieldWeight in 44, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.930783 = idf(docFreq=117, maxDocs=44421)
                0.125 = fieldNorm(doc=44)
          0.5789954 = weight(abstract_txt:excalibur in 44) [ClassicSimilarity], result of:
            0.5789954 = score(doc=44,freq=1.0), product of:
              0.52230835 = queryWeight, product of:
                3.134304 = boost
                8.868255 = idf(docFreq=16, maxDocs=44421)
                0.018790904 = queryNorm
              1.1085318 = fieldWeight in 44, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.868255 = idf(docFreq=16, maxDocs=44421)
                0.125 = fieldNorm(doc=44)
        0.2 = coord(5/25)
    
  5. Brown, S.: Developments in information retrieval systems : RetrievalWare from Excalibur (1996) 0.18
    0.18255971 = sum of:
      0.18255971 = product of:
        1.1409982 = sum of:
          0.118594274 = weight(abstract_txt:recognition in 5020) [ClassicSimilarity], result of:
            0.118594274 = score(doc=5020,freq=1.0), product of:
              0.12413741 = queryWeight, product of:
                1.0804732 = boost
                6.114219 = idf(docFreq=266, maxDocs=44421)
                0.018790904 = queryNorm
              0.95534676 = fieldWeight in 5020, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.114219 = idf(docFreq=266, maxDocs=44421)
                0.15625 = fieldNorm(doc=5020)
          0.12592173 = weight(abstract_txt:pattern in 5020) [ClassicSimilarity], result of:
            0.12592173 = score(doc=5020,freq=1.0), product of:
              0.12919945 = queryWeight, product of:
                1.1022826 = boost
                6.2376356 = idf(docFreq=235, maxDocs=44421)
                0.018790904 = queryNorm
              0.9746306 = fieldWeight in 5020, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.2376356 = idf(docFreq=235, maxDocs=44421)
                0.15625 = fieldNorm(doc=5020)
          0.17273788 = weight(abstract_txt:adaptive in 5020) [ClassicSimilarity], result of:
            0.17273788 = score(doc=5020,freq=1.0), product of:
              0.15950902 = queryWeight, product of:
                1.2247721 = boost
                6.930783 = idf(docFreq=117, maxDocs=44421)
                0.018790904 = queryNorm
              1.0829349 = fieldWeight in 5020, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.930783 = idf(docFreq=117, maxDocs=44421)
                0.15625 = fieldNorm(doc=5020)
          0.72374433 = weight(abstract_txt:excalibur in 5020) [ClassicSimilarity], result of:
            0.72374433 = score(doc=5020,freq=1.0), product of:
              0.52230835 = queryWeight, product of:
                3.134304 = boost
                8.868255 = idf(docFreq=16, maxDocs=44421)
                0.018790904 = queryNorm
              1.3856648 = fieldWeight in 5020, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.868255 = idf(docFreq=16, maxDocs=44421)
                0.15625 = fieldNorm(doc=5020)
        0.16 = coord(4/25)