Document (#1499)

Author
Ballard, T.
Lifshin, A.
Title
Prediction of OPAC spelling errors through a keyword inventory
Source
Information technology and libraries. 11(1992), S.139-145
Year
1992
Abstract
In order to find and correct spelling errors in the online public access catalog at Adelphi University, a visual inspection was performed of the 117.000 keywords indexed in the system. More than 1.000 errors were found. Certain long but common words such as administration, education, and commercial were found to generate many different misspellings. Most of the records were derived from bibliographic utilities, so the findings can be generalized to other OPACs. The same misspellings were also found in substantial numbers in CD-ROM databases. Misspellings were analyzed by the machine-readable catalog (MARC) field in which they were found, part of speech, and type of mistake. Lists of commonly misspelled root words and specific mistakes are included
Theme
OPAC

Similar documents (author)

  1. Ballard, P.I.: Bound withs versus an online catalog : a practical solution (1992) 5.62
    5.620886 = sum of:
      5.620886 = weight(author_txt:ballard in 2967) [ClassicSimilarity], result of:
        5.620886 = fieldWeight in 2967, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.993418 = idf(docFreq=14, maxDocs=44421)
          0.625 = fieldNorm(doc=2967)
    
  2. Ballard, T.: OCLC's EPIC : report from the field (1991) 5.62
    5.620886 = sum of:
      5.620886 = weight(author_txt:ballard in 4858) [ClassicSimilarity], result of:
        5.620886 = fieldWeight in 4858, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.993418 = idf(docFreq=14, maxDocs=44421)
          0.625 = fieldNorm(doc=4858)
    
  3. Ballard, T.: Using FirstSearch in a bibliographic construction (1993) 5.62
    5.620886 = sum of:
      5.620886 = weight(author_txt:ballard in 7308) [ClassicSimilarity], result of:
        5.620886 = fieldWeight in 7308, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.993418 = idf(docFreq=14, maxDocs=44421)
          0.625 = fieldNorm(doc=7308)
    
  4. Ballard, T.: Comparative searching styles of patrons and staff (1994) 5.62
    5.620886 = sum of:
      5.620886 = weight(author_txt:ballard in 115) [ClassicSimilarity], result of:
        5.620886 = fieldWeight in 115, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.993418 = idf(docFreq=14, maxDocs=44421)
          0.625 = fieldNorm(doc=115)
    
  5. Ballard, T.: Library systems : transaction log fever; analyzing patron searches can reveal solutions to increase search success (1996) 5.62
    5.620886 = sum of:
      5.620886 = weight(author_txt:ballard in 5829) [ClassicSimilarity], result of:
        5.620886 = fieldWeight in 5829, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.993418 = idf(docFreq=14, maxDocs=44421)
          0.625 = fieldNorm(doc=5829)
    

Similar documents (content)

  1. Drabenstott, K.M.; Weller, M.S.: Handling spelling errors in online catalog searches (1996) 0.41
    0.41376448 = sum of:
      0.41376448 = product of:
        1.4777303 = sum of:
          0.18558769 = weight(abstract_txt:misspelled in 6973) [ClassicSimilarity], result of:
            0.18558769 = score(doc=6973,freq=2.0), product of:
              0.21821651 = queryWeight, product of:
                1.5765942 = boost
                9.622026 = idf(docFreq=7, maxDocs=44421)
                0.01438471 = queryNorm
              0.850475 = fieldWeight in 6973, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                9.622026 = idf(docFreq=7, maxDocs=44421)
                0.0625 = fieldNorm(doc=6973)
          0.06401182 = weight(abstract_txt:words in 6973) [ClassicSimilarity], result of:
            0.06401182 = score(doc=6973,freq=2.0), product of:
              0.13521917 = queryWeight, product of:
                1.755134 = boost
                5.355831 = idf(docFreq=569, maxDocs=44421)
                0.01438471 = queryNorm
              0.47339305 = fieldWeight in 6973, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.355831 = idf(docFreq=569, maxDocs=44421)
                0.0625 = fieldNorm(doc=6973)
          0.29591462 = weight(abstract_txt:spelling in 6973) [ClassicSimilarity], result of:
            0.29591462 = score(doc=6973,freq=5.0), product of:
              0.27647918 = queryWeight, product of:
                2.5097032 = boost
                7.6584163 = idf(docFreq=56, maxDocs=44421)
                0.01438471 = queryNorm
              1.0702962 = fieldWeight in 6973, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                7.6584163 = idf(docFreq=56, maxDocs=44421)
                0.0625 = fieldNorm(doc=6973)
          0.052685574 = weight(abstract_txt:found in 6973) [ClassicSimilarity], result of:
            0.052685574 = score(doc=6973,freq=1.0), product of:
              0.18851502 = queryWeight, product of:
                2.9307523 = boost
                4.4716287 = idf(docFreq=1379, maxDocs=44421)
                0.01438471 = queryNorm
              0.2794768 = fieldWeight in 6973, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.4716287 = idf(docFreq=1379, maxDocs=44421)
                0.0625 = fieldNorm(doc=6973)
          0.24816857 = weight(abstract_txt:errors in 6973) [ClassicSimilarity], result of:
            0.24816857 = score(doc=6973,freq=4.0), product of:
              0.3031911 = queryWeight, product of:
                3.2188072 = boost
                6.548176 = idf(docFreq=172, maxDocs=44421)
                0.01438471 = queryNorm
              0.818522 = fieldWeight in 6973, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                6.548176 = idf(docFreq=172, maxDocs=44421)
                0.0625 = fieldNorm(doc=6973)
          0.061660316 = weight(abstract_txt:were in 6973) [ClassicSimilarity], result of:
            0.061660316 = score(doc=6973,freq=2.0), product of:
              0.19021398 = queryWeight, product of:
                3.6055622 = boost
                3.6674848 = idf(docFreq=3083, maxDocs=44421)
                0.01438471 = queryNorm
              0.3241629 = fieldWeight in 6973, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.6674848 = idf(docFreq=3083, maxDocs=44421)
                0.0625 = fieldNorm(doc=6973)
          0.5697018 = weight(abstract_txt:misspellings in 6973) [ClassicSimilarity], result of:
            0.5697018 = score(doc=6973,freq=3.0), product of:
              0.58071524 = queryWeight, product of:
                4.454699 = boost
                9.06241 = idf(docFreq=13, maxDocs=44421)
                0.01438471 = queryNorm
              0.9810347 = fieldWeight in 6973, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                9.06241 = idf(docFreq=13, maxDocs=44421)
                0.0625 = fieldNorm(doc=6973)
        0.28 = coord(7/25)
    
  2. Randall, N.B.: Spelling errors in the database : shadow or substance? (1999) 0.26
    0.2550973 = sum of:
      0.2550973 = product of:
        1.2754865 = sum of:
          0.05548762 = weight(abstract_txt:correct in 231) [ClassicSimilarity], result of:
            0.05548762 = score(doc=231,freq=1.0), product of:
              0.10593893 = queryWeight, product of:
                1.0985098 = boost
                6.704255 = idf(docFreq=147, maxDocs=44421)
                0.01438471 = queryNorm
              0.5237699 = fieldWeight in 231, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.704255 = idf(docFreq=147, maxDocs=44421)
                0.078125 = fieldNorm(doc=231)
          0.23394105 = weight(abstract_txt:spelling in 231) [ClassicSimilarity], result of:
            0.23394105 = score(doc=231,freq=2.0), product of:
              0.27647918 = queryWeight, product of:
                2.5097032 = boost
                7.6584163 = idf(docFreq=56, maxDocs=44421)
                0.01438471 = queryNorm
              0.8461434 = fieldWeight in 231, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.6584163 = idf(docFreq=56, maxDocs=44421)
                0.078125 = fieldNorm(doc=231)
          0.3102107 = weight(abstract_txt:errors in 231) [ClassicSimilarity], result of:
            0.3102107 = score(doc=231,freq=4.0), product of:
              0.3031911 = queryWeight, product of:
                3.2188072 = boost
                6.548176 = idf(docFreq=172, maxDocs=44421)
                0.01438471 = queryNorm
              1.0231525 = fieldWeight in 231, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                6.548176 = idf(docFreq=172, maxDocs=44421)
                0.078125 = fieldNorm(doc=231)
          0.094397694 = weight(abstract_txt:were in 231) [ClassicSimilarity], result of:
            0.094397694 = score(doc=231,freq=3.0), product of:
              0.19021398 = queryWeight, product of:
                3.6055622 = boost
                3.6674848 = idf(docFreq=3083, maxDocs=44421)
                0.01438471 = queryNorm
              0.49627107 = fieldWeight in 231, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.6674848 = idf(docFreq=3083, maxDocs=44421)
                0.078125 = fieldNorm(doc=231)
          0.58144945 = weight(abstract_txt:misspellings in 231) [ClassicSimilarity], result of:
            0.58144945 = score(doc=231,freq=2.0), product of:
              0.58071524 = queryWeight, product of:
                4.454699 = boost
                9.06241 = idf(docFreq=13, maxDocs=44421)
                0.01438471 = queryNorm
              1.0012643 = fieldWeight in 231, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                9.06241 = idf(docFreq=13, maxDocs=44421)
                0.078125 = fieldNorm(doc=231)
        0.2 = coord(5/25)
    
  3. Tüür-Fröhlich, T.: ¬The non-trivial effects of trivial errors in scientific communication and evaluation (2016) 0.24
    0.24487148 = sum of:
      0.24487148 = product of:
        0.874541 = sum of:
          0.041437864 = weight(abstract_txt:indexed in 4137) [ClassicSimilarity], result of:
            0.041437864 = score(doc=4137,freq=2.0), product of:
              0.08779054 = queryWeight, product of:
                6.1030455 = idf(docFreq=269, maxDocs=44421)
                0.01438471 = queryNorm
              0.47200832 = fieldWeight in 4137, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.1030455 = idf(docFreq=269, maxDocs=44421)
                0.0546875 = fieldNorm(doc=4137)
          0.038841337 = weight(abstract_txt:correct in 4137) [ClassicSimilarity], result of:
            0.038841337 = score(doc=4137,freq=1.0), product of:
              0.10593893 = queryWeight, product of:
                1.0985098 = boost
                6.704255 = idf(docFreq=147, maxDocs=44421)
                0.01438471 = queryNorm
              0.36663896 = fieldWeight in 4137, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.704255 = idf(docFreq=147, maxDocs=44421)
                0.0546875 = fieldNorm(doc=4137)
          0.10702156 = weight(abstract_txt:mistake in 4137) [ClassicSimilarity], result of:
            0.10702156 = score(doc=4137,freq=1.0), product of:
              0.20821258 = queryWeight, product of:
                1.5400316 = boost
                9.398883 = idf(docFreq=9, maxDocs=44421)
                0.01438471 = queryNorm
              0.5140014 = fieldWeight in 4137, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.398883 = idf(docFreq=9, maxDocs=44421)
                0.0546875 = fieldNorm(doc=4137)
          0.04609988 = weight(abstract_txt:found in 4137) [ClassicSimilarity], result of:
            0.04609988 = score(doc=4137,freq=1.0), product of:
              0.18851502 = queryWeight, product of:
                2.9307523 = boost
                4.4716287 = idf(docFreq=1379, maxDocs=44421)
                0.01438471 = queryNorm
              0.2445422 = fieldWeight in 4137, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.4716287 = idf(docFreq=1379, maxDocs=44421)
                0.0546875 = fieldNorm(doc=4137)
          0.28725913 = weight(abstract_txt:errors in 4137) [ClassicSimilarity], result of:
            0.28725913 = score(doc=4137,freq=7.0), product of:
              0.3031911 = queryWeight, product of:
                3.2188072 = boost
                6.548176 = idf(docFreq=172, maxDocs=44421)
                0.01438471 = queryNorm
              0.9474524 = fieldWeight in 4137, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                6.548176 = idf(docFreq=172, maxDocs=44421)
                0.0546875 = fieldNorm(doc=4137)
          0.06607839 = weight(abstract_txt:were in 4137) [ClassicSimilarity], result of:
            0.06607839 = score(doc=4137,freq=3.0), product of:
              0.19021398 = queryWeight, product of:
                3.6055622 = boost
                3.6674848 = idf(docFreq=3083, maxDocs=44421)
                0.01438471 = queryNorm
              0.34738976 = fieldWeight in 4137, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.6674848 = idf(docFreq=3083, maxDocs=44421)
                0.0546875 = fieldNorm(doc=4137)
          0.28780282 = weight(abstract_txt:misspellings in 4137) [ClassicSimilarity], result of:
            0.28780282 = score(doc=4137,freq=1.0), product of:
              0.58071524 = queryWeight, product of:
                4.454699 = boost
                9.06241 = idf(docFreq=13, maxDocs=44421)
                0.01438471 = queryNorm
              0.49560058 = fieldWeight in 4137, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.06241 = idf(docFreq=13, maxDocs=44421)
                0.0546875 = fieldNorm(doc=4137)
        0.28 = coord(7/25)
    
  4. Ballard, T.: Spelling and typographical errors in library databases (1992) 0.22
    0.21505219 = sum of:
      0.21505219 = product of:
        1.3440762 = sum of:
          0.30577588 = weight(abstract_txt:adelphi in 6971) [ClassicSimilarity], result of:
            0.30577588 = score(doc=6971,freq=1.0), product of:
              0.20821258 = queryWeight, product of:
                1.5400316 = boost
                9.398883 = idf(docFreq=9, maxDocs=44421)
                0.01438471 = queryNorm
              1.4685755 = fieldWeight in 6971, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.398883 = idf(docFreq=9, maxDocs=44421)
                0.15625 = fieldNorm(doc=6971)
          0.4678821 = weight(abstract_txt:spelling in 6971) [ClassicSimilarity], result of:
            0.4678821 = score(doc=6971,freq=2.0), product of:
              0.27647918 = queryWeight, product of:
                2.5097032 = boost
                7.6584163 = idf(docFreq=56, maxDocs=44421)
                0.01438471 = queryNorm
              1.6922868 = fieldWeight in 6971, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.6584163 = idf(docFreq=56, maxDocs=44421)
                0.15625 = fieldNorm(doc=6971)
          0.13171393 = weight(abstract_txt:found in 6971) [ClassicSimilarity], result of:
            0.13171393 = score(doc=6971,freq=1.0), product of:
              0.18851502 = queryWeight, product of:
                2.9307523 = boost
                4.4716287 = idf(docFreq=1379, maxDocs=44421)
                0.01438471 = queryNorm
              0.69869196 = fieldWeight in 6971, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.4716287 = idf(docFreq=1379, maxDocs=44421)
                0.15625 = fieldNorm(doc=6971)
          0.43870422 = weight(abstract_txt:errors in 6971) [ClassicSimilarity], result of:
            0.43870422 = score(doc=6971,freq=2.0), product of:
              0.3031911 = queryWeight, product of:
                3.2188072 = boost
                6.548176 = idf(docFreq=172, maxDocs=44421)
                0.01438471 = queryNorm
              1.4469562 = fieldWeight in 6971, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.548176 = idf(docFreq=172, maxDocs=44421)
                0.15625 = fieldNorm(doc=6971)
        0.16 = coord(4/25)
    
  5. Berget, G.; Sandnes, F.E.: Do autocomplete functions reduce the impact of dyslexia on information-searching behavior? : the case of Google (2016) 0.16
    0.16399942 = sum of:
      0.16399942 = product of:
        1.0249964 = sum of:
          0.23394105 = weight(abstract_txt:spelling in 4112) [ClassicSimilarity], result of:
            0.23394105 = score(doc=4112,freq=2.0), product of:
              0.27647918 = queryWeight, product of:
                2.5097032 = boost
                7.6584163 = idf(docFreq=56, maxDocs=44421)
                0.01438471 = queryNorm
              0.8461434 = fieldWeight in 4112, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.6584163 = idf(docFreq=56, maxDocs=44421)
                0.078125 = fieldNorm(doc=4112)
          0.15510535 = weight(abstract_txt:errors in 4112) [ClassicSimilarity], result of:
            0.15510535 = score(doc=4112,freq=1.0), product of:
              0.3031911 = queryWeight, product of:
                3.2188072 = boost
                6.548176 = idf(docFreq=172, maxDocs=44421)
                0.01438471 = queryNorm
              0.51157624 = fieldWeight in 4112, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.548176 = idf(docFreq=172, maxDocs=44421)
                0.078125 = fieldNorm(doc=4112)
          0.054500535 = weight(abstract_txt:were in 4112) [ClassicSimilarity], result of:
            0.054500535 = score(doc=4112,freq=1.0), product of:
              0.19021398 = queryWeight, product of:
                3.6055622 = boost
                3.6674848 = idf(docFreq=3083, maxDocs=44421)
                0.01438471 = queryNorm
              0.28652224 = fieldWeight in 4112, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6674848 = idf(docFreq=3083, maxDocs=44421)
                0.078125 = fieldNorm(doc=4112)
          0.58144945 = weight(abstract_txt:misspellings in 4112) [ClassicSimilarity], result of:
            0.58144945 = score(doc=4112,freq=2.0), product of:
              0.58071524 = queryWeight, product of:
                4.454699 = boost
                9.06241 = idf(docFreq=13, maxDocs=44421)
                0.01438471 = queryNorm
              1.0012643 = fieldWeight in 4112, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                9.06241 = idf(docFreq=13, maxDocs=44421)
                0.078125 = fieldNorm(doc=4112)
        0.16 = coord(4/25)