Document (#14791)

Author
Hosono, K.
Title
Information retrieval functions in digital libraries
Source
Pharmaceutical library bulletin [=Yakugaku Toshokan]. 41(1996) no.2, S.91-99
Year
1996
Abstract
Information retrieval functions in digital libraries have a different context from those which apply to searching commercial databases or OPACs. Different methods of browsing in this context are described, but the retrieval function should also include ordinary Boolean searching. Conversion of printed materials to electronic format using OCR can result in errors, which may cause problems for keyword searching. The n-gram method of approximate or fuzzy matching to reduce this problem is described
Footnote
[In japanisch]

Similar documents (content)

  1. Longshu, L.; Xia, Z.: On an aproximate fuzzy information retrieval agent (1998) 0.26
    0.25603792 = sum of:
      0.25603792 = product of:
        1.6002371 = sum of:
          0.2278828 = weight(abstract_txt:matching in 4294) [ClassicSimilarity], result of:
            0.2278828 = score(doc=4294,freq=2.0), product of:
              0.17068559 = queryWeight, product of:
                1.0825661 = boost
                6.0419855 = idf(docFreq=286, maxDocs=44421)
                0.026095327 = queryNorm
              1.3351028 = fieldWeight in 4294, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.0419855 = idf(docFreq=286, maxDocs=44421)
                0.15625 = fieldNorm(doc=4294)
          0.52314985 = weight(abstract_txt:fuzzy in 4294) [ClassicSimilarity], result of:
            0.52314985 = score(doc=4294,freq=5.0), product of:
              0.2188567 = queryWeight, product of:
                1.225846 = boost
                6.8416553 = idf(docFreq=128, maxDocs=44421)
                0.026095327 = queryNorm
              2.390376 = fieldWeight in 4294, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                6.8416553 = idf(docFreq=128, maxDocs=44421)
                0.15625 = fieldNorm(doc=4294)
          0.7189716 = weight(abstract_txt:approximate in 4294) [ClassicSimilarity], result of:
            0.7189716 = score(doc=4294,freq=4.0), product of:
              0.29142064 = queryWeight, product of:
                1.414543 = boost
                7.894805 = idf(docFreq=44, maxDocs=44421)
                0.026095327 = queryNorm
              2.4671266 = fieldWeight in 4294, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                7.894805 = idf(docFreq=44, maxDocs=44421)
                0.15625 = fieldNorm(doc=4294)
          0.13023284 = weight(abstract_txt:retrieval in 4294) [ClassicSimilarity], result of:
            0.13023284 = score(doc=4294,freq=2.0), product of:
              0.16952871 = queryWeight, product of:
                1.8686943 = boost
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.026095327 = queryNorm
              0.7682052 = fieldWeight in 4294, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.15625 = fieldNorm(doc=4294)
        0.16 = coord(4/25)
    
  2. Alexander, M.: Retrieving digital data with fuzzy matching (1996) 0.22
    0.21568893 = sum of:
      0.21568893 = product of:
        0.7703176 = sum of:
          0.025302297 = weight(abstract_txt:which in 30) [ClassicSimilarity], result of:
            0.025302297 = score(doc=30,freq=1.0), product of:
              0.07939328 = queryWeight, product of:
                1.04415 = boost
                2.9137893 = idf(docFreq=6552, maxDocs=44421)
                0.026095327 = queryNorm
              0.3186957 = fieldWeight in 30, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.9137893 = idf(docFreq=6552, maxDocs=44421)
                0.109375 = fieldNorm(doc=30)
          0.12539518 = weight(abstract_txt:conversion in 30) [ClassicSimilarity], result of:
            0.12539518 = score(doc=30,freq=1.0), product of:
              0.18316999 = queryWeight, product of:
                1.1214584 = boost
                6.25905 = idf(docFreq=230, maxDocs=44421)
                0.026095327 = queryNorm
              0.6845836 = fieldWeight in 30, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.25905 = idf(docFreq=230, maxDocs=44421)
                0.109375 = fieldNorm(doc=30)
          0.12698355 = weight(abstract_txt:reduce in 30) [ClassicSimilarity], result of:
            0.12698355 = score(doc=30,freq=1.0), product of:
              0.18471356 = queryWeight, product of:
                1.1261737 = boost
                6.285367 = idf(docFreq=224, maxDocs=44421)
                0.026095327 = queryNorm
              0.68746203 = fieldWeight in 30, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.285367 = idf(docFreq=224, maxDocs=44421)
                0.109375 = fieldNorm(doc=30)
          0.14358747 = weight(abstract_txt:errors in 30) [ClassicSimilarity], result of:
            0.14358747 = score(doc=30,freq=1.0), product of:
              0.20048328 = queryWeight, product of:
                1.1732622 = boost
                6.548176 = idf(docFreq=172, maxDocs=44421)
                0.026095327 = queryNorm
              0.7162067 = fieldWeight in 30, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.548176 = idf(docFreq=172, maxDocs=44421)
                0.109375 = fieldNorm(doc=30)
          0.1637718 = weight(abstract_txt:fuzzy in 30) [ClassicSimilarity], result of:
            0.1637718 = score(doc=30,freq=1.0), product of:
              0.2188567 = queryWeight, product of:
                1.225846 = boost
                6.8416553 = idf(docFreq=128, maxDocs=44421)
                0.026095327 = queryNorm
              0.74830604 = fieldWeight in 30, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.8416553 = idf(docFreq=128, maxDocs=44421)
                0.109375 = fieldNorm(doc=30)
          0.06446197 = weight(abstract_txt:retrieval in 30) [ClassicSimilarity], result of:
            0.06446197 = score(doc=30,freq=1.0), product of:
              0.16952871 = queryWeight, product of:
                1.8686943 = boost
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.026095327 = queryNorm
              0.3802422 = fieldWeight in 30, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.109375 = fieldNorm(doc=30)
          0.12081538 = weight(abstract_txt:searching in 30) [ClassicSimilarity], result of:
            0.12081538 = score(doc=30,freq=1.0), product of:
              0.2577047 = queryWeight, product of:
                2.303975 = boost
                4.2862926 = idf(docFreq=1660, maxDocs=44421)
                0.026095327 = queryNorm
              0.46881324 = fieldWeight in 30, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.2862926 = idf(docFreq=1660, maxDocs=44421)
                0.109375 = fieldNorm(doc=30)
        0.28 = coord(7/25)
    
  3. Ensor, P.: User characteristics of keyword searching in an OPAC (1992) 0.19
    0.18621427 = sum of:
      0.18621427 = product of:
        0.7758928 = sum of:
          0.04089469 = weight(abstract_txt:which in 2277) [ClassicSimilarity], result of:
            0.04089469 = score(doc=2277,freq=2.0), product of:
              0.07939328 = queryWeight, product of:
                1.04415 = boost
                2.9137893 = idf(docFreq=6552, maxDocs=44421)
                0.026095327 = queryNorm
              0.51509005 = fieldWeight in 2277, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.9137893 = idf(docFreq=6552, maxDocs=44421)
                0.125 = fieldNorm(doc=2277)
          0.11962686 = weight(abstract_txt:opacs in 2277) [ClassicSimilarity], result of:
            0.11962686 = score(doc=2277,freq=1.0), product of:
              0.16238962 = queryWeight, product of:
                1.05593 = boost
                5.8933253 = idf(docFreq=332, maxDocs=44421)
                0.026095327 = queryNorm
              0.73666567 = fieldWeight in 2277, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.8933253 = idf(docFreq=332, maxDocs=44421)
                0.125 = fieldNorm(doc=2277)
          0.18199155 = weight(abstract_txt:keyword in 2277) [ClassicSimilarity], result of:
            0.18199155 = score(doc=2277,freq=2.0), product of:
              0.1704891 = queryWeight, product of:
                1.0819428 = boost
                6.038507 = idf(docFreq=287, maxDocs=44421)
                0.026095327 = queryNorm
              1.0674673 = fieldWeight in 2277, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.038507 = idf(docFreq=287, maxDocs=44421)
                0.125 = fieldNorm(doc=2277)
          0.14301226 = weight(abstract_txt:boolean in 2277) [ClassicSimilarity], result of:
            0.14301226 = score(doc=2277,freq=1.0), product of:
              0.18291725 = queryWeight, product of:
                1.1206844 = boost
                6.25473 = idf(docFreq=231, maxDocs=44421)
                0.026095327 = queryNorm
              0.7818413 = fieldWeight in 2277, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.25473 = idf(docFreq=231, maxDocs=44421)
                0.125 = fieldNorm(doc=2277)
          0.095100336 = weight(abstract_txt:context in 2277) [ClassicSimilarity], result of:
            0.095100336 = score(doc=2277,freq=1.0), product of:
              0.17557818 = queryWeight, product of:
                1.5527669 = boost
                4.333128 = idf(docFreq=1584, maxDocs=44421)
                0.026095327 = queryNorm
              0.541641 = fieldWeight in 2277, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.333128 = idf(docFreq=1584, maxDocs=44421)
                0.125 = fieldNorm(doc=2277)
          0.19526713 = weight(abstract_txt:searching in 2277) [ClassicSimilarity], result of:
            0.19526713 = score(doc=2277,freq=2.0), product of:
              0.2577047 = queryWeight, product of:
                2.303975 = boost
                4.2862926 = idf(docFreq=1660, maxDocs=44421)
                0.026095327 = queryNorm
              0.7577166 = fieldWeight in 2277, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.2862926 = idf(docFreq=1660, maxDocs=44421)
                0.125 = fieldNorm(doc=2277)
        0.24 = coord(6/25)
    
  4. Borgman, C.L.; Hirsh, S.G.; Walter, V.A.; Gallagher, A.L.: Childrens searching behavior on browsing and keyword online catalogs : the Science Library Catalog project (1995) 0.16
    0.16192932 = sum of:
      0.16192932 = product of:
        0.5060291 = sum of:
          0.04445303 = weight(abstract_txt:browsing in 2659) [ClassicSimilarity], result of:
            0.04445303 = score(doc=2659,freq=1.0), product of:
              0.14564246 = queryWeight, product of:
                5.58117 = idf(docFreq=454, maxDocs=44421)
                0.026095327 = queryNorm
              0.30522025 = fieldWeight in 2659, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.58117 = idf(docFreq=454, maxDocs=44421)
                0.0546875 = fieldNorm(doc=2659)
          0.09064991 = weight(abstract_txt:opacs in 2659) [ClassicSimilarity], result of:
            0.09064991 = score(doc=2659,freq=3.0), product of:
              0.16238962 = queryWeight, product of:
                1.05593 = boost
                5.8933253 = idf(docFreq=332, maxDocs=44421)
                0.026095327 = queryNorm
              0.5582248 = fieldWeight in 2659, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.8933253 = idf(docFreq=332, maxDocs=44421)
                0.0546875 = fieldNorm(doc=2659)
          0.13790815 = weight(abstract_txt:keyword in 2659) [ClassicSimilarity], result of:
            0.13790815 = score(doc=2659,freq=6.0), product of:
              0.1704891 = queryWeight, product of:
                1.0819428 = boost
                6.038507 = idf(docFreq=287, maxDocs=44421)
                0.026095327 = queryNorm
              0.80889714 = fieldWeight in 2659, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                6.038507 = idf(docFreq=287, maxDocs=44421)
                0.0546875 = fieldNorm(doc=2659)
          0.06256786 = weight(abstract_txt:boolean in 2659) [ClassicSimilarity], result of:
            0.06256786 = score(doc=2659,freq=1.0), product of:
              0.18291725 = queryWeight, product of:
                1.1206844 = boost
                6.25473 = idf(docFreq=231, maxDocs=44421)
                0.026095327 = queryNorm
              0.34205556 = fieldWeight in 2659, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.25473 = idf(docFreq=231, maxDocs=44421)
                0.0546875 = fieldNorm(doc=2659)
          0.025067078 = weight(abstract_txt:different in 2659) [ClassicSimilarity], result of:
            0.025067078 = score(doc=2659,freq=1.0), product of:
              0.12524669 = queryWeight, product of:
                1.3114572 = boost
                3.6597328 = idf(docFreq=3107, maxDocs=44421)
                0.026095327 = queryNorm
              0.20014164 = fieldWeight in 2659, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6597328 = idf(docFreq=3107, maxDocs=44421)
                0.0546875 = fieldNorm(doc=2659)
          0.027722742 = weight(abstract_txt:libraries in 2659) [ClassicSimilarity], result of:
            0.027722742 = score(doc=2659,freq=1.0), product of:
              0.13394338 = queryWeight, product of:
                1.3562247 = boost
                3.78466 = idf(docFreq=2742, maxDocs=44421)
                0.026095327 = queryNorm
              0.2069736 = fieldWeight in 2659, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.78466 = idf(docFreq=2742, maxDocs=44421)
                0.0546875 = fieldNorm(doc=2659)
          0.032230984 = weight(abstract_txt:retrieval in 2659) [ClassicSimilarity], result of:
            0.032230984 = score(doc=2659,freq=1.0), product of:
              0.16952871 = queryWeight, product of:
                1.8686943 = boost
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.026095327 = queryNorm
              0.1901211 = fieldWeight in 2659, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.0546875 = fieldNorm(doc=2659)
          0.08542937 = weight(abstract_txt:searching in 2659) [ClassicSimilarity], result of:
            0.08542937 = score(doc=2659,freq=2.0), product of:
              0.2577047 = queryWeight, product of:
                2.303975 = boost
                4.2862926 = idf(docFreq=1660, maxDocs=44421)
                0.026095327 = queryNorm
              0.331501 = fieldWeight in 2659, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.2862926 = idf(docFreq=1660, maxDocs=44421)
                0.0546875 = fieldNorm(doc=2659)
        0.32 = coord(8/25)
    
  5. Tenopir, C.: Common end user errors (1997) 0.16
    0.16150934 = sum of:
      0.16150934 = product of:
        0.6729556 = sum of:
          0.08428694 = weight(abstract_txt:commercial in 1410) [ClassicSimilarity], result of:
            0.08428694 = score(doc=1410,freq=1.0), product of:
              0.1557657 = queryWeight, product of:
                1.0341699 = boost
                5.7718782 = idf(docFreq=375, maxDocs=44421)
                0.026095327 = queryNorm
              0.5411136 = fieldWeight in 1410, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.7718782 = idf(docFreq=375, maxDocs=44421)
                0.09375 = fieldNorm(doc=1410)
          0.021687683 = weight(abstract_txt:which in 1410) [ClassicSimilarity], result of:
            0.021687683 = score(doc=1410,freq=1.0), product of:
              0.07939328 = queryWeight, product of:
                1.04415 = boost
                2.9137893 = idf(docFreq=6552, maxDocs=44421)
                0.026095327 = queryNorm
              0.27316773 = fieldWeight in 1410, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.9137893 = idf(docFreq=6552, maxDocs=44421)
                0.09375 = fieldNorm(doc=1410)
          0.10725919 = weight(abstract_txt:boolean in 1410) [ClassicSimilarity], result of:
            0.10725919 = score(doc=1410,freq=1.0), product of:
              0.18291725 = queryWeight, product of:
                1.1206844 = boost
                6.25473 = idf(docFreq=231, maxDocs=44421)
                0.026095327 = queryNorm
              0.58638096 = fieldWeight in 1410, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.25473 = idf(docFreq=231, maxDocs=44421)
                0.09375 = fieldNorm(doc=1410)
          0.36922494 = weight(abstract_txt:errors in 1410) [ClassicSimilarity], result of:
            0.36922494 = score(doc=1410,freq=9.0), product of:
              0.20048328 = queryWeight, product of:
                1.1732622 = boost
                6.548176 = idf(docFreq=172, maxDocs=44421)
                0.026095327 = queryNorm
              1.8416744 = fieldWeight in 1410, product of:
                3.0 = tf(freq=9.0), with freq of:
                  9.0 = termFreq=9.0
                6.548176 = idf(docFreq=172, maxDocs=44421)
                0.09375 = fieldNorm(doc=1410)
          0.042972133 = weight(abstract_txt:different in 1410) [ClassicSimilarity], result of:
            0.042972133 = score(doc=1410,freq=1.0), product of:
              0.12524669 = queryWeight, product of:
                1.3114572 = boost
                3.6597328 = idf(docFreq=3107, maxDocs=44421)
                0.026095327 = queryNorm
              0.34309995 = fieldWeight in 1410, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6597328 = idf(docFreq=3107, maxDocs=44421)
                0.09375 = fieldNorm(doc=1410)
          0.0475247 = weight(abstract_txt:libraries in 1410) [ClassicSimilarity], result of:
            0.0475247 = score(doc=1410,freq=1.0), product of:
              0.13394338 = queryWeight, product of:
                1.3562247 = boost
                3.78466 = idf(docFreq=2742, maxDocs=44421)
                0.026095327 = queryNorm
              0.35481188 = fieldWeight in 1410, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.78466 = idf(docFreq=2742, maxDocs=44421)
                0.09375 = fieldNorm(doc=1410)
        0.24 = coord(6/25)