Document (#17510)

Author
Anzai, H.
Yamamoto, T.
Ishizuka, H.
Title
Experimental service of cataloguing database through WWW
Source
ULIS. 15(1996) no.2, S.1-16
Year
1996
Abstract
An information retrieval system for a cataloguing database through the WWW is developed, and experimentally served to Japan MARC and ULIS (Univeristy of Library and Information Science) OPAC data. Since Japanese words are not separated by obvious delimiters, ensuring the same segmentation between the query and the database is a problem. The present system solves the problem by using the multiple hash screening technique for processing both book titles and query strings, based on the same dictionary and using similar algorithms. Database management is handled by ADABAS, reducing management chores and and response time. The effectiveness of the multiple hash screening technique for a Japanese text based information system is examined, and the limitation of the Web's hypertext environment for a bibliographic information retrieval service is discussed
Footnote
[In Japanisch]

Similar documents (content)

  1. Rorvig, M.; Smith, M.M.; Uemura, A.: ¬The N-gram hypothesis applied to matched sets of visualized Japanese-English technical documents (1999) 0.12
    0.11922228 = sum of:
      0.11922228 = product of:
        0.7451393 = sum of:
          0.13770276 = weight(abstract_txt:japan in 675) [ClassicSimilarity], result of:
            0.13770276 = score(doc=675,freq=1.0), product of:
              0.16550235 = queryWeight, product of:
                1.0245558 = boost
                7.607123 = idf(docFreq=59, maxDocs=44421)
                0.021234797 = queryNorm
              0.83202904 = fieldWeight in 675, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.607123 = idf(docFreq=59, maxDocs=44421)
                0.109375 = fieldNorm(doc=675)
          0.017709034 = weight(abstract_txt:information in 675) [ClassicSimilarity], result of:
            0.017709034 = score(doc=675,freq=1.0), product of:
              0.066935875 = queryWeight, product of:
                1.3031458 = boost
                2.4188995 = idf(docFreq=10748, maxDocs=44421)
                0.021234797 = queryNorm
              0.26456714 = fieldWeight in 675, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.4188995 = idf(docFreq=10748, maxDocs=44421)
                0.109375 = fieldNorm(doc=675)
          0.109542415 = weight(abstract_txt:technique in 675) [ClassicSimilarity], result of:
            0.109542415 = score(doc=675,freq=1.0), product of:
              0.17902234 = queryWeight, product of:
                1.5069615 = boost
                5.5944448 = idf(docFreq=448, maxDocs=44421)
                0.021234797 = queryNorm
              0.6118924 = fieldWeight in 675, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.5944448 = idf(docFreq=448, maxDocs=44421)
                0.109375 = fieldNorm(doc=675)
          0.4801851 = weight(abstract_txt:japanese in 675) [ClassicSimilarity], result of:
            0.4801851 = score(doc=675,freq=3.0), product of:
              0.33246893 = queryWeight, product of:
                2.0536387 = boost
                7.62393 = idf(docFreq=58, maxDocs=44421)
                0.021234797 = queryNorm
              1.4443007 = fieldWeight in 675, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                7.62393 = idf(docFreq=58, maxDocs=44421)
                0.109375 = fieldNorm(doc=675)
        0.16 = coord(4/25)
    
  2. Yang, C.C.; Li, K.W.: ¬A heuristic method based on a statistical approach for chinese text segmentation (2005) 0.11
    0.111874044 = sum of:
      0.111874044 = product of:
        0.5593702 = sum of:
          0.26845405 = weight(abstract_txt:segmentation in 5580) [ClassicSimilarity], result of:
            0.26845405 = score(doc=5580,freq=9.0), product of:
              0.1803157 = queryWeight, product of:
                1.0694249 = boost
                7.9402676 = idf(docFreq=42, maxDocs=44421)
                0.021234797 = queryNorm
              1.4888002 = fieldWeight in 5580, product of:
                3.0 = tf(freq=9.0), with freq of:
                  9.0 = termFreq=9.0
                7.9402676 = idf(docFreq=42, maxDocs=44421)
                0.0625 = fieldNorm(doc=5580)
          0.0448346 = weight(abstract_txt:problem in 5580) [ClassicSimilarity], result of:
            0.0448346 = score(doc=5580,freq=2.0), product of:
              0.11374787 = queryWeight, product of:
                1.2012134 = boost
                4.4593854 = idf(docFreq=1396, maxDocs=44421)
                0.021234797 = queryNorm
              0.3941577 = fieldWeight in 5580, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.4593854 = idf(docFreq=1396, maxDocs=44421)
                0.0625 = fieldNorm(doc=5580)
          0.017527396 = weight(abstract_txt:information in 5580) [ClassicSimilarity], result of:
            0.017527396 = score(doc=5580,freq=3.0), product of:
              0.066935875 = queryWeight, product of:
                1.3031458 = boost
                2.4188995 = idf(docFreq=10748, maxDocs=44421)
                0.021234797 = queryNorm
              0.26185355 = fieldWeight in 5580, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                2.4188995 = idf(docFreq=10748, maxDocs=44421)
                0.0625 = fieldNorm(doc=5580)
          0.16595852 = weight(abstract_txt:delimiters in 5580) [ClassicSimilarity], result of:
            0.16595852 = score(doc=5580,freq=1.0), product of:
              0.27218705 = queryWeight, product of:
                1.3139149 = boost
                9.755557 = idf(docFreq=6, maxDocs=44421)
                0.021234797 = queryNorm
              0.6097223 = fieldWeight in 5580, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.755557 = idf(docFreq=6, maxDocs=44421)
                0.0625 = fieldNorm(doc=5580)
          0.062595665 = weight(abstract_txt:technique in 5580) [ClassicSimilarity], result of:
            0.062595665 = score(doc=5580,freq=1.0), product of:
              0.17902234 = queryWeight, product of:
                1.5069615 = boost
                5.5944448 = idf(docFreq=448, maxDocs=44421)
                0.021234797 = queryNorm
              0.3496528 = fieldWeight in 5580, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.5944448 = idf(docFreq=448, maxDocs=44421)
                0.0625 = fieldNorm(doc=5580)
        0.2 = coord(5/25)
    
  3. Sormunen, E.; Kekäläinen, J.; Koivisto, J.; Järvelin, K.: Document text characteristics affect the ranking of the most relevant documents by expanded structured queries (2001) 0.11
    0.11083241 = sum of:
      0.11083241 = product of:
        0.4618017 = sum of:
          0.022876684 = weight(abstract_txt:through in 5487) [ClassicSimilarity], result of:
            0.022876684 = score(doc=5487,freq=1.0), product of:
              0.09151096 = queryWeight, product of:
                1.0774201 = boost
                3.9998152 = idf(docFreq=2211, maxDocs=44421)
                0.021234797 = queryNorm
              0.24998845 = fieldWeight in 5487, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.9998152 = idf(docFreq=2211, maxDocs=44421)
                0.0625 = fieldNorm(doc=5487)
          0.054337613 = weight(abstract_txt:query in 5487) [ClassicSimilarity], result of:
            0.054337613 = score(doc=5487,freq=2.0), product of:
              0.12930088 = queryWeight, product of:
                1.2807055 = boost
                4.754492 = idf(docFreq=1039, maxDocs=44421)
                0.021234797 = queryNorm
              0.42024165 = fieldWeight in 5487, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.754492 = idf(docFreq=1039, maxDocs=44421)
                0.0625 = fieldNorm(doc=5487)
          0.020238895 = weight(abstract_txt:information in 5487) [ClassicSimilarity], result of:
            0.020238895 = score(doc=5487,freq=4.0), product of:
              0.066935875 = queryWeight, product of:
                1.3031458 = boost
                2.4188995 = idf(docFreq=10748, maxDocs=44421)
                0.021234797 = queryNorm
              0.30236244 = fieldWeight in 5487, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                2.4188995 = idf(docFreq=10748, maxDocs=44421)
                0.0625 = fieldNorm(doc=5487)
          0.02909656 = weight(abstract_txt:system in 5487) [ClassicSimilarity], result of:
            0.02909656 = score(doc=5487,freq=2.0), product of:
              0.09760213 = queryWeight, product of:
                1.3627739 = boost
                3.372775 = idf(docFreq=4140, maxDocs=44421)
                0.021234797 = queryNorm
              0.298114 = fieldWeight in 5487, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.372775 = idf(docFreq=4140, maxDocs=44421)
                0.0625 = fieldNorm(doc=5487)
          0.062595665 = weight(abstract_txt:technique in 5487) [ClassicSimilarity], result of:
            0.062595665 = score(doc=5487,freq=1.0), product of:
              0.17902234 = queryWeight, product of:
                1.5069615 = boost
                5.5944448 = idf(docFreq=448, maxDocs=44421)
                0.021234797 = queryNorm
              0.3496528 = fieldWeight in 5487, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.5944448 = idf(docFreq=448, maxDocs=44421)
                0.0625 = fieldNorm(doc=5487)
          0.2726563 = weight(abstract_txt:screening in 5487) [ClassicSimilarity], result of:
            0.2726563 = score(doc=5487,freq=1.0), product of:
              0.47747955 = queryWeight, product of:
                2.461081 = boost
                9.1365185 = idf(docFreq=12, maxDocs=44421)
                0.021234797 = queryNorm
              0.5710324 = fieldWeight in 5487, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.1365185 = idf(docFreq=12, maxDocs=44421)
                0.0625 = fieldNorm(doc=5487)
        0.24 = coord(6/25)
    
  4. Cathro, W.S.: ¬The development of national bibliographic and document access services in Australia (1996) 0.09
    0.09152503 = sum of:
      0.09152503 = product of:
        0.45762515 = sum of:
          0.04852877 = weight(abstract_txt:through in 3579) [ClassicSimilarity], result of:
            0.04852877 = score(doc=3579,freq=2.0), product of:
              0.09151096 = queryWeight, product of:
                1.0774201 = boost
                3.9998152 = idf(docFreq=2211, maxDocs=44421)
                0.021234797 = queryNorm
              0.53030556 = fieldWeight in 3579, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.9998152 = idf(docFreq=2211, maxDocs=44421)
                0.09375 = fieldNorm(doc=3579)
          0.102833405 = weight(abstract_txt:service in 3579) [ClassicSimilarity], result of:
            0.102833405 = score(doc=3579,freq=4.0), product of:
              0.11982655 = queryWeight, product of:
                1.232892 = boost
                4.576989 = idf(docFreq=1241, maxDocs=44421)
                0.021234797 = queryNorm
              0.85818547 = fieldWeight in 3579, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.576989 = idf(docFreq=1241, maxDocs=44421)
                0.09375 = fieldNorm(doc=3579)
          0.015179171 = weight(abstract_txt:information in 3579) [ClassicSimilarity], result of:
            0.015179171 = score(doc=3579,freq=1.0), product of:
              0.066935875 = queryWeight, product of:
                1.3031458 = boost
                2.4188995 = idf(docFreq=10748, maxDocs=44421)
                0.021234797 = queryNorm
              0.22677183 = fieldWeight in 3579, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.4188995 = idf(docFreq=10748, maxDocs=44421)
                0.09375 = fieldNorm(doc=3579)
          0.0534538 = weight(abstract_txt:system in 3579) [ClassicSimilarity], result of:
            0.0534538 = score(doc=3579,freq=3.0), product of:
              0.09760213 = queryWeight, product of:
                1.3627739 = boost
                3.372775 = idf(docFreq=4140, maxDocs=44421)
                0.021234797 = queryNorm
              0.5476704 = fieldWeight in 3579, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.372775 = idf(docFreq=4140, maxDocs=44421)
                0.09375 = fieldNorm(doc=3579)
          0.23762998 = weight(abstract_txt:japanese in 3579) [ClassicSimilarity], result of:
            0.23762998 = score(doc=3579,freq=1.0), product of:
              0.33246893 = queryWeight, product of:
                2.0536387 = boost
                7.62393 = idf(docFreq=58, maxDocs=44421)
                0.021234797 = queryNorm
              0.71474344 = fieldWeight in 3579, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.62393 = idf(docFreq=58, maxDocs=44421)
                0.09375 = fieldNorm(doc=3579)
        0.2 = coord(5/25)
    
  5. Peng, F.; Huang, X.: Machine learning for Asian language text classification (2007) 0.09
    0.09017261 = sum of:
      0.09017261 = product of:
        0.56357884 = sum of:
          0.23675421 = weight(abstract_txt:segmentation in 1831) [ClassicSimilarity], result of:
            0.23675421 = score(doc=1831,freq=7.0), product of:
              0.1803157 = queryWeight, product of:
                1.0694249 = boost
                7.9402676 = idf(docFreq=42, maxDocs=44421)
                0.021234797 = queryNorm
              1.3129983 = fieldWeight in 1831, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                7.9402676 = idf(docFreq=42, maxDocs=44421)
                0.0625 = fieldNorm(doc=1831)
          0.038122114 = weight(abstract_txt:same in 1831) [ClassicSimilarity], result of:
            0.038122114 = score(doc=1831,freq=1.0), product of:
              0.1286261 = queryWeight, product of:
                1.2773592 = boost
                4.7420692 = idf(docFreq=1052, maxDocs=44421)
                0.021234797 = queryNorm
              0.29637933 = fieldWeight in 1831, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.7420692 = idf(docFreq=1052, maxDocs=44421)
                0.0625 = fieldNorm(doc=1831)
          0.014311059 = weight(abstract_txt:information in 1831) [ClassicSimilarity], result of:
            0.014311059 = score(doc=1831,freq=2.0), product of:
              0.066935875 = queryWeight, product of:
                1.3031458 = boost
                2.4188995 = idf(docFreq=10748, maxDocs=44421)
                0.021234797 = queryNorm
              0.21380253 = fieldWeight in 1831, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.4188995 = idf(docFreq=10748, maxDocs=44421)
                0.0625 = fieldNorm(doc=1831)
          0.27439147 = weight(abstract_txt:japanese in 1831) [ClassicSimilarity], result of:
            0.27439147 = score(doc=1831,freq=3.0), product of:
              0.33246893 = queryWeight, product of:
                2.0536387 = boost
                7.62393 = idf(docFreq=58, maxDocs=44421)
                0.021234797 = queryNorm
              0.82531464 = fieldWeight in 1831, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                7.62393 = idf(docFreq=58, maxDocs=44421)
                0.0625 = fieldNorm(doc=1831)
        0.16 = coord(4/25)