Document (#30435)

Author
Arsenault, C.
Title
Word division in the transcription of Chinese script in the title fields of bibliographic Records
Source
Cataloging and classification quarterly. 32(2001) no.3, S.109-137
Year
2001
Abstract
Recently, the Library of Congress adopted the pinyin Romanization system for transcribing Chinese data in its bibliographic records. In its canonical form, pinyin aggregates Chinese "words" into single linguistic units, but pinyin entries could be constructed following either a monosyllabic or a polysyllabic pattern. Although the former is easier and less costly to implement, the latter method is potentially more beneficial for end-users, as it reduces ambiguity, and generates a much larger variety of indexable terms. The current study investigates if following the polysyllabic method improves retrieval efficiency and effectiveness in item-specific searching within online bibliographic databases. Analysis of the results revealed that aggregation of monosyllables does improve efficiency significantly (p < .05), especially during keyword searches, while effectiveness remains mainly unaffected.
Theme
Formalerschließung

Similar documents (author)

  1. Arsenault, C.: Testing the impact of syllable aggregation in romanized fields of Chinese language bibliographic records (2000) 5.81
    5.814733 = sum of:
      5.814733 = weight(author_txt:arsenault in 1087) [ClassicSimilarity], result of:
        5.814733 = fieldWeight in 1087, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.303573 = idf(docFreq=10, maxDocs=44421)
          0.625 = fieldNorm(doc=1087)
    
  2. Arsenault, C.: Aggregation consistency and frequency of Chinese words and characters (2006) 5.81
    5.814733 = sum of:
      5.814733 = weight(author_txt:arsenault in 734) [ClassicSimilarity], result of:
        5.814733 = fieldWeight in 734, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.303573 = idf(docFreq=10, maxDocs=44421)
          0.625 = fieldNorm(doc=734)
    
  3. Jacobs, C.; Arsenault, C.: Words can't describe it : streamlining PRECIS just for laughs! (1994) 4.65
    4.6517863 = sum of:
      4.6517863 = weight(author_txt:arsenault in 2266) [ClassicSimilarity], result of:
        4.6517863 = fieldWeight in 2266, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.303573 = idf(docFreq=10, maxDocs=44421)
          0.5 = fieldNorm(doc=2266)
    
  4. Arsenault, C.; Leide, J.E.: Format integration and the design of cataloging and classification curricula (2002) 4.65
    4.6517863 = sum of:
      4.6517863 = weight(author_txt:arsenault in 456) [ClassicSimilarity], result of:
        4.6517863 = fieldWeight in 456, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.303573 = idf(docFreq=10, maxDocs=44421)
          0.5 = fieldNorm(doc=456)
    
  5. Arsenault, C.; Ménard, E.: Searching titles with initial articles in library catalogs : a case study and search behavior analysis (2007) 4.65
    4.6517863 = sum of:
      4.6517863 = weight(author_txt:arsenault in 3264) [ClassicSimilarity], result of:
        4.6517863 = fieldWeight in 3264, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.303573 = idf(docFreq=10, maxDocs=44421)
          0.5 = fieldNorm(doc=3264)
    

Similar documents (content)

  1. Arsenault, C.: Testing the impact of syllable aggregation in romanized fields of Chinese language bibliographic records (2000) 0.54
    0.53685546 = sum of:
      0.53685546 = product of:
        1.6776733 = sum of:
          0.07867105 = weight(abstract_txt:script in 1087) [ClassicSimilarity], result of:
            0.07867105 = score(doc=1087,freq=1.0), product of:
              0.15805735 = queryWeight, product of:
                1.2129453 = boost
                7.963798 = idf(docFreq=41, maxDocs=44421)
                0.016362634 = queryNorm
              0.49773738 = fieldWeight in 1087, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.963798 = idf(docFreq=41, maxDocs=44421)
                0.0625 = fieldNorm(doc=1087)
          0.099432416 = weight(abstract_txt:aggregates in 1087) [ClassicSimilarity], result of:
            0.099432416 = score(doc=1087,freq=1.0), product of:
              0.18476656 = queryWeight, product of:
                1.3114314 = boost
                8.610425 = idf(docFreq=21, maxDocs=44421)
                0.016362634 = queryNorm
              0.53815156 = fieldWeight in 1087, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.610425 = idf(docFreq=21, maxDocs=44421)
                0.0625 = fieldNorm(doc=1087)
          0.14534014 = weight(abstract_txt:romanization in 1087) [ClassicSimilarity], result of:
            0.14534014 = score(doc=1087,freq=2.0), product of:
              0.18887962 = queryWeight, product of:
                1.3259479 = boost
                8.705735 = idf(docFreq=19, maxDocs=44421)
                0.016362634 = queryNorm
              0.76948553 = fieldWeight in 1087, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.705735 = idf(docFreq=19, maxDocs=44421)
                0.0625 = fieldNorm(doc=1087)
          0.046725616 = weight(abstract_txt:records in 1087) [ClassicSimilarity], result of:
            0.046725616 = score(doc=1087,freq=3.0), product of:
              0.09756132 = queryWeight, product of:
                1.347683 = boost
                4.42422 = idf(docFreq=1446, maxDocs=44421)
                0.016362634 = queryNorm
              0.47893587 = fieldWeight in 1087, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.42422 = idf(docFreq=1446, maxDocs=44421)
                0.0625 = fieldNorm(doc=1087)
          0.028364567 = weight(abstract_txt:method in 1087) [ClassicSimilarity], result of:
            0.028364567 = score(doc=1087,freq=1.0), product of:
              0.10087855 = queryWeight, product of:
                1.370403 = boost
                4.4988065 = idf(docFreq=1342, maxDocs=44421)
                0.016362634 = queryNorm
              0.2811754 = fieldWeight in 1087, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.4988065 = idf(docFreq=1342, maxDocs=44421)
                0.0625 = fieldNorm(doc=1087)
          0.05004021 = weight(abstract_txt:bibliographic in 1087) [ClassicSimilarity], result of:
            0.05004021 = score(doc=1087,freq=2.0), product of:
              0.13381802 = queryWeight, product of:
                1.9330888 = boost
                4.230674 = idf(docFreq=1755, maxDocs=44421)
                0.016362634 = queryNorm
              0.37394226 = fieldWeight in 1087, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.230674 = idf(docFreq=1755, maxDocs=44421)
                0.0625 = fieldNorm(doc=1087)
          0.30895555 = weight(abstract_txt:chinese in 1087) [ClassicSimilarity], result of:
            0.30895555 = score(doc=1087,freq=7.0), product of:
              0.2966264 = queryWeight, product of:
                2.8780572 = boost
                6.2987905 = idf(docFreq=221, maxDocs=44421)
                0.016362634 = queryNorm
              1.0415646 = fieldWeight in 1087, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                6.2987905 = idf(docFreq=221, maxDocs=44421)
                0.0625 = fieldNorm(doc=1087)
          0.9201437 = weight(abstract_txt:pinyin in 1087) [ClassicSimilarity], result of:
            0.9201437 = score(doc=1087,freq=7.0), product of:
              0.6140206 = queryWeight, product of:
                4.1408167 = boost
                9.06241 = idf(docFreq=13, maxDocs=44421)
                0.016362634 = queryNorm
              1.4985552 = fieldWeight in 1087, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                9.06241 = idf(docFreq=13, maxDocs=44421)
                0.0625 = fieldNorm(doc=1087)
        0.32 = coord(8/25)
    
  2. Groom, L.: Converting Wade-Giles cataloging to Pinyin : the development and implementation of a conversion program for the Australian National CJK Service (1997) 0.31
    0.31001672 = sum of:
      0.31001672 = product of:
        1.5500836 = sum of:
          0.07310162 = weight(abstract_txt:division in 1597) [ClassicSimilarity], result of:
            0.07310162 = score(doc=1597,freq=1.0), product of:
              0.114858165 = queryWeight, product of:
                1.0339864 = boost
                6.7888126 = idf(docFreq=135, maxDocs=44421)
                0.016362634 = queryNorm
              0.6364512 = fieldWeight in 1597, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.7888126 = idf(docFreq=135, maxDocs=44421)
                0.09375 = fieldNorm(doc=1597)
          0.21801022 = weight(abstract_txt:romanization in 1597) [ClassicSimilarity], result of:
            0.21801022 = score(doc=1597,freq=2.0), product of:
              0.18887962 = queryWeight, product of:
                1.3259479 = boost
                8.705735 = idf(docFreq=19, maxDocs=44421)
                0.016362634 = queryNorm
              1.1542283 = fieldWeight in 1597, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.705735 = idf(docFreq=19, maxDocs=44421)
                0.09375 = fieldNorm(doc=1597)
          0.04046557 = weight(abstract_txt:records in 1597) [ClassicSimilarity], result of:
            0.04046557 = score(doc=1597,freq=1.0), product of:
              0.09756132 = queryWeight, product of:
                1.347683 = boost
                4.42422 = idf(docFreq=1446, maxDocs=44421)
                0.016362634 = queryNorm
              0.41477063 = fieldWeight in 1597, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.42422 = idf(docFreq=1446, maxDocs=44421)
                0.09375 = fieldNorm(doc=1597)
          0.17516133 = weight(abstract_txt:chinese in 1597) [ClassicSimilarity], result of:
            0.17516133 = score(doc=1597,freq=1.0), product of:
              0.2966264 = queryWeight, product of:
                2.8780572 = boost
                6.2987905 = idf(docFreq=221, maxDocs=44421)
                0.016362634 = queryNorm
              0.5905116 = fieldWeight in 1597, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.2987905 = idf(docFreq=221, maxDocs=44421)
                0.09375 = fieldNorm(doc=1597)
          1.043345 = weight(abstract_txt:pinyin in 1597) [ClassicSimilarity], result of:
            1.043345 = score(doc=1597,freq=4.0), product of:
              0.6140206 = queryWeight, product of:
                4.1408167 = boost
                9.06241 = idf(docFreq=13, maxDocs=44421)
                0.016362634 = queryNorm
              1.699202 = fieldWeight in 1597, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                9.06241 = idf(docFreq=13, maxDocs=44421)
                0.09375 = fieldNorm(doc=1597)
        0.2 = coord(5/25)
    
  3. LC to convert to Pinyin for romanization of Chinese (1997) 0.24
    0.23772155 = sum of:
      0.23772155 = product of:
        1.4857597 = sum of:
          0.2569275 = weight(abstract_txt:romanization in 2095) [ClassicSimilarity], result of:
            0.2569275 = score(doc=2095,freq=1.0), product of:
              0.18887962 = queryWeight, product of:
                1.3259479 = boost
                8.705735 = idf(docFreq=19, maxDocs=44421)
                0.016362634 = queryNorm
              1.3602711 = fieldWeight in 2095, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.705735 = idf(docFreq=19, maxDocs=44421)
                0.15625 = fieldNorm(doc=2095)
          0.06744262 = weight(abstract_txt:records in 2095) [ClassicSimilarity], result of:
            0.06744262 = score(doc=2095,freq=1.0), product of:
              0.09756132 = queryWeight, product of:
                1.347683 = boost
                4.42422 = idf(docFreq=1446, maxDocs=44421)
                0.016362634 = queryNorm
              0.6912844 = fieldWeight in 2095, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.42422 = idf(docFreq=1446, maxDocs=44421)
                0.15625 = fieldNorm(doc=2095)
          0.29193553 = weight(abstract_txt:chinese in 2095) [ClassicSimilarity], result of:
            0.29193553 = score(doc=2095,freq=1.0), product of:
              0.2966264 = queryWeight, product of:
                2.8780572 = boost
                6.2987905 = idf(docFreq=221, maxDocs=44421)
                0.016362634 = queryNorm
              0.984186 = fieldWeight in 2095, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.2987905 = idf(docFreq=221, maxDocs=44421)
                0.15625 = fieldNorm(doc=2095)
          0.8694541 = weight(abstract_txt:pinyin in 2095) [ClassicSimilarity], result of:
            0.8694541 = score(doc=2095,freq=1.0), product of:
              0.6140206 = queryWeight, product of:
                4.1408167 = boost
                9.06241 = idf(docFreq=13, maxDocs=44421)
                0.016362634 = queryNorm
              1.4160016 = fieldWeight in 2095, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.06241 = idf(docFreq=13, maxDocs=44421)
                0.15625 = fieldNorm(doc=2095)
        0.16 = coord(4/25)
    
  4. Li, Y.: Consistency versus inconsistency : issues in Chinese cataloging in OCLC (2004) 0.20
    0.2016482 = sum of:
      0.2016482 = product of:
        1.2603014 = sum of:
          0.18167517 = weight(abstract_txt:romanization in 657) [ClassicSimilarity], result of:
            0.18167517 = score(doc=657,freq=2.0), product of:
              0.18887962 = queryWeight, product of:
                1.3259479 = boost
                8.705735 = idf(docFreq=19, maxDocs=44421)
                0.016362634 = queryNorm
              0.9618569 = fieldWeight in 657, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.705735 = idf(docFreq=19, maxDocs=44421)
                0.078125 = fieldNorm(doc=657)
          0.03372131 = weight(abstract_txt:records in 657) [ClassicSimilarity], result of:
            0.03372131 = score(doc=657,freq=1.0), product of:
              0.09756132 = queryWeight, product of:
                1.347683 = boost
                4.42422 = idf(docFreq=1446, maxDocs=44421)
                0.016362634 = queryNorm
              0.3456422 = fieldWeight in 657, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.42422 = idf(docFreq=1446, maxDocs=44421)
                0.078125 = fieldNorm(doc=657)
          0.29193553 = weight(abstract_txt:chinese in 657) [ClassicSimilarity], result of:
            0.29193553 = score(doc=657,freq=4.0), product of:
              0.2966264 = queryWeight, product of:
                2.8780572 = boost
                6.2987905 = idf(docFreq=221, maxDocs=44421)
                0.016362634 = queryNorm
              0.984186 = fieldWeight in 657, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                6.2987905 = idf(docFreq=221, maxDocs=44421)
                0.078125 = fieldNorm(doc=657)
          0.7529693 = weight(abstract_txt:pinyin in 657) [ClassicSimilarity], result of:
            0.7529693 = score(doc=657,freq=3.0), product of:
              0.6140206 = queryWeight, product of:
                4.1408167 = boost
                9.06241 = idf(docFreq=13, maxDocs=44421)
                0.016362634 = queryNorm
              1.2262933 = fieldWeight in 657, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                9.06241 = idf(docFreq=13, maxDocs=44421)
                0.078125 = fieldNorm(doc=657)
        0.16 = coord(4/25)
    
  5. Studwell, W.E.; Wang, R.; Wu, H.: ¬A tale of two decades : the controversy over the choice of a Chinese language romanization system in American cataloging practice (1993) 0.20
    0.19726098 = sum of:
      0.19726098 = product of:
        1.6438415 = sum of:
          0.205542 = weight(abstract_txt:romanization in 7953) [ClassicSimilarity], result of:
            0.205542 = score(doc=7953,freq=1.0), product of:
              0.18887962 = queryWeight, product of:
                1.3259479 = boost
                8.705735 = idf(docFreq=19, maxDocs=44421)
                0.016362634 = queryNorm
              1.0882169 = fieldWeight in 7953, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.705735 = idf(docFreq=19, maxDocs=44421)
                0.125 = fieldNorm(doc=7953)
          0.23354843 = weight(abstract_txt:chinese in 7953) [ClassicSimilarity], result of:
            0.23354843 = score(doc=7953,freq=1.0), product of:
              0.2966264 = queryWeight, product of:
                2.8780572 = boost
                6.2987905 = idf(docFreq=221, maxDocs=44421)
                0.016362634 = queryNorm
              0.7873488 = fieldWeight in 7953, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.2987905 = idf(docFreq=221, maxDocs=44421)
                0.125 = fieldNorm(doc=7953)
          1.204751 = weight(abstract_txt:pinyin in 7953) [ClassicSimilarity], result of:
            1.204751 = score(doc=7953,freq=3.0), product of:
              0.6140206 = queryWeight, product of:
                4.1408167 = boost
                9.06241 = idf(docFreq=13, maxDocs=44421)
                0.016362634 = queryNorm
              1.9620694 = fieldWeight in 7953, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                9.06241 = idf(docFreq=13, maxDocs=44421)
                0.125 = fieldNorm(doc=7953)
        0.12 = coord(3/25)