Document (#38888)

Author
Pope, J.T.
Holley, R.P.
Title
Google Book Search and metadata
Source
Cataloging and classification quarterly. 49(2011) no.1, S.1-13
Year
2011
Abstract
This article summarizes published documents on metadata provided by Google for books scanned as part of the Google Book Search (GBS) project and provides suggestions for improvement. The faulty, misleading, and confusing metadata in current Google records can pose potentially serious problems for users of GBS. Google admits that it took data, which proved to be inaccurate, from many sources and is attempting to correct errors. Some argue that metadata is not needed with keyword searching; but optical character recognition (OCR) errors, synonym control, and materials in foreign languages make reliable metadata a requirement for academic researchers. The authors recommend that users should be able to submit error reports to Google to correct faulty metadata.
Theme
Formalerschließung
Metadaten
Object
Google Book Search

Similar documents (author)

  1. Holley, R.P.: Report from the section on classification and indexing : 1988-89 (1989) 5.35
    5.353733 = sum of:
      5.353733 = weight(author_txt:holley in 424) [ClassicSimilarity], result of:
        5.353733 = fieldWeight in 424, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.565973 = idf(docFreq=22, maxDocs=44421)
          0.625 = fieldNorm(doc=424)
    
  2. Holley, R.P.: Subject access in the online catalog (1989) 5.35
    5.353733 = sum of:
      5.353733 = weight(author_txt:holley in 442) [ClassicSimilarity], result of:
        5.353733 = fieldWeight in 442, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.565973 = idf(docFreq=22, maxDocs=44421)
          0.625 = fieldNorm(doc=442)
    
  3. Holley, R.P.: Entwicklung und Fortschritt bei Klassifikation und Indexierung (1987) 5.35
    5.353733 = sum of:
      5.353733 = weight(author_txt:holley in 928) [ClassicSimilarity], result of:
        5.353733 = fieldWeight in 928, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.565973 = idf(docFreq=22, maxDocs=44421)
          0.625 = fieldNorm(doc=928)
    
  4. Holley, E.G.: ¬The trend to LC : thoughts on changing library classification schemes (1967) 5.35
    5.353733 = sum of:
      5.353733 = weight(author_txt:holley in 1712) [ClassicSimilarity], result of:
        5.353733 = fieldWeight in 1712, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.565973 = idf(docFreq=22, maxDocs=44421)
          0.625 = fieldNorm(doc=1712)
    
  5. Holley, R.P.: Classification in the USA (1985) 5.35
    5.353733 = sum of:
      5.353733 = weight(author_txt:holley in 1729) [ClassicSimilarity], result of:
        5.353733 = fieldWeight in 1729, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.565973 = idf(docFreq=22, maxDocs=44421)
          0.625 = fieldNorm(doc=1729)
    

Similar documents (content)

  1. Dawson, A.; Hamilton, V.: Optimising metadata to make high-value content more accessible to Google users (2006) 0.11
    0.10812544 = sum of:
      0.10812544 = product of:
        0.5406272 = sum of:
          0.0106972875 = weight(abstract_txt:that in 598) [ClassicSimilarity], result of:
            0.0106972875 = score(doc=598,freq=3.0), product of:
              0.041784365 = queryWeight, product of:
                1.0506608 = boost
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.01681636 = queryNorm
              0.25601172 = fieldWeight in 598, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.0625 = fieldNorm(doc=598)
          0.019984607 = weight(abstract_txt:users in 598) [ClassicSimilarity], result of:
            0.019984607 = score(doc=598,freq=2.0), product of:
              0.06338139 = queryWeight, product of:
                1.056552 = boost
                3.5672934 = idf(docFreq=3408, maxDocs=44421)
                0.01681636 = queryNorm
              0.31530717 = fieldWeight in 598, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.5672934 = idf(docFreq=3408, maxDocs=44421)
                0.0625 = fieldNorm(doc=598)
          0.033975665 = weight(abstract_txt:search in 598) [ClassicSimilarity], result of:
            0.033975665 = score(doc=598,freq=5.0), product of:
              0.0665217 = queryWeight, product of:
                1.0824097 = boost
                3.654598 = idf(docFreq=3123, maxDocs=44421)
                0.01681636 = queryNorm
              0.5107456 = fieldWeight in 598, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                3.654598 = idf(docFreq=3123, maxDocs=44421)
                0.0625 = fieldNorm(doc=598)
          0.18789297 = weight(abstract_txt:metadata in 598) [ClassicSimilarity], result of:
            0.18789297 = score(doc=598,freq=3.0), product of:
              0.35572556 = queryWeight, product of:
                4.33539 = boost
                4.87927 = idf(docFreq=917, maxDocs=44421)
                0.01681636 = queryNorm
              0.52819645 = fieldWeight in 598, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.87927 = idf(docFreq=917, maxDocs=44421)
                0.0625 = fieldNorm(doc=598)
          0.28807664 = weight(abstract_txt:google in 598) [ClassicSimilarity], result of:
            0.28807664 = score(doc=598,freq=4.0), product of:
              0.42973474 = queryWeight, product of:
                4.7650876 = boost
                5.3628736 = idf(docFreq=565, maxDocs=44421)
                0.01681636 = queryNorm
              0.6703592 = fieldWeight in 598, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.3628736 = idf(docFreq=565, maxDocs=44421)
                0.0625 = fieldNorm(doc=598)
        0.2 = coord(5/25)
    
  2. Dawson, A.: Creating metadata that work for digital libraries and Google (2004) 0.11
    0.10799681 = sum of:
      0.10799681 = product of:
        0.67498004 = sum of:
          0.021196876 = weight(abstract_txt:users in 5762) [ClassicSimilarity], result of:
            0.021196876 = score(doc=5762,freq=1.0), product of:
              0.06338139 = queryWeight, product of:
                1.056552 = boost
                3.5672934 = idf(docFreq=3408, maxDocs=44421)
                0.01681636 = queryNorm
              0.33443376 = fieldWeight in 5762, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.5672934 = idf(docFreq=3408, maxDocs=44421)
                0.09375 = fieldNorm(doc=5762)
          0.022791568 = weight(abstract_txt:search in 5762) [ClassicSimilarity], result of:
            0.022791568 = score(doc=5762,freq=1.0), product of:
              0.0665217 = queryWeight, product of:
                1.0824097 = boost
                3.654598 = idf(docFreq=3123, maxDocs=44421)
                0.01681636 = queryNorm
              0.34261855 = fieldWeight in 5762, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.654598 = idf(docFreq=3123, maxDocs=44421)
                0.09375 = fieldNorm(doc=5762)
          0.3254402 = weight(abstract_txt:metadata in 5762) [ClassicSimilarity], result of:
            0.3254402 = score(doc=5762,freq=4.0), product of:
              0.35572556 = queryWeight, product of:
                4.33539 = boost
                4.87927 = idf(docFreq=917, maxDocs=44421)
                0.01681636 = queryNorm
              0.9148631 = fieldWeight in 5762, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.87927 = idf(docFreq=917, maxDocs=44421)
                0.09375 = fieldNorm(doc=5762)
          0.3055514 = weight(abstract_txt:google in 5762) [ClassicSimilarity], result of:
            0.3055514 = score(doc=5762,freq=2.0), product of:
              0.42973474 = queryWeight, product of:
                4.7650876 = boost
                5.3628736 = idf(docFreq=565, maxDocs=44421)
                0.01681636 = queryNorm
              0.71102333 = fieldWeight in 5762, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.3628736 = idf(docFreq=565, maxDocs=44421)
                0.09375 = fieldNorm(doc=5762)
        0.16 = coord(4/25)
    
  3. Lewandowski, D.: Evaluating the retrieval effectiveness of web search engines using a representative query sample (2015) 0.11
    0.10534735 = sum of:
      0.10534735 = product of:
        0.52673674 = sum of:
          0.013371609 = weight(abstract_txt:that in 3157) [ClassicSimilarity], result of:
            0.013371609 = score(doc=3157,freq=3.0), product of:
              0.041784365 = queryWeight, product of:
                1.0506608 = boost
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.01681636 = queryNorm
              0.32001466 = fieldWeight in 3157, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.078125 = fieldNorm(doc=3157)
          0.017664064 = weight(abstract_txt:users in 3157) [ClassicSimilarity], result of:
            0.017664064 = score(doc=3157,freq=1.0), product of:
              0.06338139 = queryWeight, product of:
                1.056552 = boost
                3.5672934 = idf(docFreq=3408, maxDocs=44421)
                0.01681636 = queryNorm
              0.2786948 = fieldWeight in 3157, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.5672934 = idf(docFreq=3408, maxDocs=44421)
                0.078125 = fieldNorm(doc=3157)
          0.037985947 = weight(abstract_txt:search in 3157) [ClassicSimilarity], result of:
            0.037985947 = score(doc=3157,freq=4.0), product of:
              0.0665217 = queryWeight, product of:
                1.0824097 = boost
                3.654598 = idf(docFreq=3123, maxDocs=44421)
                0.01681636 = queryNorm
              0.5710309 = fieldWeight in 3157, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                3.654598 = idf(docFreq=3123, maxDocs=44421)
                0.078125 = fieldNorm(doc=3157)
          0.20308895 = weight(abstract_txt:correct in 3157) [ClassicSimilarity], result of:
            0.20308895 = score(doc=3157,freq=3.0), product of:
              0.22386445 = queryWeight, product of:
                1.9856495 = boost
                6.704255 = idf(docFreq=147, maxDocs=44421)
                0.01681636 = queryNorm
              0.9071961 = fieldWeight in 3157, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.704255 = idf(docFreq=147, maxDocs=44421)
                0.078125 = fieldNorm(doc=3157)
          0.25462615 = weight(abstract_txt:google in 3157) [ClassicSimilarity], result of:
            0.25462615 = score(doc=3157,freq=2.0), product of:
              0.42973474 = queryWeight, product of:
                4.7650876 = boost
                5.3628736 = idf(docFreq=565, maxDocs=44421)
                0.01681636 = queryNorm
              0.5925194 = fieldWeight in 3157, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.3628736 = idf(docFreq=565, maxDocs=44421)
                0.078125 = fieldNorm(doc=3157)
        0.2 = coord(5/25)
    
  4. Golderman, G.M.; Connolly, B.: Between the book covers : going beyond OPAC keyword searching with the deep linking capabilities of Google Scholar and Google Book Search (2004/05) 0.10
    0.10476088 = sum of:
      0.10476088 = product of:
        0.43650368 = sum of:
          0.017468598 = weight(abstract_txt:that in 1731) [ClassicSimilarity], result of:
            0.017468598 = score(doc=1731,freq=8.0), product of:
              0.041784365 = queryWeight, product of:
                1.0506608 = boost
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.01681636 = queryNorm
              0.41806543 = fieldWeight in 1731, product of:
                2.828427 = tf(freq=8.0), with freq of:
                  8.0 = termFreq=8.0
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.0625 = fieldNorm(doc=1731)
          0.014131251 = weight(abstract_txt:users in 1731) [ClassicSimilarity], result of:
            0.014131251 = score(doc=1731,freq=1.0), product of:
              0.06338139 = queryWeight, product of:
                1.056552 = boost
                3.5672934 = idf(docFreq=3408, maxDocs=44421)
                0.01681636 = queryNorm
              0.22295584 = fieldWeight in 1731, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.5672934 = idf(docFreq=3408, maxDocs=44421)
                0.0625 = fieldNorm(doc=1731)
          0.015194379 = weight(abstract_txt:search in 1731) [ClassicSimilarity], result of:
            0.015194379 = score(doc=1731,freq=1.0), product of:
              0.0665217 = queryWeight, product of:
                1.0824097 = boost
                3.654598 = idf(docFreq=3123, maxDocs=44421)
                0.01681636 = queryNorm
              0.22841237 = fieldWeight in 1731, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.654598 = idf(docFreq=3123, maxDocs=44421)
                0.0625 = fieldNorm(doc=1731)
          0.06597333 = weight(abstract_txt:attempting in 1731) [ClassicSimilarity], result of:
            0.06597333 = score(doc=1731,freq=1.0), product of:
              0.14052176 = queryWeight, product of:
                1.112415 = boost
                7.5118127 = idf(docFreq=65, maxDocs=44421)
                0.01681636 = queryNorm
              0.4694883 = fieldWeight in 1731, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.5118127 = idf(docFreq=65, maxDocs=44421)
                0.0625 = fieldNorm(doc=1731)
          0.035659492 = weight(abstract_txt:book in 1731) [ClassicSimilarity], result of:
            0.035659492 = score(doc=1731,freq=1.0), product of:
              0.11747842 = queryWeight, product of:
                1.4384311 = boost
                4.8566523 = idf(docFreq=938, maxDocs=44421)
                0.01681636 = queryNorm
              0.30354077 = fieldWeight in 1731, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.8566523 = idf(docFreq=938, maxDocs=44421)
                0.0625 = fieldNorm(doc=1731)
          0.28807664 = weight(abstract_txt:google in 1731) [ClassicSimilarity], result of:
            0.28807664 = score(doc=1731,freq=4.0), product of:
              0.42973474 = queryWeight, product of:
                4.7650876 = boost
                5.3628736 = idf(docFreq=565, maxDocs=44421)
                0.01681636 = queryNorm
              0.6703592 = fieldWeight in 1731, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.3628736 = idf(docFreq=565, maxDocs=44421)
                0.0625 = fieldNorm(doc=1731)
        0.24 = coord(6/25)
    
  5. Zakaria, M.S.: Measuring typographical errors in online catalogs of academic libraries using Ballard's list : a case study from Egypt (2023) 0.10
    0.10274161 = sum of:
      0.10274161 = product of:
        0.42809004 = sum of:
          0.070974916 = weight(abstract_txt:error in 2186) [ClassicSimilarity], result of:
            0.070974916 = score(doc=2186,freq=2.0), product of:
              0.11710028 = queryWeight, product of:
                1.0154861 = boost
                6.8572807 = idf(docFreq=126, maxDocs=44421)
                0.01681636 = queryNorm
              0.6061037 = fieldWeight in 2186, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.8572807 = idf(docFreq=126, maxDocs=44421)
                0.0625 = fieldNorm(doc=2186)
          0.051818043 = weight(abstract_txt:serious in 2186) [ClassicSimilarity], result of:
            0.051818043 = score(doc=2186,freq=1.0), product of:
              0.1196241 = queryWeight, product of:
                1.026371 = boost
                6.930783 = idf(docFreq=117, maxDocs=44421)
                0.01681636 = queryNorm
              0.43317392 = fieldWeight in 2186, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.930783 = idf(docFreq=117, maxDocs=44421)
                0.0625 = fieldNorm(doc=2186)
          0.0106972875 = weight(abstract_txt:that in 2186) [ClassicSimilarity], result of:
            0.0106972875 = score(doc=2186,freq=3.0), product of:
              0.041784365 = queryWeight, product of:
                1.0506608 = boost
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.01681636 = queryNorm
              0.25601172 = fieldWeight in 2186, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.0625 = fieldNorm(doc=2186)
          0.014131251 = weight(abstract_txt:users in 2186) [ClassicSimilarity], result of:
            0.014131251 = score(doc=2186,freq=1.0), product of:
              0.06338139 = queryWeight, product of:
                1.056552 = boost
                3.5672934 = idf(docFreq=3408, maxDocs=44421)
                0.01681636 = queryNorm
              0.22295584 = fieldWeight in 2186, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.5672934 = idf(docFreq=3408, maxDocs=44421)
                0.0625 = fieldNorm(doc=2186)
          0.06637641 = weight(abstract_txt:pose in 2186) [ClassicSimilarity], result of:
            0.06637641 = score(doc=2186,freq=1.0), product of:
              0.14109355 = queryWeight, product of:
                1.1146759 = boost
                7.5270805 = idf(docFreq=64, maxDocs=44421)
                0.01681636 = queryNorm
              0.47044253 = fieldWeight in 2186, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.5270805 = idf(docFreq=64, maxDocs=44421)
                0.0625 = fieldNorm(doc=2186)
          0.21409214 = weight(abstract_txt:errors in 2186) [ClassicSimilarity], result of:
            0.21409214 = score(doc=2186,freq=6.0), product of:
              0.21356237 = queryWeight, product of:
                1.9394224 = boost
                6.548176 = idf(docFreq=172, maxDocs=44421)
                0.01681636 = queryNorm
              1.0024806 = fieldWeight in 2186, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                6.548176 = idf(docFreq=172, maxDocs=44421)
                0.0625 = fieldNorm(doc=2186)
        0.24 = coord(6/25)