Document (#43496)

Author
Roy, D.
Bhatia, S.
Jain, P.
Title
Information asymmetry in Wikipedia across different languages : a statistical analysis
Source
Journal of the Association for Information Science and Technology. 73(2022) no.3, S.347-361
Year
2022
Abstract
Wikipedia is the largest web-based open encyclopedia covering more than 300 languages. Different language editions of Wikipedia differ significantly in terms of their information coverage. In this article, we compare the information coverage in English Wikipedia (most exhaustive) and Wikipedias in 8 other widely spoken languages, namely Arabic, German, Hindi, Korean, Portuguese, Russian, Spanish, and Turkish. We analyze variations in different language editions of Wikipedia in terms of the number of topics covered as well as the amount of information discussed about different topics. Further, as a step towards bridging the information gap, we present WikiCompare-a browser plugin that allows Wikipedia readers to have a comprehensive overview of topics by incorporating missing information from Wikipedia page in other language.
Content
Vgl.: https://asistdl.onlinelibrary.wiley.com/doi/10.1002/asi.24553.
Theme
Multilinguale Probleme
Object
Wikipedia

Similar documents (author)

  1. Jain, H.C.: Colon Classification : a review article (1964) 5.87
    5.874302 = sum of:
      5.874302 = weight(author_txt:jain in 1951) [ClassicSimilarity], result of:
        5.874302 = fieldWeight in 1951, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.398883 = idf(docFreq=9, maxDocs=44421)
          0.625 = fieldNorm(doc=1951)
    
  2. Jain, A.K.: Image data compression : a review (1981) 5.87
    5.874302 = sum of:
      5.874302 = weight(author_txt:jain in 310) [ClassicSimilarity], result of:
        5.874302 = fieldWeight in 310, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.398883 = idf(docFreq=9, maxDocs=44421)
          0.625 = fieldNorm(doc=310)
    
  3. Jain, R.: Visual information retrieval in digital libraries (1997) 5.87
    5.874302 = sum of:
      5.874302 = weight(author_txt:jain in 760) [ClassicSimilarity], result of:
        5.874302 = fieldWeight in 760, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.398883 = idf(docFreq=9, maxDocs=44421)
          0.625 = fieldNorm(doc=760)
    
  4. Jain, P.: ¬An empirical study of knowledge management in academic libraries in East and Southern Africa (2007) 5.87
    5.874302 = sum of:
      5.874302 = weight(author_txt:jain in 1864) [ClassicSimilarity], result of:
        5.874302 = fieldWeight in 1864, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.398883 = idf(docFreq=9, maxDocs=44421)
          0.625 = fieldNorm(doc=1864)
    
  5. Das, A.; Jain, A.: Indexing the World Wide Web : the journey so far (2012) 4.70
    4.6994414 = sum of:
      4.6994414 = weight(author_txt:jain in 1095) [ClassicSimilarity], result of:
        4.6994414 = fieldWeight in 1095, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.398883 = idf(docFreq=9, maxDocs=44421)
          0.5 = fieldNorm(doc=1095)
    

Similar documents (content)

  1. Callahan, E.S.; Herring, S.C.: Cultural bias in Wikipedia content on famous persons (2011) 0.23
    0.23252062 = sum of:
      0.23252062 = product of:
        0.96883595 = sum of:
          0.058831535 = weight(abstract_txt:encyclopedia in 764) [ClassicSimilarity], result of:
            0.058831535 = score(doc=764,freq=1.0), product of:
              0.10930654 = queryWeight, product of:
                6.889283 = idf(docFreq=122, maxDocs=44421)
                0.01586617 = queryNorm
              0.53822523 = fieldWeight in 764, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.889283 = idf(docFreq=122, maxDocs=44421)
                0.078125 = fieldNorm(doc=764)
          0.05539123 = weight(abstract_txt:language in 764) [ClassicSimilarity], result of:
            0.05539123 = score(doc=764,freq=2.0), product of:
              0.12019796 = queryWeight, product of:
                1.8162938 = boost
                4.1709876 = idf(docFreq=1863, maxDocs=44421)
                0.01586617 = queryNorm
              0.46083337 = fieldWeight in 764, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.1709876 = idf(docFreq=1863, maxDocs=44421)
                0.078125 = fieldNorm(doc=764)
          0.12751834 = weight(abstract_txt:editions in 764) [ClassicSimilarity], result of:
            0.12751834 = score(doc=764,freq=1.0), product of:
              0.23065583 = queryWeight, product of:
                2.0543487 = boost
                7.0764947 = idf(docFreq=101, maxDocs=44421)
                0.01586617 = queryNorm
              0.55285114 = fieldWeight in 764, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.0764947 = idf(docFreq=101, maxDocs=44421)
                0.078125 = fieldNorm(doc=764)
          0.035277374 = weight(abstract_txt:different in 764) [ClassicSimilarity], result of:
            0.035277374 = score(doc=764,freq=1.0), product of:
              0.123383425 = queryWeight, product of:
                2.1248846 = boost
                3.6597328 = idf(docFreq=3107, maxDocs=44421)
                0.01586617 = queryNorm
              0.28591663 = fieldWeight in 764, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6597328 = idf(docFreq=3107, maxDocs=44421)
                0.078125 = fieldNorm(doc=764)
          0.07544693 = weight(abstract_txt:languages in 764) [ClassicSimilarity], result of:
            0.07544693 = score(doc=764,freq=1.0), product of:
              0.18608333 = queryWeight, product of:
                2.2599108 = boost
                5.189722 = idf(docFreq=672, maxDocs=44421)
                0.01586617 = queryNorm
              0.40544704 = fieldWeight in 764, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.189722 = idf(docFreq=672, maxDocs=44421)
                0.078125 = fieldNorm(doc=764)
          0.61637056 = weight(abstract_txt:wikipedia in 764) [ClassicSimilarity], result of:
            0.61637056 = score(doc=764,freq=4.0), product of:
              0.6306861 = queryWeight, product of:
                6.3552494 = boost
                6.25473 = idf(docFreq=231, maxDocs=44421)
                0.01586617 = queryNorm
              0.9773016 = fieldWeight in 764, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                6.25473 = idf(docFreq=231, maxDocs=44421)
                0.078125 = fieldNorm(doc=764)
        0.24 = coord(6/25)
    
  2. Hara, N.; Shachaf, P.; Hew, K.F.: Cross-cultural analysis of the Wikipedia community (2010) 0.22
    0.2155507 = sum of:
      0.2155507 = product of:
        1.0777535 = sum of:
          0.017735498 = weight(abstract_txt:other in 1) [ClassicSimilarity], result of:
            0.017735498 = score(doc=1,freq=2.0), product of:
              0.057026267 = queryWeight, product of:
                1.0214789 = boost
                3.5186288 = idf(docFreq=3578, maxDocs=44421)
                0.01586617 = queryNorm
              0.31100577 = fieldWeight in 1, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.5186288 = idf(docFreq=3578, maxDocs=44421)
                0.0625 = fieldNorm(doc=1)
          0.17476144 = weight(abstract_txt:wikipedias in 1) [ClassicSimilarity], result of:
            0.17476144 = score(doc=1,freq=2.0), product of:
              0.20803341 = queryWeight, product of:
                1.3795692 = boost
                9.504243 = idf(docFreq=8, maxDocs=44421)
                0.01586617 = queryNorm
              0.8400643 = fieldWeight in 1, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                9.504243 = idf(docFreq=8, maxDocs=44421)
                0.0625 = fieldNorm(doc=1)
          0.028221898 = weight(abstract_txt:different in 1) [ClassicSimilarity], result of:
            0.028221898 = score(doc=1,freq=1.0), product of:
              0.123383425 = queryWeight, product of:
                2.1248846 = boost
                3.6597328 = idf(docFreq=3107, maxDocs=44421)
                0.01586617 = queryNorm
              0.2287333 = fieldWeight in 1, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6597328 = idf(docFreq=3107, maxDocs=44421)
                0.0625 = fieldNorm(doc=1)
          0.15969107 = weight(abstract_txt:languages in 1) [ClassicSimilarity], result of:
            0.15969107 = score(doc=1,freq=7.0), product of:
              0.18608333 = queryWeight, product of:
                2.2599108 = boost
                5.189722 = idf(docFreq=672, maxDocs=44421)
                0.01586617 = queryNorm
              0.8581696 = fieldWeight in 1, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                5.189722 = idf(docFreq=672, maxDocs=44421)
                0.0625 = fieldNorm(doc=1)
          0.69734365 = weight(abstract_txt:wikipedia in 1) [ClassicSimilarity], result of:
            0.69734365 = score(doc=1,freq=8.0), product of:
              0.6306861 = queryWeight, product of:
                6.3552494 = boost
                6.25473 = idf(docFreq=231, maxDocs=44421)
                0.01586617 = queryNorm
              1.1056905 = fieldWeight in 1, product of:
                2.828427 = tf(freq=8.0), with freq of:
                  8.0 = termFreq=8.0
                6.25473 = idf(docFreq=231, maxDocs=44421)
                0.0625 = fieldNorm(doc=1)
        0.2 = coord(5/25)
    
  3. Zhao, D.; Strotmann, A.: Mapping knowledge domains on Wikipedia : an author bibliographic coupling analysis of traditional Chinese medicine (2022) 0.20
    0.19786236 = sum of:
      0.19786236 = product of:
        0.82442653 = sum of:
          0.009405669 = weight(abstract_txt:other in 1609) [ClassicSimilarity], result of:
            0.009405669 = score(doc=1609,freq=1.0), product of:
              0.057026267 = queryWeight, product of:
                1.0214789 = boost
                3.5186288 = idf(docFreq=3578, maxDocs=44421)
                0.01586617 = queryNorm
              0.16493572 = fieldWeight in 1609, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.5186288 = idf(docFreq=3578, maxDocs=44421)
                0.046875 = fieldNorm(doc=1609)
          0.04070016 = weight(abstract_txt:missing in 1609) [ClassicSimilarity], result of:
            0.04070016 = score(doc=1609,freq=1.0), product of:
              0.12019025 = queryWeight, product of:
                1.0486041 = boost
                7.2241306 = idf(docFreq=87, maxDocs=44421)
                0.01586617 = queryNorm
              0.33863112 = fieldWeight in 1609, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.2241306 = idf(docFreq=87, maxDocs=44421)
                0.046875 = fieldNorm(doc=1609)
          0.012964599 = weight(abstract_txt:information in 1609) [ClassicSimilarity], result of:
            0.012964599 = score(doc=1609,freq=2.0), product of:
              0.08085092 = queryWeight, product of:
                2.106663 = boost
                2.4188995 = idf(docFreq=10748, maxDocs=44421)
                0.01586617 = queryNorm
              0.1603519 = fieldWeight in 1609, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.4188995 = idf(docFreq=10748, maxDocs=44421)
                0.046875 = fieldNorm(doc=1609)
          0.021166423 = weight(abstract_txt:different in 1609) [ClassicSimilarity], result of:
            0.021166423 = score(doc=1609,freq=1.0), product of:
              0.123383425 = queryWeight, product of:
                2.1248846 = boost
                3.6597328 = idf(docFreq=3107, maxDocs=44421)
                0.01586617 = queryNorm
              0.17154998 = fieldWeight in 1609, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6597328 = idf(docFreq=3107, maxDocs=44421)
                0.046875 = fieldNorm(doc=1609)
          0.073483 = weight(abstract_txt:topics in 1609) [ClassicSimilarity], result of:
            0.073483 = score(doc=1609,freq=3.0), product of:
              0.17820904 = queryWeight, product of:
                2.2115788 = boost
                5.078731 = idf(docFreq=751, maxDocs=44421)
                0.01586617 = queryNorm
              0.4123416 = fieldWeight in 1609, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.078731 = idf(docFreq=751, maxDocs=44421)
                0.046875 = fieldNorm(doc=1609)
          0.6667067 = weight(abstract_txt:wikipedia in 1609) [ClassicSimilarity], result of:
            0.6667067 = score(doc=1609,freq=13.0), product of:
              0.6306861 = queryWeight, product of:
                6.3552494 = boost
                6.25473 = idf(docFreq=231, maxDocs=44421)
                0.01586617 = queryNorm
              1.0571133 = fieldWeight in 1609, product of:
                3.6055512 = tf(freq=13.0), with freq of:
                  13.0 = termFreq=13.0
                6.25473 = idf(docFreq=231, maxDocs=44421)
                0.046875 = fieldNorm(doc=1609)
        0.24 = coord(6/25)
    
  4. Fallis, D.: Toward an epistemology of Wikipedia (2008) 0.18
    0.17547569 = sum of:
      0.17547569 = product of:
        0.8773784 = sum of:
          0.04118208 = weight(abstract_txt:encyclopedia in 3010) [ClassicSimilarity], result of:
            0.04118208 = score(doc=3010,freq=1.0), product of:
              0.10930654 = queryWeight, product of:
                6.889283 = idf(docFreq=122, maxDocs=44421)
                0.01586617 = queryNorm
              0.37675768 = fieldWeight in 3010, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.889283 = idf(docFreq=122, maxDocs=44421)
                0.0546875 = fieldNorm(doc=3010)
          0.01551856 = weight(abstract_txt:other in 3010) [ClassicSimilarity], result of:
            0.01551856 = score(doc=3010,freq=2.0), product of:
              0.057026267 = queryWeight, product of:
                1.0214789 = boost
                3.5186288 = idf(docFreq=3578, maxDocs=44421)
                0.01586617 = queryNorm
              0.27213004 = fieldWeight in 3010, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.5186288 = idf(docFreq=3578, maxDocs=44421)
                0.0546875 = fieldNorm(doc=3010)
          0.016655467 = weight(abstract_txt:terms in 3010) [ClassicSimilarity], result of:
            0.016655467 = score(doc=3010,freq=1.0), product of:
              0.07531622 = queryWeight, product of:
                1.1739137 = boost
                4.043712 = idf(docFreq=2116, maxDocs=44421)
                0.01586617 = queryNorm
              0.2211405 = fieldWeight in 3010, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.043712 = idf(docFreq=2116, maxDocs=44421)
                0.0546875 = fieldNorm(doc=3010)
          0.026197901 = weight(abstract_txt:information in 3010) [ClassicSimilarity], result of:
            0.026197901 = score(doc=3010,freq=6.0), product of:
              0.08085092 = queryWeight, product of:
                2.106663 = boost
                2.4188995 = idf(docFreq=10748, maxDocs=44421)
                0.01586617 = queryNorm
              0.32402724 = fieldWeight in 3010, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                2.4188995 = idf(docFreq=10748, maxDocs=44421)
                0.0546875 = fieldNorm(doc=3010)
          0.7778244 = weight(abstract_txt:wikipedia in 3010) [ClassicSimilarity], result of:
            0.7778244 = score(doc=3010,freq=13.0), product of:
              0.6306861 = queryWeight, product of:
                6.3552494 = boost
                6.25473 = idf(docFreq=231, maxDocs=44421)
                0.01586617 = queryNorm
              1.2332988 = fieldWeight in 3010, product of:
                3.6055512 = tf(freq=13.0), with freq of:
                  13.0 = termFreq=13.0
                6.25473 = idf(docFreq=231, maxDocs=44421)
                0.0546875 = fieldNorm(doc=3010)
        0.2 = coord(5/25)
    
  5. Ménard, E.; Khashman, N.; Kochkina, S.; Torres-Moreno, J.-M.; Velazquez-Morales, P.; Zhou, F.; Jourlin, P.; Rawat, P.; Peinl, P.; Linhares Pontes, E.; Brunetti., I.: ¬A second life for TIIARA : from bilingual to multilingual! (2016) 0.16
    0.15699363 = sum of:
      0.15699363 = product of:
        0.56069154 = sum of:
          0.0486398 = weight(abstract_txt:spanish in 3834) [ClassicSimilarity], result of:
            0.0486398 = score(doc=3834,freq=1.0), product of:
              0.111731045 = queryWeight, product of:
                1.0110296 = boost
                6.965269 = idf(docFreq=113, maxDocs=44421)
                0.01586617 = queryNorm
              0.43532932 = fieldWeight in 3834, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.965269 = idf(docFreq=113, maxDocs=44421)
                0.0625 = fieldNorm(doc=3834)
          0.06254769 = weight(abstract_txt:arabic in 3834) [ClassicSimilarity], result of:
            0.06254769 = score(doc=3834,freq=1.0), product of:
              0.13212557 = queryWeight, product of:
                1.099437 = boost
                7.574333 = idf(docFreq=61, maxDocs=44421)
                0.01586617 = queryNorm
              0.47339582 = fieldWeight in 3834, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.574333 = idf(docFreq=61, maxDocs=44421)
                0.0625 = fieldNorm(doc=3834)
          0.07023767 = weight(abstract_txt:russian in 3834) [ClassicSimilarity], result of:
            0.07023767 = score(doc=3834,freq=1.0), product of:
              0.14274451 = queryWeight, product of:
                1.1427642 = boost
                7.872826 = idf(docFreq=45, maxDocs=44421)
                0.01586617 = queryNorm
              0.49205163 = fieldWeight in 3834, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.872826 = idf(docFreq=45, maxDocs=44421)
                0.0625 = fieldNorm(doc=3834)
          0.08785445 = weight(abstract_txt:portuguese in 3834) [ClassicSimilarity], result of:
            0.08785445 = score(doc=3834,freq=1.0), product of:
              0.16571248 = queryWeight, product of:
                1.2312735 = boost
                8.482592 = idf(docFreq=24, maxDocs=44421)
                0.01586617 = queryNorm
              0.530162 = fieldWeight in 3834, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.482592 = idf(docFreq=24, maxDocs=44421)
                0.0625 = fieldNorm(doc=3834)
          0.12822647 = weight(abstract_txt:hindi in 3834) [ClassicSimilarity], result of:
            0.12822647 = score(doc=3834,freq=1.0), product of:
              0.21322156 = queryWeight, product of:
                1.3966658 = boost
                9.622026 = idf(docFreq=7, maxDocs=44421)
                0.01586617 = queryNorm
              0.60137665 = fieldWeight in 3834, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.622026 = idf(docFreq=7, maxDocs=44421)
                0.0625 = fieldNorm(doc=3834)
          0.028221898 = weight(abstract_txt:different in 3834) [ClassicSimilarity], result of:
            0.028221898 = score(doc=3834,freq=1.0), product of:
              0.123383425 = queryWeight, product of:
                2.1248846 = boost
                3.6597328 = idf(docFreq=3107, maxDocs=44421)
                0.01586617 = queryNorm
              0.2287333 = fieldWeight in 3834, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6597328 = idf(docFreq=3107, maxDocs=44421)
                0.0625 = fieldNorm(doc=3834)
          0.13496359 = weight(abstract_txt:languages in 3834) [ClassicSimilarity], result of:
            0.13496359 = score(doc=3834,freq=5.0), product of:
              0.18608333 = queryWeight, product of:
                2.2599108 = boost
                5.189722 = idf(docFreq=672, maxDocs=44421)
                0.01586617 = queryNorm
              0.7252857 = fieldWeight in 3834, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                5.189722 = idf(docFreq=672, maxDocs=44421)
                0.0625 = fieldNorm(doc=3834)
        0.28 = coord(7/25)