Document (#34139)

How (much) to trust Wikipedia
The Wikipedia is a collaborative encyclopedia: anyone can contribute to its articles simply by clicking on an "edit'' button. The open nature of the Wikipedia has been key to its success, but has a flip side: if anyone can edit, how can readers know whether to trust its content? To help answer this question, we have developed a reputation system for Wikipedia authors, and a trust system for Wikipedia text. Authors gain reputation when their contributions are long-lived, and they lose reputation when their contributions are undone in short order. Each word in the Wikipedia is assigned a value of trust that depends on the reputation of its author, as well as on the reputation of the authors that subsequently revised the text where the word appears. To validate our algorithms, we show that reputation and trust have good predictive value: higher-reputation authors are more likely to give lasting contributions, and higher-trust text is less likely to be edited. The trust can be visualized via an intuitive coloring of the text background. The coloring provides an effective way of spotting attempts to tamper with Wikipedia information. A trust-colored version of the entire English Wikipedia can be browsed at
Video unter: ..\videos\How (Much) to Trust Wikipedia.flv.

Similar documents (content)

  1. Fallis, D.: Toward an epistemology of Wikipedia (2008) 0.11
    0.11412355 = sum of:
      0.11412355 = product of:
        0.7132722 = sum of:
          0.04707309 = weight(abstract_txt:likely in 3010) [ClassicSimilarity], result of:
            0.04707309 = score(doc=3010,freq=3.0), product of:
              0.08759851 = queryWeight, product of:
                1.5730826 = boost
                5.673189 = idf(docFreq=414, maxDocs=44421)
                0.009815625 = queryNorm
              0.5373732 = fieldWeight in 3010, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.673189 = idf(docFreq=414, maxDocs=44421)
                0.0546875 = fieldNorm(doc=3010)
          0.05777783 = weight(abstract_txt:anyone in 3010) [ClassicSimilarity], result of:
            0.05777783 = score(doc=3010,freq=1.0), product of:
              0.14483143 = queryWeight, product of:
                2.0227144 = boost
                7.2947483 = idf(docFreq=81, maxDocs=44421)
                0.009815625 = queryNorm
              0.39893156 = fieldWeight in 3010, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.2947483 = idf(docFreq=81, maxDocs=44421)
                0.0546875 = fieldNorm(doc=3010)
          0.08314514 = weight(abstract_txt:edit in 3010) [ClassicSimilarity], result of:
            0.08314514 = score(doc=3010,freq=1.0), product of:
              0.18460634 = queryWeight, product of:
                2.2836337 = boost
                8.235732 = idf(docFreq=31, maxDocs=44421)
                0.009815625 = queryNorm
              0.4503916 = fieldWeight in 3010, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.235732 = idf(docFreq=31, maxDocs=44421)
                0.0546875 = fieldNorm(doc=3010)
          0.5252761 = weight(abstract_txt:wikipedia in 3010) [ClassicSimilarity], result of:
            0.5252761 = score(doc=3010,freq=13.0), product of:
              0.4259115 = queryWeight, product of:
                6.9373374 = boost
                6.25473 = idf(docFreq=231, maxDocs=44421)
                0.009815625 = queryNorm
              1.2332988 = fieldWeight in 3010, product of:
                3.6055512 = tf(freq=13.0), with freq of:
                  13.0 = termFreq=13.0
                6.25473 = idf(docFreq=231, maxDocs=44421)
                0.0546875 = fieldNorm(doc=3010)
        0.16 = coord(4/25)
  2. Lucassen, T.; Schraagen, J.M.: Factual accuracy and trust in information : the role of expertise (2011) 0.10
    0.09625361 = sum of:
      0.09625361 = product of:
        0.8021134 = sum of:
          0.034058414 = weight(abstract_txt:validate in 475) [ClassicSimilarity], result of:
            0.034058414 = score(doc=475,freq=1.0), product of:
              0.07393221 = queryWeight, product of:
                1.021892 = boost
                7.370734 = idf(docFreq=75, maxDocs=44421)
                0.009815625 = queryNorm
              0.4606709 = fieldWeight in 475, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.370734 = idf(docFreq=75, maxDocs=44421)
                0.0625 = fieldNorm(doc=475)
          0.16649759 = weight(abstract_txt:wikipedia in 475) [ClassicSimilarity], result of:
            0.16649759 = score(doc=475,freq=1.0), product of:
              0.4259115 = queryWeight, product of:
                6.9373374 = boost
                6.25473 = idf(docFreq=231, maxDocs=44421)
                0.009815625 = queryNorm
              0.39092064 = fieldWeight in 475, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.25473 = idf(docFreq=231, maxDocs=44421)
                0.0625 = fieldNorm(doc=475)
          0.6015574 = weight(abstract_txt:trust in 475) [ClassicSimilarity], result of:
            0.6015574 = score(doc=475,freq=7.0), product of:
              0.5242431 = queryWeight, product of:
                7.6966105 = boost
                6.939294 = idf(docFreq=116, maxDocs=44421)
                0.009815625 = queryNorm
              1.1474779 = fieldWeight in 475, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                6.939294 = idf(docFreq=116, maxDocs=44421)
                0.0625 = fieldNorm(doc=475)
        0.12 = coord(3/25)
  3. Teplitskiy, M.; Lu, G.; Duede, E.: Amplifying the impact of open access : Wikipedia and the diffusion of science (2017) 0.10
    0.09588839 = sum of:
      0.09588839 = product of:
        0.5993025 = sum of:
          0.017757967 = weight(abstract_txt:value in 4782) [ClassicSimilarity], result of:
            0.017757967 = score(doc=4782,freq=1.0), product of:
              0.05200154 = queryWeight, product of:
                1.2120241 = boost
                4.3710623 = idf(docFreq=1525, maxDocs=44421)
                0.009815625 = queryNorm
              0.34148926 = fieldWeight in 4782, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.3710623 = idf(docFreq=1525, maxDocs=44421)
                0.078125 = fieldNorm(doc=4782)
          0.032926586 = weight(abstract_txt:higher in 4782) [ClassicSimilarity], result of:
            0.032926586 = score(doc=4782,freq=1.0), product of:
              0.07848474 = queryWeight, product of:
                1.4890035 = boost
                5.3699656 = idf(docFreq=561, maxDocs=44421)
                0.009815625 = queryNorm
              0.41952854 = fieldWeight in 4782, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.3699656 = idf(docFreq=561, maxDocs=44421)
                0.078125 = fieldNorm(doc=4782)
          0.03882523 = weight(abstract_txt:likely in 4782) [ClassicSimilarity], result of:
            0.03882523 = score(doc=4782,freq=1.0), product of:
              0.08759851 = queryWeight, product of:
                1.5730826 = boost
                5.673189 = idf(docFreq=414, maxDocs=44421)
                0.009815625 = queryNorm
              0.4432179 = fieldWeight in 4782, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.673189 = idf(docFreq=414, maxDocs=44421)
                0.078125 = fieldNorm(doc=4782)
          0.5097927 = weight(abstract_txt:wikipedia in 4782) [ClassicSimilarity], result of:
            0.5097927 = score(doc=4782,freq=6.0), product of:
              0.4259115 = queryWeight, product of:
                6.9373374 = boost
                6.25473 = idf(docFreq=231, maxDocs=44421)
                0.009815625 = queryNorm
              1.1969452 = fieldWeight in 4782, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                6.25473 = idf(docFreq=231, maxDocs=44421)
                0.078125 = fieldNorm(doc=4782)
        0.16 = coord(4/25)
  4. Lucassen, T.; Muilwijk, R.; Noordzij, M.L.; Schraagen, J.M.: Topic familiarity and information skills in online credibility evaluation (2013) 0.08
    0.08361847 = sum of:
      0.08361847 = product of:
        0.52261543 = sum of:
          0.012123752 = weight(abstract_txt:when in 1608) [ClassicSimilarity], result of:
            0.012123752 = score(doc=1608,freq=1.0), product of:
              0.04678631 = queryWeight, product of:
                1.1496418 = boost
                4.1460857 = idf(docFreq=1910, maxDocs=44421)
                0.009815625 = queryNorm
              0.25913036 = fieldWeight in 1608, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.1460857 = idf(docFreq=1910, maxDocs=44421)
                0.0625 = fieldNorm(doc=1608)
          0.022448162 = weight(abstract_txt:text in 1608) [ClassicSimilarity], result of:
            0.022448162 = score(doc=1608,freq=1.0), product of:
              0.088884205 = queryWeight, product of:
                2.240941 = boost
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.009815625 = queryNorm
              0.25255513 = fieldWeight in 1608, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.0625 = fieldNorm(doc=1608)
          0.16649759 = weight(abstract_txt:wikipedia in 1608) [ClassicSimilarity], result of:
            0.16649759 = score(doc=1608,freq=1.0), product of:
              0.4259115 = queryWeight, product of:
                6.9373374 = boost
                6.25473 = idf(docFreq=231, maxDocs=44421)
                0.009815625 = queryNorm
              0.39092064 = fieldWeight in 1608, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.25473 = idf(docFreq=231, maxDocs=44421)
                0.0625 = fieldNorm(doc=1608)
          0.32154593 = weight(abstract_txt:trust in 1608) [ClassicSimilarity], result of:
            0.32154593 = score(doc=1608,freq=2.0), product of:
              0.5242431 = queryWeight, product of:
                7.6966105 = boost
                6.939294 = idf(docFreq=116, maxDocs=44421)
                0.009815625 = queryNorm
              0.6133527 = fieldWeight in 1608, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.939294 = idf(docFreq=116, maxDocs=44421)
                0.0625 = fieldNorm(doc=1608)
        0.16 = coord(4/25)
  5. Zhao, D.; Strotmann, A.: Mapping knowledge domains on Wikipedia : an author bibliographic coupling analysis of traditional Chinese medicine (2022) 0.08
    0.08318685 = sum of:
      0.08318685 = product of:
        0.51991785 = sum of:
          0.033449218 = weight(abstract_txt:visualized in 1609) [ClassicSimilarity], result of:
            0.033449218 = score(doc=1609,freq=1.0), product of:
              0.08849129 = queryWeight, product of:
                1.1179912 = boost
                8.063882 = idf(docFreq=37, maxDocs=44421)
                0.009815625 = queryNorm
              0.37799448 = fieldWeight in 1609, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.063882 = idf(docFreq=37, maxDocs=44421)
                0.046875 = fieldNorm(doc=1609)
          0.01065478 = weight(abstract_txt:value in 1609) [ClassicSimilarity], result of:
            0.01065478 = score(doc=1609,freq=1.0), product of:
              0.05200154 = queryWeight, product of:
                1.2120241 = boost
                4.3710623 = idf(docFreq=1525, maxDocs=44421)
                0.009815625 = queryNorm
              0.20489354 = fieldWeight in 1609, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.3710623 = idf(docFreq=1525, maxDocs=44421)
                0.046875 = fieldNorm(doc=1609)
          0.025577174 = weight(abstract_txt:authors in 1609) [ClassicSimilarity], result of:
            0.025577174 = score(doc=1609,freq=1.0), product of:
              0.11746223 = queryWeight, product of:
                2.576127 = boost
                4.6452923 = idf(docFreq=1159, maxDocs=44421)
                0.009815625 = queryNorm
              0.21774808 = fieldWeight in 1609, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.6452923 = idf(docFreq=1159, maxDocs=44421)
                0.046875 = fieldNorm(doc=1609)
          0.45023668 = weight(abstract_txt:wikipedia in 1609) [ClassicSimilarity], result of:
            0.45023668 = score(doc=1609,freq=13.0), product of:
              0.4259115 = queryWeight, product of:
                6.9373374 = boost
                6.25473 = idf(docFreq=231, maxDocs=44421)
                0.009815625 = queryNorm
              1.0571133 = fieldWeight in 1609, product of:
                3.6055512 = tf(freq=13.0), with freq of:
                  13.0 = termFreq=13.0
                6.25473 = idf(docFreq=231, maxDocs=44421)
                0.046875 = fieldNorm(doc=1609)
        0.16 = coord(4/25)