Document (#37377)

Author
Egghe, L.
Guns, R.
Title
Applications of the generalized law of Benford to informetric data
Source
Journal of the American Society for Information Science and Technology. 63(2012) no.8, S.1662-1665
Year
2012
Series
Brief communication
Abstract
In a previous work (Egghe, 2011), the first author showed that Benford's law (describing the logarithmic distribution of the numbers 1, 2, ... , 9 as first digits of data in decimal form) is related to the classical law of Zipf with exponent 1. The work of Campanario and Coslado (2011), however, shows that Benford's law does not always fit practical data in a statistical sense. In this article, we use a generalization of Benford's law related to the general law of Zipf with exponent ? > 0. Using data from Campanario and Coslado, we apply nonlinear least squares to determine the optimal ? and show that this generalized law of Benford fits the data better than the classical law of Benford.
Theme
Informetrie
Object
Zipf-Gesetz
Benford-Gesetz

Similar documents (author)

  1. Egghe, L.; Guns, R.; Rousseau, R.: Thoughts on uncitedness : Nobel laureates and Fields medalists as case studies (2011) 4.40
    4.398363 = sum of:
      4.398363 = sum of:
        1.651527 = weight(author_txt:egghe in 994) [ClassicSimilarity], result of:
          1.651527 = score(doc=994,freq=1.0), product of:
            0.58020127 = queryWeight, product of:
              7.590594 = idf(docFreq=60, maxDocs=44421)
              0.07643688 = queryNorm
            2.8464727 = fieldWeight in 994, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              7.590594 = idf(docFreq=60, maxDocs=44421)
              0.375 = fieldNorm(doc=994)
        2.7468362 = weight(author_txt:guns in 994) [ClassicSimilarity], result of:
          2.7468362 = score(doc=994,freq=1.0), product of:
            0.8144731 = queryWeight, product of:
              1.1848109 = boost
              8.993418 = idf(docFreq=14, maxDocs=44421)
              0.07643688 = queryNorm
            3.3725317 = fieldWeight in 994, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              8.993418 = idf(docFreq=14, maxDocs=44421)
              0.375 = fieldNorm(doc=994)
    
  2. Rousseau, R.; Egghe, L.; Guns, R.: Becoming metric-wise : a bibliometric guide for researchers (2018) 4.40
    4.398363 = sum of:
      4.398363 = sum of:
        1.651527 = weight(author_txt:egghe in 226) [ClassicSimilarity], result of:
          1.651527 = score(doc=226,freq=1.0), product of:
            0.58020127 = queryWeight, product of:
              7.590594 = idf(docFreq=60, maxDocs=44421)
              0.07643688 = queryNorm
            2.8464727 = fieldWeight in 226, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              7.590594 = idf(docFreq=60, maxDocs=44421)
              0.375 = fieldNorm(doc=226)
        2.7468362 = weight(author_txt:guns in 226) [ClassicSimilarity], result of:
          2.7468362 = score(doc=226,freq=1.0), product of:
            0.8144731 = queryWeight, product of:
              1.1848109 = boost
              8.993418 = idf(docFreq=14, maxDocs=44421)
              0.07643688 = queryNorm
            3.3725317 = fieldWeight in 226, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              8.993418 = idf(docFreq=14, maxDocs=44421)
              0.375 = fieldNorm(doc=226)
    
  3. Egghe, L.; Guns, R.; Rousseau, R.; Leuven, K.U.: Erratum (2012) 3.67
    3.6653028 = sum of:
      3.6653028 = sum of:
        1.3762726 = weight(author_txt:egghe in 992) [ClassicSimilarity], result of:
          1.3762726 = score(doc=992,freq=1.0), product of:
            0.58020127 = queryWeight, product of:
              7.590594 = idf(docFreq=60, maxDocs=44421)
              0.07643688 = queryNorm
            2.3720605 = fieldWeight in 992, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              7.590594 = idf(docFreq=60, maxDocs=44421)
              0.3125 = fieldNorm(doc=992)
        2.28903 = weight(author_txt:guns in 992) [ClassicSimilarity], result of:
          2.28903 = score(doc=992,freq=1.0), product of:
            0.8144731 = queryWeight, product of:
              1.1848109 = boost
              8.993418 = idf(docFreq=14, maxDocs=44421)
              0.07643688 = queryNorm
            2.810443 = fieldWeight in 992, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              8.993418 = idf(docFreq=14, maxDocs=44421)
              0.3125 = fieldNorm(doc=992)
    
  4. Guns, R.: ¬The three dimensions of informetrics : a conceptual view (2013) 2.29
    2.28903 = sum of:
      2.28903 = product of:
        4.57806 = sum of:
          4.57806 = weight(author_txt:guns in 1398) [ClassicSimilarity], result of:
            4.57806 = score(doc=1398,freq=1.0), product of:
              0.8144731 = queryWeight, product of:
                1.1848109 = boost
                8.993418 = idf(docFreq=14, maxDocs=44421)
                0.07643688 = queryNorm
              5.620886 = fieldWeight in 1398, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.993418 = idf(docFreq=14, maxDocs=44421)
                0.625 = fieldNorm(doc=1398)
        0.5 = coord(1/2)
    
  5. Guns, R.: Tracing the origins of the semantic web (2013) 2.29
    2.28903 = sum of:
      2.28903 = product of:
        4.57806 = sum of:
          4.57806 = weight(author_txt:guns in 2093) [ClassicSimilarity], result of:
            4.57806 = score(doc=2093,freq=1.0), product of:
              0.8144731 = queryWeight, product of:
                1.1848109 = boost
                8.993418 = idf(docFreq=14, maxDocs=44421)
                0.07643688 = queryNorm
              5.620886 = fieldWeight in 2093, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.993418 = idf(docFreq=14, maxDocs=44421)
                0.625 = fieldNorm(doc=2093)
        0.5 = coord(1/2)
    

Similar documents (content)

  1. Shan, S.: On the generalized Zipf distribution : part I (2005) 0.21
    0.21043327 = sum of:
      0.21043327 = product of:
        1.315208 = sum of:
          0.12253832 = weight(abstract_txt:informetric in 2061) [ClassicSimilarity], result of:
            0.12253832 = score(doc=2061,freq=1.0), product of:
              0.16736677 = queryWeight, product of:
                1.3410039 = boost
                7.809647 = idf(docFreq=48, maxDocs=44421)
                0.015981141 = queryNorm
              0.7321544 = fieldWeight in 2061, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.809647 = idf(docFreq=48, maxDocs=44421)
                0.09375 = fieldNorm(doc=2061)
          0.14370911 = weight(abstract_txt:generalization in 2061) [ClassicSimilarity], result of:
            0.14370911 = score(doc=2061,freq=1.0), product of:
              0.18612762 = queryWeight, product of:
                1.4141674 = boost
                8.235732 = idf(docFreq=31, maxDocs=44421)
                0.015981141 = queryNorm
              0.77209985 = fieldWeight in 2061, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.235732 = idf(docFreq=31, maxDocs=44421)
                0.09375 = fieldNorm(doc=2061)
          0.31450343 = weight(abstract_txt:generalized in 2061) [ClassicSimilarity], result of:
            0.31450343 = score(doc=2061,freq=3.0), product of:
              0.2740782 = queryWeight, product of:
                2.4268763 = boost
                7.0667386 = idf(docFreq=102, maxDocs=44421)
                0.015981141 = queryNorm
              1.1474953 = fieldWeight in 2061, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                7.0667386 = idf(docFreq=102, maxDocs=44421)
                0.09375 = fieldNorm(doc=2061)
          0.7344572 = weight(abstract_txt:zipf in 2061) [ClassicSimilarity], result of:
            0.7344572 = score(doc=2061,freq=5.0), product of:
              0.40689805 = queryWeight, product of:
                2.957013 = boost
                8.610425 = idf(docFreq=21, maxDocs=44421)
                0.015981141 = queryNorm
              1.8050153 = fieldWeight in 2061, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                8.610425 = idf(docFreq=21, maxDocs=44421)
                0.09375 = fieldNorm(doc=2061)
        0.16 = coord(4/25)
    
  2. Milojevic, S.: Power law distributions in information science : making the case for logarithmic binning (2010) 0.14
    0.13731208 = sum of:
      0.13731208 = product of:
        0.85820055 = sum of:
          0.010208378 = weight(abstract_txt:that in 113) [ClassicSimilarity], result of:
            0.010208378 = score(doc=113,freq=1.0), product of:
              0.046043273 = queryWeight, product of:
                1.2182577 = boost
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.015981141 = queryNorm
              0.22171268 = fieldWeight in 113, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.09375 = fieldNorm(doc=113)
          0.2929827 = weight(abstract_txt:logarithmic in 113) [ClassicSimilarity], result of:
            0.2929827 = score(doc=113,freq=2.0), product of:
              0.23752315 = queryWeight, product of:
                1.5975276 = boost
                9.303573 = idf(docFreq=10, maxDocs=44421)
                0.015981141 = queryNorm
              1.2334911 = fieldWeight in 113, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                9.303573 = idf(docFreq=10, maxDocs=44421)
                0.09375 = fieldNorm(doc=113)
          0.047508206 = weight(abstract_txt:data in 113) [ClassicSimilarity], result of:
            0.047508206 = score(doc=113,freq=1.0), product of:
              0.15216812 = queryWeight, product of:
                2.859185 = boost
                3.3302255 = idf(docFreq=4320, maxDocs=44421)
                0.015981141 = queryNorm
              0.31220865 = fieldWeight in 113, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.3302255 = idf(docFreq=4320, maxDocs=44421)
                0.09375 = fieldNorm(doc=113)
          0.50750124 = weight(abstract_txt:exponent in 113) [ClassicSimilarity], result of:
            0.50750124 = score(doc=113,freq=2.0), product of:
              0.4316311 = queryWeight, product of:
                3.0455573 = boost
                8.868255 = idf(docFreq=16, maxDocs=44421)
                0.015981141 = queryNorm
              1.1757755 = fieldWeight in 113, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.868255 = idf(docFreq=16, maxDocs=44421)
                0.09375 = fieldNorm(doc=113)
        0.16 = coord(4/25)
    
  3. Egghe, L.: Zipfian and Lotkaian continuous concentration theory (2005) 0.13
    0.13271317 = sum of:
      0.13271317 = product of:
        0.8294573 = sum of:
          0.044288523 = weight(abstract_txt:apply in 4678) [ClassicSimilarity], result of:
            0.044288523 = score(doc=4678,freq=1.0), product of:
              0.09589654 = queryWeight, product of:
                1.0150721 = boost
                5.9115076 = idf(docFreq=326, maxDocs=44421)
                0.015981141 = queryNorm
              0.46183652 = fieldWeight in 4678, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.9115076 = idf(docFreq=326, maxDocs=44421)
                0.078125 = fieldNorm(doc=4678)
          0.012030689 = weight(abstract_txt:that in 4678) [ClassicSimilarity], result of:
            0.012030689 = score(doc=4678,freq=2.0), product of:
              0.046043273 = queryWeight, product of:
                1.2182577 = boost
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.015981141 = queryNorm
              0.2612909 = fieldWeight in 4678, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.078125 = fieldNorm(doc=4678)
          0.47409007 = weight(abstract_txt:zipf in 4678) [ClassicSimilarity], result of:
            0.47409007 = score(doc=4678,freq=3.0), product of:
              0.40689805 = queryWeight, product of:
                2.957013 = boost
                8.610425 = idf(docFreq=21, maxDocs=44421)
                0.015981141 = queryNorm
              1.1651323 = fieldWeight in 4678, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                8.610425 = idf(docFreq=21, maxDocs=44421)
                0.078125 = fieldNorm(doc=4678)
          0.299048 = weight(abstract_txt:exponent in 4678) [ClassicSimilarity], result of:
            0.299048 = score(doc=4678,freq=1.0), product of:
              0.4316311 = queryWeight, product of:
                3.0455573 = boost
                8.868255 = idf(docFreq=16, maxDocs=44421)
                0.015981141 = queryNorm
              0.6928324 = fieldWeight in 4678, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.868255 = idf(docFreq=16, maxDocs=44421)
                0.078125 = fieldNorm(doc=4678)
        0.16 = coord(4/25)
    
  4. Sarabia, J.M.; Sarabia, M.: Explicit expressions for the Leimkuhler curve in parametric families (2008) 0.12
    0.1224994 = sum of:
      0.1224994 = product of:
        0.5104142 = sum of:
          0.018665152 = weight(abstract_txt:work in 3120) [ClassicSimilarity], result of:
            0.018665152 = score(doc=3120,freq=1.0), product of:
              0.07880972 = queryWeight, product of:
                1.3013686 = boost
                3.7894108 = idf(docFreq=2729, maxDocs=44421)
                0.015981141 = queryNorm
              0.23683818 = fieldWeight in 3120, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.7894108 = idf(docFreq=2729, maxDocs=44421)
                0.0625 = fieldNorm(doc=3120)
          0.08169221 = weight(abstract_txt:informetric in 3120) [ClassicSimilarity], result of:
            0.08169221 = score(doc=3120,freq=1.0), product of:
              0.16736677 = queryWeight, product of:
                1.3410039 = boost
                7.809647 = idf(docFreq=48, maxDocs=44421)
                0.015981141 = queryNorm
              0.48810294 = fieldWeight in 3120, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.809647 = idf(docFreq=48, maxDocs=44421)
                0.0625 = fieldNorm(doc=3120)
          0.03497108 = weight(abstract_txt:first in 3120) [ClassicSimilarity], result of:
            0.03497108 = score(doc=3120,freq=2.0), product of:
              0.095065184 = queryWeight, product of:
                1.4292927 = boost
                4.1619086 = idf(docFreq=1880, maxDocs=44421)
                0.015981141 = queryNorm
              0.36786422 = fieldWeight in 3120, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.1619086 = idf(docFreq=1880, maxDocs=44421)
                0.0625 = fieldNorm(doc=3120)
          0.13374461 = weight(abstract_txt:classical in 3120) [ClassicSimilarity], result of:
            0.13374461 = score(doc=3120,freq=2.0), product of:
              0.23248754 = queryWeight, product of:
                2.2351685 = boost
                6.5085106 = idf(docFreq=179, maxDocs=44421)
                0.015981141 = queryNorm
              0.5752765 = fieldWeight in 3120, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.5085106 = idf(docFreq=179, maxDocs=44421)
                0.0625 = fieldNorm(doc=3120)
          0.20966896 = weight(abstract_txt:generalized in 3120) [ClassicSimilarity], result of:
            0.20966896 = score(doc=3120,freq=3.0), product of:
              0.2740782 = queryWeight, product of:
                2.4268763 = boost
                7.0667386 = idf(docFreq=102, maxDocs=44421)
                0.015981141 = queryNorm
              0.7649969 = fieldWeight in 3120, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                7.0667386 = idf(docFreq=102, maxDocs=44421)
                0.0625 = fieldNorm(doc=3120)
          0.031672135 = weight(abstract_txt:data in 3120) [ClassicSimilarity], result of:
            0.031672135 = score(doc=3120,freq=1.0), product of:
              0.15216812 = queryWeight, product of:
                2.859185 = boost
                3.3302255 = idf(docFreq=4320, maxDocs=44421)
                0.015981141 = queryNorm
              0.20813909 = fieldWeight in 3120, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.3302255 = idf(docFreq=4320, maxDocs=44421)
                0.0625 = fieldNorm(doc=3120)
        0.24 = coord(6/25)
    
  5. Burrell, Q.L.: "Ambiguity" ans scientometric measurement : a dissenting view (2001) 0.12
    0.11858952 = sum of:
      0.11858952 = product of:
        0.5929476 = sum of:
          0.017013961 = weight(abstract_txt:that in 981) [ClassicSimilarity], result of:
            0.017013961 = score(doc=981,freq=4.0), product of:
              0.046043273 = queryWeight, product of:
                1.2182577 = boost
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.015981141 = queryNorm
              0.3695211 = fieldWeight in 981, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.078125 = fieldNorm(doc=981)
          0.1444128 = weight(abstract_txt:informetric in 981) [ClassicSimilarity], result of:
            0.1444128 = score(doc=981,freq=2.0), product of:
              0.16736677 = queryWeight, product of:
                1.3410039 = boost
                7.809647 = idf(docFreq=48, maxDocs=44421)
                0.015981141 = queryNorm
              0.8628523 = fieldWeight in 981, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.809647 = idf(docFreq=48, maxDocs=44421)
                0.078125 = fieldNorm(doc=981)
          0.11821466 = weight(abstract_txt:classical in 981) [ClassicSimilarity], result of:
            0.11821466 = score(doc=981,freq=1.0), product of:
              0.23248754 = queryWeight, product of:
                2.2351685 = boost
                6.5085106 = idf(docFreq=179, maxDocs=44421)
                0.015981141 = queryNorm
              0.5084774 = fieldWeight in 981, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.5085106 = idf(docFreq=179, maxDocs=44421)
                0.078125 = fieldNorm(doc=981)
          0.03959017 = weight(abstract_txt:data in 981) [ClassicSimilarity], result of:
            0.03959017 = score(doc=981,freq=1.0), product of:
              0.15216812 = queryWeight, product of:
                2.859185 = boost
                3.3302255 = idf(docFreq=4320, maxDocs=44421)
                0.015981141 = queryNorm
              0.26017386 = fieldWeight in 981, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.3302255 = idf(docFreq=4320, maxDocs=44421)
                0.078125 = fieldNorm(doc=981)
          0.27371603 = weight(abstract_txt:zipf in 981) [ClassicSimilarity], result of:
            0.27371603 = score(doc=981,freq=1.0), product of:
              0.40689805 = queryWeight, product of:
                2.957013 = boost
                8.610425 = idf(docFreq=21, maxDocs=44421)
                0.015981141 = queryNorm
              0.67268944 = fieldWeight in 981, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.610425 = idf(docFreq=21, maxDocs=44421)
                0.078125 = fieldNorm(doc=981)
        0.2 = coord(5/25)