Document (#31008)

Author
Leydesdorff, L.
Bensman, S.
Title
Classification and Powerlaws : the logarithmic transformation
Source
Journal of the American Society for Information Science and Technology. 57(2006) no.11, S.1470-1486
Year
2006
Abstract
Logarithmic transformation of the data has been recommended by the literature in the case of highly skewed distributions such as those commonly found in information science. The purpose of the transformation is to make the data conform to the lognormal law of error for inferential purposes. How does this transformation affect the analysis? We factor analyze and visualize the citation environment of the Journal of the American Chemical Society (JACS) before and after a logarithmic transformation. The transformation strongly reduces the variance necessary for classificatory purposes and therefore is counterproductive to the purposes of the descriptive statistics. We recommend against the logarithmic transformation when sets cannot be defined unambiguously. The intellectual organization of the sciences is reflected in the curvilinear parts of the citation distributions while negative powerlaws fit excellently to the tails of the distributions.
Theme
Informetrie

Similar documents (author)

  1. Bensman, S.J.; Leydesdorff, L.: Definition and identification of journals as bibliographic and subject entities : librarianship versus ISI Journal Citation Reports methods and their effect on citation measures (2009) 5.82
    5.817269 = sum of:
      5.817269 = sum of:
        1.8908194 = weight(author_txt:leydesdorff in 3840) [ClassicSimilarity], result of:
          1.8908194 = score(doc=3840,freq=1.0), product of:
            0.5234732 = queryWeight, product of:
              7.2241306 = idf(docFreq=87, maxDocs=44421)
              0.072461754 = queryNorm
            3.6120653 = fieldWeight in 3840, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              7.2241306 = idf(docFreq=87, maxDocs=44421)
              0.5 = fieldNorm(doc=3840)
        3.9264493 = weight(author_txt:bensman in 3840) [ClassicSimilarity], result of:
          3.9264493 = score(doc=3840,freq=1.0), product of:
            0.85204214 = queryWeight, product of:
              1.2758021 = boost
              9.216561 = idf(docFreq=11, maxDocs=44421)
              0.072461754 = queryNorm
            4.6082807 = fieldWeight in 3840, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              9.216561 = idf(docFreq=11, maxDocs=44421)
              0.5 = fieldNorm(doc=3840)
    
  2. Bensman, S.J.: Garfield and the impact factors (2007) 2.45
    2.4540308 = sum of:
      2.4540308 = product of:
        4.9080615 = sum of:
          4.9080615 = weight(author_txt:bensman in 4679) [ClassicSimilarity], result of:
            4.9080615 = score(doc=4679,freq=1.0), product of:
              0.85204214 = queryWeight, product of:
                1.2758021 = boost
                9.216561 = idf(docFreq=11, maxDocs=44421)
                0.072461754 = queryNorm
              5.7603507 = fieldWeight in 4679, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.216561 = idf(docFreq=11, maxDocs=44421)
                0.625 = fieldNorm(doc=4679)
        0.5 = coord(1/2)
    
  3. Bensman, S.J.: Probability distributions in library and information science : a historical and practitioner viewpoint (2000) 2.45
    2.4540308 = sum of:
      2.4540308 = product of:
        4.9080615 = sum of:
          4.9080615 = weight(author_txt:bensman in 5859) [ClassicSimilarity], result of:
            4.9080615 = score(doc=5859,freq=1.0), product of:
              0.85204214 = queryWeight, product of:
                1.2758021 = boost
                9.216561 = idf(docFreq=11, maxDocs=44421)
                0.072461754 = queryNorm
              5.7603507 = fieldWeight in 5859, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.216561 = idf(docFreq=11, maxDocs=44421)
                0.625 = fieldNorm(doc=5859)
        0.5 = coord(1/2)
    
  4. Bensman, S.J.: Urquhart's and Garfield's laws : the British controversy over their validity (2001) 2.45
    2.4540308 = sum of:
      2.4540308 = product of:
        4.9080615 = sum of:
          4.9080615 = weight(author_txt:bensman in 26) [ClassicSimilarity], result of:
            4.9080615 = score(doc=26,freq=1.0), product of:
              0.85204214 = queryWeight, product of:
                1.2758021 = boost
                9.216561 = idf(docFreq=11, maxDocs=44421)
                0.072461754 = queryNorm
              5.7603507 = fieldWeight in 26, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.216561 = idf(docFreq=11, maxDocs=44421)
                0.625 = fieldNorm(doc=26)
        0.5 = coord(1/2)
    
  5. Bensman, S.J.: Urquhart and probability : the transition from librarianship to library and information science (2005) 2.45
    2.4540308 = sum of:
      2.4540308 = product of:
        4.9080615 = sum of:
          4.9080615 = weight(author_txt:bensman in 4311) [ClassicSimilarity], result of:
            4.9080615 = score(doc=4311,freq=1.0), product of:
              0.85204214 = queryWeight, product of:
                1.2758021 = boost
                9.216561 = idf(docFreq=11, maxDocs=44421)
                0.072461754 = queryNorm
              5.7603507 = fieldWeight in 4311, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.216561 = idf(docFreq=11, maxDocs=44421)
                0.625 = fieldNorm(doc=4311)
        0.5 = coord(1/2)
    

Similar documents (content)

  1. Milojevic, S.: Power law distributions in information science : making the case for logarithmic binning (2010) 0.18
    0.18281265 = sum of:
      0.18281265 = product of:
        1.1425791 = sum of:
          0.012285822 = weight(abstract_txt:data in 113) [ClassicSimilarity], result of:
            0.012285822 = score(doc=113,freq=1.0), product of:
              0.039351318 = queryWeight, product of:
                1.060424 = boost
                3.3302255 = idf(docFreq=4320, maxDocs=44421)
                0.011143101 = queryNorm
              0.31220865 = fieldWeight in 113, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.3302255 = idf(docFreq=4320, maxDocs=44421)
                0.09375 = fieldNorm(doc=113)
          0.14816746 = weight(abstract_txt:tails in 113) [ClassicSimilarity], result of:
            0.14816746 = score(doc=113,freq=1.0), product of:
              0.16425365 = queryWeight, product of:
                1.5319424 = boost
                9.622026 = idf(docFreq=7, maxDocs=44421)
                0.011143101 = queryNorm
              0.902065 = fieldWeight in 113, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.622026 = idf(docFreq=7, maxDocs=44421)
                0.09375 = fieldNorm(doc=113)
          0.22446048 = weight(abstract_txt:distributions in 113) [ClassicSimilarity], result of:
            0.22446048 = score(doc=113,freq=2.0), product of:
              0.24801055 = queryWeight, product of:
                3.2604728 = boost
                6.82627 = idf(docFreq=130, maxDocs=44421)
                0.011143101 = queryNorm
              0.9050441 = fieldWeight in 113, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.82627 = idf(docFreq=130, maxDocs=44421)
                0.09375 = fieldNorm(doc=113)
          0.75766534 = weight(abstract_txt:logarithmic in 113) [ClassicSimilarity], result of:
            0.75766534 = score(doc=113,freq=2.0), product of:
              0.6142447 = queryWeight, product of:
                5.9249625 = boost
                9.303573 = idf(docFreq=10, maxDocs=44421)
                0.011143101 = queryNorm
              1.2334911 = fieldWeight in 113, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                9.303573 = idf(docFreq=10, maxDocs=44421)
                0.09375 = fieldNorm(doc=113)
        0.16 = coord(4/25)
    
  2. Leydesdorff, L.: Similarity measures, author cocitation Analysis, and information theory (2005) 0.15
    0.14672953 = sum of:
      0.14672953 = product of:
        1.2227461 = sum of:
          0.012285822 = weight(abstract_txt:data in 4471) [ClassicSimilarity], result of:
            0.012285822 = score(doc=4471,freq=1.0), product of:
              0.039351318 = queryWeight, product of:
                1.060424 = boost
                3.3302255 = idf(docFreq=4320, maxDocs=44421)
                0.011143101 = queryNorm
              0.31220865 = fieldWeight in 4471, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.3302255 = idf(docFreq=4320, maxDocs=44421)
                0.09375 = fieldNorm(doc=4471)
          0.75766534 = weight(abstract_txt:logarithmic in 4471) [ClassicSimilarity], result of:
            0.75766534 = score(doc=4471,freq=2.0), product of:
              0.6142447 = queryWeight, product of:
                5.9249625 = boost
                9.303573 = idf(docFreq=10, maxDocs=44421)
                0.011143101 = queryNorm
              1.2334911 = fieldWeight in 4471, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                9.303573 = idf(docFreq=10, maxDocs=44421)
                0.09375 = fieldNorm(doc=4471)
          0.45279503 = weight(abstract_txt:transformation in 4471) [ClassicSimilarity], result of:
            0.45279503 = score(doc=4471,freq=2.0), product of:
              0.52517444 = queryWeight, product of:
                7.247458 = boost
                6.5029707 = idf(docFreq=180, maxDocs=44421)
                0.011143101 = queryNorm
              0.86218023 = fieldWeight in 4471, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.5029707 = idf(docFreq=180, maxDocs=44421)
                0.09375 = fieldNorm(doc=4471)
        0.12 = coord(3/25)
    
  3. Bensman, S.J.; Smolinsky, L.J.; Pudovkin, A.I.: Mean citation rate per article in mathematics journals : differences from the scientific model (2010) 0.10
    0.10410714 = sum of:
      0.10410714 = product of:
        0.3718112 = sum of:
          0.034289137 = weight(abstract_txt:negative in 582) [ClassicSimilarity], result of:
            0.034289137 = score(doc=582,freq=2.0), product of:
              0.070387624 = queryWeight, product of:
                1.0028433 = boost
                6.2987905 = idf(docFreq=221, maxDocs=44421)
                0.011143101 = queryNorm
              0.4871472 = fieldWeight in 582, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.2987905 = idf(docFreq=221, maxDocs=44421)
                0.0546875 = fieldNorm(doc=582)
          0.0071667293 = weight(abstract_txt:data in 582) [ClassicSimilarity], result of:
            0.0071667293 = score(doc=582,freq=1.0), product of:
              0.039351318 = queryWeight, product of:
                1.060424 = boost
                3.3302255 = idf(docFreq=4320, maxDocs=44421)
                0.011143101 = queryNorm
              0.18212171 = fieldWeight in 582, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.3302255 = idf(docFreq=4320, maxDocs=44421)
                0.0546875 = fieldNorm(doc=582)
          0.031284254 = weight(abstract_txt:error in 582) [ClassicSimilarity], result of:
            0.031284254 = score(doc=582,freq=1.0), product of:
              0.083423 = queryWeight, product of:
                1.0917616 = boost
                6.8572807 = idf(docFreq=126, maxDocs=44421)
                0.011143101 = queryNorm
              0.37500754 = fieldWeight in 582, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.8572807 = idf(docFreq=126, maxDocs=44421)
                0.0546875 = fieldNorm(doc=582)
          0.06587409 = weight(abstract_txt:variance in 582) [ClassicSimilarity], result of:
            0.06587409 = score(doc=582,freq=2.0), product of:
              0.108776495 = queryWeight, product of:
                1.2466726 = boost
                7.8302665 = idf(docFreq=47, maxDocs=44421)
                0.011143101 = queryNorm
              0.60559124 = fieldWeight in 582, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.8302665 = idf(docFreq=47, maxDocs=44421)
                0.0546875 = fieldNorm(doc=582)
          0.05687634 = weight(abstract_txt:skewed in 582) [ClassicSimilarity], result of:
            0.05687634 = score(doc=582,freq=1.0), product of:
              0.12426716 = queryWeight, product of:
                1.3324873 = boost
                8.369263 = idf(docFreq=27, maxDocs=44421)
                0.011143101 = queryNorm
              0.45769405 = fieldWeight in 582, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.369263 = idf(docFreq=27, maxDocs=44421)
                0.0546875 = fieldNorm(doc=582)
          0.045385383 = weight(abstract_txt:citation in 582) [ClassicSimilarity], result of:
            0.045385383 = score(doc=582,freq=4.0), product of:
              0.08485341 = queryWeight, product of:
                1.5571647 = boost
                4.890223 = idf(docFreq=907, maxDocs=44421)
                0.011143101 = queryNorm
              0.5348681 = fieldWeight in 582, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.890223 = idf(docFreq=907, maxDocs=44421)
                0.0546875 = fieldNorm(doc=582)
          0.13093528 = weight(abstract_txt:distributions in 582) [ClassicSimilarity], result of:
            0.13093528 = score(doc=582,freq=2.0), product of:
              0.24801055 = queryWeight, product of:
                3.2604728 = boost
                6.82627 = idf(docFreq=130, maxDocs=44421)
                0.011143101 = queryNorm
              0.5279424 = fieldWeight in 582, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.82627 = idf(docFreq=130, maxDocs=44421)
                0.0546875 = fieldNorm(doc=582)
        0.28 = coord(7/25)
    
  4. Leydesdorff, L.; Zhou, P.; Bornmann, L.: How can journal impact factors be normalized across fields of science? : An assessment in terms of percentile ranks and fractional counts (2013) 0.10
    0.09787323 = sum of:
      0.09787323 = product of:
        0.48936617 = sum of:
          0.05323431 = weight(abstract_txt:variance in 1532) [ClassicSimilarity], result of:
            0.05323431 = score(doc=1532,freq=1.0), product of:
              0.108776495 = queryWeight, product of:
                1.2466726 = boost
                7.8302665 = idf(docFreq=47, maxDocs=44421)
                0.011143101 = queryNorm
              0.48939165 = fieldWeight in 1532, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.8302665 = idf(docFreq=47, maxDocs=44421)
                0.0625 = fieldNorm(doc=1532)
          0.06500153 = weight(abstract_txt:skewed in 1532) [ClassicSimilarity], result of:
            0.06500153 = score(doc=1532,freq=1.0), product of:
              0.12426716 = queryWeight, product of:
                1.3324873 = boost
                8.369263 = idf(docFreq=27, maxDocs=44421)
                0.011143101 = queryNorm
              0.5230789 = fieldWeight in 1532, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.369263 = idf(docFreq=27, maxDocs=44421)
                0.0625 = fieldNorm(doc=1532)
          0.051869012 = weight(abstract_txt:citation in 1532) [ClassicSimilarity], result of:
            0.051869012 = score(doc=1532,freq=4.0), product of:
              0.08485341 = queryWeight, product of:
                1.5571647 = boost
                4.890223 = idf(docFreq=907, maxDocs=44421)
                0.011143101 = queryNorm
              0.6112779 = fieldWeight in 1532, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.890223 = idf(docFreq=907, maxDocs=44421)
                0.0625 = fieldNorm(doc=1532)
          0.105811685 = weight(abstract_txt:distributions in 1532) [ClassicSimilarity], result of:
            0.105811685 = score(doc=1532,freq=1.0), product of:
              0.24801055 = queryWeight, product of:
                3.2604728 = boost
                6.82627 = idf(docFreq=130, maxDocs=44421)
                0.011143101 = queryNorm
              0.42664188 = fieldWeight in 1532, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.82627 = idf(docFreq=130, maxDocs=44421)
                0.0625 = fieldNorm(doc=1532)
          0.21344963 = weight(abstract_txt:transformation in 1532) [ClassicSimilarity], result of:
            0.21344963 = score(doc=1532,freq=1.0), product of:
              0.52517444 = queryWeight, product of:
                7.247458 = boost
                6.5029707 = idf(docFreq=180, maxDocs=44421)
                0.011143101 = queryNorm
              0.40643567 = fieldWeight in 1532, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.5029707 = idf(docFreq=180, maxDocs=44421)
                0.0625 = fieldNorm(doc=1532)
        0.2 = coord(5/25)
    
  5. Bensman, S.J.: Distributional differences of the impact factor in the sciences versus the social sciences : an analysis of the probabilistic structure of the 2005 journal citation reports (2008) 0.10
    0.097370185 = sum of:
      0.097370185 = product of:
        0.48685092 = sum of:
          0.034637257 = weight(abstract_txt:negative in 2953) [ClassicSimilarity], result of:
            0.034637257 = score(doc=2953,freq=1.0), product of:
              0.070387624 = queryWeight, product of:
                1.0028433 = boost
                6.2987905 = idf(docFreq=221, maxDocs=44421)
                0.011143101 = queryNorm
              0.492093 = fieldWeight in 2953, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.2987905 = idf(docFreq=221, maxDocs=44421)
                0.078125 = fieldNorm(doc=2953)
          0.09410585 = weight(abstract_txt:variance in 2953) [ClassicSimilarity], result of:
            0.09410585 = score(doc=2953,freq=2.0), product of:
              0.108776495 = queryWeight, product of:
                1.2466726 = boost
                7.8302665 = idf(docFreq=47, maxDocs=44421)
                0.011143101 = queryNorm
              0.86513036 = fieldWeight in 2953, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.8302665 = idf(docFreq=47, maxDocs=44421)
                0.078125 = fieldNorm(doc=2953)
          0.11490755 = weight(abstract_txt:skewed in 2953) [ClassicSimilarity], result of:
            0.11490755 = score(doc=2953,freq=2.0), product of:
              0.12426716 = queryWeight, product of:
                1.3324873 = boost
                8.369263 = idf(docFreq=27, maxDocs=44421)
                0.011143101 = queryNorm
              0.92468154 = fieldWeight in 2953, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.369263 = idf(docFreq=27, maxDocs=44421)
                0.078125 = fieldNorm(doc=2953)
          0.056149855 = weight(abstract_txt:citation in 2953) [ClassicSimilarity], result of:
            0.056149855 = score(doc=2953,freq=3.0), product of:
              0.08485341 = queryWeight, product of:
                1.5571647 = boost
                4.890223 = idf(docFreq=907, maxDocs=44421)
                0.011143101 = queryNorm
              0.6617277 = fieldWeight in 2953, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.890223 = idf(docFreq=907, maxDocs=44421)
                0.078125 = fieldNorm(doc=2953)
          0.1870504 = weight(abstract_txt:distributions in 2953) [ClassicSimilarity], result of:
            0.1870504 = score(doc=2953,freq=2.0), product of:
              0.24801055 = queryWeight, product of:
                3.2604728 = boost
                6.82627 = idf(docFreq=130, maxDocs=44421)
                0.011143101 = queryNorm
              0.75420344 = fieldWeight in 2953, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.82627 = idf(docFreq=130, maxDocs=44421)
                0.078125 = fieldNorm(doc=2953)
        0.2 = coord(5/25)