Document (#30124)

Yoshikane, F.
Kageura, K.
Tsuji, K.
¬A method for the comparative analysis of concentration of author productivity, giving consideration to the effect of sample size dependency of statistical measures
Journal of the American Society for Information Science and technology. 54(2003) no.6, S.519-527
Studies of the concentration of author productivity based upon counts of papers by individual authors will produce measures that change systematically with sample size. Yoshikane, Kageura, and Tsuji seek a statistical framework which will avoid this scale effect problem. Using the number of authors in a field as an absolute concentration measure, and Gini's index as a relative concentration measure, they describe four literatures form both viewpoints with measures insensitive to one another. Both measures will increase with sample size. They then plot profiles of the two measures on the basis of a Monte-Carlo simulation of 1000 trials for 20 equally spaced intervals and compare the characteristics of the literatures. Using data from conferences hosted by four academic societies between 1992 and 1997, they find a coefficient of loss exceeding 0.15 indicating measures will depend highly on sample size. The simulation shows that a larger sample size leads to lower absolute concentration and higher relative concentration. Comparisons made at the same sample size present quite different results than the original data and allow direct comparison of population characteristics.

Similar documents (author)

  1. Kageura, K.: Terminological semantics : an examination of 'concept' and 'meaning' in the study of terms (1995) 5.94
    5.9401517 = sum of:
      5.9401517 = weight(author_txt:kageura in 4629) [ClassicSimilarity], result of:
        5.9401517 = fieldWeight in 4629, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.504243 = idf(docFreq=8, maxDocs=44421)
          0.625 = fieldNorm(doc=4629)
  2. Kageura, K.: Theories of terminology : a quest for a framework for the study of term formation (1999) 5.94
    5.9401517 = sum of:
      5.9401517 = weight(author_txt:kageura in 290) [ClassicSimilarity], result of:
        5.9401517 = fieldWeight in 290, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.504243 = idf(docFreq=8, maxDocs=44421)
          0.625 = fieldNorm(doc=290)
  3. Kageura, K.: ¬The dynamics of terminology : a descriptive theory of term formation and terminological growth (2002) 5.94
    5.9401517 = sum of:
      5.9401517 = weight(author_txt:kageura in 2787) [ClassicSimilarity], result of:
        5.9401517 = fieldWeight in 2787, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.504243 = idf(docFreq=8, maxDocs=44421)
          0.625 = fieldNorm(doc=2787)
  4. Fukuda, M.; Kageura, K.: Research into 'see also' references in the dictionary of terminology : using semantic relations between entries (1993) 4.75
    4.7521214 = sum of:
      4.7521214 = weight(author_txt:kageura in 1118) [ClassicSimilarity], result of:
        4.7521214 = fieldWeight in 1118, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.504243 = idf(docFreq=8, maxDocs=44421)
          0.5 = fieldNorm(doc=1118)
  5. Tsuji, K.; Kageura, K.: Analysis of word structure of medical synonyms (1996) 4.75
    4.7521214 = sum of:
      4.7521214 = weight(author_txt:kageura in 6406) [ClassicSimilarity], result of:
        4.7521214 = fieldWeight in 6406, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.504243 = idf(docFreq=8, maxDocs=44421)
          0.5 = fieldNorm(doc=6406)

Similar documents (content)

  1. Burrell, Q.L.: Measuring similarity of concentration between different informetric distributions : two new approaches (2005) 0.27
    0.26517284 = sum of:
      0.26517284 = product of:
        1.1048869 = sum of:
          0.080668285 = weight(abstract_txt:measure in 4410) [ClassicSimilarity], result of:
            0.080668285 = score(doc=4410,freq=4.0), product of:
              0.09497079 = queryWeight, product of:
                1.2934582 = boost
                5.4361663 = idf(docFreq=525, maxDocs=44421)
                0.013506565 = queryNorm
              0.849401 = fieldWeight in 4410, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.4361663 = idf(docFreq=525, maxDocs=44421)
                0.078125 = fieldNorm(doc=4410)
          0.09867468 = weight(abstract_txt:relative in 4410) [ClassicSimilarity], result of:
            0.09867468 = score(doc=4410,freq=3.0), product of:
              0.11955605 = queryWeight, product of:
                1.451253 = boost
                6.099349 = idf(docFreq=270, maxDocs=44421)
                0.013506565 = queryNorm
              0.8253424 = fieldWeight in 4410, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.099349 = idf(docFreq=270, maxDocs=44421)
                0.078125 = fieldNorm(doc=4410)
          0.11145683 = weight(abstract_txt:productivity in 4410) [ClassicSimilarity], result of:
            0.11145683 = score(doc=4410,freq=2.0), product of:
              0.14843488 = queryWeight, product of:
                1.6170571 = boost
                6.7961926 = idf(docFreq=134, maxDocs=44421)
                0.013506565 = queryNorm
              0.7508803 = fieldWeight in 4410, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.7961926 = idf(docFreq=134, maxDocs=44421)
                0.078125 = fieldNorm(doc=4410)
          0.028914055 = weight(abstract_txt:will in 4410) [ClassicSimilarity], result of:
            0.028914055 = score(doc=4410,freq=1.0), product of:
              0.09584236 = queryWeight, product of:
                1.8376006 = boost
                3.8615482 = idf(docFreq=2539, maxDocs=44421)
                0.013506565 = queryNorm
              0.30168346 = fieldWeight in 4410, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.8615482 = idf(docFreq=2539, maxDocs=44421)
                0.078125 = fieldNorm(doc=4410)
          0.12024662 = weight(abstract_txt:measures in 4410) [ClassicSimilarity], result of:
            0.12024662 = score(doc=4410,freq=1.0), product of:
              0.28372473 = queryWeight, product of:
                3.8722787 = boost
                5.424824 = idf(docFreq=531, maxDocs=44421)
                0.013506565 = queryNorm
              0.4238144 = fieldWeight in 4410, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.424824 = idf(docFreq=531, maxDocs=44421)
                0.078125 = fieldNorm(doc=4410)
          0.6649264 = weight(abstract_txt:concentration in 4410) [ClassicSimilarity], result of:
            0.6649264 = score(doc=4410,freq=3.0), product of:
              0.6151635 = queryWeight, product of:
                5.701817 = boost
                7.9878955 = idf(docFreq=40, maxDocs=44421)
                0.013506565 = queryNorm
              1.0808938 = fieldWeight in 4410, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                7.9878955 = idf(docFreq=40, maxDocs=44421)
                0.078125 = fieldNorm(doc=4410)
        0.24 = coord(6/25)
  2. Burrell, Q.L.: On Egghe's version of continuous concentration theory (2006) 0.15
    0.14747436 = sum of:
      0.14747436 = product of:
        1.228953 = sum of:
          0.059979856 = weight(abstract_txt:statistical in 5903) [ClassicSimilarity], result of:
            0.059979856 = score(doc=5903,freq=1.0), product of:
              0.098868914 = queryWeight, product of:
                1.3197366 = boost
                5.5466094 = idf(docFreq=470, maxDocs=44421)
                0.013506565 = queryNorm
              0.6066604 = fieldWeight in 5903, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.5466094 = idf(docFreq=470, maxDocs=44421)
                0.109375 = fieldNorm(doc=5903)
          0.23807615 = weight(abstract_txt:measures in 5903) [ClassicSimilarity], result of:
            0.23807615 = score(doc=5903,freq=2.0), product of:
              0.28372473 = queryWeight, product of:
                3.8722787 = boost
                5.424824 = idf(docFreq=531, maxDocs=44421)
                0.013506565 = queryNorm
              0.83910966 = fieldWeight in 5903, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.424824 = idf(docFreq=531, maxDocs=44421)
                0.109375 = fieldNorm(doc=5903)
          0.930897 = weight(abstract_txt:concentration in 5903) [ClassicSimilarity], result of:
            0.930897 = score(doc=5903,freq=3.0), product of:
              0.6151635 = queryWeight, product of:
                5.701817 = boost
                7.9878955 = idf(docFreq=40, maxDocs=44421)
                0.013506565 = queryNorm
              1.5132513 = fieldWeight in 5903, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                7.9878955 = idf(docFreq=40, maxDocs=44421)
                0.109375 = fieldNorm(doc=5903)
        0.12 = coord(3/25)
  3. Egghe, L.: Zipfian and Lotkaian continuous concentration theory (2005) 0.13
    0.12965035 = sum of:
      0.12965035 = product of:
        1.0804197 = sum of:
          0.01982505 = weight(abstract_txt:they in 4678) [ClassicSimilarity], result of:
            0.01982505 = score(doc=4678,freq=1.0), product of:
              0.06770927 = queryWeight, product of:
                1.3376025 = boost
                3.7477977 = idf(docFreq=2845, maxDocs=44421)
                0.013506565 = queryNorm
              0.2927967 = fieldWeight in 4678, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.7477977 = idf(docFreq=2845, maxDocs=44421)
                0.078125 = fieldNorm(doc=4678)
          0.12024662 = weight(abstract_txt:measures in 4678) [ClassicSimilarity], result of:
            0.12024662 = score(doc=4678,freq=1.0), product of:
              0.28372473 = queryWeight, product of:
                3.8722787 = boost
                5.424824 = idf(docFreq=531, maxDocs=44421)
                0.013506565 = queryNorm
              0.4238144 = fieldWeight in 4678, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.424824 = idf(docFreq=531, maxDocs=44421)
                0.078125 = fieldNorm(doc=4678)
          0.940348 = weight(abstract_txt:concentration in 4678) [ClassicSimilarity], result of:
            0.940348 = score(doc=4678,freq=6.0), product of:
              0.6151635 = queryWeight, product of:
                5.701817 = boost
                7.9878955 = idf(docFreq=40, maxDocs=44421)
                0.013506565 = queryNorm
              1.5286148 = fieldWeight in 4678, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                7.9878955 = idf(docFreq=40, maxDocs=44421)
                0.078125 = fieldNorm(doc=4678)
        0.12 = coord(3/25)
  4. Boyack, K.W.; Klavans, R.: Co-citation analysis, bibliographic coupling, and direct citation : which citation approach represents the research front most accurately? (2010) 0.13
    0.12767284 = sum of:
      0.12767284 = product of:
        0.6383642 = sum of:
          0.0396018 = weight(abstract_txt:four in 111) [ClassicSimilarity], result of:
            0.0396018 = score(doc=111,freq=2.0), product of:
              0.08640684 = queryWeight, product of:
                1.2337621 = boost
                5.1852746 = idf(docFreq=675, maxDocs=44421)
                0.013506565 = queryNorm
              0.45831785 = fieldWeight in 111, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.1852746 = idf(docFreq=675, maxDocs=44421)
                0.0625 = fieldNorm(doc=111)
          0.032267313 = weight(abstract_txt:measure in 111) [ClassicSimilarity], result of:
            0.032267313 = score(doc=111,freq=1.0), product of:
              0.09497079 = queryWeight, product of:
                1.2934582 = boost
                5.4361663 = idf(docFreq=525, maxDocs=44421)
                0.013506565 = queryNorm
              0.3397604 = fieldWeight in 111, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.4361663 = idf(docFreq=525, maxDocs=44421)
                0.0625 = fieldNorm(doc=111)
          0.13604352 = weight(abstract_txt:measures in 111) [ClassicSimilarity], result of:
            0.13604352 = score(doc=111,freq=2.0), product of:
              0.28372473 = queryWeight, product of:
                3.8722787 = boost
                5.424824 = idf(docFreq=531, maxDocs=44421)
                0.013506565 = queryNorm
              0.47949123 = fieldWeight in 111, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.424824 = idf(docFreq=531, maxDocs=44421)
                0.0625 = fieldNorm(doc=111)
          0.12333521 = weight(abstract_txt:size in 111) [ClassicSimilarity], result of:
            0.12333521 = score(doc=111,freq=1.0), product of:
              0.33484718 = queryWeight, product of:
                4.206698 = boost
                5.8933253 = idf(docFreq=332, maxDocs=44421)
                0.013506565 = queryNorm
              0.36833283 = fieldWeight in 111, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.8933253 = idf(docFreq=332, maxDocs=44421)
                0.0625 = fieldNorm(doc=111)
          0.30711636 = weight(abstract_txt:concentration in 111) [ClassicSimilarity], result of:
            0.30711636 = score(doc=111,freq=1.0), product of:
              0.6151635 = queryWeight, product of:
                5.701817 = boost
                7.9878955 = idf(docFreq=40, maxDocs=44421)
                0.013506565 = queryNorm
              0.49924347 = fieldWeight in 111, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.9878955 = idf(docFreq=40, maxDocs=44421)
                0.0625 = fieldNorm(doc=111)
        0.2 = coord(5/25)
  5. Larivière, V.; Gingras, Y.; Archambault, E.: ¬The decline in the concentration of citations, 1900-2007 (2009) 0.11
    0.108984366 = sum of:
      0.108984366 = product of:
        0.90820307 = sum of:
          0.035003375 = weight(abstract_txt:four in 3763) [ClassicSimilarity], result of:
            0.035003375 = score(doc=3763,freq=1.0), product of:
              0.08640684 = queryWeight, product of:
                1.2337621 = boost
                5.1852746 = idf(docFreq=675, maxDocs=44421)
                0.013506565 = queryNorm
              0.40509957 = fieldWeight in 3763, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.1852746 = idf(docFreq=675, maxDocs=44421)
                0.078125 = fieldNorm(doc=3763)
          0.20827326 = weight(abstract_txt:measures in 3763) [ClassicSimilarity], result of:
            0.20827326 = score(doc=3763,freq=3.0), product of:
              0.28372473 = queryWeight, product of:
                3.8722787 = boost
                5.424824 = idf(docFreq=531, maxDocs=44421)
                0.013506565 = queryNorm
              0.7340681 = fieldWeight in 3763, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.424824 = idf(docFreq=531, maxDocs=44421)
                0.078125 = fieldNorm(doc=3763)
          0.6649264 = weight(abstract_txt:concentration in 3763) [ClassicSimilarity], result of:
            0.6649264 = score(doc=3763,freq=3.0), product of:
              0.6151635 = queryWeight, product of:
                5.701817 = boost
                7.9878955 = idf(docFreq=40, maxDocs=44421)
                0.013506565 = queryNorm
              1.0808938 = fieldWeight in 3763, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                7.9878955 = idf(docFreq=40, maxDocs=44421)
                0.078125 = fieldNorm(doc=3763)
        0.12 = coord(3/25)