Document (#34010)

Author
Egghe, L.
Ravichandra Rao, I.K.
Title
¬The influence of the broadness of a query of a topic on its h-index : models and examples of the h-index of n-grams
Source
Journal of the American Society for Information Science and Technology. 59(2008) no.10, S.1688-1693
Year
2008
Series
Brief communication
Abstract
The article studies the influence of the query formulation of a topic on its h-index. In order to generate pure random sets of documents, we used N-grams (N variable) to measure this influence: strings of zeros, truncated at the end. The used databases are WoS and Scopus. The formula h=T**1/alpha, proved in Egghe and Rousseau (2006) where T is the number of retrieved documents and is Lotka's exponent, is confirmed being a concavely increasing function of T. We also give a formula for the relation between h and N the length of the N-gram: h=D10**(-N/alpha) where D is a constant, a convexly decreasing function, which is found in our experiments. Nonlinear regression on h=T**1/alpha gives an estimation of , which can then be used to estimate the h-index of the entire database (Web of Science [WoS] and Scopus): h=S**1/alpha, , where S is the total number of documents in the database.
Theme
Informetrie
Object
h-index

Similar documents (author)

  1. Egghe, L.; Ravichandra Rao, I.K.: Duality revisited : construction of fractional frequency distributions based on two dual Lotka laws (2002) 5.29
    5.2915335 = sum of:
      5.2915335 = sum of:
        1.7858565 = weight(author_txt:egghe in 2006) [ClassicSimilarity], result of:
          1.7858565 = score(doc=2006,freq=1.0), product of:
            0.53776526 = queryWeight, product of:
              7.590594 = idf(docFreq=60, maxDocs=44421)
              0.070846274 = queryNorm
            3.3208847 = fieldWeight in 2006, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              7.590594 = idf(docFreq=60, maxDocs=44421)
              0.4375 = fieldNorm(doc=2006)
        3.5056772 = weight(author_txt:ravichandra in 2006) [ClassicSimilarity], result of:
          3.5056772 = score(doc=2006,freq=1.0), product of:
            0.84309465 = queryWeight, product of:
              1.252108 = boost
              9.504243 = idf(docFreq=8, maxDocs=44421)
              0.070846274 = queryNorm
            4.1581063 = fieldWeight in 2006, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              9.504243 = idf(docFreq=8, maxDocs=44421)
              0.4375 = fieldNorm(doc=2006)
    
  2. Egghe, L.; Ravichandra Rao, I.K.: Study of different h-indices for groups of authors (2008) 5.29
    5.2915335 = sum of:
      5.2915335 = sum of:
        1.7858565 = weight(author_txt:egghe in 2878) [ClassicSimilarity], result of:
          1.7858565 = score(doc=2878,freq=1.0), product of:
            0.53776526 = queryWeight, product of:
              7.590594 = idf(docFreq=60, maxDocs=44421)
              0.070846274 = queryNorm
            3.3208847 = fieldWeight in 2878, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              7.590594 = idf(docFreq=60, maxDocs=44421)
              0.4375 = fieldNorm(doc=2878)
        3.5056772 = weight(author_txt:ravichandra in 2878) [ClassicSimilarity], result of:
          3.5056772 = score(doc=2878,freq=1.0), product of:
            0.84309465 = queryWeight, product of:
              1.252108 = boost
              9.504243 = idf(docFreq=8, maxDocs=44421)
              0.070846274 = queryNorm
            4.1581063 = fieldWeight in 2878, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              9.504243 = idf(docFreq=8, maxDocs=44421)
              0.4375 = fieldNorm(doc=2878)
    
  3. Rao, I.K.Ravichandra -> Ravichandra Rao, I.K.: 1.75
    1.7528386 = sum of:
      1.7528386 = product of:
        3.5056772 = sum of:
          3.5056772 = weight(author_txt:ravichandra in 240) [ClassicSimilarity], result of:
            3.5056772 = score(doc=240,freq=1.0), product of:
              0.84309465 = queryWeight, product of:
                1.252108 = boost
                9.504243 = idf(docFreq=8, maxDocs=44421)
                0.070846274 = queryNorm
              4.1581063 = fieldWeight in 240, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.504243 = idf(docFreq=8, maxDocs=44421)
                0.4375 = fieldNorm(doc=240)
        0.5 = coord(1/2)
    
  4. Rao, I.K.R. -> Ravichandra Rao, I.K.: 1.75
    1.7528386 = sum of:
      1.7528386 = product of:
        3.5056772 = sum of:
          3.5056772 = weight(author_txt:ravichandra in 2794) [ClassicSimilarity], result of:
            3.5056772 = score(doc=2794,freq=1.0), product of:
              0.84309465 = queryWeight, product of:
                1.252108 = boost
                9.504243 = idf(docFreq=8, maxDocs=44421)
                0.070846274 = queryNorm
              4.1581063 = fieldWeight in 2794, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.504243 = idf(docFreq=8, maxDocs=44421)
                0.4375 = fieldNorm(doc=2794)
        0.5 = coord(1/2)
    
  5. Ravichandra Rao, I.K.; Neelameghan, A.: From librametry to informetrcis : an overview and Ranganathan's contributions (1992) 1.75
    1.7528386 = sum of:
      1.7528386 = product of:
        3.5056772 = sum of:
          3.5056772 = weight(author_txt:ravichandra in 2963) [ClassicSimilarity], result of:
            3.5056772 = score(doc=2963,freq=1.0), product of:
              0.84309465 = queryWeight, product of:
                1.252108 = boost
                9.504243 = idf(docFreq=8, maxDocs=44421)
                0.070846274 = queryNorm
              4.1581063 = fieldWeight in 2963, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.504243 = idf(docFreq=8, maxDocs=44421)
                0.4375 = fieldNorm(doc=2963)
        0.5 = coord(1/2)
    

Similar documents (content)

  1. Egghe, L.: ¬A new short proof of Naranan's theorem, explaining Lotka's law and Zipf's law (2010) 0.13
    0.1297984 = sum of:
      0.1297984 = product of:
        0.64899194 = sum of:
          0.040130503 = weight(abstract_txt:number in 419) [ClassicSimilarity], result of:
            0.040130503 = score(doc=419,freq=2.0), product of:
              0.07318836 = queryWeight, product of:
                1.0618664 = boost
                4.1356745 = idf(docFreq=1930, maxDocs=44421)
                0.016665785 = queryNorm
              0.54831815 = fieldWeight in 419, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.1356745 = idf(docFreq=1930, maxDocs=44421)
                0.09375 = fieldNorm(doc=419)
          0.2217813 = weight(abstract_txt:lotka's in 419) [ClassicSimilarity], result of:
            0.2217813 = score(doc=419,freq=3.0), product of:
              0.15862383 = queryWeight, product of:
                1.1053965 = boost
                8.610425 = idf(docFreq=21, maxDocs=44421)
                0.016665785 = queryNorm
              1.3981588 = fieldWeight in 419, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                8.610425 = idf(docFreq=21, maxDocs=44421)
                0.09375 = fieldNorm(doc=419)
          0.1398959 = weight(abstract_txt:exponent in 419) [ClassicSimilarity], result of:
            0.1398959 = score(doc=419,freq=1.0), product of:
              0.16826569 = queryWeight, product of:
                1.1384964 = boost
                8.868255 = idf(docFreq=16, maxDocs=44421)
                0.016665785 = queryNorm
              0.83139884 = fieldWeight in 419, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.868255 = idf(docFreq=16, maxDocs=44421)
                0.09375 = fieldNorm(doc=419)
          0.0694147 = weight(abstract_txt:function in 419) [ClassicSimilarity], result of:
            0.0694147 = score(doc=419,freq=1.0), product of:
              0.13287294 = queryWeight, product of:
                1.4307611 = boost
                5.5724173 = idf(docFreq=458, maxDocs=44421)
                0.016665785 = queryNorm
              0.5224141 = fieldWeight in 419, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.5724173 = idf(docFreq=458, maxDocs=44421)
                0.09375 = fieldNorm(doc=419)
          0.17776948 = weight(abstract_txt:formula in 419) [ClassicSimilarity], result of:
            0.17776948 = score(doc=419,freq=1.0), product of:
              0.2487179 = queryWeight, product of:
                1.9575028 = boost
                7.62393 = idf(docFreq=58, maxDocs=44421)
                0.016665785 = queryNorm
              0.71474344 = fieldWeight in 419, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.62393 = idf(docFreq=58, maxDocs=44421)
                0.09375 = fieldNorm(doc=419)
        0.2 = coord(5/25)
    
  2. Burrell, Q.L.: Formulae for the h-index : a lack of robustness in Lotkaian informetrics? (2013) 0.12
    0.11509674 = sum of:
      0.11509674 = product of:
        0.5754837 = sum of:
          0.12477588 = weight(abstract_txt:egghe in 1977) [ClassicSimilarity], result of:
            0.12477588 = score(doc=1977,freq=2.0), product of:
              0.16215494 = queryWeight, product of:
                1.1176324 = boost
                8.705735 = idf(docFreq=19, maxDocs=44421)
                0.016665785 = queryNorm
              0.76948553 = fieldWeight in 1977, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.705735 = idf(docFreq=19, maxDocs=44421)
                0.0625 = fieldNorm(doc=1977)
          0.13755886 = weight(abstract_txt:rousseau in 1977) [ClassicSimilarity], result of:
            0.13755886 = score(doc=1977,freq=2.0), product of:
              0.17304888 = queryWeight, product of:
                1.1545647 = boost
                8.993418 = idf(docFreq=14, maxDocs=44421)
                0.016665785 = queryNorm
              0.7949133 = fieldWeight in 1977, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.993418 = idf(docFreq=14, maxDocs=44421)
                0.0625 = fieldNorm(doc=1977)
          0.046276465 = weight(abstract_txt:function in 1977) [ClassicSimilarity], result of:
            0.046276465 = score(doc=1977,freq=1.0), product of:
              0.13287294 = queryWeight, product of:
                1.4307611 = boost
                5.5724173 = idf(docFreq=458, maxDocs=44421)
                0.016665785 = queryNorm
              0.34827608 = fieldWeight in 1977, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.5724173 = idf(docFreq=458, maxDocs=44421)
                0.0625 = fieldNorm(doc=1977)
          0.16760269 = weight(abstract_txt:formula in 1977) [ClassicSimilarity], result of:
            0.16760269 = score(doc=1977,freq=2.0), product of:
              0.2487179 = queryWeight, product of:
                1.9575028 = boost
                7.62393 = idf(docFreq=58, maxDocs=44421)
                0.016665785 = queryNorm
              0.67386657 = fieldWeight in 1977, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.62393 = idf(docFreq=58, maxDocs=44421)
                0.0625 = fieldNorm(doc=1977)
          0.0992698 = weight(abstract_txt:index in 1977) [ClassicSimilarity], result of:
            0.0992698 = score(doc=1977,freq=3.0), product of:
              0.1930682 = queryWeight, product of:
                2.4390419 = boost
                4.7496953 = idf(docFreq=1044, maxDocs=44421)
                0.016665785 = queryNorm
              0.5141696 = fieldWeight in 1977, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.7496953 = idf(docFreq=1044, maxDocs=44421)
                0.0625 = fieldNorm(doc=1977)
        0.2 = coord(5/25)
    
  3. Carterette, B.; Can, F.: Comparing inverted files and signature files for searching a large lexicon (2005) 0.11
    0.108800374 = sum of:
      0.108800374 = product of:
        0.5440019 = sum of:
          0.083678655 = weight(abstract_txt:gram in 2029) [ClassicSimilarity], result of:
            0.083678655 = score(doc=2029,freq=1.0), product of:
              0.13489303 = queryWeight, product of:
                1.0193624 = boost
                7.9402676 = idf(docFreq=42, maxDocs=44421)
                0.016665785 = queryNorm
              0.62033343 = fieldWeight in 2029, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.9402676 = idf(docFreq=42, maxDocs=44421)
                0.078125 = fieldNorm(doc=2029)
          0.023647126 = weight(abstract_txt:number in 2029) [ClassicSimilarity], result of:
            0.023647126 = score(doc=2029,freq=1.0), product of:
              0.07318836 = queryWeight, product of:
                1.0618664 = boost
                4.1356745 = idf(docFreq=1930, maxDocs=44421)
                0.016665785 = queryNorm
              0.32309955 = fieldWeight in 2029, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.1356745 = idf(docFreq=1930, maxDocs=44421)
                0.078125 = fieldNorm(doc=2029)
          0.051442668 = weight(abstract_txt:where in 2029) [ClassicSimilarity], result of:
            0.051442668 = score(doc=2029,freq=1.0), product of:
              0.14065953 = queryWeight, product of:
                1.802931 = boost
                4.681277 = idf(docFreq=1118, maxDocs=44421)
                0.016665785 = queryNorm
              0.36572474 = fieldWeight in 2029, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.681277 = idf(docFreq=1118, maxDocs=44421)
                0.078125 = fieldNorm(doc=2029)
          0.2611462 = weight(abstract_txt:grams in 2029) [ClassicSimilarity], result of:
            0.2611462 = score(doc=2029,freq=2.0), product of:
              0.28807276 = queryWeight, product of:
                2.1066868 = boost
                8.20496 = idf(docFreq=32, maxDocs=44421)
                0.016665785 = queryNorm
              0.90652853 = fieldWeight in 2029, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.20496 = idf(docFreq=32, maxDocs=44421)
                0.078125 = fieldNorm(doc=2029)
          0.12408725 = weight(abstract_txt:index in 2029) [ClassicSimilarity], result of:
            0.12408725 = score(doc=2029,freq=3.0), product of:
              0.1930682 = queryWeight, product of:
                2.4390419 = boost
                4.7496953 = idf(docFreq=1044, maxDocs=44421)
                0.016665785 = queryNorm
              0.642712 = fieldWeight in 2029, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.7496953 = idf(docFreq=1044, maxDocs=44421)
                0.078125 = fieldNorm(doc=2029)
        0.2 = coord(5/25)
    
  4. Cohen, J.D.: Highlights: language- and domain-independent automatic indexing terms for abstracting (1995) 0.11
    0.10677495 = sum of:
      0.10677495 = product of:
        0.53387475 = sum of:
          0.05178087 = weight(abstract_txt:topic in 1861) [ClassicSimilarity], result of:
            0.05178087 = score(doc=1861,freq=1.0), product of:
              0.10929035 = queryWeight, product of:
                1.2975968 = boost
                5.053779 = idf(docFreq=770, maxDocs=44421)
                0.016665785 = queryNorm
              0.47379178 = fieldWeight in 1861, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.053779 = idf(docFreq=770, maxDocs=44421)
                0.09375 = fieldNorm(doc=1861)
          0.0694147 = weight(abstract_txt:function in 1861) [ClassicSimilarity], result of:
            0.0694147 = score(doc=1861,freq=1.0), product of:
              0.13287294 = queryWeight, product of:
                1.4307611 = boost
                5.5724173 = idf(docFreq=458, maxDocs=44421)
                0.016665785 = queryNorm
              0.5224141 = fieldWeight in 1861, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.5724173 = idf(docFreq=458, maxDocs=44421)
                0.09375 = fieldNorm(doc=1861)
          0.04218457 = weight(abstract_txt:documents in 1861) [ClassicSimilarity], result of:
            0.04218457 = score(doc=1861,freq=1.0), product of:
              0.109127715 = queryWeight, product of:
                1.5880423 = boost
                4.123322 = idf(docFreq=1954, maxDocs=44421)
                0.016665785 = queryNorm
              0.38656145 = fieldWeight in 1861, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.123322 = idf(docFreq=1954, maxDocs=44421)
                0.09375 = fieldNorm(doc=1861)
          0.2215899 = weight(abstract_txt:grams in 1861) [ClassicSimilarity], result of:
            0.2215899 = score(doc=1861,freq=1.0), product of:
              0.28807276 = queryWeight, product of:
                2.1066868 = boost
                8.20496 = idf(docFreq=32, maxDocs=44421)
                0.016665785 = queryNorm
              0.769215 = fieldWeight in 1861, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.20496 = idf(docFreq=32, maxDocs=44421)
                0.09375 = fieldNorm(doc=1861)
          0.1489047 = weight(abstract_txt:index in 1861) [ClassicSimilarity], result of:
            0.1489047 = score(doc=1861,freq=3.0), product of:
              0.1930682 = queryWeight, product of:
                2.4390419 = boost
                4.7496953 = idf(docFreq=1044, maxDocs=44421)
                0.016665785 = queryNorm
              0.77125436 = fieldWeight in 1861, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.7496953 = idf(docFreq=1044, maxDocs=44421)
                0.09375 = fieldNorm(doc=1861)
        0.2 = coord(5/25)
    
  5. Bodoff, D.: Test theory for evaluating reliability of IR test collections (2008) 0.10
    0.09882232 = sum of:
      0.09882232 = product of:
        0.4941116 = sum of:
          0.06579964 = weight(abstract_txt:estimation in 3085) [ClassicSimilarity], result of:
            0.06579964 = score(doc=3085,freq=1.0), product of:
              0.13335279 = queryWeight, product of:
                1.0135261 = boost
                7.894805 = idf(docFreq=44, maxDocs=44421)
                0.016665785 = queryNorm
              0.4934253 = fieldWeight in 3085, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.894805 = idf(docFreq=44, maxDocs=44421)
                0.0625 = fieldNorm(doc=3085)
          0.018917702 = weight(abstract_txt:number in 3085) [ClassicSimilarity], result of:
            0.018917702 = score(doc=3085,freq=1.0), product of:
              0.07318836 = queryWeight, product of:
                1.0618664 = boost
                4.1356745 = idf(docFreq=1930, maxDocs=44421)
                0.016665785 = queryNorm
              0.25847965 = fieldWeight in 3085, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.1356745 = idf(docFreq=1930, maxDocs=44421)
                0.0625 = fieldNorm(doc=3085)
          0.046276465 = weight(abstract_txt:function in 3085) [ClassicSimilarity], result of:
            0.046276465 = score(doc=3085,freq=1.0), product of:
              0.13287294 = queryWeight, product of:
                1.4307611 = boost
                5.5724173 = idf(docFreq=458, maxDocs=44421)
                0.016665785 = queryNorm
              0.34827608 = fieldWeight in 3085, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.5724173 = idf(docFreq=458, maxDocs=44421)
                0.0625 = fieldNorm(doc=3085)
          0.041154135 = weight(abstract_txt:where in 3085) [ClassicSimilarity], result of:
            0.041154135 = score(doc=3085,freq=1.0), product of:
              0.14065953 = queryWeight, product of:
                1.802931 = boost
                4.681277 = idf(docFreq=1118, maxDocs=44421)
                0.016665785 = queryNorm
              0.2925798 = fieldWeight in 3085, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.681277 = idf(docFreq=1118, maxDocs=44421)
                0.0625 = fieldNorm(doc=3085)
          0.32196367 = weight(abstract_txt:alpha in 3085) [ClassicSimilarity], result of:
            0.32196367 = score(doc=3085,freq=1.0), product of:
              0.610114 = queryWeight, product of:
                4.335801 = boost
                8.443371 = idf(docFreq=25, maxDocs=44421)
                0.016665785 = queryNorm
              0.5277107 = fieldWeight in 3085, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.443371 = idf(docFreq=25, maxDocs=44421)
                0.0625 = fieldNorm(doc=3085)
        0.2 = coord(5/25)