Document (#38646)

Author
Zhu, X.
Turney, P.
Lemire, D.
Vellino, A.
Title
Measuring academic influence : not all citations are equal
Source
Journal of the Association for Information Science and Technology. 66(2015) no.2, S.408-427
Year
2015
Abstract
The importance of a research article is routinely measured by counting how many times it has been cited. However, treating all citations with equal weight ignores the wide variety of functions that citations perform. We want to automatically identify the subset of references in a bibliography that have a central academic influence on the citing paper. For this purpose, we examine the effectiveness of a variety of features for determining the academic influence of a citation. By asking authors to identify the key references in their own work, we created a data set in which citations were labeled according to their academic influence. Using automatic feature selection with supervised machine learning, we found a model for predicting academic influence that achieves good performance on this data set using only four features. The best features, among those we evaluated, were those based on the number of times a reference is mentioned in the body of a citing paper. The performance of these features inspired us to design an influence-primed h-index (the hip-index). Unlike the conventional h-index, it weights citations by how many times a reference is mentioned. According to our experiments, the hip-index is a better indicator of researcher performance than the conventional h-index.
Content
Vgl.: http://onlinelibrary.wiley.com/doi/10.1002/asi.23179/abstract.
Theme
Informetrie

Similar documents (content)

  1. Wan, X.; Liu, F.: WL-index : leveraging citation mention number to quantify an individual's scientific impact (2014) 0.32
    0.32086113 = sum of:
      0.32086113 = product of:
        1.002691 = sum of:
          0.037301887 = weight(abstract_txt:reference in 2549) [ClassicSimilarity], result of:
            0.037301887 = score(doc=2549,freq=2.0), product of:
              0.094515234 = queryWeight, product of:
                1.1959053 = boost
                4.4651284 = idf(docFreq=1388, maxDocs=44421)
                0.01769991 = queryNorm
              0.39466533 = fieldWeight in 2549, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.4651284 = idf(docFreq=1388, maxDocs=44421)
                0.0625 = fieldNorm(doc=2549)
          0.06039495 = weight(abstract_txt:according in 2549) [ClassicSimilarity], result of:
            0.06039495 = score(doc=2549,freq=2.0), product of:
              0.13032119 = queryWeight, product of:
                1.404279 = boost
                5.2431293 = idf(docFreq=637, maxDocs=44421)
                0.01769991 = queryNorm
              0.46343154 = fieldWeight in 2549, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.2431293 = idf(docFreq=637, maxDocs=44421)
                0.0625 = fieldNorm(doc=2549)
          0.052002493 = weight(abstract_txt:references in 2549) [ClassicSimilarity], result of:
            0.052002493 = score(doc=2549,freq=1.0), product of:
              0.1486075 = queryWeight, product of:
                1.4995683 = boost
                5.598909 = idf(docFreq=446, maxDocs=44421)
                0.01769991 = queryNorm
              0.3499318 = fieldWeight in 2549, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.598909 = idf(docFreq=446, maxDocs=44421)
                0.0625 = fieldNorm(doc=2549)
          0.15371454 = weight(abstract_txt:citing in 2549) [ClassicSimilarity], result of:
            0.15371454 = score(doc=2549,freq=3.0), product of:
              0.21222384 = queryWeight, product of:
                1.7920206 = boost
                6.690832 = idf(docFreq=149, maxDocs=44421)
                0.01769991 = queryNorm
              0.72430384 = fieldWeight in 2549, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.690832 = idf(docFreq=149, maxDocs=44421)
                0.0625 = fieldNorm(doc=2549)
          0.1987469 = weight(abstract_txt:mentioned in 2549) [ClassicSimilarity], result of:
            0.1987469 = score(doc=2549,freq=4.0), product of:
              0.2288433 = queryWeight, product of:
                1.8608656 = boost
                6.9478774 = idf(docFreq=115, maxDocs=44421)
                0.01769991 = queryNorm
              0.8684847 = fieldWeight in 2549, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                6.9478774 = idf(docFreq=115, maxDocs=44421)
                0.0625 = fieldNorm(doc=2549)
          0.09853901 = weight(abstract_txt:times in 2549) [ClassicSimilarity], result of:
            0.09853901 = score(doc=2549,freq=1.0), product of:
              0.26049167 = queryWeight, product of:
                2.4315794 = boost
                6.0524936 = idf(docFreq=283, maxDocs=44421)
                0.01769991 = queryNorm
              0.37828085 = fieldWeight in 2549, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.0524936 = idf(docFreq=283, maxDocs=44421)
                0.0625 = fieldNorm(doc=2549)
          0.1774746 = weight(abstract_txt:index in 2549) [ClassicSimilarity], result of:
            0.1774746 = score(doc=2549,freq=5.0), product of:
              0.26736555 = queryWeight, product of:
                3.1803038 = boost
                4.7496953 = idf(docFreq=1044, maxDocs=44421)
                0.01769991 = queryNorm
              0.6637901 = fieldWeight in 2549, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                4.7496953 = idf(docFreq=1044, maxDocs=44421)
                0.0625 = fieldNorm(doc=2549)
          0.22451663 = weight(abstract_txt:citations in 2549) [ClassicSimilarity], result of:
            0.22451663 = score(doc=2549,freq=4.0), product of:
              0.33688653 = queryWeight, product of:
                3.5699136 = boost
                5.331567 = idf(docFreq=583, maxDocs=44421)
                0.01769991 = queryNorm
              0.66644585 = fieldWeight in 2549, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.331567 = idf(docFreq=583, maxDocs=44421)
                0.0625 = fieldNorm(doc=2549)
        0.32 = coord(8/25)
    
  2. Cronin, B.; Weaver-Wozniak, S.: Online access to acknowledgements (1993) 0.20
    0.20024705 = sum of:
      0.20024705 = product of:
        0.83436275 = sum of:
          0.060618192 = weight(abstract_txt:variety in 7826) [ClassicSimilarity], result of:
            0.060618192 = score(doc=7826,freq=1.0), product of:
              0.12561238 = queryWeight, product of:
                1.3786757 = boost
                5.1475344 = idf(docFreq=701, maxDocs=44421)
                0.01769991 = queryNorm
              0.48258135 = fieldWeight in 7826, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.1475344 = idf(docFreq=701, maxDocs=44421)
                0.09375 = fieldNorm(doc=7826)
          0.06572861 = weight(abstract_txt:performance in 7826) [ClassicSimilarity], result of:
            0.06572861 = score(doc=7826,freq=1.0), product of:
              0.15176228 = queryWeight, product of:
                1.8559806 = boost
                4.619759 = idf(docFreq=1189, maxDocs=44421)
                0.01769991 = queryNorm
              0.43310243 = fieldWeight in 7826, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.619759 = idf(docFreq=1189, maxDocs=44421)
                0.09375 = fieldNorm(doc=7826)
          0.16018714 = weight(abstract_txt:academic in 7826) [ClassicSimilarity], result of:
            0.16018714 = score(doc=7826,freq=2.0), product of:
              0.25863397 = queryWeight, product of:
                3.1279418 = boost
                4.6714945 = idf(docFreq=1129, maxDocs=44421)
                0.01769991 = queryNorm
              0.6193585 = fieldWeight in 7826, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.6714945 = idf(docFreq=1129, maxDocs=44421)
                0.09375 = fieldNorm(doc=7826)
          0.11905359 = weight(abstract_txt:index in 7826) [ClassicSimilarity], result of:
            0.11905359 = score(doc=7826,freq=1.0), product of:
              0.26736555 = queryWeight, product of:
                3.1803038 = boost
                4.7496953 = idf(docFreq=1044, maxDocs=44421)
                0.01769991 = queryNorm
              0.44528395 = fieldWeight in 7826, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.7496953 = idf(docFreq=1044, maxDocs=44421)
                0.09375 = fieldNorm(doc=7826)
          0.23813583 = weight(abstract_txt:citations in 7826) [ClassicSimilarity], result of:
            0.23813583 = score(doc=7826,freq=2.0), product of:
              0.33688653 = queryWeight, product of:
                3.5699136 = boost
                5.331567 = idf(docFreq=583, maxDocs=44421)
                0.01769991 = queryNorm
              0.7068725 = fieldWeight in 7826, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.331567 = idf(docFreq=583, maxDocs=44421)
                0.09375 = fieldNorm(doc=7826)
          0.19063935 = weight(abstract_txt:influence in 7826) [ClassicSimilarity], result of:
            0.19063935 = score(doc=7826,freq=1.0), product of:
              0.38887727 = queryWeight, product of:
                4.2015815 = boost
                5.229121 = idf(docFreq=646, maxDocs=44421)
                0.01769991 = queryNorm
              0.4902301 = fieldWeight in 7826, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.229121 = idf(docFreq=646, maxDocs=44421)
                0.09375 = fieldNorm(doc=7826)
        0.24 = coord(6/25)
    
  3. Wan, X.; Liu, F.: Are all literature citations equally important? : automatic citation strength estimation and its applications (2014) 0.17
    0.16560906 = sum of:
      0.16560906 = product of:
        0.8280453 = sum of:
          0.1130803 = weight(abstract_txt:labeled in 2350) [ClassicSimilarity], result of:
            0.1130803 = score(doc=2350,freq=2.0), product of:
              0.13541162 = queryWeight, product of:
                1.0121826 = boost
                7.558333 = idf(docFreq=62, maxDocs=44421)
                0.01769991 = queryNorm
              0.83508563 = fieldWeight in 2350, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.558333 = idf(docFreq=62, maxDocs=44421)
                0.078125 = fieldNorm(doc=2350)
          0.06934257 = weight(abstract_txt:features in 2350) [ClassicSimilarity], result of:
            0.06934257 = score(doc=2350,freq=1.0), product of:
              0.1954765 = queryWeight, product of:
                2.4322498 = boost
                4.5406218 = idf(docFreq=1287, maxDocs=44421)
                0.01769991 = queryNorm
              0.3547361 = fieldWeight in 2350, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.5406218 = idf(docFreq=1287, maxDocs=44421)
                0.078125 = fieldNorm(doc=2350)
          0.140306 = weight(abstract_txt:index in 2350) [ClassicSimilarity], result of:
            0.140306 = score(doc=2350,freq=2.0), product of:
              0.26736555 = queryWeight, product of:
                3.1803038 = boost
                4.7496953 = idf(docFreq=1044, maxDocs=44421)
                0.01769991 = queryNorm
              0.52477217 = fieldWeight in 2350, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.7496953 = idf(docFreq=1044, maxDocs=44421)
                0.078125 = fieldNorm(doc=2350)
          0.2806458 = weight(abstract_txt:citations in 2350) [ClassicSimilarity], result of:
            0.2806458 = score(doc=2350,freq=4.0), product of:
              0.33688653 = queryWeight, product of:
                3.5699136 = boost
                5.331567 = idf(docFreq=583, maxDocs=44421)
                0.01769991 = queryNorm
              0.8330573 = fieldWeight in 2350, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.331567 = idf(docFreq=583, maxDocs=44421)
                0.078125 = fieldNorm(doc=2350)
          0.22467063 = weight(abstract_txt:influence in 2350) [ClassicSimilarity], result of:
            0.22467063 = score(doc=2350,freq=2.0), product of:
              0.38887727 = queryWeight, product of:
                4.2015815 = boost
                5.229121 = idf(docFreq=646, maxDocs=44421)
                0.01769991 = queryNorm
              0.57774174 = fieldWeight in 2350, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.229121 = idf(docFreq=646, maxDocs=44421)
                0.078125 = fieldNorm(doc=2350)
        0.2 = coord(5/25)
    
  4. González, L.; Campanario, J.M.: Structure of the impact factor of journals included in the Social Sciences Citation Index : citations from documents labeled "Editorial Material" (2007) 0.13
    0.12870164 = sum of:
      0.12870164 = product of:
        0.6435082 = sum of:
          0.095951825 = weight(abstract_txt:labeled in 1075) [ClassicSimilarity], result of:
            0.095951825 = score(doc=1075,freq=1.0), product of:
              0.13541162 = queryWeight, product of:
                1.0121826 = boost
                7.558333 = idf(docFreq=62, maxDocs=44421)
                0.01769991 = queryNorm
              0.7085937 = fieldWeight in 1075, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.558333 = idf(docFreq=62, maxDocs=44421)
                0.09375 = fieldNorm(doc=1075)
          0.030179791 = weight(abstract_txt:many in 1075) [ClassicSimilarity], result of:
            0.030179791 = score(doc=1075,freq=1.0), product of:
              0.07890562 = queryWeight, product of:
                1.0926973 = boost
                4.0797825 = idf(docFreq=2041, maxDocs=44421)
                0.01769991 = queryNorm
              0.3824796 = fieldWeight in 1075, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.0797825 = idf(docFreq=2041, maxDocs=44421)
                0.09375 = fieldNorm(doc=1075)
          0.16018714 = weight(abstract_txt:academic in 1075) [ClassicSimilarity], result of:
            0.16018714 = score(doc=1075,freq=2.0), product of:
              0.25863397 = queryWeight, product of:
                3.1279418 = boost
                4.6714945 = idf(docFreq=1129, maxDocs=44421)
                0.01769991 = queryNorm
              0.6193585 = fieldWeight in 1075, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.6714945 = idf(docFreq=1129, maxDocs=44421)
                0.09375 = fieldNorm(doc=1075)
          0.11905359 = weight(abstract_txt:index in 1075) [ClassicSimilarity], result of:
            0.11905359 = score(doc=1075,freq=1.0), product of:
              0.26736555 = queryWeight, product of:
                3.1803038 = boost
                4.7496953 = idf(docFreq=1044, maxDocs=44421)
                0.01769991 = queryNorm
              0.44528395 = fieldWeight in 1075, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.7496953 = idf(docFreq=1044, maxDocs=44421)
                0.09375 = fieldNorm(doc=1075)
          0.23813583 = weight(abstract_txt:citations in 1075) [ClassicSimilarity], result of:
            0.23813583 = score(doc=1075,freq=2.0), product of:
              0.33688653 = queryWeight, product of:
                3.5699136 = boost
                5.331567 = idf(docFreq=583, maxDocs=44421)
                0.01769991 = queryNorm
              0.7068725 = fieldWeight in 1075, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.331567 = idf(docFreq=583, maxDocs=44421)
                0.09375 = fieldNorm(doc=1075)
        0.2 = coord(5/25)
    
  5. Walters, W.H.: Google Scholar coverage of a multidisciplinary field (2007) 0.12
    0.12365268 = sum of:
      0.12365268 = product of:
        0.5152195 = sum of:
          0.025149826 = weight(abstract_txt:many in 1928) [ClassicSimilarity], result of:
            0.025149826 = score(doc=1928,freq=1.0), product of:
              0.07890562 = queryWeight, product of:
                1.0926973 = boost
                4.0797825 = idf(docFreq=2041, maxDocs=44421)
                0.01769991 = queryNorm
              0.318733 = fieldWeight in 1928, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.0797825 = idf(docFreq=2041, maxDocs=44421)
                0.078125 = fieldNorm(doc=1928)
          0.032970518 = weight(abstract_txt:reference in 1928) [ClassicSimilarity], result of:
            0.032970518 = score(doc=1928,freq=1.0), product of:
              0.094515234 = queryWeight, product of:
                1.1959053 = boost
                4.4651284 = idf(docFreq=1388, maxDocs=44421)
                0.01769991 = queryNorm
              0.34883815 = fieldWeight in 1928, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.4651284 = idf(docFreq=1388, maxDocs=44421)
                0.078125 = fieldNorm(doc=1928)
          0.123173766 = weight(abstract_txt:times in 1928) [ClassicSimilarity], result of:
            0.123173766 = score(doc=1928,freq=1.0), product of:
              0.26049167 = queryWeight, product of:
                2.4315794 = boost
                6.0524936 = idf(docFreq=283, maxDocs=44421)
                0.01769991 = queryNorm
              0.47285107 = fieldWeight in 1928, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.0524936 = idf(docFreq=283, maxDocs=44421)
                0.078125 = fieldNorm(doc=1928)
          0.09439118 = weight(abstract_txt:academic in 1928) [ClassicSimilarity], result of:
            0.09439118 = score(doc=1928,freq=1.0), product of:
              0.25863397 = queryWeight, product of:
                3.1279418 = boost
                4.6714945 = idf(docFreq=1129, maxDocs=44421)
                0.01769991 = queryNorm
              0.3649605 = fieldWeight in 1928, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.6714945 = idf(docFreq=1129, maxDocs=44421)
                0.078125 = fieldNorm(doc=1928)
          0.09921131 = weight(abstract_txt:index in 1928) [ClassicSimilarity], result of:
            0.09921131 = score(doc=1928,freq=1.0), product of:
              0.26736555 = queryWeight, product of:
                3.1803038 = boost
                4.7496953 = idf(docFreq=1044, maxDocs=44421)
                0.01769991 = queryNorm
              0.37106994 = fieldWeight in 1928, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.7496953 = idf(docFreq=1044, maxDocs=44421)
                0.078125 = fieldNorm(doc=1928)
          0.1403229 = weight(abstract_txt:citations in 1928) [ClassicSimilarity], result of:
            0.1403229 = score(doc=1928,freq=1.0), product of:
              0.33688653 = queryWeight, product of:
                3.5699136 = boost
                5.331567 = idf(docFreq=583, maxDocs=44421)
                0.01769991 = queryNorm
              0.41652864 = fieldWeight in 1928, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.331567 = idf(docFreq=583, maxDocs=44421)
                0.078125 = fieldNorm(doc=1928)
        0.24 = coord(6/25)