Document (#36349)

Author
Ding, Y.
Title
Topic-based PageRank on author cocitation networks
Source
Journal of the American Society for Information Science and Technology. 62(2011) no.3, S.449-466
Year
2011
Abstract
Ranking authors is vital for identifying a researcher's impact and standing within a scientific field. There are many different ranking methods (e.g., citations, publications, h-index, PageRank, and weighted PageRank), but most of them are topic-independent. This paper proposes topic-dependent ranks based on the combination of a topic model and a weighted PageRank algorithm. The author-conference-topic (ACT) model was used to extract topic distribution of individual authors. Two ways for combining the ACT model with the PageRank algorithm are proposed: simple combination (I_PR) or using a topic distribution as a weighted vector for PageRank (PR_t). Information retrieval was chosen as the test field and representative authors for different topics at different time phases were identified. Principal component analysis (PCA) was applied to analyze the ranking difference between I_PR and PR_t.
Theme
Retrievalalgorithmen
Object
PageRank

Similar documents (author)

  1. Ding, Y.: Visualization of intellectual structure in information retrieval : author cocitation analysis (1998) 4.76
    4.7649565 = sum of:
      4.7649565 = weight(author_txt:ding in 3792) [ClassicSimilarity], result of:
        4.7649565 = fieldWeight in 3792, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          7.62393 = idf(docFreq=58, maxDocs=44421)
          0.625 = fieldNorm(doc=3792)
    
  2. Ding, Y.: Scholarly communication and bibliometrics : Part 1: The scholarly communication model: literature review (1998) 4.76
    4.7649565 = sum of:
      4.7649565 = weight(author_txt:ding in 4995) [ClassicSimilarity], result of:
        4.7649565 = fieldWeight in 4995, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          7.62393 = idf(docFreq=58, maxDocs=44421)
          0.625 = fieldNorm(doc=4995)
    
  3. Ding, C.H.Q.: ¬A probabilistic model for Latent Semantic Indexing (2005) 4.76
    4.7649565 = sum of:
      4.7649565 = weight(author_txt:ding in 4459) [ClassicSimilarity], result of:
        4.7649565 = fieldWeight in 4459, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          7.62393 = idf(docFreq=58, maxDocs=44421)
          0.625 = fieldNorm(doc=4459)
    
  4. Ding, Y.: ¬A review of ontologies with the Semantic Web in view (2001) 4.76
    4.7649565 = sum of:
      4.7649565 = weight(author_txt:ding in 5152) [ClassicSimilarity], result of:
        4.7649565 = fieldWeight in 5152, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          7.62393 = idf(docFreq=58, maxDocs=44421)
          0.625 = fieldNorm(doc=5152)
    
  5. Ding, Y.: Applying weighted PageRank to author citation networks (2011) 4.76
    4.7649565 = sum of:
      4.7649565 = weight(author_txt:ding in 188) [ClassicSimilarity], result of:
        4.7649565 = fieldWeight in 188, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          7.62393 = idf(docFreq=58, maxDocs=44421)
          0.625 = fieldNorm(doc=188)
    

Similar documents (content)

  1. Ding, Y.; Yan, E.; Frazho, A.; Caverlee, J.: PageRank for ranking authors in co-citation networks (2009) 0.58
    0.58175427 = sum of:
      0.58175427 = product of:
        1.615984 = sum of:
          0.008250928 = weight(abstract_txt:based in 148) [ClassicSimilarity], result of:
            0.008250928 = score(doc=148,freq=1.0), product of:
              0.041473992 = queryWeight, product of:
                3.1830752 = idf(docFreq=5005, maxDocs=44421)
                0.013029535 = queryNorm
              0.1989422 = fieldWeight in 148, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.1830752 = idf(docFreq=5005, maxDocs=44421)
                0.0625 = fieldNorm(doc=148)
          0.05523446 = weight(abstract_txt:ranks in 148) [ClassicSimilarity], result of:
            0.05523446 = score(doc=148,freq=1.0), product of:
              0.11692411 = queryWeight, product of:
                1.187269 = boost
                7.558333 = idf(docFreq=62, maxDocs=44421)
                0.013029535 = queryNorm
              0.4723958 = fieldWeight in 148, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.558333 = idf(docFreq=62, maxDocs=44421)
                0.0625 = fieldNorm(doc=148)
          0.05472953 = weight(abstract_txt:author in 148) [ClassicSimilarity], result of:
            0.05472953 = score(doc=148,freq=3.0), product of:
              0.10151917 = queryWeight, product of:
                1.564538 = boost
                4.980042 = idf(docFreq=829, maxDocs=44421)
                0.013029535 = queryNorm
              0.53910536 = fieldWeight in 148, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.980042 = idf(docFreq=829, maxDocs=44421)
                0.0625 = fieldNorm(doc=148)
          0.032580823 = weight(abstract_txt:different in 148) [ClassicSimilarity], result of:
            0.032580823 = score(doc=148,freq=3.0), product of:
              0.08223791 = queryWeight, product of:
                1.7246213 = boost
                3.6597328 = idf(docFreq=3107, maxDocs=44421)
                0.013029535 = queryNorm
              0.39617768 = fieldWeight in 148, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.6597328 = idf(docFreq=3107, maxDocs=44421)
                0.0625 = fieldNorm(doc=148)
          0.04750426 = weight(abstract_txt:algorithm in 148) [ClassicSimilarity], result of:
            0.04750426 = score(doc=148,freq=1.0), product of:
              0.13322806 = queryWeight, product of:
                1.7922969 = boost
                5.7050157 = idf(docFreq=401, maxDocs=44421)
                0.013029535 = queryNorm
              0.35656348 = fieldWeight in 148, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.7050157 = idf(docFreq=401, maxDocs=44421)
                0.0625 = fieldNorm(doc=148)
          0.08601565 = weight(abstract_txt:authors in 148) [ClassicSimilarity], result of:
            0.08601565 = score(doc=148,freq=5.0), product of:
              0.13249497 = queryWeight, product of:
                2.1890588 = boost
                4.6452923 = idf(docFreq=1159, maxDocs=44421)
                0.013029535 = queryNorm
              0.64919937 = fieldWeight in 148, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                4.6452923 = idf(docFreq=1159, maxDocs=44421)
                0.0625 = fieldNorm(doc=148)
          0.09513897 = weight(abstract_txt:ranking in 148) [ClassicSimilarity], result of:
            0.09513897 = score(doc=148,freq=2.0), product of:
              0.19232395 = queryWeight, product of:
                2.6373904 = boost
                5.5966744 = idf(docFreq=447, maxDocs=44421)
                0.013029535 = queryNorm
              0.4946808 = fieldWeight in 148, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.5966744 = idf(docFreq=447, maxDocs=44421)
                0.0625 = fieldNorm(doc=148)
          0.22293097 = weight(abstract_txt:weighted in 148) [ClassicSimilarity], result of:
            0.22293097 = score(doc=148,freq=3.0), product of:
              0.2963996 = queryWeight, product of:
                3.2741344 = boost
                6.9478774 = idf(docFreq=115, maxDocs=44421)
                0.013029535 = queryNorm
              0.7521298 = fieldWeight in 148, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.9478774 = idf(docFreq=115, maxDocs=44421)
                0.0625 = fieldNorm(doc=148)
          1.0135983 = weight(abstract_txt:pagerank in 148) [ClassicSimilarity], result of:
            1.0135983 = score(doc=148,freq=9.0), product of:
              0.710631 = queryWeight, product of:
                7.169598 = boost
                7.607123 = idf(docFreq=59, maxDocs=44421)
                0.013029535 = queryNorm
              1.4263356 = fieldWeight in 148, product of:
                3.0 = tf(freq=9.0), with freq of:
                  9.0 = termFreq=9.0
                7.607123 = idf(docFreq=59, maxDocs=44421)
                0.0625 = fieldNorm(doc=148)
        0.36 = coord(9/25)
    
  2. Liu, X.; Zhang, J.; Guo, C.: Full-text citation analysis : a new method to enhance scholarly networks (2013) 0.43
    0.4284931 = sum of:
      0.4284931 = product of:
        1.339041 = sum of:
          0.04468647 = weight(abstract_txt:author in 2044) [ClassicSimilarity], result of:
            0.04468647 = score(doc=2044,freq=2.0), product of:
              0.10151917 = queryWeight, product of:
                1.564538 = boost
                4.980042 = idf(docFreq=829, maxDocs=44421)
                0.013029535 = queryNorm
              0.44017768 = fieldWeight in 2044, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.980042 = idf(docFreq=829, maxDocs=44421)
                0.0625 = fieldNorm(doc=2044)
          0.018810548 = weight(abstract_txt:different in 2044) [ClassicSimilarity], result of:
            0.018810548 = score(doc=2044,freq=1.0), product of:
              0.08223791 = queryWeight, product of:
                1.7246213 = boost
                3.6597328 = idf(docFreq=3107, maxDocs=44421)
                0.013029535 = queryNorm
              0.2287333 = fieldWeight in 2044, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6597328 = idf(docFreq=3107, maxDocs=44421)
                0.0625 = fieldNorm(doc=2044)
          0.07768064 = weight(abstract_txt:distribution in 2044) [ClassicSimilarity], result of:
            0.07768064 = score(doc=2044,freq=3.0), product of:
              0.12821597 = queryWeight, product of:
                1.7582603 = boost
                5.5966744 = idf(docFreq=447, maxDocs=44421)
                0.013029535 = queryNorm
              0.6058578 = fieldWeight in 2044, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.5966744 = idf(docFreq=447, maxDocs=44421)
                0.0625 = fieldNorm(doc=2044)
          0.04750426 = weight(abstract_txt:algorithm in 2044) [ClassicSimilarity], result of:
            0.04750426 = score(doc=2044,freq=1.0), product of:
              0.13322806 = queryWeight, product of:
                1.7922969 = boost
                5.7050157 = idf(docFreq=401, maxDocs=44421)
                0.013029535 = queryNorm
              0.35656348 = fieldWeight in 2044, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.7050157 = idf(docFreq=401, maxDocs=44421)
                0.0625 = fieldNorm(doc=2044)
          0.03428688 = weight(abstract_txt:model in 2044) [ClassicSimilarity], result of:
            0.03428688 = score(doc=2044,freq=2.0), product of:
              0.09739718 = queryWeight, product of:
                1.8768557 = boost
                3.9827821 = idf(docFreq=2249, maxDocs=44421)
                0.013029535 = queryNorm
              0.35203153 = fieldWeight in 2044, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.9827821 = idf(docFreq=2249, maxDocs=44421)
                0.0625 = fieldNorm(doc=2044)
          0.13454682 = weight(abstract_txt:ranking in 2044) [ClassicSimilarity], result of:
            0.13454682 = score(doc=2044,freq=4.0), product of:
              0.19232395 = queryWeight, product of:
                2.6373904 = boost
                5.5966744 = idf(docFreq=447, maxDocs=44421)
                0.013029535 = queryNorm
              0.6995843 = fieldWeight in 2044, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.5966744 = idf(docFreq=447, maxDocs=44421)
                0.0625 = fieldNorm(doc=2044)
          0.30579323 = weight(abstract_txt:topic in 2044) [ClassicSimilarity], result of:
            0.30579323 = score(doc=2044,freq=7.0), product of:
              0.365917 = queryWeight, product of:
                5.5569615 = boost
                5.053779 = idf(docFreq=770, maxDocs=44421)
                0.013029535 = queryNorm
              0.83569014 = fieldWeight in 2044, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                5.053779 = idf(docFreq=770, maxDocs=44421)
                0.0625 = fieldNorm(doc=2044)
          0.6757322 = weight(abstract_txt:pagerank in 2044) [ClassicSimilarity], result of:
            0.6757322 = score(doc=2044,freq=4.0), product of:
              0.710631 = queryWeight, product of:
                7.169598 = boost
                7.607123 = idf(docFreq=59, maxDocs=44421)
                0.013029535 = queryNorm
              0.95089036 = fieldWeight in 2044, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                7.607123 = idf(docFreq=59, maxDocs=44421)
                0.0625 = fieldNorm(doc=2044)
        0.32 = coord(8/25)
    
  3. Ding, Y.: Applying weighted PageRank to author citation networks (2011) 0.36
    0.3569039 = sum of:
      0.3569039 = product of:
        1.4870996 = sum of:
          0.06716297 = weight(abstract_txt:principal in 188) [ClassicSimilarity], result of:
            0.06716297 = score(doc=188,freq=1.0), product of:
              0.10165368 = queryWeight, product of:
                1.1070281 = boost
                7.0475073 = idf(docFreq=104, maxDocs=44421)
                0.013029535 = queryNorm
              0.6607038 = fieldWeight in 188, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.0475073 = idf(docFreq=104, maxDocs=44421)
                0.09375 = fieldNorm(doc=188)
          0.04907421 = weight(abstract_txt:field in 188) [ClassicSimilarity], result of:
            0.04907421 = score(doc=188,freq=2.0), product of:
              0.08246545 = queryWeight, product of:
                1.4100941 = boost
                4.4884357 = idf(docFreq=1356, maxDocs=44421)
                0.013029535 = queryNorm
              0.5950881 = fieldWeight in 188, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.4884357 = idf(docFreq=1356, maxDocs=44421)
                0.09375 = fieldNorm(doc=188)
          0.06702971 = weight(abstract_txt:author in 188) [ClassicSimilarity], result of:
            0.06702971 = score(doc=188,freq=2.0), product of:
              0.10151917 = queryWeight, product of:
                1.564538 = boost
                4.980042 = idf(docFreq=829, maxDocs=44421)
                0.013029535 = queryNorm
              0.6602665 = fieldWeight in 188, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.980042 = idf(docFreq=829, maxDocs=44421)
                0.09375 = fieldNorm(doc=188)
          0.039903197 = weight(abstract_txt:different in 188) [ClassicSimilarity], result of:
            0.039903197 = score(doc=188,freq=2.0), product of:
              0.08223791 = queryWeight, product of:
                1.7246213 = boost
                3.6597328 = idf(docFreq=3107, maxDocs=44421)
                0.013029535 = queryNorm
              0.48521662 = fieldWeight in 188, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.6597328 = idf(docFreq=3107, maxDocs=44421)
                0.09375 = fieldNorm(doc=188)
          0.38612774 = weight(abstract_txt:weighted in 188) [ClassicSimilarity], result of:
            0.38612774 = score(doc=188,freq=4.0), product of:
              0.2963996 = queryWeight, product of:
                3.2741344 = boost
                6.9478774 = idf(docFreq=115, maxDocs=44421)
                0.013029535 = queryNorm
              1.302727 = fieldWeight in 188, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                6.9478774 = idf(docFreq=115, maxDocs=44421)
                0.09375 = fieldNorm(doc=188)
          0.8778019 = weight(abstract_txt:pagerank in 188) [ClassicSimilarity], result of:
            0.8778019 = score(doc=188,freq=3.0), product of:
              0.710631 = queryWeight, product of:
                7.169598 = boost
                7.607123 = idf(docFreq=59, maxDocs=44421)
                0.013029535 = queryNorm
              1.2352428 = fieldWeight in 188, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                7.607123 = idf(docFreq=59, maxDocs=44421)
                0.09375 = fieldNorm(doc=188)
        0.24 = coord(6/25)
    
  4. Yan, E.; Ding, Y.: Discovering author impact : a PageRank perspective (2011) 0.32
    0.32016355 = sum of:
      0.32016355 = product of:
        1.6008177 = sum of:
          0.08209429 = weight(abstract_txt:author in 3704) [ClassicSimilarity], result of:
            0.08209429 = score(doc=3704,freq=3.0), product of:
              0.10151917 = queryWeight, product of:
                1.564538 = boost
                4.980042 = idf(docFreq=829, maxDocs=44421)
                0.013029535 = queryNorm
              0.808658 = fieldWeight in 3704, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.980042 = idf(docFreq=829, maxDocs=44421)
                0.09375 = fieldNorm(doc=3704)
          0.028215822 = weight(abstract_txt:different in 3704) [ClassicSimilarity], result of:
            0.028215822 = score(doc=3704,freq=1.0), product of:
              0.08223791 = queryWeight, product of:
                1.7246213 = boost
                3.6597328 = idf(docFreq=3107, maxDocs=44421)
                0.013029535 = queryNorm
              0.34309995 = fieldWeight in 3704, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6597328 = idf(docFreq=3107, maxDocs=44421)
                0.09375 = fieldNorm(doc=3704)
          0.14251278 = weight(abstract_txt:algorithm in 3704) [ClassicSimilarity], result of:
            0.14251278 = score(doc=3704,freq=4.0), product of:
              0.13322806 = queryWeight, product of:
                1.7922969 = boost
                5.7050157 = idf(docFreq=401, maxDocs=44421)
                0.013029535 = queryNorm
              1.0696905 = fieldWeight in 3704, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.7050157 = idf(docFreq=401, maxDocs=44421)
                0.09375 = fieldNorm(doc=3704)
          0.33439645 = weight(abstract_txt:weighted in 3704) [ClassicSimilarity], result of:
            0.33439645 = score(doc=3704,freq=3.0), product of:
              0.2963996 = queryWeight, product of:
                3.2741344 = boost
                6.9478774 = idf(docFreq=115, maxDocs=44421)
                0.013029535 = queryNorm
              1.1281947 = fieldWeight in 3704, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.9478774 = idf(docFreq=115, maxDocs=44421)
                0.09375 = fieldNorm(doc=3704)
          1.0135983 = weight(abstract_txt:pagerank in 3704) [ClassicSimilarity], result of:
            1.0135983 = score(doc=3704,freq=4.0), product of:
              0.710631 = queryWeight, product of:
                7.169598 = boost
                7.607123 = idf(docFreq=59, maxDocs=44421)
                0.013029535 = queryNorm
              1.4263356 = fieldWeight in 3704, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                7.607123 = idf(docFreq=59, maxDocs=44421)
                0.09375 = fieldNorm(doc=3704)
        0.2 = coord(5/25)
    
  5. Bryan, K.; Leise, T.: ¬The $25.000.000.000 eigenvector : the linear algebra behind Google 0.25
    0.24745253 = sum of:
      0.24745253 = product of:
        1.2372626 = sum of:
          0.082851686 = weight(abstract_txt:ranks in 2353) [ClassicSimilarity], result of:
            0.082851686 = score(doc=2353,freq=1.0), product of:
              0.11692411 = queryWeight, product of:
                1.187269 = boost
                7.558333 = idf(docFreq=62, maxDocs=44421)
                0.013029535 = queryNorm
              0.7085937 = fieldWeight in 2353, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.558333 = idf(docFreq=62, maxDocs=44421)
                0.09375 = fieldNorm(doc=2353)
          0.07125639 = weight(abstract_txt:algorithm in 2353) [ClassicSimilarity], result of:
            0.07125639 = score(doc=2353,freq=1.0), product of:
              0.13322806 = queryWeight, product of:
                1.7922969 = boost
                5.7050157 = idf(docFreq=401, maxDocs=44421)
                0.013029535 = queryNorm
              0.53484523 = fieldWeight in 2353, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.7050157 = idf(docFreq=401, maxDocs=44421)
                0.09375 = fieldNorm(doc=2353)
          0.19306387 = weight(abstract_txt:weighted in 2353) [ClassicSimilarity], result of:
            0.19306387 = score(doc=2353,freq=1.0), product of:
              0.2963996 = queryWeight, product of:
                3.2741344 = boost
                6.9478774 = idf(docFreq=115, maxDocs=44421)
                0.013029535 = queryNorm
              0.6513635 = fieldWeight in 2353, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.9478774 = idf(docFreq=115, maxDocs=44421)
                0.09375 = fieldNorm(doc=2353)
          0.17336847 = weight(abstract_txt:topic in 2353) [ClassicSimilarity], result of:
            0.17336847 = score(doc=2353,freq=1.0), product of:
              0.365917 = queryWeight, product of:
                5.5569615 = boost
                5.053779 = idf(docFreq=770, maxDocs=44421)
                0.013029535 = queryNorm
              0.47379178 = fieldWeight in 2353, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.053779 = idf(docFreq=770, maxDocs=44421)
                0.09375 = fieldNorm(doc=2353)
          0.7167222 = weight(abstract_txt:pagerank in 2353) [ClassicSimilarity], result of:
            0.7167222 = score(doc=2353,freq=2.0), product of:
              0.710631 = queryWeight, product of:
                7.169598 = boost
                7.607123 = idf(docFreq=59, maxDocs=44421)
                0.013029535 = queryNorm
              1.0085715 = fieldWeight in 2353, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.607123 = idf(docFreq=59, maxDocs=44421)
                0.09375 = fieldNorm(doc=2353)
        0.2 = coord(5/25)