Document (#36485)

Author
Shibata, N.
Kajikawa, Y.
Sakata, I.
Title
Measuring relatedness between communities in a citation network
Source
Journal of the American Society for Information Science and Technology. 62(2011) no.7, S.1360-1369
Year
2011
Abstract
As academic disciplines are segmented and specialized, it becomes more difficult to capture relevant research areas precisely by common retrieval strategies using either keywords or journal categories. This paper proposes a method of measuring the relatedness among sets of academic papers in order to detect unrelated communities which are not related to target topic. A citation network, extracted by given keywords, is divided into communities based on the density of links. We measured and compared four measures of relatedness between two communities in a citation network for three large-scale citation datasets. We used both link and semantic similarities. The topological distance from the center in a citation network is a more efficient measure for removing the unrelated communities than the other three measures: the ratio of the number of intercluster links over the all links, the ratio of the number of common terms over all terms, cosine similarity of tf-idf vectors.

Similar documents (author)

  1. Shibata, N.; Kajikawa, Y.; Matsushima, K.: Topological analysis of citation networks to discover the future core articles (2007) 3.72
    3.7161405 = sum of:
      3.7161405 = weight(author_txt:kajikawa in 1286) [ClassicSimilarity], result of:
        3.7161405 = fieldWeight in 1286, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.909708 = idf(docFreq=5, maxDocs=44421)
          0.375 = fieldNorm(doc=1286)
    
  2. Shibata, N.; Kajikawa, Y.; Sakata, I.: Link prediction in citation networks (2012) 3.72
    3.7161405 = sum of:
      3.7161405 = weight(author_txt:kajikawa in 964) [ClassicSimilarity], result of:
        3.7161405 = fieldWeight in 964, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.909708 = idf(docFreq=5, maxDocs=44421)
          0.375 = fieldNorm(doc=964)
    
  3. Shibata, N.; Kajikawa, Y.; Takeda, Y.; Matsushima, K.: Comparative study on methods of detecting research fronts using different types of citation (2009) 3.10
    3.0967836 = sum of:
      3.0967836 = weight(author_txt:kajikawa in 3743) [ClassicSimilarity], result of:
        3.0967836 = fieldWeight in 3743, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.909708 = idf(docFreq=5, maxDocs=44421)
          0.3125 = fieldNorm(doc=3743)
    
  4. Tashiro, H.; Lau, A.; Mori, J.; Fujii, N.; Kajikawa, Y.: E-mail networks and leadership performance (2012) 3.10
    3.0967836 = sum of:
      3.0967836 = weight(author_txt:kajikawa in 1077) [ClassicSimilarity], result of:
        3.0967836 = fieldWeight in 1077, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.909708 = idf(docFreq=5, maxDocs=44421)
          0.3125 = fieldNorm(doc=1077)
    

Similar documents (content)

  1. Klavans, R.; Boyack, K.W.: Identifying a better measure of relatedness for mapping science (2006) 0.20
    0.19933431 = sum of:
      0.19933431 = product of:
        0.99667156 = sum of:
          0.020806316 = weight(abstract_txt:between in 252) [ClassicSimilarity], result of:
            0.020806316 = score(doc=252,freq=3.0), product of:
              0.055591118 = queryWeight, product of:
                3.4573963 = idf(docFreq=3804, maxDocs=44421)
                0.016078897 = queryNorm
              0.3742741 = fieldWeight in 252, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.4573963 = idf(docFreq=3804, maxDocs=44421)
                0.0625 = fieldNorm(doc=252)
          0.096399575 = weight(abstract_txt:cosine in 252) [ClassicSimilarity], result of:
            0.096399575 = score(doc=252,freq=2.0), product of:
              0.14037155 = queryWeight, product of:
                1.1236262 = boost
                7.769642 = idf(docFreq=50, maxDocs=44421)
                0.016078897 = queryNorm
              0.6867458 = fieldWeight in 252, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.769642 = idf(docFreq=50, maxDocs=44421)
                0.0625 = fieldNorm(doc=252)
          0.12277024 = weight(abstract_txt:measures in 252) [ClassicSimilarity], result of:
            0.12277024 = score(doc=252,freq=7.0), product of:
              0.1368606 = queryWeight, product of:
                1.569049 = boost
                5.424824 = idf(docFreq=531, maxDocs=44421)
                0.016078897 = queryNorm
              0.89704597 = fieldWeight in 252, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                5.424824 = idf(docFreq=531, maxDocs=44421)
                0.0625 = fieldNorm(doc=252)
          0.082937784 = weight(abstract_txt:measuring in 252) [ClassicSimilarity], result of:
            0.082937784 = score(doc=252,freq=1.0), product of:
              0.20156601 = queryWeight, product of:
                1.904171 = boost
                6.5834737 = idf(docFreq=166, maxDocs=44421)
                0.016078897 = queryNorm
              0.4114671 = fieldWeight in 252, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.5834737 = idf(docFreq=166, maxDocs=44421)
                0.0625 = fieldNorm(doc=252)
          0.6737576 = weight(abstract_txt:relatedness in 252) [ClassicSimilarity], result of:
            0.6737576 = score(doc=252,freq=8.0), product of:
              0.4662139 = queryWeight, product of:
                3.5467906 = boost
                8.175107 = idf(docFreq=33, maxDocs=44421)
                0.016078897 = queryNorm
              1.4451684 = fieldWeight in 252, product of:
                2.828427 = tf(freq=8.0), with freq of:
                  8.0 = termFreq=8.0
                8.175107 = idf(docFreq=33, maxDocs=44421)
                0.0625 = fieldNorm(doc=252)
        0.2 = coord(5/25)
    
  2. Shibata, N.; Kajikawa, Y.; Takeda, Y.; Matsushima, K.: Comparative study on methods of detecting research fronts using different types of citation (2009) 0.19
    0.19053724 = sum of:
      0.19053724 = product of:
        0.68049014 = sum of:
          0.073445894 = weight(abstract_txt:detect in 3743) [ClassicSimilarity], result of:
            0.073445894 = score(doc=3743,freq=2.0), product of:
              0.117095634 = queryWeight, product of:
                1.0262488 = boost
                7.0962973 = idf(docFreq=99, maxDocs=44421)
                0.016078897 = queryNorm
              0.62723 = fieldWeight in 3743, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.0962973 = idf(docFreq=99, maxDocs=44421)
                0.0625 = fieldNorm(doc=3743)
          0.06922314 = weight(abstract_txt:density in 3743) [ClassicSimilarity], result of:
            0.06922314 = score(doc=3743,freq=1.0), product of:
              0.14182079 = queryWeight, product of:
                1.1294116 = boost
                7.809647 = idf(docFreq=48, maxDocs=44421)
                0.016078897 = queryNorm
              0.48810294 = fieldWeight in 3743, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.809647 = idf(docFreq=48, maxDocs=44421)
                0.0625 = fieldNorm(doc=3743)
          0.07856144 = weight(abstract_txt:topological in 3743) [ClassicSimilarity], result of:
            0.07856144 = score(doc=3743,freq=1.0), product of:
              0.15430452 = queryWeight, product of:
                1.1780714 = boost
                8.146119 = idf(docFreq=34, maxDocs=44421)
                0.016078897 = queryNorm
              0.50913244 = fieldWeight in 3743, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.146119 = idf(docFreq=34, maxDocs=44421)
                0.0625 = fieldNorm(doc=3743)
          0.042973466 = weight(abstract_txt:three in 3743) [ClassicSimilarity], result of:
            0.042973466 = score(doc=3743,freq=3.0), product of:
              0.090158954 = queryWeight, product of:
                1.2735082 = boost
                4.4030223 = idf(docFreq=1477, maxDocs=44421)
                0.016078897 = queryNorm
              0.47664115 = fieldWeight in 3743, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.4030223 = idf(docFreq=1477, maxDocs=44421)
                0.0625 = fieldNorm(doc=3743)
          0.046402793 = weight(abstract_txt:measures in 3743) [ClassicSimilarity], result of:
            0.046402793 = score(doc=3743,freq=1.0), product of:
              0.1368606 = queryWeight, product of:
                1.569049 = boost
                5.424824 = idf(docFreq=531, maxDocs=44421)
                0.016078897 = queryNorm
              0.3390515 = fieldWeight in 3743, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.424824 = idf(docFreq=531, maxDocs=44421)
                0.0625 = fieldNorm(doc=3743)
          0.1149455 = weight(abstract_txt:network in 3743) [ClassicSimilarity], result of:
            0.1149455 = score(doc=3743,freq=4.0), product of:
              0.19886896 = queryWeight, product of:
                2.6748276 = boost
                4.6239696 = idf(docFreq=1184, maxDocs=44421)
                0.016078897 = queryNorm
              0.5779962 = fieldWeight in 3743, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.6239696 = idf(docFreq=1184, maxDocs=44421)
                0.0625 = fieldNorm(doc=3743)
          0.2549379 = weight(abstract_txt:citation in 3743) [ClassicSimilarity], result of:
            0.2549379 = score(doc=3743,freq=9.0), product of:
              0.27803817 = queryWeight, product of:
                3.5360591 = boost
                4.890223 = idf(docFreq=907, maxDocs=44421)
                0.016078897 = queryNorm
              0.91691685 = fieldWeight in 3743, product of:
                3.0 = tf(freq=9.0), with freq of:
                  9.0 = termFreq=9.0
                4.890223 = idf(docFreq=907, maxDocs=44421)
                0.0625 = fieldNorm(doc=3743)
        0.28 = coord(7/25)
    
  3. Serpa, F.G.; Graves, A.M.; Javier, A.: Statistical common author networks (2013) 0.17
    0.17307666 = sum of:
      0.17307666 = product of:
        0.6181309 = sum of:
          0.012012533 = weight(abstract_txt:between in 2133) [ClassicSimilarity], result of:
            0.012012533 = score(doc=2133,freq=1.0), product of:
              0.055591118 = queryWeight, product of:
                3.4573963 = idf(docFreq=3804, maxDocs=44421)
                0.016078897 = queryNorm
              0.21608727 = fieldWeight in 2133, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4573963 = idf(docFreq=3804, maxDocs=44421)
                0.0625 = fieldNorm(doc=2133)
          0.020560144 = weight(abstract_txt:number in 2133) [ClassicSimilarity], result of:
            0.020560144 = score(doc=2133,freq=1.0), product of:
              0.0795426 = queryWeight, product of:
                1.1961818 = boost
                4.1356745 = idf(docFreq=1930, maxDocs=44421)
                0.016078897 = queryNorm
              0.25847965 = fieldWeight in 2133, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.1356745 = idf(docFreq=1930, maxDocs=44421)
                0.0625 = fieldNorm(doc=2133)
          0.045595378 = weight(abstract_txt:common in 2133) [ClassicSimilarity], result of:
            0.045595378 = score(doc=2133,freq=2.0), product of:
              0.107362576 = queryWeight, product of:
                1.3897086 = boost
                4.8047733 = idf(docFreq=988, maxDocs=44421)
                0.016078897 = queryNorm
              0.42468596 = fieldWeight in 2133, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.8047733 = idf(docFreq=988, maxDocs=44421)
                0.0625 = fieldNorm(doc=2133)
          0.082937784 = weight(abstract_txt:measuring in 2133) [ClassicSimilarity], result of:
            0.082937784 = score(doc=2133,freq=1.0), product of:
              0.20156601 = queryWeight, product of:
                1.904171 = boost
                6.5834737 = idf(docFreq=166, maxDocs=44421)
                0.016078897 = queryNorm
              0.4114671 = fieldWeight in 2133, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.5834737 = idf(docFreq=166, maxDocs=44421)
                0.0625 = fieldNorm(doc=2133)
          0.06267349 = weight(abstract_txt:links in 2133) [ClassicSimilarity], result of:
            0.06267349 = score(doc=2133,freq=1.0), product of:
              0.19142649 = queryWeight, product of:
                2.2727096 = boost
                5.238438 = idf(docFreq=640, maxDocs=44421)
                0.016078897 = queryNorm
              0.32740238 = fieldWeight in 2133, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.238438 = idf(docFreq=640, maxDocs=44421)
                0.0625 = fieldNorm(doc=2133)
          0.05747275 = weight(abstract_txt:network in 2133) [ClassicSimilarity], result of:
            0.05747275 = score(doc=2133,freq=1.0), product of:
              0.19886896 = queryWeight, product of:
                2.6748276 = boost
                4.6239696 = idf(docFreq=1184, maxDocs=44421)
                0.016078897 = queryNorm
              0.2889981 = fieldWeight in 2133, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.6239696 = idf(docFreq=1184, maxDocs=44421)
                0.0625 = fieldNorm(doc=2133)
          0.3368788 = weight(abstract_txt:relatedness in 2133) [ClassicSimilarity], result of:
            0.3368788 = score(doc=2133,freq=2.0), product of:
              0.4662139 = queryWeight, product of:
                3.5467906 = boost
                8.175107 = idf(docFreq=33, maxDocs=44421)
                0.016078897 = queryNorm
              0.7225842 = fieldWeight in 2133, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.175107 = idf(docFreq=33, maxDocs=44421)
                0.0625 = fieldNorm(doc=2133)
        0.28 = coord(7/25)
    
  4. Thelwall, M.: Extracting macroscopic information from Web links (2001) 0.17
    0.16780151 = sum of:
      0.16780151 = product of:
        0.52437973 = sum of:
          0.012012533 = weight(abstract_txt:between in 851) [ClassicSimilarity], result of:
            0.012012533 = score(doc=851,freq=1.0), product of:
              0.055591118 = queryWeight, product of:
                3.4573963 = idf(docFreq=3804, maxDocs=44421)
                0.016078897 = queryNorm
              0.21608727 = fieldWeight in 851, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4573963 = idf(docFreq=3804, maxDocs=44421)
                0.0625 = fieldNorm(doc=851)
          0.022198088 = weight(abstract_txt:over in 851) [ClassicSimilarity], result of:
            0.022198088 = score(doc=851,freq=1.0), product of:
              0.083712965 = queryWeight, product of:
                1.2271388 = boost
                4.242705 = idf(docFreq=1734, maxDocs=44421)
                0.016078897 = queryNorm
              0.26516905 = fieldWeight in 851, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.242705 = idf(docFreq=1734, maxDocs=44421)
                0.0625 = fieldNorm(doc=851)
          0.059263133 = weight(abstract_txt:academic in 851) [ClassicSimilarity], result of:
            0.059263133 = score(doc=851,freq=4.0), product of:
              0.10148895 = queryWeight, product of:
                1.3511597 = boost
                4.6714945 = idf(docFreq=1129, maxDocs=44421)
                0.016078897 = queryNorm
              0.5839368 = fieldWeight in 851, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.6714945 = idf(docFreq=1129, maxDocs=44421)
                0.0625 = fieldNorm(doc=851)
          0.046402793 = weight(abstract_txt:measures in 851) [ClassicSimilarity], result of:
            0.046402793 = score(doc=851,freq=1.0), product of:
              0.1368606 = queryWeight, product of:
                1.569049 = boost
                5.424824 = idf(docFreq=531, maxDocs=44421)
                0.016078897 = queryNorm
              0.3390515 = fieldWeight in 851, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.424824 = idf(docFreq=531, maxDocs=44421)
                0.0625 = fieldNorm(doc=851)
          0.082937784 = weight(abstract_txt:measuring in 851) [ClassicSimilarity], result of:
            0.082937784 = score(doc=851,freq=1.0), product of:
              0.20156601 = queryWeight, product of:
                1.904171 = boost
                6.5834737 = idf(docFreq=166, maxDocs=44421)
                0.016078897 = queryNorm
              0.4114671 = fieldWeight in 851, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.5834737 = idf(docFreq=166, maxDocs=44421)
                0.0625 = fieldNorm(doc=851)
          0.1279524 = weight(abstract_txt:ratio in 851) [ClassicSimilarity], result of:
            0.1279524 = score(doc=851,freq=1.0), product of:
              0.26912123 = queryWeight, product of:
                2.200246 = boost
                7.607123 = idf(docFreq=59, maxDocs=44421)
                0.016078897 = queryNorm
              0.47544518 = fieldWeight in 851, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.607123 = idf(docFreq=59, maxDocs=44421)
                0.0625 = fieldNorm(doc=851)
          0.0886337 = weight(abstract_txt:links in 851) [ClassicSimilarity], result of:
            0.0886337 = score(doc=851,freq=2.0), product of:
              0.19142649 = queryWeight, product of:
                2.2727096 = boost
                5.238438 = idf(docFreq=640, maxDocs=44421)
                0.016078897 = queryNorm
              0.4630169 = fieldWeight in 851, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.238438 = idf(docFreq=640, maxDocs=44421)
                0.0625 = fieldNorm(doc=851)
          0.084979296 = weight(abstract_txt:citation in 851) [ClassicSimilarity], result of:
            0.084979296 = score(doc=851,freq=1.0), product of:
              0.27803817 = queryWeight, product of:
                3.5360591 = boost
                4.890223 = idf(docFreq=907, maxDocs=44421)
                0.016078897 = queryNorm
              0.30563894 = fieldWeight in 851, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.890223 = idf(docFreq=907, maxDocs=44421)
                0.0625 = fieldNorm(doc=851)
        0.32 = coord(8/25)
    
  5. Macias-Galindo, D.; Cavedon, L.; Thangarajah, J.; Wong, W.: Effects of domain on measures of semantic relatedness (2015) 0.16
    0.15878682 = sum of:
      0.15878682 = product of:
        0.79393405 = sum of:
          0.033288054 = weight(abstract_txt:terms in 3220) [ClassicSimilarity], result of:
            0.033288054 = score(doc=3220,freq=3.0), product of:
              0.076044455 = queryWeight, product of:
                1.1695831 = boost
                4.043712 = idf(docFreq=2116, maxDocs=44421)
                0.016078897 = queryNorm
              0.43774468 = fieldWeight in 3220, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.043712 = idf(docFreq=2116, maxDocs=44421)
                0.0625 = fieldNorm(doc=3220)
          0.031392835 = weight(abstract_txt:over in 3220) [ClassicSimilarity], result of:
            0.031392835 = score(doc=3220,freq=2.0), product of:
              0.083712965 = queryWeight, product of:
                1.2271388 = boost
                4.242705 = idf(docFreq=1734, maxDocs=44421)
                0.016078897 = queryNorm
              0.37500566 = fieldWeight in 3220, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.242705 = idf(docFreq=1734, maxDocs=44421)
                0.0625 = fieldNorm(doc=3220)
          0.11366317 = weight(abstract_txt:measures in 3220) [ClassicSimilarity], result of:
            0.11366317 = score(doc=3220,freq=6.0), product of:
              0.1368606 = queryWeight, product of:
                1.569049 = boost
                5.424824 = idf(docFreq=531, maxDocs=44421)
                0.016078897 = queryNorm
              0.8305032 = fieldWeight in 3220, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                5.424824 = idf(docFreq=531, maxDocs=44421)
                0.0625 = fieldNorm(doc=3220)
          0.082937784 = weight(abstract_txt:measuring in 3220) [ClassicSimilarity], result of:
            0.082937784 = score(doc=3220,freq=1.0), product of:
              0.20156601 = queryWeight, product of:
                1.904171 = boost
                6.5834737 = idf(docFreq=166, maxDocs=44421)
                0.016078897 = queryNorm
              0.4114671 = fieldWeight in 3220, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.5834737 = idf(docFreq=166, maxDocs=44421)
                0.0625 = fieldNorm(doc=3220)
          0.5326522 = weight(abstract_txt:relatedness in 3220) [ClassicSimilarity], result of:
            0.5326522 = score(doc=3220,freq=5.0), product of:
              0.4662139 = queryWeight, product of:
                3.5467906 = boost
                8.175107 = idf(docFreq=33, maxDocs=44421)
                0.016078897 = queryNorm
              1.142506 = fieldWeight in 3220, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                8.175107 = idf(docFreq=33, maxDocs=44421)
                0.0625 = fieldNorm(doc=3220)
        0.2 = coord(5/25)