Document (#32331)

Author
Liben-Nowell, D.
Kleinberg, J.
Title
¬The link-prediction problem for social networks
Source
Journal of the American Society for Information Science and Technology. 58(2007) no.7, S.1019-1031
Year
2007
Abstract
Given a snapshot of a social network, can we infer which new interactions among its members are likely to occur in the near future? We formalize this question as the link-prediction problem, and we develop approaches to link prediction based on measures for analyzing the "proximity" of nodes in a network. Experiments on large coauthorship networks suggest that information about future interactions can be extracted from network topology alone, and that fairly subtle measures for detecting node proximity can outperform more direct measures.
Theme
Internet

Similar documents (author)

  1. Kleinberg, I.: Making the case for professional indexers : where is the proof? (1993) 6.19
    6.190705 = sum of:
      6.190705 = weight(author_txt:kleinberg in 7766) [ClassicSimilarity], result of:
        6.190705 = fieldWeight in 7766, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.905128 = idf(docFreq=5, maxDocs=44218)
          0.625 = fieldNorm(doc=7766)
    
  2. Kleinberg, I.: For want of an alphabetical index : some notes toward a history of the back-of-the-book index in nineteenth century America (1997) 6.19
    6.190705 = sum of:
      6.190705 = weight(author_txt:kleinberg in 3734) [ClassicSimilarity], result of:
        6.190705 = fieldWeight in 3734, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.905128 = idf(docFreq=5, maxDocs=44218)
          0.625 = fieldNorm(doc=3734)
    
  3. Kleinberg, J.M.: Authoritative sources in a hyperlinked environment (1998) 6.19
    6.190705 = sum of:
      6.190705 = weight(author_txt:kleinberg in 5) [ClassicSimilarity], result of:
        6.190705 = fieldWeight in 5, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.905128 = idf(docFreq=5, maxDocs=44218)
          0.625 = fieldNorm(doc=5)
    
  4. Chakrabarti, S.; Dom, B.; Kumar, S.R.; Raghavan, P.; Rajagopalan, S.; Tomkins, A.; Kleinberg, J.M.; Gibson, D.: Neue Pfade durch den Internet-Dschungel : Die zweite Generation von Web-Suchmaschinen (1999) 2.48
    2.476282 = sum of:
      2.476282 = weight(author_txt:kleinberg in 3) [ClassicSimilarity], result of:
        2.476282 = fieldWeight in 3, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.905128 = idf(docFreq=5, maxDocs=44218)
          0.25 = fieldNorm(doc=3)
    

Similar documents (content)

  1. Hu, D.; Kaza, S.; Chen, H.: Identifying significant facilitators of dark network evolution (2009) 0.21
    0.21225253 = sum of:
      0.21225253 = product of:
        0.7580447 = sum of:
          0.062187944 = weight(abstract_txt:nodes in 2753) [ClassicSimilarity], result of:
            0.062187944 = score(doc=2753,freq=1.0), product of:
              0.14165701 = queryWeight, product of:
                1.140352 = boost
                7.0240583 = idf(docFreq=106, maxDocs=44218)
                0.017685242 = queryNorm
              0.43900365 = fieldWeight in 2753, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.0240583 = idf(docFreq=106, maxDocs=44218)
                0.0625 = fieldNorm(doc=2753)
          0.060207296 = weight(abstract_txt:social in 2753) [ClassicSimilarity], result of:
            0.060207296 = score(doc=2753,freq=5.0), product of:
              0.10214567 = queryWeight, product of:
                1.3694459 = boost
                4.2175875 = idf(docFreq=1770, maxDocs=44218)
                0.017685242 = queryNorm
              0.5894258 = fieldWeight in 2753, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                4.2175875 = idf(docFreq=1770, maxDocs=44218)
                0.0625 = fieldNorm(doc=2753)
          0.047514852 = weight(abstract_txt:future in 2753) [ClassicSimilarity], result of:
            0.047514852 = score(doc=2753,freq=3.0), product of:
              0.10342442 = queryWeight, product of:
                1.3779912 = boost
                4.243905 = idf(docFreq=1724, maxDocs=44218)
                0.017685242 = queryNorm
              0.45941618 = fieldWeight in 2753, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.243905 = idf(docFreq=1724, maxDocs=44218)
                0.0625 = fieldNorm(doc=2753)
          0.10853723 = weight(abstract_txt:networks in 2753) [ClassicSimilarity], result of:
            0.10853723 = score(doc=2753,freq=5.0), product of:
              0.15130028 = queryWeight, product of:
                1.6666898 = boost
                5.133032 = idf(docFreq=708, maxDocs=44218)
                0.017685242 = queryNorm
              0.717363 = fieldWeight in 2753, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                5.133032 = idf(docFreq=708, maxDocs=44218)
                0.0625 = fieldNorm(doc=2753)
          0.10642382 = weight(abstract_txt:network in 2753) [ClassicSimilarity], result of:
            0.10642382 = score(doc=2753,freq=4.0), product of:
              0.18413948 = queryWeight, product of:
                2.2519252 = boost
                4.6236176 = idf(docFreq=1179, maxDocs=44218)
                0.017685242 = queryNorm
              0.5779522 = fieldWeight in 2753, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.6236176 = idf(docFreq=1179, maxDocs=44218)
                0.0625 = fieldNorm(doc=2753)
          0.17340383 = weight(abstract_txt:link in 2753) [ClassicSimilarity], result of:
            0.17340383 = score(doc=2753,freq=3.0), product of:
              0.28063363 = queryWeight, product of:
                2.7800357 = boost
                5.707926 = idf(docFreq=398, maxDocs=44218)
                0.017685242 = queryNorm
              0.6179011 = fieldWeight in 2753, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.707926 = idf(docFreq=398, maxDocs=44218)
                0.0625 = fieldNorm(doc=2753)
          0.19976974 = weight(abstract_txt:prediction in 2753) [ClassicSimilarity], result of:
            0.19976974 = score(doc=2753,freq=1.0), product of:
              0.44479594 = queryWeight, product of:
                3.4999428 = boost
                7.1860275 = idf(docFreq=90, maxDocs=44218)
                0.017685242 = queryNorm
              0.44912672 = fieldWeight in 2753, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.1860275 = idf(docFreq=90, maxDocs=44218)
                0.0625 = fieldNorm(doc=2753)
        0.28 = coord(7/25)
    
  2. Zhang, Y.; Wu, M.; Zhang, G.; Lu, J.: Stepping beyond your comfort zone : diffusion-based network analytics for knowledge trajectory recommendation (2023) 0.18
    0.18281363 = sum of:
      0.18281363 = product of:
        0.76172346 = sum of:
          0.062187944 = weight(abstract_txt:nodes in 994) [ClassicSimilarity], result of:
            0.062187944 = score(doc=994,freq=1.0), product of:
              0.14165701 = queryWeight, product of:
                1.140352 = boost
                7.0240583 = idf(docFreq=106, maxDocs=44218)
                0.017685242 = queryNorm
              0.43900365 = fieldWeight in 994, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.0240583 = idf(docFreq=106, maxDocs=44218)
                0.0625 = fieldNorm(doc=994)
          0.02692552 = weight(abstract_txt:social in 994) [ClassicSimilarity], result of:
            0.02692552 = score(doc=994,freq=1.0), product of:
              0.10214567 = queryWeight, product of:
                1.3694459 = boost
                4.2175875 = idf(docFreq=1770, maxDocs=44218)
                0.017685242 = queryNorm
              0.26359922 = fieldWeight in 994, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.2175875 = idf(docFreq=1770, maxDocs=44218)
                0.0625 = fieldNorm(doc=994)
          0.15634358 = weight(abstract_txt:interactions in 994) [ClassicSimilarity], result of:
            0.15634358 = score(doc=994,freq=4.0), product of:
              0.207879 = queryWeight, product of:
                1.9536206 = boost
                6.0167146 = idf(docFreq=292, maxDocs=44218)
                0.017685242 = queryNorm
              0.7520893 = fieldWeight in 994, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                6.0167146 = idf(docFreq=292, maxDocs=44218)
                0.0625 = fieldNorm(doc=994)
          0.09216573 = weight(abstract_txt:network in 994) [ClassicSimilarity], result of:
            0.09216573 = score(doc=994,freq=3.0), product of:
              0.18413948 = queryWeight, product of:
                2.2519252 = boost
                4.6236176 = idf(docFreq=1179, maxDocs=44218)
                0.017685242 = queryNorm
              0.5005213 = fieldWeight in 994, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.6236176 = idf(docFreq=1179, maxDocs=44218)
                0.0625 = fieldNorm(doc=994)
          0.14158362 = weight(abstract_txt:link in 994) [ClassicSimilarity], result of:
            0.14158362 = score(doc=994,freq=2.0), product of:
              0.28063363 = queryWeight, product of:
                2.7800357 = boost
                5.707926 = idf(docFreq=398, maxDocs=44218)
                0.017685242 = queryNorm
              0.5045141 = fieldWeight in 994, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.707926 = idf(docFreq=398, maxDocs=44218)
                0.0625 = fieldNorm(doc=994)
          0.28251708 = weight(abstract_txt:prediction in 994) [ClassicSimilarity], result of:
            0.28251708 = score(doc=994,freq=2.0), product of:
              0.44479594 = queryWeight, product of:
                3.4999428 = boost
                7.1860275 = idf(docFreq=90, maxDocs=44218)
                0.017685242 = queryNorm
              0.6351611 = fieldWeight in 994, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.1860275 = idf(docFreq=90, maxDocs=44218)
                0.0625 = fieldNorm(doc=994)
        0.24 = coord(6/25)
    
  3. Yan, E.; Ding, Y.: Applying centrality measures to impact analysis : a coauthorship network analysis (2009) 0.15
    0.15497632 = sum of:
      0.15497632 = product of:
        0.7748816 = sum of:
          0.19084492 = weight(abstract_txt:coauthorship in 3083) [ClassicSimilarity], result of:
            0.19084492 = score(doc=3083,freq=3.0), product of:
              0.17874774 = queryWeight, product of:
                1.2809737 = boost
                7.890225 = idf(docFreq=44, maxDocs=44218)
                0.017685242 = queryNorm
              1.0676774 = fieldWeight in 3083, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                7.890225 = idf(docFreq=44, maxDocs=44218)
                0.078125 = fieldNorm(doc=3083)
          0.1330003 = weight(abstract_txt:topology in 3083) [ClassicSimilarity], result of:
            0.1330003 = score(doc=3083,freq=1.0), product of:
              0.20264179 = queryWeight, product of:
                1.3639059 = boost
                8.401051 = idf(docFreq=26, maxDocs=44218)
                0.017685242 = queryNorm
              0.6563321 = fieldWeight in 3083, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.401051 = idf(docFreq=26, maxDocs=44218)
                0.078125 = fieldNorm(doc=3083)
          0.060674153 = weight(abstract_txt:networks in 3083) [ClassicSimilarity], result of:
            0.060674153 = score(doc=3083,freq=1.0), product of:
              0.15130028 = queryWeight, product of:
                1.6666898 = boost
                5.133032 = idf(docFreq=708, maxDocs=44218)
                0.017685242 = queryNorm
              0.4010181 = fieldWeight in 3083, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.133032 = idf(docFreq=708, maxDocs=44218)
                0.078125 = fieldNorm(doc=3083)
          0.1487318 = weight(abstract_txt:network in 3083) [ClassicSimilarity], result of:
            0.1487318 = score(doc=3083,freq=5.0), product of:
              0.18413948 = queryWeight, product of:
                2.2519252 = boost
                4.6236176 = idf(docFreq=1179, maxDocs=44218)
                0.017685242 = queryNorm
              0.80771273 = fieldWeight in 3083, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                4.6236176 = idf(docFreq=1179, maxDocs=44218)
                0.078125 = fieldNorm(doc=3083)
          0.2416304 = weight(abstract_txt:measures in 3083) [ClassicSimilarity], result of:
            0.2416304 = score(doc=3083,freq=5.0), product of:
              0.25447515 = queryWeight, product of:
                2.6473002 = boost
                5.4353957 = idf(docFreq=523, maxDocs=44218)
                0.017685242 = queryNorm
              0.9495246 = fieldWeight in 3083, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                5.4353957 = idf(docFreq=523, maxDocs=44218)
                0.078125 = fieldNorm(doc=3083)
        0.2 = coord(5/25)
    
  4. Zhao, S.X.; Ye, F.Y.: Power-law link strength distribution in paper cocitation networks (2013) 0.13
    0.13401608 = sum of:
      0.13401608 = product of:
        0.6700804 = sum of:
          0.109933786 = weight(abstract_txt:nodes in 973) [ClassicSimilarity], result of:
            0.109933786 = score(doc=973,freq=2.0), product of:
              0.14165701 = queryWeight, product of:
                1.140352 = boost
                7.0240583 = idf(docFreq=106, maxDocs=44218)
                0.017685242 = queryNorm
              0.7760561 = fieldWeight in 973, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.0240583 = idf(docFreq=106, maxDocs=44218)
                0.078125 = fieldNorm(doc=973)
          0.14423488 = weight(abstract_txt:node in 973) [ClassicSimilarity], result of:
            0.14423488 = score(doc=973,freq=2.0), product of:
              0.16977124 = queryWeight, product of:
                1.2483948 = boost
                7.689554 = idf(docFreq=54, maxDocs=44218)
                0.017685242 = queryNorm
              0.84958375 = fieldWeight in 973, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.689554 = idf(docFreq=54, maxDocs=44218)
                0.078125 = fieldNorm(doc=973)
          0.105090715 = weight(abstract_txt:networks in 973) [ClassicSimilarity], result of:
            0.105090715 = score(doc=973,freq=3.0), product of:
              0.15130028 = queryWeight, product of:
                1.6666898 = boost
                5.133032 = idf(docFreq=708, maxDocs=44218)
                0.017685242 = queryNorm
              0.6945837 = fieldWeight in 973, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.133032 = idf(docFreq=708, maxDocs=44218)
                0.078125 = fieldNorm(doc=973)
          0.09406625 = weight(abstract_txt:network in 973) [ClassicSimilarity], result of:
            0.09406625 = score(doc=973,freq=2.0), product of:
              0.18413948 = queryWeight, product of:
                2.2519252 = boost
                4.6236176 = idf(docFreq=1179, maxDocs=44218)
                0.017685242 = queryNorm
              0.5108424 = fieldWeight in 973, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.6236176 = idf(docFreq=1179, maxDocs=44218)
                0.078125 = fieldNorm(doc=973)
          0.21675478 = weight(abstract_txt:link in 973) [ClassicSimilarity], result of:
            0.21675478 = score(doc=973,freq=3.0), product of:
              0.28063363 = queryWeight, product of:
                2.7800357 = boost
                5.707926 = idf(docFreq=398, maxDocs=44218)
                0.017685242 = queryNorm
              0.77237636 = fieldWeight in 973, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.707926 = idf(docFreq=398, maxDocs=44218)
                0.078125 = fieldNorm(doc=973)
        0.2 = coord(5/25)
    
  5. Ma, Z.; Sun, A.; Cong, G.: On predicting the popularity of newly emerging hashtags in Twitter (2013) 0.13
    0.13142046 = sum of:
      0.13142046 = product of:
        0.4693588 = sum of:
          0.041936234 = weight(abstract_txt:extracted in 967) [ClassicSimilarity], result of:
            0.041936234 = score(doc=967,freq=1.0), product of:
              0.10893319 = queryWeight, product of:
                6.159553 = idf(docFreq=253, maxDocs=44218)
                0.017685242 = queryNorm
              0.38497207 = fieldWeight in 967, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.159553 = idf(docFreq=253, maxDocs=44218)
                0.0625 = fieldNorm(doc=967)
          0.06098278 = weight(abstract_txt:near in 967) [ClassicSimilarity], result of:
            0.06098278 = score(doc=967,freq=1.0), product of:
              0.13982089 = queryWeight, product of:
                1.1329374 = boost
                6.9783883 = idf(docFreq=111, maxDocs=44218)
                0.017685242 = queryNorm
              0.43614927 = fieldWeight in 967, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.9783883 = idf(docFreq=111, maxDocs=44218)
                0.0625 = fieldNorm(doc=967)
          0.08045985 = weight(abstract_txt:outperform in 967) [ClassicSimilarity], result of:
            0.08045985 = score(doc=967,freq=1.0), product of:
              0.1681977 = queryWeight, product of:
                1.2425959 = boost
                7.653836 = idf(docFreq=56, maxDocs=44218)
                0.017685242 = queryNorm
              0.47836474 = fieldWeight in 967, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.653836 = idf(docFreq=56, maxDocs=44218)
                0.0625 = fieldNorm(doc=967)
          0.02692552 = weight(abstract_txt:social in 967) [ClassicSimilarity], result of:
            0.02692552 = score(doc=967,freq=1.0), product of:
              0.10214567 = queryWeight, product of:
                1.3694459 = boost
                4.2175875 = idf(docFreq=1770, maxDocs=44218)
                0.017685242 = queryNorm
              0.26359922 = fieldWeight in 967, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.2175875 = idf(docFreq=1770, maxDocs=44218)
                0.0625 = fieldNorm(doc=967)
          0.027432714 = weight(abstract_txt:future in 967) [ClassicSimilarity], result of:
            0.027432714 = score(doc=967,freq=1.0), product of:
              0.10342442 = queryWeight, product of:
                1.3779912 = boost
                4.243905 = idf(docFreq=1724, maxDocs=44218)
                0.017685242 = queryNorm
              0.26524407 = fieldWeight in 967, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.243905 = idf(docFreq=1724, maxDocs=44218)
                0.0625 = fieldNorm(doc=967)
          0.03185198 = weight(abstract_txt:problem in 967) [ClassicSimilarity], result of:
            0.03185198 = score(doc=967,freq=1.0), product of:
              0.114253156 = queryWeight, product of:
                1.4483349 = boost
                4.460548 = idf(docFreq=1388, maxDocs=44218)
                0.017685242 = queryNorm
              0.27878425 = fieldWeight in 967, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.460548 = idf(docFreq=1388, maxDocs=44218)
                0.0625 = fieldNorm(doc=967)
          0.19976974 = weight(abstract_txt:prediction in 967) [ClassicSimilarity], result of:
            0.19976974 = score(doc=967,freq=1.0), product of:
              0.44479594 = queryWeight, product of:
                3.4999428 = boost
                7.1860275 = idf(docFreq=90, maxDocs=44218)
                0.017685242 = queryNorm
              0.44912672 = fieldWeight in 967, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.1860275 = idf(docFreq=90, maxDocs=44218)
                0.0625 = fieldNorm(doc=967)
        0.28 = coord(7/25)