Document (#32331)

Author
Liben-Nowell, D.
Kleinberg, J.
Title
¬The link-prediction problem for social networks
Source
Journal of the American Society for Information Science and Technology. 58(2007) no.7, S.1019-1031
Year
2007
Abstract
Given a snapshot of a social network, can we infer which new interactions among its members are likely to occur in the near future? We formalize this question as the link-prediction problem, and we develop approaches to link prediction based on measures for analyzing the "proximity" of nodes in a network. Experiments on large coauthorship networks suggest that information about future interactions can be extracted from network topology alone, and that fairly subtle measures for detecting node proximity can outperform more direct measures.
Theme
Internet

Similar documents (author)

  1. Kleinberg, I.: Making the case for professional indexers : where is the proof? (1993) 6.19
    6.1935673 = sum of:
      6.1935673 = weight(author_txt:kleinberg in 7765) [ClassicSimilarity], result of:
        6.1935673 = fieldWeight in 7765, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.909708 = idf(docFreq=5, maxDocs=44421)
          0.625 = fieldNorm(doc=7765)
    
  2. Kleinberg, I.: For want of an alphabetical index : some notes toward a history of the back-of-the-book index in nineteenth century America (1997) 6.19
    6.1935673 = sum of:
      6.1935673 = weight(author_txt:kleinberg in 4734) [ClassicSimilarity], result of:
        6.1935673 = fieldWeight in 4734, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.909708 = idf(docFreq=5, maxDocs=44421)
          0.625 = fieldNorm(doc=4734)
    
  3. Kleinberg, J.M.: Authoritative sources in a hyperlinked environment (1998) 6.19
    6.1935673 = sum of:
      6.1935673 = weight(author_txt:kleinberg in 1005) [ClassicSimilarity], result of:
        6.1935673 = fieldWeight in 1005, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.909708 = idf(docFreq=5, maxDocs=44421)
          0.625 = fieldNorm(doc=1005)
    
  4. Chakrabarti, S.; Dom, B.; Kumar, S.R.; Raghavan, P.; Rajagopalan, S.; Tomkins, A.; Kleinberg, J.M.; Gibson, D.: Neue Pfade durch den Internet-Dschungel : Die zweite Generation von Web-Suchmaschinen (1999) 2.48
    2.477427 = sum of:
      2.477427 = weight(author_txt:kleinberg in 1003) [ClassicSimilarity], result of:
        2.477427 = fieldWeight in 1003, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.909708 = idf(docFreq=5, maxDocs=44421)
          0.25 = fieldNorm(doc=1003)
    

Similar documents (content)

  1. Hu, D.; Kaza, S.; Chen, H.: Identifying significant facilitators of dark network evolution (2009) 0.21
    0.21225557 = sum of:
      0.21225557 = product of:
        0.75805557 = sum of:
          0.062357597 = weight(abstract_txt:nodes in 3753) [ClassicSimilarity], result of:
            0.062357597 = score(doc=3753,freq=1.0), product of:
              0.14195089 = queryWeight, product of:
                1.1424239 = boost
                7.028639 = idf(docFreq=106, maxDocs=44421)
                0.017678265 = queryNorm
              0.43928993 = fieldWeight in 3753, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.028639 = idf(docFreq=106, maxDocs=44421)
                0.0625 = fieldNorm(doc=3753)
          0.059637014 = weight(abstract_txt:social in 3753) [ClassicSimilarity], result of:
            0.059637014 = score(doc=3753,freq=5.0), product of:
              0.10152566 = queryWeight, product of:
                1.3663473 = boost
                4.2031517 = idf(docFreq=1804, maxDocs=44421)
                0.017678265 = queryNorm
              0.5874083 = fieldWeight in 3753, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                4.2031517 = idf(docFreq=1804, maxDocs=44421)
                0.0625 = fieldNorm(doc=3753)
          0.047260728 = weight(abstract_txt:future in 3753) [ClassicSimilarity], result of:
            0.047260728 = score(doc=3753,freq=3.0), product of:
              0.10308174 = queryWeight, product of:
                1.3767785 = boost
                4.23524 = idf(docFreq=1747, maxDocs=44421)
                0.017678265 = queryNorm
              0.45847818 = fieldWeight in 3753, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.23524 = idf(docFreq=1747, maxDocs=44421)
                0.0625 = fieldNorm(doc=3753)
          0.10846541 = weight(abstract_txt:networks in 3753) [ClassicSimilarity], result of:
            0.10846541 = score(doc=3753,freq=5.0), product of:
              0.15127228 = queryWeight, product of:
                1.6678343 = boost
                5.1305847 = idf(docFreq=713, maxDocs=44421)
                0.017678265 = queryNorm
              0.71702105 = fieldWeight in 3753, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                5.1305847 = idf(docFreq=713, maxDocs=44421)
                0.0625 = fieldNorm(doc=3753)
          0.10652997 = weight(abstract_txt:network in 3753) [ClassicSimilarity], result of:
            0.10652997 = score(doc=3753,freq=4.0), product of:
              0.18430912 = queryWeight, product of:
                2.254718 = boost
                4.6239696 = idf(docFreq=1184, maxDocs=44421)
                0.017678265 = queryNorm
              0.5779962 = fieldWeight in 3753, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.6239696 = idf(docFreq=1184, maxDocs=44421)
                0.0625 = fieldNorm(doc=3753)
          0.17349891 = weight(abstract_txt:link in 3753) [ClassicSimilarity], result of:
            0.17349891 = score(doc=3753,freq=3.0), product of:
              0.28080815 = queryWeight, product of:
                2.7830672 = boost
                5.707506 = idf(docFreq=400, maxDocs=44421)
                0.017678265 = queryNorm
              0.61785567 = fieldWeight in 3753, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.707506 = idf(docFreq=400, maxDocs=44421)
                0.0625 = fieldNorm(doc=3753)
          0.20030591 = weight(abstract_txt:prediction in 3753) [ClassicSimilarity], result of:
            0.20030591 = score(doc=3753,freq=1.0), product of:
              0.44570565 = queryWeight, product of:
                3.5062501 = boost
                7.190608 = idf(docFreq=90, maxDocs=44421)
                0.017678265 = queryNorm
              0.449413 = fieldWeight in 3753, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.190608 = idf(docFreq=90, maxDocs=44421)
                0.0625 = fieldNorm(doc=3753)
        0.28 = coord(7/25)
    
  2. Zhang, Y.; Wu, M.; Zhang, G.; Lu, J.: Stepping beyond your comfort zone : diffusion-based network analytics for knowledge trajectory recommendation (2023) 0.18
    0.18262762 = sum of:
      0.18262762 = product of:
        0.7609484 = sum of:
          0.062357597 = weight(abstract_txt:nodes in 1996) [ClassicSimilarity], result of:
            0.062357597 = score(doc=1996,freq=1.0), product of:
              0.14195089 = queryWeight, product of:
                1.1424239 = boost
                7.028639 = idf(docFreq=106, maxDocs=44421)
                0.017678265 = queryNorm
              0.43928993 = fieldWeight in 1996, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.028639 = idf(docFreq=106, maxDocs=44421)
                0.0625 = fieldNorm(doc=1996)
          0.026670484 = weight(abstract_txt:social in 1996) [ClassicSimilarity], result of:
            0.026670484 = score(doc=1996,freq=1.0), product of:
              0.10152566 = queryWeight, product of:
                1.3663473 = boost
                4.2031517 = idf(docFreq=1804, maxDocs=44421)
                0.017678265 = queryNorm
              0.26269698 = fieldWeight in 1996, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.2031517 = idf(docFreq=1804, maxDocs=44421)
                0.0625 = fieldNorm(doc=1996)
          0.15472607 = weight(abstract_txt:interactions in 1996) [ClassicSimilarity], result of:
            0.15472607 = score(doc=1996,freq=4.0), product of:
              0.20649564 = queryWeight, product of:
                1.9486268 = boost
                5.994357 = idf(docFreq=300, maxDocs=44421)
                0.017678265 = queryNorm
              0.74929464 = fieldWeight in 1996, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.994357 = idf(docFreq=300, maxDocs=44421)
                0.0625 = fieldNorm(doc=1996)
          0.09225766 = weight(abstract_txt:network in 1996) [ClassicSimilarity], result of:
            0.09225766 = score(doc=1996,freq=3.0), product of:
              0.18430912 = queryWeight, product of:
                2.254718 = boost
                4.6239696 = idf(docFreq=1184, maxDocs=44421)
                0.017678265 = queryNorm
              0.5005594 = fieldWeight in 1996, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.6239696 = idf(docFreq=1184, maxDocs=44421)
                0.0625 = fieldNorm(doc=1996)
          0.14166126 = weight(abstract_txt:link in 1996) [ClassicSimilarity], result of:
            0.14166126 = score(doc=1996,freq=2.0), product of:
              0.28080815 = queryWeight, product of:
                2.7830672 = boost
                5.707506 = idf(docFreq=400, maxDocs=44421)
                0.017678265 = queryNorm
              0.504477 = fieldWeight in 1996, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.707506 = idf(docFreq=400, maxDocs=44421)
                0.0625 = fieldNorm(doc=1996)
          0.28327534 = weight(abstract_txt:prediction in 1996) [ClassicSimilarity], result of:
            0.28327534 = score(doc=1996,freq=2.0), product of:
              0.44570565 = queryWeight, product of:
                3.5062501 = boost
                7.190608 = idf(docFreq=90, maxDocs=44421)
                0.017678265 = queryNorm
              0.63556594 = fieldWeight in 1996, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.190608 = idf(docFreq=90, maxDocs=44421)
                0.0625 = fieldNorm(doc=1996)
        0.24 = coord(6/25)
    
  3. Yan, E.; Ding, Y.: Applying centrality measures to impact analysis : a coauthorship network analysis (2009) 0.15
    0.15491341 = sum of:
      0.15491341 = product of:
        0.77456707 = sum of:
          0.1913245 = weight(abstract_txt:coauthorship in 70) [ClassicSimilarity], result of:
            0.1913245 = score(doc=70,freq=3.0), product of:
              0.17909296 = queryWeight, product of:
                1.2832092 = boost
                7.894805 = idf(docFreq=44, maxDocs=44421)
                0.017678265 = queryNorm
              1.0682971 = fieldWeight in 70, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                7.894805 = idf(docFreq=44, maxDocs=44421)
                0.078125 = fieldNorm(doc=70)
          0.13332042 = weight(abstract_txt:topology in 70) [ClassicSimilarity], result of:
            0.13332042 = score(doc=70,freq=1.0), product of:
              0.20301883 = queryWeight, product of:
                1.366238 = boost
                8.405631 = idf(docFreq=26, maxDocs=44421)
                0.017678265 = queryNorm
              0.65668994 = fieldWeight in 70, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.405631 = idf(docFreq=26, maxDocs=44421)
                0.078125 = fieldNorm(doc=70)
          0.060634006 = weight(abstract_txt:networks in 70) [ClassicSimilarity], result of:
            0.060634006 = score(doc=70,freq=1.0), product of:
              0.15127228 = queryWeight, product of:
                1.6678343 = boost
                5.1305847 = idf(docFreq=713, maxDocs=44421)
                0.017678265 = queryNorm
              0.40082693 = fieldWeight in 70, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.1305847 = idf(docFreq=713, maxDocs=44421)
                0.078125 = fieldNorm(doc=70)
          0.14888015 = weight(abstract_txt:network in 70) [ClassicSimilarity], result of:
            0.14888015 = score(doc=70,freq=5.0), product of:
              0.18430912 = queryWeight, product of:
                2.254718 = boost
                4.6239696 = idf(docFreq=1184, maxDocs=44421)
                0.017678265 = queryNorm
              0.8077742 = fieldWeight in 70, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                4.6239696 = idf(docFreq=1184, maxDocs=44421)
                0.078125 = fieldNorm(doc=70)
          0.24040802 = weight(abstract_txt:measures in 70) [ClassicSimilarity], result of:
            0.24040802 = score(doc=70,freq=5.0), product of:
              0.25368118 = queryWeight, product of:
                2.6452272 = boost
                5.424824 = idf(docFreq=531, maxDocs=44421)
                0.017678265 = queryNorm
              0.9476778 = fieldWeight in 70, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                5.424824 = idf(docFreq=531, maxDocs=44421)
                0.078125 = fieldNorm(doc=70)
        0.2 = coord(5/25)
    
  4. Zhao, S.X.; Ye, F.Y.: Power-law link strength distribution in paper cocitation networks (2013) 0.13
    0.13397579 = sum of:
      0.13397579 = product of:
        0.66987896 = sum of:
          0.11023369 = weight(abstract_txt:nodes in 1973) [ClassicSimilarity], result of:
            0.11023369 = score(doc=1973,freq=2.0), product of:
              0.14195089 = queryWeight, product of:
                1.1424239 = boost
                7.028639 = idf(docFreq=106, maxDocs=44421)
                0.017678265 = queryNorm
              0.77656215 = fieldWeight in 1973, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.028639 = idf(docFreq=106, maxDocs=44421)
                0.078125 = fieldNorm(doc=1973)
          0.14359036 = weight(abstract_txt:node in 1973) [ClassicSimilarity], result of:
            0.14359036 = score(doc=1973,freq=2.0), product of:
              0.16930848 = queryWeight, product of:
                1.2476637 = boost
                7.676116 = idf(docFreq=55, maxDocs=44421)
                0.017678265 = queryNorm
              0.848099 = fieldWeight in 1973, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.676116 = idf(docFreq=55, maxDocs=44421)
                0.078125 = fieldNorm(doc=1973)
          0.10502118 = weight(abstract_txt:networks in 1973) [ClassicSimilarity], result of:
            0.10502118 = score(doc=1973,freq=3.0), product of:
              0.15127228 = queryWeight, product of:
                1.6678343 = boost
                5.1305847 = idf(docFreq=713, maxDocs=44421)
                0.017678265 = queryNorm
              0.6942526 = fieldWeight in 1973, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.1305847 = idf(docFreq=713, maxDocs=44421)
                0.078125 = fieldNorm(doc=1973)
          0.09416009 = weight(abstract_txt:network in 1973) [ClassicSimilarity], result of:
            0.09416009 = score(doc=1973,freq=2.0), product of:
              0.18430912 = queryWeight, product of:
                2.254718 = boost
                4.6239696 = idf(docFreq=1184, maxDocs=44421)
                0.017678265 = queryNorm
              0.5108813 = fieldWeight in 1973, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.6239696 = idf(docFreq=1184, maxDocs=44421)
                0.078125 = fieldNorm(doc=1973)
          0.21687363 = weight(abstract_txt:link in 1973) [ClassicSimilarity], result of:
            0.21687363 = score(doc=1973,freq=3.0), product of:
              0.28080815 = queryWeight, product of:
                2.7830672 = boost
                5.707506 = idf(docFreq=400, maxDocs=44421)
                0.017678265 = queryNorm
              0.77231956 = fieldWeight in 1973, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.707506 = idf(docFreq=400, maxDocs=44421)
                0.078125 = fieldNorm(doc=1973)
        0.2 = coord(5/25)
    
  5. Ma, Z.; Sun, A.; Cong, G.: On predicting the popularity of newly emerging hashtags in Twitter (2013) 0.13
    0.13140084 = sum of:
      0.13140084 = product of:
        0.4692887 = sum of:
          0.041822266 = weight(abstract_txt:extracted in 1967) [ClassicSimilarity], result of:
            0.041822266 = score(doc=1967,freq=1.0), product of:
              0.108763605 = queryWeight, product of:
                6.1523914 = idf(docFreq=256, maxDocs=44421)
                0.017678265 = queryNorm
              0.38452446 = fieldWeight in 1967, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.1523914 = idf(docFreq=256, maxDocs=44421)
                0.0625 = fieldNorm(doc=1967)
          0.060686123 = weight(abstract_txt:near in 1967) [ClassicSimilarity], result of:
            0.060686123 = score(doc=1967,freq=1.0), product of:
              0.13940279 = queryWeight, product of:
                1.1321238 = boost
                6.965269 = idf(docFreq=113, maxDocs=44421)
                0.017678265 = queryNorm
              0.43532932 = fieldWeight in 1967, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.965269 = idf(docFreq=113, maxDocs=44421)
                0.0625 = fieldNorm(doc=1967)
          0.080666386 = weight(abstract_txt:outperform in 1967) [ClassicSimilarity], result of:
            0.080666386 = score(doc=1967,freq=1.0), product of:
              0.1685286 = queryWeight, product of:
                1.2447869 = boost
                7.6584163 = idf(docFreq=56, maxDocs=44421)
                0.017678265 = queryNorm
              0.47865102 = fieldWeight in 1967, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.6584163 = idf(docFreq=56, maxDocs=44421)
                0.0625 = fieldNorm(doc=1967)
          0.026670484 = weight(abstract_txt:social in 1967) [ClassicSimilarity], result of:
            0.026670484 = score(doc=1967,freq=1.0), product of:
              0.10152566 = queryWeight, product of:
                1.3663473 = boost
                4.2031517 = idf(docFreq=1804, maxDocs=44421)
                0.017678265 = queryNorm
              0.26269698 = fieldWeight in 1967, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.2031517 = idf(docFreq=1804, maxDocs=44421)
                0.0625 = fieldNorm(doc=1967)
          0.027285995 = weight(abstract_txt:future in 1967) [ClassicSimilarity], result of:
            0.027285995 = score(doc=1967,freq=1.0), product of:
              0.10308174 = queryWeight, product of:
                1.3767785 = boost
                4.23524 = idf(docFreq=1747, maxDocs=44421)
                0.017678265 = queryNorm
              0.2647025 = fieldWeight in 1967, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.23524 = idf(docFreq=1747, maxDocs=44421)
                0.0625 = fieldNorm(doc=1967)
          0.031851556 = weight(abstract_txt:problem in 1967) [ClassicSimilarity], result of:
            0.031851556 = score(doc=1967,freq=1.0), product of:
              0.11428142 = queryWeight, product of:
                1.4496429 = boost
                4.4593854 = idf(docFreq=1396, maxDocs=44421)
                0.017678265 = queryNorm
              0.2787116 = fieldWeight in 1967, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.4593854 = idf(docFreq=1396, maxDocs=44421)
                0.0625 = fieldNorm(doc=1967)
          0.20030591 = weight(abstract_txt:prediction in 1967) [ClassicSimilarity], result of:
            0.20030591 = score(doc=1967,freq=1.0), product of:
              0.44570565 = queryWeight, product of:
                3.5062501 = boost
                7.190608 = idf(docFreq=90, maxDocs=44421)
                0.017678265 = queryNorm
              0.449413 = fieldWeight in 1967, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.190608 = idf(docFreq=90, maxDocs=44421)
                0.0625 = fieldNorm(doc=1967)
        0.28 = coord(7/25)