Document (#34710)

Author
Wallace, M.L.
Gingras, Y.
Duhon, R.
Title
¬A new approach for detecting scientific specialties from raw cocitation networks
Source
Journal of the American Society for Information Science and Technology. 60(2009) no.2, S.240-246
Year
2009
Abstract
We use a technique recently developed by V. Blondel, J.-L. Guillaume, R. Lambiotte, and E. Lefebvre (2008) to detect scientific specialties from author cocitation networks. This algorithm has distinct advantages over most previous methods used to obtain cocitation clusters since it avoids the use of similarity measures, relies entirely on the topology of the weighted network, and can be applied to relatively large networks. Most importantly, it requires no subjective interpretation of the cocitation data or of the communities found. Using two examples, we show that the resulting specialties are the smallest coherent groups of researchers (within a hierarchy of cluster sizes) and can thus be identified unambiguously. Furthermore, we confirm that these communities are indeed representative of what we know about the structure of a given scientific discipline and that as specialties, they can be accurately characterized by a few keywords (from the publication titles). We argue that this robust and efficient algorithm is particularly well-suited to cocitation networks and that the results generated can be of great use to researchers studying various facets of the structure and evolution of science.
Theme
Informetrie

Similar documents (author)

  1. Gingras, Y.: Bibliometrics and research evaluation : uses and abuses (2016) 2.01
    2.0149353 = sum of:
      2.0149353 = product of:
        4.0298705 = sum of:
          4.0298705 = weight(author_txt:gingras in 4805) [ClassicSimilarity], result of:
            4.0298705 = score(doc=4805,freq=1.0), product of:
              0.7169458 = queryWeight, product of:
                1.0141137 = boost
                8.993418 = idf(docFreq=14, maxDocs=44421)
                0.07860948 = queryNorm
              5.620886 = fieldWeight in 4805, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.993418 = idf(docFreq=14, maxDocs=44421)
                0.625 = fieldNorm(doc=4805)
        0.5 = coord(1/2)
    
  2. Wallace, P.M.: How do patrons search the online catalog when no one's looking? : transaction log analysis and implications for bibliographic instruction and system design (1993) 1.93
    1.9319739 = sum of:
      1.9319739 = product of:
        3.8639479 = sum of:
          3.8639479 = weight(author_txt:wallace in 6973) [ClassicSimilarity], result of:
            3.8639479 = score(doc=6973,freq=1.0), product of:
              0.6971289 = queryWeight, product of:
                8.868255 = idf(docFreq=16, maxDocs=44421)
                0.07860948 = queryNorm
              5.5426593 = fieldWeight in 6973, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.868255 = idf(docFreq=16, maxDocs=44421)
                0.625 = fieldNorm(doc=6973)
        0.5 = coord(1/2)
    
  3. Wallace, D.A.: ¬The World Wide Web and Mosaic (1994) 1.93
    1.9319739 = sum of:
      1.9319739 = product of:
        3.8639479 = sum of:
          3.8639479 = weight(author_txt:wallace in 1565) [ClassicSimilarity], result of:
            3.8639479 = score(doc=1565,freq=1.0), product of:
              0.6971289 = queryWeight, product of:
                8.868255 = idf(docFreq=16, maxDocs=44421)
                0.07860948 = queryNorm
              5.5426593 = fieldWeight in 1565, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.868255 = idf(docFreq=16, maxDocs=44421)
                0.625 = fieldNorm(doc=1565)
        0.5 = coord(1/2)
    
  4. Wallace, A.H.: Developing a slide/tape to teach end-user searching (1990) 1.93
    1.9319739 = sum of:
      1.9319739 = product of:
        3.8639479 = sum of:
          3.8639479 = weight(author_txt:wallace in 5773) [ClassicSimilarity], result of:
            3.8639479 = score(doc=5773,freq=1.0), product of:
              0.6971289 = queryWeight, product of:
                8.868255 = idf(docFreq=16, maxDocs=44421)
                0.07860948 = queryNorm
              5.5426593 = fieldWeight in 5773, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.868255 = idf(docFreq=16, maxDocs=44421)
                0.625 = fieldNorm(doc=5773)
        0.5 = coord(1/2)
    
  5. Wallace, D.A.: Archives and the information superhighway : current status and future challenges (1996) 1.93
    1.9319739 = sum of:
      1.9319739 = product of:
        3.8639479 = sum of:
          3.8639479 = weight(author_txt:wallace in 6444) [ClassicSimilarity], result of:
            3.8639479 = score(doc=6444,freq=1.0), product of:
              0.6971289 = queryWeight, product of:
                8.868255 = idf(docFreq=16, maxDocs=44421)
                0.07860948 = queryNorm
              5.5426593 = fieldWeight in 6444, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.868255 = idf(docFreq=16, maxDocs=44421)
                0.625 = fieldNorm(doc=6444)
        0.5 = coord(1/2)
    

Similar documents (content)

  1. White, H.D.: Pathfinder networks and author cocitation analysis : a remapping of paradigmatic information scientists (2003) 0.24
    0.24145894 = sum of:
      0.24145894 = product of:
        1.006079 = sum of:
          0.015520208 = weight(abstract_txt:most in 2459) [ClassicSimilarity], result of:
            0.015520208 = score(doc=2459,freq=1.0), product of:
              0.06298977 = queryWeight, product of:
                1.112641 = boost
                3.94228 = idf(docFreq=2342, maxDocs=44421)
                0.014360431 = queryNorm
              0.2463925 = fieldWeight in 2459, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.94228 = idf(docFreq=2342, maxDocs=44421)
                0.0625 = fieldNorm(doc=2459)
          0.011290282 = weight(abstract_txt:from in 2459) [ClassicSimilarity], result of:
            0.011290282 = score(doc=2459,freq=2.0), product of:
              0.046290863 = queryWeight, product of:
                1.1681895 = boost
                2.759399 = idf(docFreq=7646, maxDocs=44421)
                0.014360431 = queryNorm
              0.2438987 = fieldWeight in 2459, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.759399 = idf(docFreq=7646, maxDocs=44421)
                0.0625 = fieldNorm(doc=2459)
          0.014508194 = weight(abstract_txt:that in 2459) [ClassicSimilarity], result of:
            0.014508194 = score(doc=2459,freq=3.0), product of:
              0.05667004 = queryWeight, product of:
                1.668656 = boost
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.014360431 = queryNorm
              0.25601172 = fieldWeight in 2459, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.0625 = fieldNorm(doc=2459)
          0.06842041 = weight(abstract_txt:networks in 2459) [ClassicSimilarity], result of:
            0.06842041 = score(doc=2459,freq=1.0), product of:
              0.21337266 = queryWeight, product of:
                2.8960392 = boost
                5.1305847 = idf(docFreq=713, maxDocs=44421)
                0.014360431 = queryNorm
              0.32066154 = fieldWeight in 2459, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.1305847 = idf(docFreq=713, maxDocs=44421)
                0.0625 = fieldNorm(doc=2459)
          0.40022725 = weight(abstract_txt:specialties in 2459) [ClassicSimilarity], result of:
            0.40022725 = score(doc=2459,freq=2.0), product of:
              0.54980594 = queryWeight, product of:
                4.6487885 = boost
                8.235732 = idf(docFreq=31, maxDocs=44421)
                0.014360431 = queryNorm
              0.72794276 = fieldWeight in 2459, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.235732 = idf(docFreq=31, maxDocs=44421)
                0.0625 = fieldNorm(doc=2459)
          0.49611259 = weight(abstract_txt:cocitation in 2459) [ClassicSimilarity], result of:
            0.49611259 = score(doc=2459,freq=3.0), product of:
              0.59703267 = queryWeight, product of:
                5.4161305 = boost
                7.676116 = idf(docFreq=55, maxDocs=44421)
                0.014360431 = queryNorm
              0.8309639 = fieldWeight in 2459, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                7.676116 = idf(docFreq=55, maxDocs=44421)
                0.0625 = fieldNorm(doc=2459)
        0.24 = coord(6/25)
    
  2. Ding, W.; Chen, C.: Dynamic topic detection and tracking : a comparison of HDP, C-word, and cocitation methods (2014) 0.21
    0.21013433 = sum of:
      0.21013433 = product of:
        0.87555975 = sum of:
          0.056575775 = weight(abstract_txt:detect in 2502) [ClassicSimilarity], result of:
            0.056575775 = score(doc=2502,freq=1.0), product of:
              0.10204898 = queryWeight, product of:
                1.0014042 = boost
                7.0962973 = idf(docFreq=99, maxDocs=44421)
                0.014360431 = queryNorm
              0.55439824 = fieldWeight in 2502, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.0962973 = idf(docFreq=99, maxDocs=44421)
                0.078125 = fieldNorm(doc=2502)
          0.10198335 = weight(abstract_txt:detecting in 2502) [ClassicSimilarity], result of:
            0.10198335 = score(doc=2502,freq=2.0), product of:
              0.11996775 = queryWeight, product of:
                1.0857687 = boost
                7.694134 = idf(docFreq=54, maxDocs=44421)
                0.014360431 = queryNorm
              0.8500897 = fieldWeight in 2502, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.694134 = idf(docFreq=54, maxDocs=44421)
                0.078125 = fieldNorm(doc=2502)
          0.03483676 = weight(abstract_txt:researchers in 2502) [ClassicSimilarity], result of:
            0.03483676 = score(doc=2502,freq=1.0), product of:
              0.093058676 = queryWeight, product of:
                1.3523792 = boost
                4.791714 = idf(docFreq=1001, maxDocs=44421)
                0.014360431 = queryNorm
              0.37435266 = fieldWeight in 2502, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.791714 = idf(docFreq=1001, maxDocs=44421)
                0.078125 = fieldNorm(doc=2502)
          0.014807366 = weight(abstract_txt:that in 2502) [ClassicSimilarity], result of:
            0.014807366 = score(doc=2502,freq=2.0), product of:
              0.05667004 = queryWeight, product of:
                1.668656 = boost
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.014360431 = queryNorm
              0.2612909 = fieldWeight in 2502, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.078125 = fieldNorm(doc=2502)
          0.04721576 = weight(abstract_txt:scientific in 2502) [ClassicSimilarity], result of:
            0.04721576 = score(doc=2502,freq=1.0), product of:
              0.13046281 = queryWeight, product of:
                1.9611421 = boost
                4.6324444 = idf(docFreq=1174, maxDocs=44421)
                0.014360431 = queryNorm
              0.36190972 = fieldWeight in 2502, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.6324444 = idf(docFreq=1174, maxDocs=44421)
                0.078125 = fieldNorm(doc=2502)
          0.62014073 = weight(abstract_txt:cocitation in 2502) [ClassicSimilarity], result of:
            0.62014073 = score(doc=2502,freq=3.0), product of:
              0.59703267 = queryWeight, product of:
                5.4161305 = boost
                7.676116 = idf(docFreq=55, maxDocs=44421)
                0.014360431 = queryNorm
              1.0387049 = fieldWeight in 2502, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                7.676116 = idf(docFreq=55, maxDocs=44421)
                0.078125 = fieldNorm(doc=2502)
        0.24 = coord(6/25)
    
  3. Chen, C.; Kuljis, J.: ¬The rising landscape : a visual exploration of superstring revolutions in physics (2003) 0.20
    0.19554083 = sum of:
      0.19554083 = product of:
        0.97770417 = sum of:
          0.0119751515 = weight(abstract_txt:from in 2469) [ClassicSimilarity], result of:
            0.0119751515 = score(doc=2469,freq=1.0), product of:
              0.046290863 = queryWeight, product of:
                1.1681895 = boost
                2.759399 = idf(docFreq=7646, maxDocs=44421)
                0.014360431 = queryNorm
              0.25869364 = fieldWeight in 2469, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.759399 = idf(docFreq=7646, maxDocs=44421)
                0.09375 = fieldNorm(doc=2469)
          0.031449977 = weight(abstract_txt:structure in 2469) [ClassicSimilarity], result of:
            0.031449977 = score(doc=2469,freq=1.0), product of:
              0.076976426 = queryWeight, product of:
                1.2299825 = boost
                4.3580413 = idf(docFreq=1545, maxDocs=44421)
                0.014360431 = queryNorm
              0.40856636 = fieldWeight in 2469, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.3580413 = idf(docFreq=1545, maxDocs=44421)
                0.09375 = fieldNorm(doc=2469)
          0.0801278 = weight(abstract_txt:scientific in 2469) [ClassicSimilarity], result of:
            0.0801278 = score(doc=2469,freq=2.0), product of:
              0.13046281 = queryWeight, product of:
                1.9611421 = boost
                4.6324444 = idf(docFreq=1174, maxDocs=44421)
                0.014360431 = queryNorm
              0.61418116 = fieldWeight in 2469, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.6324444 = idf(docFreq=1174, maxDocs=44421)
                0.09375 = fieldNorm(doc=2469)
          0.42450508 = weight(abstract_txt:specialties in 2469) [ClassicSimilarity], result of:
            0.42450508 = score(doc=2469,freq=1.0), product of:
              0.54980594 = queryWeight, product of:
                4.6487885 = boost
                8.235732 = idf(docFreq=31, maxDocs=44421)
                0.014360431 = queryNorm
              0.77209985 = fieldWeight in 2469, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.235732 = idf(docFreq=31, maxDocs=44421)
                0.09375 = fieldNorm(doc=2469)
          0.4296461 = weight(abstract_txt:cocitation in 2469) [ClassicSimilarity], result of:
            0.4296461 = score(doc=2469,freq=1.0), product of:
              0.59703267 = queryWeight, product of:
                5.4161305 = boost
                7.676116 = idf(docFreq=55, maxDocs=44421)
                0.014360431 = queryNorm
              0.71963584 = fieldWeight in 2469, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.676116 = idf(docFreq=55, maxDocs=44421)
                0.09375 = fieldNorm(doc=2469)
        0.2 = coord(5/25)
    
  4. Chen, C.; Ibekwe-SanJuan, F.; Hou, J.: ¬The structure and dynamics of cocitation clusters : a multiple-perspective cocitation analysis (2010) 0.19
    0.19159982 = sum of:
      0.19159982 = product of:
        1.1974989 = sum of:
          0.026208317 = weight(abstract_txt:structure in 578) [ClassicSimilarity], result of:
            0.026208317 = score(doc=578,freq=1.0), product of:
              0.076976426 = queryWeight, product of:
                1.2299825 = boost
                4.3580413 = idf(docFreq=1545, maxDocs=44421)
                0.014360431 = queryNorm
              0.34047198 = fieldWeight in 578, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.3580413 = idf(docFreq=1545, maxDocs=44421)
                0.078125 = fieldNorm(doc=578)
          0.010470388 = weight(abstract_txt:that in 578) [ClassicSimilarity], result of:
            0.010470388 = score(doc=578,freq=1.0), product of:
              0.05667004 = queryWeight, product of:
                1.668656 = boost
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.014360431 = queryNorm
              0.18476056 = fieldWeight in 578, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.078125 = fieldNorm(doc=578)
          0.14813453 = weight(abstract_txt:networks in 578) [ClassicSimilarity], result of:
            0.14813453 = score(doc=578,freq=3.0), product of:
              0.21337266 = queryWeight, product of:
                2.8960392 = boost
                5.1305847 = idf(docFreq=713, maxDocs=44421)
                0.014360431 = queryNorm
              0.6942526 = fieldWeight in 578, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.1305847 = idf(docFreq=713, maxDocs=44421)
                0.078125 = fieldNorm(doc=578)
          1.0126857 = weight(abstract_txt:cocitation in 578) [ClassicSimilarity], result of:
            1.0126857 = score(doc=578,freq=8.0), product of:
              0.59703267 = queryWeight, product of:
                5.4161305 = boost
                7.676116 = idf(docFreq=55, maxDocs=44421)
                0.014360431 = queryNorm
              1.696198 = fieldWeight in 578, product of:
                2.828427 = tf(freq=8.0), with freq of:
                  8.0 = termFreq=8.0
                7.676116 = idf(docFreq=55, maxDocs=44421)
                0.078125 = fieldNorm(doc=578)
        0.16 = coord(4/25)
    
  5. Quirin, A.; Cordón, O.; Santamaría, J.; Vargas-Quesada, B.; Moya-Anegón, F.: ¬A new variant of the Pathfinder algorithm to generate large visual science maps in cubic time (2008) 0.19
    0.19134411 = sum of:
      0.19134411 = product of:
        0.68337184 = sum of:
          0.009878996 = weight(abstract_txt:from in 3112) [ClassicSimilarity], result of:
            0.009878996 = score(doc=3112,freq=2.0), product of:
              0.046290863 = queryWeight, product of:
                1.1681895 = boost
                2.759399 = idf(docFreq=7646, maxDocs=44421)
                0.014360431 = queryNorm
              0.21341136 = fieldWeight in 3112, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.759399 = idf(docFreq=7646, maxDocs=44421)
                0.0546875 = fieldNorm(doc=3112)
          0.018345822 = weight(abstract_txt:structure in 3112) [ClassicSimilarity], result of:
            0.018345822 = score(doc=3112,freq=1.0), product of:
              0.076976426 = queryWeight, product of:
                1.2299825 = boost
                4.3580413 = idf(docFreq=1545, maxDocs=44421)
                0.014360431 = queryNorm
              0.23833038 = fieldWeight in 3112, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.3580413 = idf(docFreq=1545, maxDocs=44421)
                0.0546875 = fieldNorm(doc=3112)
          0.082312085 = weight(abstract_txt:algorithm in 3112) [ClassicSimilarity], result of:
            0.082312085 = score(doc=3112,freq=4.0), product of:
              0.13191333 = queryWeight, product of:
                1.6101428 = boost
                5.7050157 = idf(docFreq=401, maxDocs=44421)
                0.014360431 = queryNorm
              0.62398607 = fieldWeight in 3112, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.7050157 = idf(docFreq=401, maxDocs=44421)
                0.0546875 = fieldNorm(doc=3112)
          0.007329272 = weight(abstract_txt:that in 3112) [ClassicSimilarity], result of:
            0.007329272 = score(doc=3112,freq=1.0), product of:
              0.05667004 = queryWeight, product of:
                1.668656 = boost
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.014360431 = queryNorm
              0.1293324 = fieldWeight in 3112, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.0546875 = fieldNorm(doc=3112)
          0.046741217 = weight(abstract_txt:scientific in 3112) [ClassicSimilarity], result of:
            0.046741217 = score(doc=3112,freq=2.0), product of:
              0.13046281 = queryWeight, product of:
                1.9611421 = boost
                4.6324444 = idf(docFreq=1174, maxDocs=44421)
                0.014360431 = queryNorm
              0.35827234 = fieldWeight in 3112, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.6324444 = idf(docFreq=1174, maxDocs=44421)
                0.0546875 = fieldNorm(doc=3112)
          0.08466594 = weight(abstract_txt:networks in 3112) [ClassicSimilarity], result of:
            0.08466594 = score(doc=3112,freq=2.0), product of:
              0.21337266 = queryWeight, product of:
                2.8960392 = boost
                5.1305847 = idf(docFreq=713, maxDocs=44421)
                0.014360431 = queryNorm
              0.39679843 = fieldWeight in 3112, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.1305847 = idf(docFreq=713, maxDocs=44421)
                0.0546875 = fieldNorm(doc=3112)
          0.4340985 = weight(abstract_txt:cocitation in 3112) [ClassicSimilarity], result of:
            0.4340985 = score(doc=3112,freq=3.0), product of:
              0.59703267 = queryWeight, product of:
                5.4161305 = boost
                7.676116 = idf(docFreq=55, maxDocs=44421)
                0.014360431 = queryNorm
              0.7270934 = fieldWeight in 3112, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                7.676116 = idf(docFreq=55, maxDocs=44421)
                0.0546875 = fieldNorm(doc=3112)
        0.28 = coord(7/25)