Document (#38134)

Author
Serpa, F.G.
Graves, A.M.
Javier, A.
Title
Statistical common author networks
Source
Journal of the American Society for Information Science and Technology. 64(2013) no.12, S.2507-2512
Year
2013
Abstract
A new method for visualizing the relatedness of scientific areas has been developed that is based on measuring the overlap of researchers between areas. It is found that closely related areas have a high propensity to share a larger number of common authors. A method for comparing areas of vastly different sizes and to handle name homonymy is constructed, allowing for the robust deployment of this method on real data sets. A statistical analysis of the probability distributions of the common author overlap that accounts for noise is carried out along with the production of network maps with weighted links proportional to the overlap strength. This is demonstrated on 2 case studies, complexity science and neutrino physics, where the level of relatedness of areas within each area is expected to vary greatly. It is found that the results returned by this method closely match the intuitive expectation that the broad, multidisciplinary area of complexity science possesses areas that are weakly related to each other, whereas the much narrower area of neutrino physics shows very strongly related areas.
Theme
Informetrie

Similar documents (content)

  1. Braun, T.; Glanzel, W.; Grupp, H.: ¬The scientometric weight of 50 nations in 27 scientific areas, 1989-1993 : Pt.1: All fields combined, mathematics, engineering, chemistry and physics (1995) 0.17
    0.17383808 = sum of:
      0.17383808 = product of:
        0.62085027 = sum of:
          0.045157433 = weight(abstract_txt:science in 829) [ClassicSimilarity], result of:
            0.045157433 = score(doc=829,freq=3.0), product of:
              0.07222219 = queryWeight, product of:
                3.850585 = idf(docFreq=2567, maxDocs=44421)
                0.01875616 = queryNorm
              0.6252571 = fieldWeight in 829, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.850585 = idf(docFreq=2567, maxDocs=44421)
                0.09375 = fieldNorm(doc=829)
          0.031882785 = weight(abstract_txt:each in 829) [ClassicSimilarity], result of:
            0.031882785 = score(doc=829,freq=1.0), product of:
              0.082590304 = queryWeight, product of:
                1.069373 = boost
                4.1177115 = idf(docFreq=1965, maxDocs=44421)
                0.01875616 = queryNorm
              0.38603544 = fieldWeight in 829, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.1177115 = idf(docFreq=1965, maxDocs=44421)
                0.09375 = fieldNorm(doc=829)
          0.040830597 = weight(abstract_txt:found in 829) [ClassicSimilarity], result of:
            0.040830597 = score(doc=829,freq=1.0), product of:
              0.0973977 = queryWeight, product of:
                1.1612855 = boost
                4.4716287 = idf(docFreq=1379, maxDocs=44421)
                0.01875616 = queryNorm
              0.4192152 = fieldWeight in 829, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.4716287 = idf(docFreq=1379, maxDocs=44421)
                0.09375 = fieldNorm(doc=829)
          0.1363749 = weight(abstract_txt:physics in 829) [ClassicSimilarity], result of:
            0.1363749 = score(doc=829,freq=1.0), product of:
              0.21762788 = queryWeight, product of:
                1.735889 = boost
                6.684188 = idf(docFreq=150, maxDocs=44421)
                0.01875616 = queryNorm
              0.6266426 = fieldWeight in 829, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.684188 = idf(docFreq=150, maxDocs=44421)
                0.09375 = fieldNorm(doc=829)
          0.01812039 = weight(abstract_txt:that in 829) [ClassicSimilarity], result of:
            0.01812039 = score(doc=829,freq=1.0), product of:
              0.08172915 = queryWeight, product of:
                1.8425267 = boost
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.01875616 = queryNorm
              0.22171268 = fieldWeight in 829, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.09375 = fieldNorm(doc=829)
          0.08714396 = weight(abstract_txt:area in 829) [ClassicSimilarity], result of:
            0.08714396 = score(doc=829,freq=1.0), product of:
              0.18481909 = queryWeight, product of:
                1.9592223 = boost
                5.0294347 = idf(docFreq=789, maxDocs=44421)
                0.01875616 = queryNorm
              0.47150952 = fieldWeight in 829, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.0294347 = idf(docFreq=789, maxDocs=44421)
                0.09375 = fieldNorm(doc=829)
          0.2613402 = weight(abstract_txt:areas in 829) [ClassicSimilarity], result of:
            0.2613402 = score(doc=829,freq=2.0), product of:
              0.40461478 = queryWeight, product of:
                4.428122 = boost
                4.871674 = idf(docFreq=924, maxDocs=44421)
                0.01875616 = queryNorm
              0.6458988 = fieldWeight in 829, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.871674 = idf(docFreq=924, maxDocs=44421)
                0.09375 = fieldNorm(doc=829)
        0.28 = coord(7/25)
    
  2. Wang, F.; Wolfram, D.: Assessment of journal similarity based on citing discipline analysis (2015) 0.16
    0.15882978 = sum of:
      0.15882978 = product of:
        0.49634308 = sum of:
          0.042574838 = weight(abstract_txt:science in 2849) [ClassicSimilarity], result of:
            0.042574838 = score(doc=2849,freq=6.0), product of:
              0.07222219 = queryWeight, product of:
                3.850585 = idf(docFreq=2567, maxDocs=44421)
                0.01875616 = queryNorm
              0.58949804 = fieldWeight in 2849, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                3.850585 = idf(docFreq=2567, maxDocs=44421)
                0.0625 = fieldNorm(doc=2849)
          0.021255191 = weight(abstract_txt:each in 2849) [ClassicSimilarity], result of:
            0.021255191 = score(doc=2849,freq=1.0), product of:
              0.082590304 = queryWeight, product of:
                1.069373 = boost
                4.1177115 = idf(docFreq=1965, maxDocs=44421)
                0.01875616 = queryNorm
              0.25735697 = fieldWeight in 2849, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.1177115 = idf(docFreq=1965, maxDocs=44421)
                0.0625 = fieldNorm(doc=2849)
          0.047784384 = weight(abstract_txt:related in 2849) [ClassicSimilarity], result of:
            0.047784384 = score(doc=2849,freq=2.0), product of:
              0.12877458 = queryWeight, product of:
                1.6354053 = boost
                4.198178 = idf(docFreq=1813, maxDocs=44421)
                0.01875616 = queryNorm
              0.37107 = fieldWeight in 2849, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.198178 = idf(docFreq=1813, maxDocs=44421)
                0.0625 = fieldNorm(doc=2849)
          0.1270726 = weight(abstract_txt:closely in 2849) [ClassicSimilarity], result of:
            0.1270726 = score(doc=2849,freq=2.0), product of:
              0.21592869 = queryWeight, product of:
                1.7290989 = boost
                6.6580424 = idf(docFreq=154, maxDocs=44421)
                0.01875616 = queryNorm
              0.58849335 = fieldWeight in 2849, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.6580424 = idf(docFreq=154, maxDocs=44421)
                0.0625 = fieldNorm(doc=2849)
          0.02092362 = weight(abstract_txt:that in 2849) [ClassicSimilarity], result of:
            0.02092362 = score(doc=2849,freq=3.0), product of:
              0.08172915 = queryWeight, product of:
                1.8425267 = boost
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.01875616 = queryNorm
              0.25601172 = fieldWeight in 2849, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.0625 = fieldNorm(doc=2849)
          0.05809597 = weight(abstract_txt:area in 2849) [ClassicSimilarity], result of:
            0.05809597 = score(doc=2849,freq=1.0), product of:
              0.18481909 = queryWeight, product of:
                1.9592223 = boost
                5.0294347 = idf(docFreq=789, maxDocs=44421)
                0.01875616 = queryNorm
              0.31433967 = fieldWeight in 2849, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.0294347 = idf(docFreq=789, maxDocs=44421)
                0.0625 = fieldNorm(doc=2849)
          0.055439487 = weight(abstract_txt:method in 2849) [ClassicSimilarity], result of:
            0.055439487 = score(doc=2849,freq=1.0), product of:
              0.19717047 = queryWeight, product of:
                2.3366873 = boost
                4.4988065 = idf(docFreq=1342, maxDocs=44421)
                0.01875616 = queryNorm
              0.2811754 = fieldWeight in 2849, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.4988065 = idf(docFreq=1342, maxDocs=44421)
                0.0625 = fieldNorm(doc=2849)
          0.12319696 = weight(abstract_txt:areas in 2849) [ClassicSimilarity], result of:
            0.12319696 = score(doc=2849,freq=1.0), product of:
              0.40461478 = queryWeight, product of:
                4.428122 = boost
                4.871674 = idf(docFreq=924, maxDocs=44421)
                0.01875616 = queryNorm
              0.30447963 = fieldWeight in 2849, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.871674 = idf(docFreq=924, maxDocs=44421)
                0.0625 = fieldNorm(doc=2849)
        0.32 = coord(8/25)
    
  3. Boyack, K.W.; Wylie, B.N.; Davidson, G.S.: Domain visualization using VxInsight®) [register mark] for science and technology management (2002) 0.15
    0.14860037 = sum of:
      0.14860037 = product of:
        0.46437615 = sum of:
          0.026341835 = weight(abstract_txt:science in 244) [ClassicSimilarity], result of:
            0.026341835 = score(doc=244,freq=3.0), product of:
              0.07222219 = queryWeight, product of:
                3.850585 = idf(docFreq=2567, maxDocs=44421)
                0.01875616 = queryNorm
              0.36473328 = fieldWeight in 244, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.850585 = idf(docFreq=2567, maxDocs=44421)
                0.0546875 = fieldNorm(doc=244)
          0.032213185 = weight(abstract_txt:each in 244) [ClassicSimilarity], result of:
            0.032213185 = score(doc=244,freq=3.0), product of:
              0.082590304 = queryWeight, product of:
                1.069373 = boost
                4.1177115 = idf(docFreq=1965, maxDocs=44421)
                0.01875616 = queryNorm
              0.39003593 = fieldWeight in 244, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.1177115 = idf(docFreq=1965, maxDocs=44421)
                0.0546875 = fieldNorm(doc=244)
          0.029565081 = weight(abstract_txt:related in 244) [ClassicSimilarity], result of:
            0.029565081 = score(doc=244,freq=1.0), product of:
              0.12877458 = queryWeight, product of:
                1.6354053 = boost
                4.198178 = idf(docFreq=1813, maxDocs=44421)
                0.01875616 = queryNorm
              0.22958785 = fieldWeight in 244, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.198178 = idf(docFreq=1813, maxDocs=44421)
                0.0546875 = fieldNorm(doc=244)
          0.079552025 = weight(abstract_txt:physics in 244) [ClassicSimilarity], result of:
            0.079552025 = score(doc=244,freq=1.0), product of:
              0.21762788 = queryWeight, product of:
                1.735889 = boost
                6.684188 = idf(docFreq=150, maxDocs=44421)
                0.01875616 = queryNorm
              0.36554152 = fieldWeight in 244, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.684188 = idf(docFreq=150, maxDocs=44421)
                0.0546875 = fieldNorm(doc=244)
          0.010570227 = weight(abstract_txt:that in 244) [ClassicSimilarity], result of:
            0.010570227 = score(doc=244,freq=1.0), product of:
              0.08172915 = queryWeight, product of:
                1.8425267 = boost
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.01875616 = queryNorm
              0.1293324 = fieldWeight in 244, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.0546875 = fieldNorm(doc=244)
          0.044321585 = weight(abstract_txt:common in 244) [ClassicSimilarity], result of:
            0.044321585 = score(doc=244,freq=1.0), product of:
              0.16867639 = queryWeight, product of:
                1.8717052 = boost
                4.8047733 = idf(docFreq=988, maxDocs=44421)
                0.01875616 = queryNorm
              0.26276106 = fieldWeight in 244, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.8047733 = idf(docFreq=988, maxDocs=44421)
                0.0546875 = fieldNorm(doc=244)
          0.13401487 = weight(abstract_txt:overlap in 244) [ClassicSimilarity], result of:
            0.13401487 = score(doc=244,freq=1.0), product of:
              0.35270596 = queryWeight, product of:
                2.706554 = boost
                6.9478774 = idf(docFreq=115, maxDocs=44421)
                0.01875616 = queryNorm
              0.37996206 = fieldWeight in 244, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.9478774 = idf(docFreq=115, maxDocs=44421)
                0.0546875 = fieldNorm(doc=244)
          0.10779734 = weight(abstract_txt:areas in 244) [ClassicSimilarity], result of:
            0.10779734 = score(doc=244,freq=1.0), product of:
              0.40461478 = queryWeight, product of:
                4.428122 = boost
                4.871674 = idf(docFreq=924, maxDocs=44421)
                0.01875616 = queryNorm
              0.26641968 = fieldWeight in 244, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.871674 = idf(docFreq=924, maxDocs=44421)
                0.0546875 = fieldNorm(doc=244)
        0.32 = coord(8/25)
    
  4. Talvensaari, T.; Laurikkala, J.; Järvelin, K.; Juhola, M.: ¬A study on automatic creation of a comparable document collection in cross-language information retrieval (2006) 0.14
    0.14142233 = sum of:
      0.14142233 = product of:
        0.50507975 = sum of:
          0.1041255 = weight(abstract_txt:weakly in 601) [ClassicSimilarity], result of:
            0.1041255 = score(doc=601,freq=1.0), product of:
              0.18908067 = queryWeight, product of:
                1.1441244 = boost
                8.811096 = idf(docFreq=17, maxDocs=44421)
                0.01875616 = queryNorm
              0.5506935 = fieldWeight in 601, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.811096 = idf(docFreq=17, maxDocs=44421)
                0.0625 = fieldNorm(doc=601)
          0.027220396 = weight(abstract_txt:found in 601) [ClassicSimilarity], result of:
            0.027220396 = score(doc=601,freq=1.0), product of:
              0.0973977 = queryWeight, product of:
                1.1612855 = boost
                4.4716287 = idf(docFreq=1379, maxDocs=44421)
                0.01875616 = queryNorm
              0.2794768 = fieldWeight in 601, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.4716287 = idf(docFreq=1379, maxDocs=44421)
                0.0625 = fieldNorm(doc=601)
          0.033788662 = weight(abstract_txt:related in 601) [ClassicSimilarity], result of:
            0.033788662 = score(doc=601,freq=1.0), product of:
              0.12877458 = queryWeight, product of:
                1.6354053 = boost
                4.198178 = idf(docFreq=1813, maxDocs=44421)
                0.01875616 = queryNorm
              0.2623861 = fieldWeight in 601, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.198178 = idf(docFreq=1813, maxDocs=44421)
                0.0625 = fieldNorm(doc=601)
          0.012080259 = weight(abstract_txt:that in 601) [ClassicSimilarity], result of:
            0.012080259 = score(doc=601,freq=1.0), product of:
              0.08172915 = queryWeight, product of:
                1.8425267 = boost
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.01875616 = queryNorm
              0.14780845 = fieldWeight in 601, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.0625 = fieldNorm(doc=601)
          0.050653238 = weight(abstract_txt:common in 601) [ClassicSimilarity], result of:
            0.050653238 = score(doc=601,freq=1.0), product of:
              0.16867639 = queryWeight, product of:
                1.8717052 = boost
                4.8047733 = idf(docFreq=988, maxDocs=44421)
                0.01875616 = queryNorm
              0.30029833 = fieldWeight in 601, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.8047733 = idf(docFreq=988, maxDocs=44421)
                0.0625 = fieldNorm(doc=601)
          0.16633269 = weight(abstract_txt:relatedness in 601) [ClassicSimilarity], result of:
            0.16633269 = score(doc=601,freq=1.0), product of:
              0.32553986 = queryWeight, product of:
                2.123082 = boost
                8.175107 = idf(docFreq=33, maxDocs=44421)
                0.01875616 = queryNorm
              0.5109442 = fieldWeight in 601, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.175107 = idf(docFreq=33, maxDocs=44421)
                0.0625 = fieldNorm(doc=601)
          0.110878974 = weight(abstract_txt:method in 601) [ClassicSimilarity], result of:
            0.110878974 = score(doc=601,freq=4.0), product of:
              0.19717047 = queryWeight, product of:
                2.3366873 = boost
                4.4988065 = idf(docFreq=1342, maxDocs=44421)
                0.01875616 = queryNorm
              0.5623508 = fieldWeight in 601, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.4988065 = idf(docFreq=1342, maxDocs=44421)
                0.0625 = fieldNorm(doc=601)
        0.28 = coord(7/25)
    
  5. Shibata, N.; Kajikawa, Y.; Sakata, I.: Measuring relatedness between communities in a citation network (2011) 0.13
    0.1298224 = sum of:
      0.1298224 = product of:
        0.649112 = sum of:
          0.04223583 = weight(abstract_txt:related in 484) [ClassicSimilarity], result of:
            0.04223583 = score(doc=484,freq=1.0), product of:
              0.12877458 = queryWeight, product of:
                1.6354053 = boost
                4.198178 = idf(docFreq=1813, maxDocs=44421)
                0.01875616 = queryNorm
              0.32798263 = fieldWeight in 484, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.198178 = idf(docFreq=1813, maxDocs=44421)
                0.078125 = fieldNorm(doc=484)
          0.08954312 = weight(abstract_txt:common in 484) [ClassicSimilarity], result of:
            0.08954312 = score(doc=484,freq=2.0), product of:
              0.16867639 = queryWeight, product of:
                1.8717052 = boost
                4.8047733 = idf(docFreq=988, maxDocs=44421)
                0.01875616 = queryNorm
              0.53085744 = fieldWeight in 484, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.8047733 = idf(docFreq=988, maxDocs=44421)
                0.078125 = fieldNorm(doc=484)
          0.29403746 = weight(abstract_txt:relatedness in 484) [ClassicSimilarity], result of:
            0.29403746 = score(doc=484,freq=2.0), product of:
              0.32553986 = queryWeight, product of:
                2.123082 = boost
                8.175107 = idf(docFreq=33, maxDocs=44421)
                0.01875616 = queryNorm
              0.90323025 = fieldWeight in 484, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.175107 = idf(docFreq=33, maxDocs=44421)
                0.078125 = fieldNorm(doc=484)
          0.069299355 = weight(abstract_txt:method in 484) [ClassicSimilarity], result of:
            0.069299355 = score(doc=484,freq=1.0), product of:
              0.19717047 = queryWeight, product of:
                2.3366873 = boost
                4.4988065 = idf(docFreq=1342, maxDocs=44421)
                0.01875616 = queryNorm
              0.35146925 = fieldWeight in 484, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.4988065 = idf(docFreq=1342, maxDocs=44421)
                0.078125 = fieldNorm(doc=484)
          0.1539962 = weight(abstract_txt:areas in 484) [ClassicSimilarity], result of:
            0.1539962 = score(doc=484,freq=1.0), product of:
              0.40461478 = queryWeight, product of:
                4.428122 = boost
                4.871674 = idf(docFreq=924, maxDocs=44421)
                0.01875616 = queryNorm
              0.38059953 = fieldWeight in 484, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.871674 = idf(docFreq=924, maxDocs=44421)
                0.078125 = fieldNorm(doc=484)
        0.2 = coord(5/25)