Document (#41215)

Author
Colavizza, G.
Boyack, K.W.
Eck, N.J. van
Waltman, L.
Title
¬The closer the better : similarity of publication pairs at different cocitation levels
Source
Journal of the Association for Information Science and Technology. 69(2018) no.4, S.600-609
Year
2018
Abstract
We investigated the similarities of pairs of articles that are cocited at the different cocitation levels of the journal, article, section, paragraph, sentence, and bracket. Our results indicate that textual similarity, intellectual overlap (shared references), author overlap (shared authors), proximity in publication time all rise monotonically as the cocitation level gets lower (from journal to bracket). While the main gain in similarity happens when moving from journal to article cocitation, all level changes entail an increase in similarity, especially section to paragraph and paragraph to sentence/bracket levels. We compared the results from four journals over the years 2010-2015: Cell, the European Journal of Operational Research, Physics Letters B, and Research Policy, with consistent general outcomes and some interesting differences. Our findings motivate the use of granular cocitation information as defined by meaningful units of text, with implications for, among others, the elaboration of maps of science and the retrieval of scholarly literature.
Content
Vgl.: https://onlinelibrary.wiley.com/doi/abs/10.1002/asi.23981.
Theme
Informetrie

Similar documents (author)

  1. Boyack; K.W.; Börner, K.: Indicator-assisted evaluation and funding of research : visualizing the influence of grants on the number and citation counts of research papers (2003) 1.65
    1.6518102 = sum of:
      1.6518102 = product of:
        3.3036203 = sum of:
          3.3036203 = weight(author_txt:boyack in 2471) [ClassicSimilarity], result of:
            3.3036203 = score(doc=2471,freq=1.0), product of:
              0.7231683 = queryWeight, product of:
                1.0232549 = boost
                9.1365185 = idf(docFreq=12, maxDocs=44421)
                0.07735259 = queryNorm
              4.5682592 = fieldWeight in 2471, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.1365185 = idf(docFreq=12, maxDocs=44421)
                0.5 = fieldNorm(doc=2471)
        0.5 = coord(1/2)
    
  2. Klavans, R.; Boyack, K.W.: Identifying a better measure of relatedness for mapping science (2006) 1.65
    1.6518102 = sum of:
      1.6518102 = product of:
        3.3036203 = sum of:
          3.3036203 = weight(author_txt:boyack in 252) [ClassicSimilarity], result of:
            3.3036203 = score(doc=252,freq=1.0), product of:
              0.7231683 = queryWeight, product of:
                1.0232549 = boost
                9.1365185 = idf(docFreq=12, maxDocs=44421)
                0.07735259 = queryNorm
              4.5682592 = fieldWeight in 252, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.1365185 = idf(docFreq=12, maxDocs=44421)
                0.5 = fieldNorm(doc=252)
        0.5 = coord(1/2)
    
  3. Klavans, R.; Boyack, K.W.: Toward a consensus map of science (2009) 1.65
    1.6518102 = sum of:
      1.6518102 = product of:
        3.3036203 = sum of:
          3.3036203 = weight(author_txt:boyack in 3736) [ClassicSimilarity], result of:
            3.3036203 = score(doc=3736,freq=1.0), product of:
              0.7231683 = queryWeight, product of:
                1.0232549 = boost
                9.1365185 = idf(docFreq=12, maxDocs=44421)
                0.07735259 = queryNorm
              4.5682592 = fieldWeight in 3736, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.1365185 = idf(docFreq=12, maxDocs=44421)
                0.5 = fieldNorm(doc=3736)
        0.5 = coord(1/2)
    
  4. Boyack, K.W.; Klavans, R.: Co-citation analysis, bibliographic coupling, and direct citation : which citation approach represents the research front most accurately? (2010) 1.65
    1.6518102 = sum of:
      1.6518102 = product of:
        3.3036203 = sum of:
          3.3036203 = weight(author_txt:boyack in 111) [ClassicSimilarity], result of:
            3.3036203 = score(doc=111,freq=1.0), product of:
              0.7231683 = queryWeight, product of:
                1.0232549 = boost
                9.1365185 = idf(docFreq=12, maxDocs=44421)
                0.07735259 = queryNorm
              4.5682592 = fieldWeight in 111, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.1365185 = idf(docFreq=12, maxDocs=44421)
                0.5 = fieldNorm(doc=111)
        0.5 = coord(1/2)
    
  5. Klavans, R.; Boyack, K.W.: Using global mapping to create more accurate document-level maps of research fields (2011) 1.65
    1.6518102 = sum of:
      1.6518102 = product of:
        3.3036203 = sum of:
          3.3036203 = weight(author_txt:boyack in 956) [ClassicSimilarity], result of:
            3.3036203 = score(doc=956,freq=1.0), product of:
              0.7231683 = queryWeight, product of:
                1.0232549 = boost
                9.1365185 = idf(docFreq=12, maxDocs=44421)
                0.07735259 = queryNorm
              4.5682592 = fieldWeight in 956, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.1365185 = idf(docFreq=12, maxDocs=44421)
                0.5 = fieldNorm(doc=956)
        0.5 = coord(1/2)
    

Similar documents (content)

  1. Wang, F.; Wolfram, D.: Assessment of journal similarity based on citing discipline analysis (2015) 0.16
    0.15673254 = sum of:
      0.15673254 = product of:
        0.7836627 = sum of:
          0.011646813 = weight(abstract_txt:different in 2849) [ClassicSimilarity], result of:
            0.011646813 = score(doc=2849,freq=1.0), product of:
              0.050918747 = queryWeight, product of:
                3.6597328 = idf(docFreq=3107, maxDocs=44421)
                0.013913241 = queryNorm
              0.2287333 = fieldWeight in 2849, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6597328 = idf(docFreq=3107, maxDocs=44421)
                0.0625 = fieldNorm(doc=2849)
          0.00748848 = weight(abstract_txt:from in 2849) [ClassicSimilarity], result of:
            0.00748848 = score(doc=2849,freq=1.0), product of:
              0.043420933 = queryWeight, product of:
                1.1309837 = boost
                2.759399 = idf(docFreq=7646, maxDocs=44421)
                0.013913241 = queryNorm
              0.17246243 = fieldWeight in 2849, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.759399 = idf(docFreq=7646, maxDocs=44421)
                0.0625 = fieldNorm(doc=2849)
          0.11198697 = weight(abstract_txt:journal in 2849) [ClassicSimilarity], result of:
            0.11198697 = score(doc=2849,freq=3.0), product of:
              0.20113496 = queryWeight, product of:
                2.8107352 = boost
                5.14327 = idf(docFreq=704, maxDocs=44421)
                0.013913241 = queryNorm
              0.5567753 = fieldWeight in 2849, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.14327 = idf(docFreq=704, maxDocs=44421)
                0.0625 = fieldNorm(doc=2849)
          0.18718569 = weight(abstract_txt:similarity in 2849) [ClassicSimilarity], result of:
            0.18718569 = score(doc=2849,freq=4.0), product of:
              0.25738195 = queryWeight, product of:
                3.1795464 = boost
                5.8181453 = idf(docFreq=358, maxDocs=44421)
                0.013913241 = queryNorm
              0.72726816 = fieldWeight in 2849, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.8181453 = idf(docFreq=358, maxDocs=44421)
                0.0625 = fieldNorm(doc=2849)
          0.46535474 = weight(abstract_txt:cocitation in 2849) [ClassicSimilarity], result of:
            0.46535474 = score(doc=2849,freq=3.0), product of:
              0.560018 = queryWeight, product of:
                5.2436314 = boost
                7.676116 = idf(docFreq=55, maxDocs=44421)
                0.013913241 = queryNorm
              0.8309639 = fieldWeight in 2849, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                7.676116 = idf(docFreq=55, maxDocs=44421)
                0.0625 = fieldNorm(doc=2849)
        0.2 = coord(5/25)
    
  2. White, H.D.: Author cocitation analysis and pearson's r (2003) 0.14
    0.14273755 = sum of:
      0.14273755 = product of:
        0.7136878 = sum of:
          0.011402365 = weight(abstract_txt:article in 3119) [ClassicSimilarity], result of:
            0.011402365 = score(doc=3119,freq=1.0), product of:
              0.05487791 = queryWeight, product of:
                1.0381496 = boost
                3.79935 = idf(docFreq=2702, maxDocs=44421)
                0.013913241 = queryNorm
              0.20777695 = fieldWeight in 3119, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.79935 = idf(docFreq=2702, maxDocs=44421)
                0.0546875 = fieldNorm(doc=3119)
          0.0065524196 = weight(abstract_txt:from in 3119) [ClassicSimilarity], result of:
            0.0065524196 = score(doc=3119,freq=1.0), product of:
              0.043420933 = queryWeight, product of:
                1.1309837 = boost
                2.759399 = idf(docFreq=7646, maxDocs=44421)
                0.013913241 = queryNorm
              0.15090463 = fieldWeight in 3119, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.759399 = idf(docFreq=7646, maxDocs=44421)
                0.0546875 = fieldNorm(doc=3119)
          0.08371171 = weight(abstract_txt:cocited in 3119) [ClassicSimilarity], result of:
            0.08371171 = score(doc=3119,freq=1.0), product of:
              0.16453126 = queryWeight, product of:
                1.2710726 = boost
                9.303573 = idf(docFreq=10, maxDocs=44421)
                0.013913241 = queryNorm
              0.5087891 = fieldWeight in 3119, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.303573 = idf(docFreq=10, maxDocs=44421)
                0.0546875 = fieldNorm(doc=3119)
          0.14184411 = weight(abstract_txt:similarity in 3119) [ClassicSimilarity], result of:
            0.14184411 = score(doc=3119,freq=3.0), product of:
              0.25738195 = queryWeight, product of:
                3.1795464 = boost
                5.8181453 = idf(docFreq=358, maxDocs=44421)
                0.013913241 = queryNorm
              0.5511036 = fieldWeight in 3119, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.8181453 = idf(docFreq=358, maxDocs=44421)
                0.0546875 = fieldNorm(doc=3119)
          0.4701772 = weight(abstract_txt:cocitation in 3119) [ClassicSimilarity], result of:
            0.4701772 = score(doc=3119,freq=4.0), product of:
              0.560018 = queryWeight, product of:
                5.2436314 = boost
                7.676116 = idf(docFreq=55, maxDocs=44421)
                0.013913241 = queryNorm
              0.8395752 = fieldWeight in 3119, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                7.676116 = idf(docFreq=55, maxDocs=44421)
                0.0546875 = fieldNorm(doc=3119)
        0.2 = coord(5/25)
    
  3. White, H.D.: Pathfinder networks and author cocitation analysis : a remapping of paradigmatic information scientists (2003) 0.13
    0.13287753 = sum of:
      0.13287753 = product of:
        0.66438764 = sum of:
          0.018429004 = weight(abstract_txt:article in 2459) [ClassicSimilarity], result of:
            0.018429004 = score(doc=2459,freq=2.0), product of:
              0.05487791 = queryWeight, product of:
                1.0381496 = boost
                3.79935 = idf(docFreq=2702, maxDocs=44421)
                0.013913241 = queryNorm
              0.33581826 = fieldWeight in 2459, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.79935 = idf(docFreq=2702, maxDocs=44421)
                0.0625 = fieldNorm(doc=2459)
          0.010590309 = weight(abstract_txt:from in 2459) [ClassicSimilarity], result of:
            0.010590309 = score(doc=2459,freq=2.0), product of:
              0.043420933 = queryWeight, product of:
                1.1309837 = boost
                2.759399 = idf(docFreq=7646, maxDocs=44421)
                0.013913241 = queryNorm
              0.2438987 = fieldWeight in 2459, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.759399 = idf(docFreq=7646, maxDocs=44421)
                0.0625 = fieldNorm(doc=2459)
          0.095670536 = weight(abstract_txt:cocited in 2459) [ClassicSimilarity], result of:
            0.095670536 = score(doc=2459,freq=1.0), product of:
              0.16453126 = queryWeight, product of:
                1.2710726 = boost
                9.303573 = idf(docFreq=10, maxDocs=44421)
                0.013913241 = queryNorm
              0.5814733 = fieldWeight in 2459, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.303573 = idf(docFreq=10, maxDocs=44421)
                0.0625 = fieldNorm(doc=2459)
          0.074343055 = weight(abstract_txt:pairs in 2459) [ClassicSimilarity], result of:
            0.074343055 = score(doc=2459,freq=1.0), product of:
              0.1752131 = queryWeight, product of:
                1.8550023 = boost
                6.7888126 = idf(docFreq=135, maxDocs=44421)
                0.013913241 = queryNorm
              0.4243008 = fieldWeight in 2459, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.7888126 = idf(docFreq=135, maxDocs=44421)
                0.0625 = fieldNorm(doc=2459)
          0.46535474 = weight(abstract_txt:cocitation in 2459) [ClassicSimilarity], result of:
            0.46535474 = score(doc=2459,freq=3.0), product of:
              0.560018 = queryWeight, product of:
                5.2436314 = boost
                7.676116 = idf(docFreq=55, maxDocs=44421)
                0.013913241 = queryNorm
              0.8309639 = fieldWeight in 2459, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                7.676116 = idf(docFreq=55, maxDocs=44421)
                0.0625 = fieldNorm(doc=2459)
        0.2 = coord(5/25)
    
  4. Pirkola, A.; Jarvelin, K.: ¬The effect of anaphor and ellipsis resolution on proximity searching in a text database (1995) 0.12
    0.1225873 = sum of:
      0.1225873 = product of:
        0.6129365 = sum of:
          0.011646813 = weight(abstract_txt:different in 4156) [ClassicSimilarity], result of:
            0.011646813 = score(doc=4156,freq=1.0), product of:
              0.050918747 = queryWeight, product of:
                3.6597328 = idf(docFreq=3107, maxDocs=44421)
                0.013913241 = queryNorm
              0.2287333 = fieldWeight in 4156, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6597328 = idf(docFreq=3107, maxDocs=44421)
                0.0625 = fieldNorm(doc=4156)
          0.013031274 = weight(abstract_txt:article in 4156) [ClassicSimilarity], result of:
            0.013031274 = score(doc=4156,freq=1.0), product of:
              0.05487791 = queryWeight, product of:
                1.0381496 = boost
                3.79935 = idf(docFreq=2702, maxDocs=44421)
                0.013913241 = queryNorm
              0.23745938 = fieldWeight in 4156, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.79935 = idf(docFreq=2702, maxDocs=44421)
                0.0625 = fieldNorm(doc=4156)
          0.10513696 = weight(abstract_txt:pairs in 4156) [ClassicSimilarity], result of:
            0.10513696 = score(doc=4156,freq=2.0), product of:
              0.1752131 = queryWeight, product of:
                1.8550023 = boost
                6.7888126 = idf(docFreq=135, maxDocs=44421)
                0.013913241 = queryNorm
              0.60005194 = fieldWeight in 4156, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.7888126 = idf(docFreq=135, maxDocs=44421)
                0.0625 = fieldNorm(doc=4156)
          0.10797884 = weight(abstract_txt:sentence in 4156) [ClassicSimilarity], result of:
            0.10797884 = score(doc=4156,freq=2.0), product of:
              0.17835642 = queryWeight, product of:
                1.8715676 = boost
                6.849437 = idf(docFreq=127, maxDocs=44421)
                0.013913241 = queryNorm
              0.60541046 = fieldWeight in 4156, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.849437 = idf(docFreq=127, maxDocs=44421)
                0.0625 = fieldNorm(doc=4156)
          0.37514257 = weight(abstract_txt:paragraph in 4156) [ClassicSimilarity], result of:
            0.37514257 = score(doc=4156,freq=2.0), product of:
              0.4683361 = queryWeight, product of:
                3.7143738 = boost
                9.06241 = idf(docFreq=13, maxDocs=44421)
                0.013913241 = queryNorm
              0.80101144 = fieldWeight in 4156, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                9.06241 = idf(docFreq=13, maxDocs=44421)
                0.0625 = fieldNorm(doc=4156)
        0.2 = coord(5/25)
    
  5. Tang, X.; Yang, C.C.; Song, M.: Understanding the evolution of multiple scientific research domains using a content and network approach (2013) 0.12
    0.122404486 = sum of:
      0.122404486 = product of:
        0.5100187 = sum of:
          0.011646813 = weight(abstract_txt:different in 1744) [ClassicSimilarity], result of:
            0.011646813 = score(doc=1744,freq=1.0), product of:
              0.050918747 = queryWeight, product of:
                3.6597328 = idf(docFreq=3107, maxDocs=44421)
                0.013913241 = queryNorm
              0.2287333 = fieldWeight in 1744, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6597328 = idf(docFreq=3107, maxDocs=44421)
                0.0625 = fieldNorm(doc=1744)
          0.047071885 = weight(abstract_txt:closer in 1744) [ClassicSimilarity], result of:
            0.047071885 = score(doc=1744,freq=1.0), product of:
              0.10254253 = queryWeight, product of:
                1.0034556 = boost
                7.344759 = idf(docFreq=77, maxDocs=44421)
                0.013913241 = queryNorm
              0.45904744 = fieldWeight in 1744, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.344759 = idf(docFreq=77, maxDocs=44421)
                0.0625 = fieldNorm(doc=1744)
          0.013031274 = weight(abstract_txt:article in 1744) [ClassicSimilarity], result of:
            0.013031274 = score(doc=1744,freq=1.0), product of:
              0.05487791 = queryWeight, product of:
                1.0381496 = boost
                3.79935 = idf(docFreq=2702, maxDocs=44421)
                0.013913241 = queryNorm
              0.23745938 = fieldWeight in 1744, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.79935 = idf(docFreq=2702, maxDocs=44421)
                0.0625 = fieldNorm(doc=1744)
          0.00748848 = weight(abstract_txt:from in 1744) [ClassicSimilarity], result of:
            0.00748848 = score(doc=1744,freq=1.0), product of:
              0.043420933 = queryWeight, product of:
                1.1309837 = boost
                2.759399 = idf(docFreq=7646, maxDocs=44421)
                0.013913241 = queryNorm
              0.17246243 = fieldWeight in 1744, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.759399 = idf(docFreq=7646, maxDocs=44421)
                0.0625 = fieldNorm(doc=1744)
          0.16210756 = weight(abstract_txt:similarity in 1744) [ClassicSimilarity], result of:
            0.16210756 = score(doc=1744,freq=3.0), product of:
              0.25738195 = queryWeight, product of:
                3.1795464 = boost
                5.8181453 = idf(docFreq=358, maxDocs=44421)
                0.013913241 = queryNorm
              0.6298327 = fieldWeight in 1744, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.8181453 = idf(docFreq=358, maxDocs=44421)
                0.0625 = fieldNorm(doc=1744)
          0.2686727 = weight(abstract_txt:cocitation in 1744) [ClassicSimilarity], result of:
            0.2686727 = score(doc=1744,freq=1.0), product of:
              0.560018 = queryWeight, product of:
                5.2436314 = boost
                7.676116 = idf(docFreq=55, maxDocs=44421)
                0.013913241 = queryNorm
              0.47975725 = fieldWeight in 1744, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.676116 = idf(docFreq=55, maxDocs=44421)
                0.0625 = fieldNorm(doc=1744)
        0.24 = coord(6/25)