Document (#38507)

Author
Koppel, M.
Schweitzer, N.
Title
Measuring direct and indirect authorial influence in historical corpora
Source
Journal of the Association for Information Science and Technology. 65(2014) no.10, S.2138-2144
Year
2014
Abstract
We show how automatically extracted citations in historical corpora can be used to measure the direct and indirect influence of authors on each other. These measures can in turn be used to determine an author's overall prominence in the corpus and to identify distinct schools of thought. We apply our methods to two major historical corpora. Using scholarly consensus as a gold standard, we demonstrate empirically the superiority of indirect influence over direct influence as a basis for various measures of authorial impact.

Similar documents (author)

  1. Koppel, T.P.: Public access catalogs through Internet (1990) 6.01
    6.010904 = sum of:
      6.010904 = weight(author_txt:koppel in 4070) [ClassicSimilarity], result of:
        6.010904 = fieldWeight in 4070, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.617446 = idf(docFreq=7, maxDocs=44218)
          0.625 = fieldNorm(doc=4070)
    
  2. Akiva, N.; Koppel, M.: ¬A generic unsupervised method for decomposing multi-author documents (2013) 4.81
    4.808723 = sum of:
      4.808723 = weight(author_txt:koppel in 1098) [ClassicSimilarity], result of:
        4.808723 = fieldWeight in 1098, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.617446 = idf(docFreq=7, maxDocs=44218)
          0.5 = fieldNorm(doc=1098)
    
  3. Koppel, M.; Winter, Y.: Determining if two documents are written by the same author (2014) 4.81
    4.808723 = sum of:
      4.808723 = weight(author_txt:koppel in 1602) [ClassicSimilarity], result of:
        4.808723 = fieldWeight in 1602, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.617446 = idf(docFreq=7, maxDocs=44218)
          0.5 = fieldNorm(doc=1602)
    
  4. Koppel, M.; Akiva, N.; Dagan, I.: Feature instability as a criterion for selecting potential style markers (2006) 3.61
    3.606542 = sum of:
      3.606542 = weight(author_txt:koppel in 6092) [ClassicSimilarity], result of:
        3.606542 = fieldWeight in 6092, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.617446 = idf(docFreq=7, maxDocs=44218)
          0.375 = fieldNorm(doc=6092)
    
  5. Koppel, M.; Schler, J.; Argamon, S.: Computational methods in authorship attribution (2009) 3.61
    3.606542 = sum of:
      3.606542 = weight(author_txt:koppel in 2683) [ClassicSimilarity], result of:
        3.606542 = fieldWeight in 2683, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.617446 = idf(docFreq=7, maxDocs=44218)
          0.375 = fieldNorm(doc=2683)
    

Similar documents (content)

  1. Akter, S.; D'Ambra, J.; Ray, P.: Trustworthiness in mHealth information services : an assessment of a hierarchical model with mediating and moderating effects using partial least squares (PLS) (2011) 0.14
    0.14410537 = sum of:
      0.14410537 = product of:
        0.6004391 = sum of:
          0.033120167 = weight(abstract_txt:overall in 4136) [ClassicSimilarity], result of:
            0.033120167 = score(doc=4136,freq=1.0), product of:
              0.07738516 = queryWeight, product of:
                1.0043309 = boost
                5.478287 = idf(docFreq=501, maxDocs=44218)
                0.01406488 = queryNorm
              0.42799118 = fieldWeight in 4136, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.478287 = idf(docFreq=501, maxDocs=44218)
                0.078125 = fieldNorm(doc=4136)
          0.058654625 = weight(abstract_txt:empirically in 4136) [ClassicSimilarity], result of:
            0.058654625 = score(doc=4136,freq=1.0), product of:
              0.11327416 = queryWeight, product of:
                1.2151039 = boost
                6.627983 = idf(docFreq=158, maxDocs=44218)
                0.01406488 = queryNorm
              0.5178112 = fieldWeight in 4136, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.627983 = idf(docFreq=158, maxDocs=44218)
                0.078125 = fieldNorm(doc=4136)
          0.015273343 = weight(abstract_txt:used in 4136) [ClassicSimilarity], result of:
            0.015273343 = score(doc=4136,freq=1.0), product of:
              0.058196303 = queryWeight, product of:
                1.2317163 = boost
                3.3592992 = idf(docFreq=4177, maxDocs=44218)
                0.01406488 = queryNorm
              0.26244524 = fieldWeight in 4136, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.3592992 = idf(docFreq=4177, maxDocs=44218)
                0.078125 = fieldNorm(doc=4136)
          0.12266002 = weight(abstract_txt:direct in 4136) [ClassicSimilarity], result of:
            0.12266002 = score(doc=4136,freq=1.0), product of:
              0.2671602 = queryWeight, product of:
                3.2321723 = boost
                5.8768044 = idf(docFreq=336, maxDocs=44218)
                0.01406488 = queryNorm
              0.45912534 = fieldWeight in 4136, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.8768044 = idf(docFreq=336, maxDocs=44218)
                0.078125 = fieldNorm(doc=4136)
          0.11657141 = weight(abstract_txt:influence in 4136) [ClassicSimilarity], result of:
            0.11657141 = score(doc=4136,freq=1.0), product of:
              0.28423488 = queryWeight, product of:
                3.8496094 = boost
                5.2495813 = idf(docFreq=630, maxDocs=44218)
                0.01406488 = queryNorm
              0.41012353 = fieldWeight in 4136, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.2495813 = idf(docFreq=630, maxDocs=44218)
                0.078125 = fieldNorm(doc=4136)
          0.25415954 = weight(abstract_txt:indirect in 4136) [ClassicSimilarity], result of:
            0.25415954 = score(doc=4136,freq=1.0), product of:
              0.43421754 = queryWeight, product of:
                4.1206174 = boost
                7.4921947 = idf(docFreq=66, maxDocs=44218)
                0.01406488 = queryNorm
              0.5853277 = fieldWeight in 4136, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.4921947 = idf(docFreq=66, maxDocs=44218)
                0.078125 = fieldNorm(doc=4136)
        0.24 = coord(6/25)
    
  2. Cui, H.; Heidorn, P.B.: ¬The reusability of induced knowledge for the automatic semantic markup of taxonomic descriptions (2007) 0.14
    0.13544469 = sum of:
      0.13544469 = product of:
        0.67722344 = sum of:
          0.046869144 = weight(abstract_txt:automatically in 84) [ClassicSimilarity], result of:
            0.046869144 = score(doc=84,freq=3.0), product of:
              0.07847902 = queryWeight, product of:
                1.0114043 = boost
                5.5168705 = idf(docFreq=482, maxDocs=44218)
                0.01406488 = queryNorm
              0.59721875 = fieldWeight in 84, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.5168705 = idf(docFreq=482, maxDocs=44218)
                0.0625 = fieldNorm(doc=84)
          0.03655189 = weight(abstract_txt:corpus in 84) [ClassicSimilarity], result of:
            0.03655189 = score(doc=84,freq=1.0), product of:
              0.09589793 = queryWeight, product of:
                1.1180278 = boost
                6.0984654 = idf(docFreq=269, maxDocs=44218)
                0.01406488 = queryNorm
              0.3811541 = fieldWeight in 84, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.0984654 = idf(docFreq=269, maxDocs=44218)
                0.0625 = fieldNorm(doc=84)
          0.012218675 = weight(abstract_txt:used in 84) [ClassicSimilarity], result of:
            0.012218675 = score(doc=84,freq=1.0), product of:
              0.058196303 = queryWeight, product of:
                1.2317163 = boost
                3.3592992 = idf(docFreq=4177, maxDocs=44218)
                0.01406488 = queryNorm
              0.2099562 = fieldWeight in 84, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.3592992 = idf(docFreq=4177, maxDocs=44218)
                0.0625 = fieldNorm(doc=84)
          0.0517573 = weight(abstract_txt:measures in 84) [ClassicSimilarity], result of:
            0.0517573 = score(doc=84,freq=1.0), product of:
              0.1523563 = queryWeight, product of:
                1.9929352 = boost
                5.4353957 = idf(docFreq=523, maxDocs=44218)
                0.01406488 = queryNorm
              0.33971223 = fieldWeight in 84, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.4353957 = idf(docFreq=523, maxDocs=44218)
                0.0625 = fieldNorm(doc=84)
          0.52982646 = weight(abstract_txt:corpora in 84) [ClassicSimilarity], result of:
            0.52982646 = score(doc=84,freq=10.0), product of:
              0.38165024 = queryWeight, product of:
                3.8631482 = boost
                7.0240583 = idf(docFreq=106, maxDocs=44218)
                0.01406488 = queryNorm
              1.3882514 = fieldWeight in 84, product of:
                3.1622777 = tf(freq=10.0), with freq of:
                  10.0 = termFreq=10.0
                7.0240583 = idf(docFreq=106, maxDocs=44218)
                0.0625 = fieldNorm(doc=84)
        0.2 = coord(5/25)
    
  3. Akiva, N.; Koppel, M.: ¬A generic unsupervised method for decomposing multi-author documents (2013) 0.10
    0.10086689 = sum of:
      0.10086689 = product of:
        0.84055746 = sum of:
          0.054119825 = weight(abstract_txt:automatically in 1098) [ClassicSimilarity], result of:
            0.054119825 = score(doc=1098,freq=1.0), product of:
              0.07847902 = queryWeight, product of:
                1.0114043 = boost
                5.5168705 = idf(docFreq=482, maxDocs=44218)
                0.01406488 = queryNorm
              0.6896088 = fieldWeight in 1098, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.5168705 = idf(docFreq=482, maxDocs=44218)
                0.125 = fieldNorm(doc=1098)
          0.07377803 = weight(abstract_txt:distinct in 1098) [ClassicSimilarity], result of:
            0.07377803 = score(doc=1098,freq=1.0), product of:
              0.09648669 = queryWeight, product of:
                1.1214546 = boost
                6.1171575 = idf(docFreq=264, maxDocs=44218)
                0.01406488 = queryNorm
              0.7646447 = fieldWeight in 1098, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.1171575 = idf(docFreq=264, maxDocs=44218)
                0.125 = fieldNorm(doc=1098)
          0.7126596 = weight(abstract_txt:authorial in 1098) [ClassicSimilarity], result of:
            0.7126596 = score(doc=1098,freq=2.0), product of:
              0.437627 = queryWeight, product of:
                3.3776531 = boost
                9.211981 = idf(docFreq=11, maxDocs=44218)
                0.01406488 = queryNorm
              1.6284635 = fieldWeight in 1098, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                9.211981 = idf(docFreq=11, maxDocs=44218)
                0.125 = fieldNorm(doc=1098)
        0.12 = coord(3/25)
    
  4. Herdagdelen, A.; Baroni, M.: Stereotypical gender actions can be extracted from web text (2011) 0.10
    0.10051545 = sum of:
      0.10051545 = product of:
        0.41881436 = sum of:
          0.027059913 = weight(abstract_txt:automatically in 4752) [ClassicSimilarity], result of:
            0.027059913 = score(doc=4752,freq=1.0), product of:
              0.07847902 = queryWeight, product of:
                1.0114043 = boost
                5.5168705 = idf(docFreq=482, maxDocs=44218)
                0.01406488 = queryNorm
              0.3448044 = fieldWeight in 4752, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.5168705 = idf(docFreq=482, maxDocs=44218)
                0.0625 = fieldNorm(doc=4752)
          0.06330973 = weight(abstract_txt:corpus in 4752) [ClassicSimilarity], result of:
            0.06330973 = score(doc=4752,freq=3.0), product of:
              0.09589793 = queryWeight, product of:
                1.1180278 = boost
                6.0984654 = idf(docFreq=269, maxDocs=44218)
                0.01406488 = queryNorm
              0.66017824 = fieldWeight in 4752, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.0984654 = idf(docFreq=269, maxDocs=44218)
                0.0625 = fieldNorm(doc=4752)
          0.037661333 = weight(abstract_txt:extracted in 4752) [ClassicSimilarity], result of:
            0.037661333 = score(doc=4752,freq=1.0), product of:
              0.09782874 = queryWeight, product of:
                1.1292269 = boost
                6.159553 = idf(docFreq=253, maxDocs=44218)
                0.01406488 = queryNorm
              0.38497207 = fieldWeight in 4752, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.159553 = idf(docFreq=253, maxDocs=44218)
                0.0625 = fieldNorm(doc=4752)
          0.012218675 = weight(abstract_txt:used in 4752) [ClassicSimilarity], result of:
            0.012218675 = score(doc=4752,freq=1.0), product of:
              0.058196303 = queryWeight, product of:
                1.2317163 = boost
                3.3592992 = idf(docFreq=4177, maxDocs=44218)
                0.01406488 = queryNorm
              0.2099562 = fieldWeight in 4752, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.3592992 = idf(docFreq=4177, maxDocs=44218)
                0.0625 = fieldNorm(doc=4752)
          0.11101887 = weight(abstract_txt:gold in 4752) [ClassicSimilarity], result of:
            0.11101887 = score(doc=4752,freq=2.0), product of:
              0.15963344 = queryWeight, product of:
                1.4424804 = boost
                7.8682456 = idf(docFreq=45, maxDocs=44218)
                0.01406488 = queryNorm
              0.6954612 = fieldWeight in 4752, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.8682456 = idf(docFreq=45, maxDocs=44218)
                0.0625 = fieldNorm(doc=4752)
          0.16754584 = weight(abstract_txt:corpora in 4752) [ClassicSimilarity], result of:
            0.16754584 = score(doc=4752,freq=1.0), product of:
              0.38165024 = queryWeight, product of:
                3.8631482 = boost
                7.0240583 = idf(docFreq=106, maxDocs=44218)
                0.01406488 = queryNorm
              0.43900365 = fieldWeight in 4752, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.0240583 = idf(docFreq=106, maxDocs=44218)
                0.0625 = fieldNorm(doc=4752)
        0.24 = coord(6/25)
    
  5. Clavier, V.; Paganelli, C.: Including authorial stance in the indexing of scientific documents (2012) 0.09
    0.09426043 = sum of:
      0.09426043 = product of:
        0.7855036 = sum of:
          0.015273343 = weight(abstract_txt:used in 320) [ClassicSimilarity], result of:
            0.015273343 = score(doc=320,freq=1.0), product of:
              0.058196303 = queryWeight, product of:
                1.2317163 = boost
                3.3592992 = idf(docFreq=4177, maxDocs=44218)
                0.01406488 = queryNorm
              0.26244524 = fieldWeight in 320, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.3592992 = idf(docFreq=4177, maxDocs=44218)
                0.078125 = fieldNorm(doc=320)
          0.065971695 = weight(abstract_txt:author's in 320) [ClassicSimilarity], result of:
            0.065971695 = score(doc=320,freq=1.0), product of:
              0.12250893 = queryWeight, product of:
                1.2636647 = boost
                6.892866 = idf(docFreq=121, maxDocs=44218)
                0.01406488 = queryNorm
              0.5385052 = fieldWeight in 320, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.892866 = idf(docFreq=121, maxDocs=44218)
                0.078125 = fieldNorm(doc=320)
          0.7042586 = weight(abstract_txt:authorial in 320) [ClassicSimilarity], result of:
            0.7042586 = score(doc=320,freq=5.0), product of:
              0.437627 = queryWeight, product of:
                3.3776531 = boost
                9.211981 = idf(docFreq=11, maxDocs=44218)
                0.01406488 = queryNorm
              1.6092669 = fieldWeight in 320, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                9.211981 = idf(docFreq=11, maxDocs=44218)
                0.078125 = fieldNorm(doc=320)
        0.12 = coord(3/25)