Document (#38507)

Author
Koppel, M.
Schweitzer, N.
Title
Measuring direct and indirect authorial influence in historical corpora
Source
Journal of the Association for Information Science and Technology. 65(2014) no.10, S.2138-2144
Year
2014
Abstract
We show how automatically extracted citations in historical corpora can be used to measure the direct and indirect influence of authors on each other. These measures can in turn be used to determine an author's overall prominence in the corpus and to identify distinct schools of thought. We apply our methods to two major historical corpora. Using scholarly consensus as a gold standard, we demonstrate empirically the superiority of indirect influence over direct influence as a basis for various measures of authorial impact.

Similar documents (author)

  1. Koppel, T.P.: Public access catalogs through Internet (1990) 6.01
    6.0137663 = sum of:
      6.0137663 = weight(author_txt:koppel in 4069) [ClassicSimilarity], result of:
        6.0137663 = fieldWeight in 4069, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.622026 = idf(docFreq=7, maxDocs=44421)
          0.625 = fieldNorm(doc=4069)
    
  2. Akiva, N.; Koppel, M.: ¬A generic unsupervised method for decomposing multi-author documents (2013) 4.81
    4.811013 = sum of:
      4.811013 = weight(author_txt:koppel in 2098) [ClassicSimilarity], result of:
        4.811013 = fieldWeight in 2098, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.622026 = idf(docFreq=7, maxDocs=44421)
          0.5 = fieldNorm(doc=2098)
    
  3. Koppel, M.; Winter, Y.: Determining if two documents are written by the same author (2014) 4.81
    4.811013 = sum of:
      4.811013 = weight(author_txt:koppel in 2602) [ClassicSimilarity], result of:
        4.811013 = fieldWeight in 2602, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.622026 = idf(docFreq=7, maxDocs=44421)
          0.5 = fieldNorm(doc=2602)
    
  4. Koppel, M.; Akiva, N.; Dagan, I.: Feature instability as a criterion for selecting potential style markers (2006) 3.61
    3.60826 = sum of:
      3.60826 = weight(author_txt:koppel in 92) [ClassicSimilarity], result of:
        3.60826 = fieldWeight in 92, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.622026 = idf(docFreq=7, maxDocs=44421)
          0.375 = fieldNorm(doc=92)
    
  5. Koppel, M.; Schler, J.; Argamon, S.: Computational methods in authorship attribution (2009) 3.61
    3.60826 = sum of:
      3.60826 = weight(author_txt:koppel in 3683) [ClassicSimilarity], result of:
        3.60826 = fieldWeight in 3683, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.622026 = idf(docFreq=7, maxDocs=44421)
          0.375 = fieldNorm(doc=3683)
    

Similar documents (content)

  1. Akter, S.; D'Ambra, J.; Ray, P.: Trustworthiness in mHealth information services : an assessment of a hierarchical model with mediating and moderating effects using partial least squares (PLS) (2011) 0.14
    0.1438486 = sum of:
      0.1438486 = product of:
        0.59936917 = sum of:
          0.03310896 = weight(abstract_txt:overall in 136) [ClassicSimilarity], result of:
            0.03310896 = score(doc=136,freq=1.0), product of:
              0.07740641 = queryWeight, product of:
                1.0057186 = boost
                5.474931 = idf(docFreq=505, maxDocs=44421)
                0.0140579445 = queryNorm
              0.42772895 = fieldWeight in 136, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.474931 = idf(docFreq=505, maxDocs=44421)
                0.078125 = fieldNorm(doc=136)
          0.058205515 = weight(abstract_txt:empirically in 136) [ClassicSimilarity], result of:
            0.058205515 = score(doc=136,freq=1.0), product of:
              0.11275157 = queryWeight, product of:
                1.213806 = boost
                6.6077175 = idf(docFreq=162, maxDocs=44421)
                0.0140579445 = queryNorm
              0.51622796 = fieldWeight in 136, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.6077175 = idf(docFreq=162, maxDocs=44421)
                0.078125 = fieldNorm(doc=136)
          0.015267623 = weight(abstract_txt:used in 136) [ClassicSimilarity], result of:
            0.015267623 = score(doc=136,freq=1.0), product of:
              0.058210876 = queryWeight, product of:
                1.2334032 = boost
                3.3572001 = idf(docFreq=4205, maxDocs=44421)
                0.0140579445 = queryNorm
              0.26228127 = fieldWeight in 136, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.3572001 = idf(docFreq=4205, maxDocs=44421)
                0.078125 = fieldNorm(doc=136)
          0.12239203 = weight(abstract_txt:direct in 136) [ClassicSimilarity], result of:
            0.12239203 = score(doc=136,freq=1.0), product of:
              0.26690438 = queryWeight, product of:
                3.234644 = boost
                5.869585 = idf(docFreq=340, maxDocs=44421)
                0.0140579445 = queryNorm
              0.45856133 = fieldWeight in 136, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.869585 = idf(docFreq=340, maxDocs=44421)
                0.078125 = fieldNorm(doc=136)
          0.11538673 = weight(abstract_txt:influence in 136) [ClassicSimilarity], result of:
            0.11538673 = score(doc=136,freq=1.0), product of:
              0.2824471 = queryWeight, product of:
                3.8422582 = boost
                5.229121 = idf(docFreq=646, maxDocs=44421)
                0.0140579445 = queryNorm
              0.4085251 = fieldWeight in 136, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.229121 = idf(docFreq=646, maxDocs=44421)
                0.078125 = fieldNorm(doc=136)
          0.25500834 = weight(abstract_txt:indirect in 136) [ClassicSimilarity], result of:
            0.25500834 = score(doc=136,freq=1.0), product of:
              0.43540147 = queryWeight, product of:
                4.1313653 = boost
                7.496775 = idf(docFreq=66, maxDocs=44421)
                0.0140579445 = queryNorm
              0.58568555 = fieldWeight in 136, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.496775 = idf(docFreq=66, maxDocs=44421)
                0.078125 = fieldNorm(doc=136)
        0.24 = coord(6/25)
    
  2. Cui, H.; Heidorn, P.B.: ¬The reusability of induced knowledge for the automatic semantic markup of taxonomic descriptions (2007) 0.13
    0.13495271 = sum of:
      0.13495271 = product of:
        0.67476356 = sum of:
          0.04705654 = weight(abstract_txt:automatically in 1084) [ClassicSimilarity], result of:
            0.04705654 = score(doc=1084,freq=3.0), product of:
              0.07872744 = queryWeight, product of:
                1.0142641 = boost
                5.521451 = idf(docFreq=482, maxDocs=44421)
                0.0140579445 = queryNorm
              0.5977146 = fieldWeight in 1084, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.521451 = idf(docFreq=482, maxDocs=44421)
                0.0625 = fieldNorm(doc=1084)
          0.036490384 = weight(abstract_txt:corpus in 1084) [ClassicSimilarity], result of:
            0.036490384 = score(doc=1084,freq=1.0), product of:
              0.09583824 = queryWeight, product of:
                1.1190704 = boost
                6.0919957 = idf(docFreq=272, maxDocs=44421)
                0.0140579445 = queryNorm
              0.38074973 = fieldWeight in 1084, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.0919957 = idf(docFreq=272, maxDocs=44421)
                0.0625 = fieldNorm(doc=1084)
          0.012214097 = weight(abstract_txt:used in 1084) [ClassicSimilarity], result of:
            0.012214097 = score(doc=1084,freq=1.0), product of:
              0.058210876 = queryWeight, product of:
                1.2334032 = boost
                3.3572001 = idf(docFreq=4205, maxDocs=44421)
                0.0140579445 = queryNorm
              0.20982501 = fieldWeight in 1084, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.3572001 = idf(docFreq=4205, maxDocs=44421)
                0.0625 = fieldNorm(doc=1084)
          0.05153315 = weight(abstract_txt:measures in 1084) [ClassicSimilarity], result of:
            0.05153315 = score(doc=1084,freq=1.0), product of:
              0.1519921 = queryWeight, product of:
                1.9930285 = boost
                5.424824 = idf(docFreq=531, maxDocs=44421)
                0.0140579445 = queryNorm
              0.3390515 = fieldWeight in 1084, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.424824 = idf(docFreq=531, maxDocs=44421)
                0.0625 = fieldNorm(doc=1084)
          0.5274694 = weight(abstract_txt:corpora in 1084) [ClassicSimilarity], result of:
            0.5274694 = score(doc=1084,freq=10.0), product of:
              0.38070783 = queryWeight, product of:
                3.8631763 = boost
                7.01012 = idf(docFreq=108, maxDocs=44421)
                0.0140579445 = queryNorm
              1.3854966 = fieldWeight in 1084, product of:
                3.1622777 = tf(freq=10.0), with freq of:
                  10.0 = termFreq=10.0
                7.01012 = idf(docFreq=108, maxDocs=44421)
                0.0625 = fieldNorm(doc=1084)
        0.2 = coord(5/25)
    
  3. Akiva, N.; Koppel, M.: ¬A generic unsupervised method for decomposing multi-author documents (2013) 0.10
    0.10110116 = sum of:
      0.10110116 = product of:
        0.8425097 = sum of:
          0.054336213 = weight(abstract_txt:automatically in 2098) [ClassicSimilarity], result of:
            0.054336213 = score(doc=2098,freq=1.0), product of:
              0.07872744 = queryWeight, product of:
                1.0142641 = boost
                5.521451 = idf(docFreq=482, maxDocs=44421)
                0.0140579445 = queryNorm
              0.6901814 = fieldWeight in 2098, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.521451 = idf(docFreq=482, maxDocs=44421)
                0.125 = fieldNorm(doc=2098)
          0.07337861 = weight(abstract_txt:distinct in 2098) [ClassicSimilarity], result of:
            0.07337861 = score(doc=2098,freq=1.0), product of:
              0.09618622 = queryWeight, product of:
                1.1211002 = boost
                6.1030455 = idf(docFreq=269, maxDocs=44421)
                0.0140579445 = queryNorm
              0.7628807 = fieldWeight in 2098, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.1030455 = idf(docFreq=269, maxDocs=44421)
                0.125 = fieldNorm(doc=2098)
          0.7147949 = weight(abstract_txt:authorial in 2098) [ClassicSimilarity], result of:
            0.7147949 = score(doc=2098,freq=2.0), product of:
              0.43872008 = queryWeight, product of:
                3.3860765 = boost
                9.216561 = idf(docFreq=11, maxDocs=44421)
                0.0140579445 = queryNorm
              1.6292732 = fieldWeight in 2098, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                9.216561 = idf(docFreq=11, maxDocs=44421)
                0.125 = fieldNorm(doc=2098)
        0.12 = coord(3/25)
    
  4. Herdagdelen, A.; Baroni, M.: Stereotypical gender actions can be extracted from web text (2011) 0.10
    0.100186065 = sum of:
      0.100186065 = product of:
        0.41744193 = sum of:
          0.027168106 = weight(abstract_txt:automatically in 752) [ClassicSimilarity], result of:
            0.027168106 = score(doc=752,freq=1.0), product of:
              0.07872744 = queryWeight, product of:
                1.0142641 = boost
                5.521451 = idf(docFreq=482, maxDocs=44421)
                0.0140579445 = queryNorm
              0.3450907 = fieldWeight in 752, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.521451 = idf(docFreq=482, maxDocs=44421)
                0.0625 = fieldNorm(doc=752)
          0.0632032 = weight(abstract_txt:corpus in 752) [ClassicSimilarity], result of:
            0.0632032 = score(doc=752,freq=3.0), product of:
              0.09583824 = queryWeight, product of:
                1.1190704 = boost
                6.0919957 = idf(docFreq=272, maxDocs=44421)
                0.0140579445 = queryNorm
              0.6594779 = fieldWeight in 752, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.0919957 = idf(docFreq=272, maxDocs=44421)
                0.0625 = fieldNorm(doc=752)
          0.03758647 = weight(abstract_txt:extracted in 752) [ClassicSimilarity], result of:
            0.03758647 = score(doc=752,freq=1.0), product of:
              0.09774793 = queryWeight, product of:
                1.1301649 = boost
                6.1523914 = idf(docFreq=256, maxDocs=44421)
                0.0140579445 = queryNorm
              0.38452446 = fieldWeight in 752, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.1523914 = idf(docFreq=256, maxDocs=44421)
                0.0625 = fieldNorm(doc=752)
          0.012214097 = weight(abstract_txt:used in 752) [ClassicSimilarity], result of:
            0.012214097 = score(doc=752,freq=1.0), product of:
              0.058210876 = queryWeight, product of:
                1.2334032 = boost
                3.3572001 = idf(docFreq=4205, maxDocs=44421)
                0.0140579445 = queryNorm
              0.20982501 = fieldWeight in 752, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.3572001 = idf(docFreq=4205, maxDocs=44421)
                0.0625 = fieldNorm(doc=752)
          0.11046958 = weight(abstract_txt:gold in 752) [ClassicSimilarity], result of:
            0.11046958 = score(doc=752,freq=2.0), product of:
              0.15918605 = queryWeight, product of:
                1.4422499 = boost
                7.85132 = idf(docFreq=46, maxDocs=44421)
                0.0140579445 = queryNorm
              0.6939652 = fieldWeight in 752, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.85132 = idf(docFreq=46, maxDocs=44421)
                0.0625 = fieldNorm(doc=752)
          0.16680047 = weight(abstract_txt:corpora in 752) [ClassicSimilarity], result of:
            0.16680047 = score(doc=752,freq=1.0), product of:
              0.38070783 = queryWeight, product of:
                3.8631763 = boost
                7.01012 = idf(docFreq=108, maxDocs=44421)
                0.0140579445 = queryNorm
              0.4381325 = fieldWeight in 752, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.01012 = idf(docFreq=108, maxDocs=44421)
                0.0625 = fieldNorm(doc=752)
        0.24 = coord(6/25)
    
  5. Clavier, V.; Paganelli, C.: Including authorial stance in the indexing of scientific documents (2012) 0.09
    0.09454067 = sum of:
      0.09454067 = product of:
        0.78783894 = sum of:
          0.015267623 = weight(abstract_txt:used in 1320) [ClassicSimilarity], result of:
            0.015267623 = score(doc=1320,freq=1.0), product of:
              0.058210876 = queryWeight, product of:
                1.2334032 = boost
                3.3572001 = idf(docFreq=4205, maxDocs=44421)
                0.0140579445 = queryNorm
              0.26228127 = fieldWeight in 1320, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.3572001 = idf(docFreq=4205, maxDocs=44421)
                0.078125 = fieldNorm(doc=1320)
          0.06620255 = weight(abstract_txt:author's in 1320) [ClassicSimilarity], result of:
            0.06620255 = score(doc=1320,freq=1.0), product of:
              0.122856 = queryWeight, product of:
                1.267028 = boost
                6.8974466 = idf(docFreq=121, maxDocs=44421)
                0.0140579445 = queryNorm
              0.538863 = fieldWeight in 1320, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.8974466 = idf(docFreq=121, maxDocs=44421)
                0.078125 = fieldNorm(doc=1320)
          0.70636874 = weight(abstract_txt:authorial in 1320) [ClassicSimilarity], result of:
            0.70636874 = score(doc=1320,freq=5.0), product of:
              0.43872008 = queryWeight, product of:
                3.3860765 = boost
                9.216561 = idf(docFreq=11, maxDocs=44421)
                0.0140579445 = queryNorm
              1.610067 = fieldWeight in 1320, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                9.216561 = idf(docFreq=11, maxDocs=44421)
                0.078125 = fieldNorm(doc=1320)
        0.12 = coord(3/25)