Document (#43848)

Author
Corbara, S.
Moreo, A.
Sebastiani, F.
Title
Syllabic quantity patterns as rhythmic features for Latin authorship attribution
Source
Journal of the Association for Information Science and Technology. 74(2023) no.1, S.128-141
Year
2023
Abstract
It is well known that, within the Latin production of written text, peculiar metric schemes were followed not only in poetic compositions, but also in many prose works. Such metric patterns were based on so-called syllabic quantity, that is, on the length of the involved syllables, and there is substantial evidence suggesting that certain authors had a preference for certain metric patterns over others. In this research we investigate the possibility to employ syllabic quantity as a base for deriving rhythmic features for the task of computational authorship attribution of Latin prose texts. We test the impact of these features on the authorship attribution task when combined with other topic-agnostic features. Our experiments, carried out on three different datasets using support vector machines (SVMs) show that rhythmic features based on syllabic quantity are beneficial in discriminating among Latin prose authors.
Content
Vgl.: https://asistdl.onlinelibrary.wiley.com/doi/10.1002/asi.24660. https://doi.org/10.1002/asi.24660.
Theme
Computerlinguistik
Formalerschließung

Similar documents (author)

  1. Sebastiani, F.: On the role of logic in information retrieval (1998) 5.94
    5.9401517 = sum of:
      5.9401517 = weight(author_txt:sebastiani in 2140) [ClassicSimilarity], result of:
        5.9401517 = fieldWeight in 2140, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.504243 = idf(docFreq=8, maxDocs=44421)
          0.625 = fieldNorm(doc=2140)
    
  2. Sebastiani, F.: Machine learning in automated text categorization (2002) 5.94
    5.9401517 = sum of:
      5.9401517 = weight(author_txt:sebastiani in 4389) [ClassicSimilarity], result of:
        5.9401517 = fieldWeight in 4389, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.504243 = idf(docFreq=8, maxDocs=44421)
          0.625 = fieldNorm(doc=4389)
    
  3. Sebastiani, F.: ¬A tutorial an automated text categorisation (1999) 5.94
    5.9401517 = sum of:
      5.9401517 = weight(author_txt:sebastiani in 4390) [ClassicSimilarity], result of:
        5.9401517 = fieldWeight in 4390, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.504243 = idf(docFreq=8, maxDocs=44421)
          0.625 = fieldNorm(doc=4390)
    
  4. Sebastiani, F.: Classification of text, automatic (2006) 5.94
    5.9401517 = sum of:
      5.9401517 = weight(author_txt:sebastiani in 3) [ClassicSimilarity], result of:
        5.9401517 = fieldWeight in 3, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.504243 = idf(docFreq=8, maxDocs=44421)
          0.625 = fieldNorm(doc=3)
    
  5. Debole, F.; Sebastiani, F.: ¬An analysis of the relative hardness of Reuters-21578 subsets (2005) 4.75
    4.7521214 = sum of:
      4.7521214 = weight(author_txt:sebastiani in 4456) [ClassicSimilarity], result of:
        4.7521214 = fieldWeight in 4456, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.504243 = idf(docFreq=8, maxDocs=44421)
          0.5 = fieldNorm(doc=4456)
    

Similar documents (content)

  1. Stover, J.A.; Winter, Y.; Koppel, M.; Kestemont, M.: Computational authorship verification method attributes a new work to a major 2nd century African author (2016) 0.14
    0.14457709 = sum of:
      0.14457709 = product of:
        0.60240453 = sum of:
          0.038333222 = weight(abstract_txt:authors in 3503) [ClassicSimilarity], result of:
            0.038333222 = score(doc=3503,freq=3.0), product of:
              0.07622925 = queryWeight, product of:
                1.4111979 = boost
                4.6452923 = idf(docFreq=1159, maxDocs=44421)
                0.01162842 = queryNorm
              0.50286764 = fieldWeight in 3503, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.6452923 = idf(docFreq=1159, maxDocs=44421)
                0.0625 = fieldNorm(doc=3503)
          0.0058406787 = weight(abstract_txt:that in 3503) [ClassicSimilarity], result of:
            0.0058406787 = score(doc=3503,freq=1.0), product of:
              0.03951519 = queryWeight, product of:
                1.436892 = boost
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.01162842 = queryNorm
              0.14780845 = fieldWeight in 3503, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.0625 = fieldNorm(doc=3503)
          0.026067235 = weight(abstract_txt:task in 3503) [ClassicSimilarity], result of:
            0.026067235 = score(doc=3503,freq=1.0), product of:
              0.08501753 = queryWeight, product of:
                1.4903262 = boost
                4.9057617 = idf(docFreq=893, maxDocs=44421)
                0.01162842 = queryNorm
              0.3066101 = fieldWeight in 3503, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.9057617 = idf(docFreq=893, maxDocs=44421)
                0.0625 = fieldNorm(doc=3503)
          0.15887026 = weight(abstract_txt:authorship in 3503) [ClassicSimilarity], result of:
            0.15887026 = score(doc=3503,freq=2.0), product of:
              0.25772747 = queryWeight, product of:
                3.177994 = boost
                6.9740796 = idf(docFreq=112, maxDocs=44421)
                0.01162842 = queryNorm
              0.61642736 = fieldWeight in 3503, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.9740796 = idf(docFreq=112, maxDocs=44421)
                0.0625 = fieldNorm(doc=3503)
          0.16296378 = weight(abstract_txt:attribution in 3503) [ClassicSimilarity], result of:
            0.16296378 = score(doc=3503,freq=1.0), product of:
              0.3302704 = queryWeight, product of:
                3.5975559 = boost
                7.894805 = idf(docFreq=44, maxDocs=44421)
                0.01162842 = queryNorm
              0.4934253 = fieldWeight in 3503, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.894805 = idf(docFreq=44, maxDocs=44421)
                0.0625 = fieldNorm(doc=3503)
          0.21032934 = weight(abstract_txt:latin in 3503) [ClassicSimilarity], result of:
            0.21032934 = score(doc=3503,freq=1.0), product of:
              0.43091184 = queryWeight, product of:
                4.745001 = boost
                7.809647 = idf(docFreq=48, maxDocs=44421)
                0.01162842 = queryNorm
              0.48810294 = fieldWeight in 3503, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.809647 = idf(docFreq=48, maxDocs=44421)
                0.0625 = fieldNorm(doc=3503)
        0.24 = coord(6/25)
    
  2. Yuan, Q.; Xu, S.; Jian, L.: ¬A new method for retrieving batik shape patterns (2018) 0.12
    0.11874089 = sum of:
      0.11874089 = product of:
        0.49475372 = sum of:
          0.013477322 = weight(abstract_txt:were in 186) [ClassicSimilarity], result of:
            0.013477322 = score(doc=186,freq=2.0), product of:
              0.047515165 = queryWeight, product of:
                1.1141489 = boost
                3.6674848 = idf(docFreq=3083, maxDocs=44421)
                0.01162842 = queryNorm
              0.28364253 = fieldWeight in 186, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.6674848 = idf(docFreq=3083, maxDocs=44421)
                0.0546875 = fieldNorm(doc=186)
          0.07367089 = weight(abstract_txt:compositions in 186) [ClassicSimilarity], result of:
            0.07367089 = score(doc=186,freq=1.0), product of:
              0.147444 = queryWeight, product of:
                1.387796 = boost
                9.1365185 = idf(docFreq=12, maxDocs=44421)
                0.01162842 = queryNorm
              0.49965334 = fieldWeight in 186, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.1365185 = idf(docFreq=12, maxDocs=44421)
                0.0546875 = fieldNorm(doc=186)
          0.010221188 = weight(abstract_txt:that in 186) [ClassicSimilarity], result of:
            0.010221188 = score(doc=186,freq=4.0), product of:
              0.03951519 = queryWeight, product of:
                1.436892 = boost
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.01162842 = queryNorm
              0.2586648 = fieldWeight in 186, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.0546875 = fieldNorm(doc=186)
          0.112122215 = weight(abstract_txt:patterns in 186) [ClassicSimilarity], result of:
            0.112122215 = score(doc=186,freq=7.0), product of:
              0.14708397 = queryWeight, product of:
                2.4007967 = boost
                5.2685275 = idf(docFreq=621, maxDocs=44421)
                0.01162842 = queryNorm
              0.7623007 = fieldWeight in 186, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                5.2685275 = idf(docFreq=621, maxDocs=44421)
                0.0546875 = fieldNorm(doc=186)
          0.19483478 = weight(abstract_txt:metric in 186) [ClassicSimilarity], result of:
            0.19483478 = score(doc=186,freq=3.0), product of:
              0.28197297 = queryWeight, product of:
                3.3241181 = boost
                7.2947483 = idf(docFreq=81, maxDocs=44421)
                0.01162842 = queryNorm
              0.6909697 = fieldWeight in 186, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                7.2947483 = idf(docFreq=81, maxDocs=44421)
                0.0546875 = fieldNorm(doc=186)
          0.090427324 = weight(abstract_txt:features in 186) [ClassicSimilarity], result of:
            0.090427324 = score(doc=186,freq=4.0), product of:
              0.1820817 = queryWeight, product of:
                3.4485002 = boost
                4.5406218 = idf(docFreq=1287, maxDocs=44421)
                0.01162842 = queryNorm
              0.4966305 = fieldWeight in 186, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.5406218 = idf(docFreq=1287, maxDocs=44421)
                0.0546875 = fieldNorm(doc=186)
        0.24 = coord(6/25)
    
  3. Zheng, R.; Li, J.; Chen, H.; Huang, Z.: ¬A framework for authorship identification of online messages : writing-style features and classification techniques (2006) 0.11
    0.114295095 = sum of:
      0.114295095 = product of:
        0.47622958 = sum of:
          0.054647114 = weight(abstract_txt:machines in 276) [ClassicSimilarity], result of:
            0.054647114 = score(doc=276,freq=2.0), product of:
              0.08772769 = queryWeight, product of:
                1.0704846 = boost
                7.0475073 = idf(docFreq=104, maxDocs=44421)
                0.01162842 = queryNorm
              0.62291753 = fieldWeight in 276, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.0475073 = idf(docFreq=104, maxDocs=44421)
                0.0625 = fieldNorm(doc=276)
          0.12222706 = weight(abstract_txt:discriminating in 276) [ClassicSimilarity], result of:
            0.12222706 = score(doc=276,freq=2.0), product of:
              0.15003875 = queryWeight, product of:
                1.3999542 = boost
                9.216561 = idf(docFreq=11, maxDocs=44421)
                0.01162842 = queryNorm
              0.8146366 = fieldWeight in 276, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                9.216561 = idf(docFreq=11, maxDocs=44421)
                0.0625 = fieldNorm(doc=276)
          0.031298947 = weight(abstract_txt:authors in 276) [ClassicSimilarity], result of:
            0.031298947 = score(doc=276,freq=2.0), product of:
              0.07622925 = queryWeight, product of:
                1.4111979 = boost
                4.6452923 = idf(docFreq=1159, maxDocs=44421)
                0.01162842 = queryNorm
              0.4105897 = fieldWeight in 276, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.6452923 = idf(docFreq=1159, maxDocs=44421)
                0.0625 = fieldNorm(doc=276)
          0.0058406787 = weight(abstract_txt:that in 276) [ClassicSimilarity], result of:
            0.0058406787 = score(doc=276,freq=1.0), product of:
              0.03951519 = queryWeight, product of:
                1.436892 = boost
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.01162842 = queryNorm
              0.14780845 = fieldWeight in 276, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.0625 = fieldNorm(doc=276)
          0.15887026 = weight(abstract_txt:authorship in 276) [ClassicSimilarity], result of:
            0.15887026 = score(doc=276,freq=2.0), product of:
              0.25772747 = queryWeight, product of:
                3.177994 = boost
                6.9740796 = idf(docFreq=112, maxDocs=44421)
                0.01162842 = queryNorm
              0.61642736 = fieldWeight in 276, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.9740796 = idf(docFreq=112, maxDocs=44421)
                0.0625 = fieldNorm(doc=276)
          0.10334551 = weight(abstract_txt:features in 276) [ClassicSimilarity], result of:
            0.10334551 = score(doc=276,freq=4.0), product of:
              0.1820817 = queryWeight, product of:
                3.4485002 = boost
                4.5406218 = idf(docFreq=1287, maxDocs=44421)
                0.01162842 = queryNorm
              0.5675777 = fieldWeight in 276, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.5406218 = idf(docFreq=1287, maxDocs=44421)
                0.0625 = fieldNorm(doc=276)
        0.24 = coord(6/25)
    
  4. Stamatatos, E.: Masking topic-related information to enhance authorship attribution (2018) 0.10
    0.101627044 = sum of:
      0.101627044 = product of:
        0.635169 = sum of:
          0.031298947 = weight(abstract_txt:authors in 124) [ClassicSimilarity], result of:
            0.031298947 = score(doc=124,freq=2.0), product of:
              0.07622925 = queryWeight, product of:
                1.4111979 = boost
                4.6452923 = idf(docFreq=1159, maxDocs=44421)
                0.01162842 = queryNorm
              0.4105897 = fieldWeight in 124, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.6452923 = idf(docFreq=1159, maxDocs=44421)
                0.0625 = fieldNorm(doc=124)
          0.010116352 = weight(abstract_txt:that in 124) [ClassicSimilarity], result of:
            0.010116352 = score(doc=124,freq=3.0), product of:
              0.03951519 = queryWeight, product of:
                1.436892 = boost
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.01162842 = queryNorm
              0.25601172 = fieldWeight in 124, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.0625 = fieldNorm(doc=124)
          0.19457555 = weight(abstract_txt:authorship in 124) [ClassicSimilarity], result of:
            0.19457555 = score(doc=124,freq=3.0), product of:
              0.25772747 = queryWeight, product of:
                3.177994 = boost
                6.9740796 = idf(docFreq=112, maxDocs=44421)
                0.01162842 = queryNorm
              0.75496626 = fieldWeight in 124, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.9740796 = idf(docFreq=112, maxDocs=44421)
                0.0625 = fieldNorm(doc=124)
          0.39917815 = weight(abstract_txt:attribution in 124) [ClassicSimilarity], result of:
            0.39917815 = score(doc=124,freq=6.0), product of:
              0.3302704 = queryWeight, product of:
                3.5975559 = boost
                7.894805 = idf(docFreq=44, maxDocs=44421)
                0.01162842 = queryNorm
              1.2086403 = fieldWeight in 124, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                7.894805 = idf(docFreq=44, maxDocs=44421)
                0.0625 = fieldNorm(doc=124)
        0.16 = coord(4/25)
    
  5. Potha, N.; Stamatatos, E.: Improving author verification based on topic modeling (2019) 0.10
    0.09924877 = sum of:
      0.09924877 = product of:
        0.41353655 = sum of:
          0.022131696 = weight(abstract_txt:authors in 385) [ClassicSimilarity], result of:
            0.022131696 = score(doc=385,freq=1.0), product of:
              0.07622925 = queryWeight, product of:
                1.4111979 = boost
                4.6452923 = idf(docFreq=1159, maxDocs=44421)
                0.01162842 = queryNorm
              0.29033077 = fieldWeight in 385, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.6452923 = idf(docFreq=1159, maxDocs=44421)
                0.0625 = fieldNorm(doc=385)
          0.011681357 = weight(abstract_txt:that in 385) [ClassicSimilarity], result of:
            0.011681357 = score(doc=385,freq=4.0), product of:
              0.03951519 = queryWeight, product of:
                1.436892 = boost
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.01162842 = queryNorm
              0.2956169 = fieldWeight in 385, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.0625 = fieldNorm(doc=385)
          0.10249481 = weight(abstract_txt:agnostic in 385) [ClassicSimilarity], result of:
            0.10249481 = score(doc=385,freq=1.0), product of:
              0.1681008 = queryWeight, product of:
                1.4818252 = boost
                9.755557 = idf(docFreq=6, maxDocs=44421)
                0.01162842 = queryNorm
              0.6097223 = fieldWeight in 385, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.755557 = idf(docFreq=6, maxDocs=44421)
                0.0625 = fieldNorm(doc=385)
          0.045149773 = weight(abstract_txt:task in 385) [ClassicSimilarity], result of:
            0.045149773 = score(doc=385,freq=3.0), product of:
              0.08501753 = queryWeight, product of:
                1.4903262 = boost
                4.9057617 = idf(docFreq=893, maxDocs=44421)
                0.01162842 = queryNorm
              0.5310643 = fieldWeight in 385, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.9057617 = idf(docFreq=893, maxDocs=44421)
                0.0625 = fieldNorm(doc=385)
          0.03750338 = weight(abstract_txt:certain in 385) [ClassicSimilarity], result of:
            0.03750338 = score(doc=385,freq=1.0), product of:
              0.10834914 = queryWeight, product of:
                1.6824409 = boost
                5.5381527 = idf(docFreq=474, maxDocs=44421)
                0.01162842 = queryNorm
              0.34613454 = fieldWeight in 385, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.5381527 = idf(docFreq=474, maxDocs=44421)
                0.0625 = fieldNorm(doc=385)
          0.19457555 = weight(abstract_txt:authorship in 385) [ClassicSimilarity], result of:
            0.19457555 = score(doc=385,freq=3.0), product of:
              0.25772747 = queryWeight, product of:
                3.177994 = boost
                6.9740796 = idf(docFreq=112, maxDocs=44421)
                0.01162842 = queryNorm
              0.75496626 = fieldWeight in 385, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.9740796 = idf(docFreq=112, maxDocs=44421)
                0.0625 = fieldNorm(doc=385)
        0.24 = coord(6/25)