Document (#22300)

Author
Kokol, P.
Podgorelec, V.
Zorman, M.
Kokol, T.
Njivar, T.
Title
Computer and natural language texts : a comparison based on long-range correlations
Source
Journal of the American Society for Information Science. 50(1999) no.14, S.1295-1301
Year
1999
Abstract
'Long-range power low correlation' (LRC) is defined as a maximal propagation distance of the effect of some disturbance within a system found in many systems that can be represented as strings of symbols. LRC between characters has also been identified in natural language texts. The aim of this article is to show that long-range power law correlations can also be found in computer programs, meaning that some common laws hold for both natural language texts and computer programs. This fact enables one to draw parallels between these 2 different types of human writings, and also enables one to measure the differences between them
Theme
Computerlinguistik

Similar documents (content)

  1. Altmann, E.G.; Cristadoro, G.; Esposti, M.D.: On the origin of long-range correlations in texts (2012) 0.26
    0.25788853 = sum of:
      0.25788853 = product of:
        0.92103046 = sum of:
          0.009749211 = weight(abstract_txt:that in 1330) [ClassicSimilarity], result of:
            0.009749211 = score(doc=1330,freq=1.0), product of:
              0.052766737 = queryWeight, product of:
                1.0358231 = boost
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.02154048 = queryNorm
              0.18476056 = fieldWeight in 1330, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.078125 = fieldNorm(doc=1330)
          0.053484567 = weight(abstract_txt:language in 1330) [ClassicSimilarity], result of:
            0.053484567 = score(doc=1330,freq=1.0), product of:
              0.16413437 = queryWeight, product of:
                1.82686 = boost
                4.1709876 = idf(docFreq=1863, maxDocs=44421)
                0.02154048 = queryNorm
              0.3258584 = fieldWeight in 1330, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.1709876 = idf(docFreq=1863, maxDocs=44421)
                0.078125 = fieldNorm(doc=1330)
          0.34452352 = weight(abstract_txt:correlations in 1330) [ClassicSimilarity], result of:
            0.34452352 = score(doc=1330,freq=3.0), product of:
              0.34418264 = queryWeight, product of:
                2.160003 = boost
                7.3974023 = idf(docFreq=73, maxDocs=44421)
                0.02154048 = queryNorm
              1.0009904 = fieldWeight in 1330, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                7.3974023 = idf(docFreq=73, maxDocs=44421)
                0.078125 = fieldNorm(doc=1330)
          0.0952862 = weight(abstract_txt:range in 1330) [ClassicSimilarity], result of:
            0.0952862 = score(doc=1330,freq=1.0), product of:
              0.24121292 = queryWeight, product of:
                2.2146535 = boost
                5.0563765 = idf(docFreq=768, maxDocs=44421)
                0.02154048 = queryNorm
              0.39502943 = fieldWeight in 1330, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.0563765 = idf(docFreq=768, maxDocs=44421)
                0.078125 = fieldNorm(doc=1330)
          0.13580424 = weight(abstract_txt:natural in 1330) [ClassicSimilarity], result of:
            0.13580424 = score(doc=1330,freq=2.0), product of:
              0.24246335 = queryWeight, product of:
                2.2203863 = boost
                5.0694656 = idf(docFreq=758, maxDocs=44421)
                0.02154048 = queryNorm
              0.5601021 = fieldWeight in 1330, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.0694656 = idf(docFreq=758, maxDocs=44421)
                0.078125 = fieldNorm(doc=1330)
          0.15010987 = weight(abstract_txt:long in 1330) [ClassicSimilarity], result of:
            0.15010987 = score(doc=1330,freq=2.0), product of:
              0.259205 = queryWeight, product of:
                2.2957637 = boost
                5.2415633 = idf(docFreq=638, maxDocs=44421)
                0.02154048 = queryNorm
              0.5791164 = fieldWeight in 1330, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.2415633 = idf(docFreq=638, maxDocs=44421)
                0.078125 = fieldNorm(doc=1330)
          0.13207287 = weight(abstract_txt:texts in 1330) [ClassicSimilarity], result of:
            0.13207287 = score(doc=1330,freq=1.0), product of:
              0.29986307 = queryWeight, product of:
                2.469261 = boost
                5.6376824 = idf(docFreq=429, maxDocs=44421)
                0.02154048 = queryNorm
              0.44044393 = fieldWeight in 1330, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.6376824 = idf(docFreq=429, maxDocs=44421)
                0.078125 = fieldNorm(doc=1330)
        0.28 = coord(7/25)
    
  2. Clark, M.; Kim, Y.; Kruschwitz, U.; Song, D.; Albakour, D.; Dignum, S.; Beresi, U.C.; Fasli, M.; Roeck, A De: Automatically structuring domain knowledge from text : an overview of current research (2012) 0.19
    0.19327927 = sum of:
      0.19327927 = product of:
        0.6039977 = sum of:
          0.011699054 = weight(abstract_txt:that in 3738) [ClassicSimilarity], result of:
            0.011699054 = score(doc=3738,freq=1.0), product of:
              0.052766737 = queryWeight, product of:
                1.0358231 = boost
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.02154048 = queryNorm
              0.22171268 = fieldWeight in 3738, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.09375 = fieldNorm(doc=3738)
          0.041510127 = weight(abstract_txt:some in 3738) [ClassicSimilarity], result of:
            0.041510127 = score(doc=3738,freq=2.0), product of:
              0.08511159 = queryWeight, product of:
                1.0741235 = boost
                3.6785707 = idf(docFreq=3049, maxDocs=44421)
                0.02154048 = queryNorm
              0.48771417 = fieldWeight in 3738, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.6785707 = idf(docFreq=3049, maxDocs=44421)
                0.09375 = fieldNorm(doc=3738)
          0.17283566 = weight(abstract_txt:propagation in 3738) [ClassicSimilarity], result of:
            0.17283566 = score(doc=3738,freq=1.0), product of:
              0.2202799 = queryWeight, product of:
                1.2218906 = boost
                8.369263 = idf(docFreq=27, maxDocs=44421)
                0.02154048 = queryNorm
              0.7846184 = fieldWeight in 3738, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.369263 = idf(docFreq=27, maxDocs=44421)
                0.09375 = fieldNorm(doc=3738)
          0.034610715 = weight(abstract_txt:also in 3738) [ClassicSimilarity], result of:
            0.034610715 = score(doc=3738,freq=1.0), product of:
              0.1087427 = queryWeight, product of:
                1.4869814 = boost
                3.3949955 = idf(docFreq=4049, maxDocs=44421)
                0.02154048 = queryNorm
              0.31828082 = fieldWeight in 3738, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.3949955 = idf(docFreq=4049, maxDocs=44421)
                0.09375 = fieldNorm(doc=3738)
          0.03655447 = weight(abstract_txt:between in 3738) [ClassicSimilarity], result of:
            0.03655447 = score(doc=3738,freq=1.0), product of:
              0.112776875 = queryWeight, product of:
                1.5143125 = boost
                3.4573963 = idf(docFreq=3804, maxDocs=44421)
                0.02154048 = queryNorm
              0.3241309 = fieldWeight in 3738, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4573963 = idf(docFreq=3804, maxDocs=44421)
                0.09375 = fieldNorm(doc=3738)
          0.06418148 = weight(abstract_txt:language in 3738) [ClassicSimilarity], result of:
            0.06418148 = score(doc=3738,freq=1.0), product of:
              0.16413437 = queryWeight, product of:
                1.82686 = boost
                4.1709876 = idf(docFreq=1863, maxDocs=44421)
                0.02154048 = queryNorm
              0.39103007 = fieldWeight in 3738, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.1709876 = idf(docFreq=1863, maxDocs=44421)
                0.09375 = fieldNorm(doc=3738)
          0.11523371 = weight(abstract_txt:natural in 3738) [ClassicSimilarity], result of:
            0.11523371 = score(doc=3738,freq=1.0), product of:
              0.24246335 = queryWeight, product of:
                2.2203863 = boost
                5.0694656 = idf(docFreq=758, maxDocs=44421)
                0.02154048 = queryNorm
              0.4752624 = fieldWeight in 3738, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.0694656 = idf(docFreq=758, maxDocs=44421)
                0.09375 = fieldNorm(doc=3738)
          0.12737244 = weight(abstract_txt:long in 3738) [ClassicSimilarity], result of:
            0.12737244 = score(doc=3738,freq=1.0), product of:
              0.259205 = queryWeight, product of:
                2.2957637 = boost
                5.2415633 = idf(docFreq=638, maxDocs=44421)
                0.02154048 = queryNorm
              0.49139655 = fieldWeight in 3738, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.2415633 = idf(docFreq=638, maxDocs=44421)
                0.09375 = fieldNorm(doc=3738)
        0.32 = coord(8/25)
    
  3. Egghe, L.: ¬The power of power laws and an interpretation of Lotkaian informetric systems as self-similar fractals (2005) 0.16
    0.16238523 = sum of:
      0.16238523 = product of:
        0.50745386 = sum of:
          0.011029975 = weight(abstract_txt:that in 4466) [ClassicSimilarity], result of:
            0.011029975 = score(doc=4466,freq=2.0), product of:
              0.052766737 = queryWeight, product of:
                1.0358231 = boost
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.02154048 = queryNorm
              0.20903271 = fieldWeight in 4466, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.0625 = fieldNorm(doc=4466)
          0.14291552 = weight(abstract_txt:laws in 4466) [ClassicSimilarity], result of:
            0.14291552 = score(doc=4466,freq=4.0), product of:
              0.16019407 = queryWeight, product of:
                1.0420009 = boost
                7.1371193 = idf(docFreq=95, maxDocs=44421)
                0.02154048 = queryNorm
              0.8921399 = fieldWeight in 4466, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                7.1371193 = idf(docFreq=95, maxDocs=44421)
                0.0625 = fieldNorm(doc=4466)
          0.019568063 = weight(abstract_txt:some in 4466) [ClassicSimilarity], result of:
            0.019568063 = score(doc=4466,freq=1.0), product of:
              0.08511159 = queryWeight, product of:
                1.0741235 = boost
                3.6785707 = idf(docFreq=3049, maxDocs=44421)
                0.02154048 = queryNorm
              0.22991067 = fieldWeight in 4466, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6785707 = idf(docFreq=3049, maxDocs=44421)
                0.0625 = fieldNorm(doc=4466)
          0.035148572 = weight(abstract_txt:found in 4466) [ClassicSimilarity], result of:
            0.035148572 = score(doc=4466,freq=1.0), product of:
              0.12576562 = queryWeight, product of:
                1.3056923 = boost
                4.4716287 = idf(docFreq=1379, maxDocs=44421)
                0.02154048 = queryNorm
              0.2794768 = fieldWeight in 4466, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.4716287 = idf(docFreq=1379, maxDocs=44421)
                0.0625 = fieldNorm(doc=4466)
          0.03996501 = weight(abstract_txt:also in 4466) [ClassicSimilarity], result of:
            0.03996501 = score(doc=4466,freq=3.0), product of:
              0.1087427 = queryWeight, product of:
                1.4869814 = boost
                3.3949955 = idf(docFreq=4049, maxDocs=44421)
                0.02154048 = queryNorm
              0.36751902 = fieldWeight in 4466, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.3949955 = idf(docFreq=4049, maxDocs=44421)
                0.0625 = fieldNorm(doc=4466)
          0.024369646 = weight(abstract_txt:between in 4466) [ClassicSimilarity], result of:
            0.024369646 = score(doc=4466,freq=1.0), product of:
              0.112776875 = queryWeight, product of:
                1.5143125 = boost
                3.4573963 = idf(docFreq=3804, maxDocs=44421)
                0.02154048 = queryNorm
              0.21608727 = fieldWeight in 4466, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4573963 = idf(docFreq=3804, maxDocs=44421)
                0.0625 = fieldNorm(doc=4466)
          0.12879877 = weight(abstract_txt:power in 4466) [ClassicSimilarity], result of:
            0.12879877 = score(doc=4466,freq=3.0), product of:
              0.20726417 = queryWeight, product of:
                1.6761851 = boost
                5.7404623 = idf(docFreq=387, maxDocs=44421)
                0.02154048 = queryNorm
              0.62142324 = fieldWeight in 4466, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.7404623 = idf(docFreq=387, maxDocs=44421)
                0.0625 = fieldNorm(doc=4466)
          0.1056583 = weight(abstract_txt:texts in 4466) [ClassicSimilarity], result of:
            0.1056583 = score(doc=4466,freq=1.0), product of:
              0.29986307 = queryWeight, product of:
                2.469261 = boost
                5.6376824 = idf(docFreq=429, maxDocs=44421)
                0.02154048 = queryNorm
              0.35235515 = fieldWeight in 4466, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.6376824 = idf(docFreq=429, maxDocs=44421)
                0.0625 = fieldNorm(doc=4466)
        0.32 = coord(8/25)
    
  4. Ucoluk, G.; Toroslu, I.H.: ¬A genetic algorithm approach for verification of the syllable-based text compression technique (1997) 0.15
    0.1496571 = sum of:
      0.1496571 = product of:
        0.6235713 = sum of:
          0.021799902 = weight(abstract_txt:that in 3601) [ClassicSimilarity], result of:
            0.021799902 = score(doc=3601,freq=5.0), product of:
              0.052766737 = queryWeight, product of:
                1.0358231 = boost
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.02154048 = queryNorm
              0.4131372 = fieldWeight in 3601, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.078125 = fieldNorm(doc=3601)
          0.0963439 = weight(abstract_txt:symbols in 3601) [ClassicSimilarity], result of:
            0.0963439 = score(doc=3601,freq=1.0), product of:
              0.16848306 = queryWeight, product of:
                1.0686193 = boost
                7.319441 = idf(docFreq=79, maxDocs=44421)
                0.02154048 = queryNorm
              0.57183135 = fieldWeight in 3601, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.319441 = idf(docFreq=79, maxDocs=44421)
                0.078125 = fieldNorm(doc=3601)
          0.16773435 = weight(abstract_txt:strings in 3601) [ClassicSimilarity], result of:
            0.16773435 = score(doc=3601,freq=3.0), product of:
              0.16906267 = queryWeight, product of:
                1.0704558 = boost
                7.33202 = idf(docFreq=78, maxDocs=44421)
                0.02154048 = queryNorm
              0.99214303 = fieldWeight in 3601, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                7.33202 = idf(docFreq=78, maxDocs=44421)
                0.078125 = fieldNorm(doc=3601)
          0.15213573 = weight(abstract_txt:maximal in 3601) [ClassicSimilarity], result of:
            0.15213573 = score(doc=3601,freq=1.0), product of:
              0.22846918 = queryWeight, product of:
                1.2443962 = boost
                8.523414 = idf(docFreq=23, maxDocs=44421)
                0.02154048 = queryNorm
              0.6658917 = fieldWeight in 3601, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.523414 = idf(docFreq=23, maxDocs=44421)
                0.078125 = fieldNorm(doc=3601)
          0.053484567 = weight(abstract_txt:language in 3601) [ClassicSimilarity], result of:
            0.053484567 = score(doc=3601,freq=1.0), product of:
              0.16413437 = queryWeight, product of:
                1.82686 = boost
                4.1709876 = idf(docFreq=1863, maxDocs=44421)
                0.02154048 = queryNorm
              0.3258584 = fieldWeight in 3601, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.1709876 = idf(docFreq=1863, maxDocs=44421)
                0.078125 = fieldNorm(doc=3601)
          0.13207287 = weight(abstract_txt:texts in 3601) [ClassicSimilarity], result of:
            0.13207287 = score(doc=3601,freq=1.0), product of:
              0.29986307 = queryWeight, product of:
                2.469261 = boost
                5.6376824 = idf(docFreq=429, maxDocs=44421)
                0.02154048 = queryNorm
              0.44044393 = fieldWeight in 3601, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.6376824 = idf(docFreq=429, maxDocs=44421)
                0.078125 = fieldNorm(doc=3601)
        0.24 = coord(6/25)
    
  5. Agarwal, B.; Ramampiaro, H.; Langseth, H.; Ruocco, M.: ¬A deep network model for paraphrase detection in short text messages (2018) 0.14
    0.13555089 = sum of:
      0.13555089 = product of:
        0.5647954 = sum of:
          0.015598739 = weight(abstract_txt:that in 43) [ClassicSimilarity], result of:
            0.015598739 = score(doc=43,freq=4.0), product of:
              0.052766737 = queryWeight, product of:
                1.0358231 = boost
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.02154048 = queryNorm
              0.2956169 = fieldWeight in 43, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.0625 = fieldNorm(doc=43)
          0.09068918 = weight(abstract_txt:enables in 43) [ClassicSimilarity], result of:
            0.09068918 = score(doc=43,freq=1.0), product of:
              0.2365886 = queryWeight, product of:
                1.7908399 = boost
                6.133123 = idf(docFreq=261, maxDocs=44421)
                0.02154048 = queryNorm
              0.38332018 = fieldWeight in 43, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.133123 = idf(docFreq=261, maxDocs=44421)
                0.0625 = fieldNorm(doc=43)
          0.060510878 = weight(abstract_txt:language in 43) [ClassicSimilarity], result of:
            0.060510878 = score(doc=43,freq=2.0), product of:
              0.16413437 = queryWeight, product of:
                1.82686 = boost
                4.1709876 = idf(docFreq=1863, maxDocs=44421)
                0.02154048 = queryNorm
              0.3686667 = fieldWeight in 43, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.1709876 = idf(docFreq=1863, maxDocs=44421)
                0.0625 = fieldNorm(doc=43)
          0.076822475 = weight(abstract_txt:natural in 43) [ClassicSimilarity], result of:
            0.076822475 = score(doc=43,freq=1.0), product of:
              0.24246335 = queryWeight, product of:
                2.2203863 = boost
                5.0694656 = idf(docFreq=758, maxDocs=44421)
                0.02154048 = queryNorm
              0.3168416 = fieldWeight in 43, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.0694656 = idf(docFreq=758, maxDocs=44421)
                0.0625 = fieldNorm(doc=43)
          0.08491497 = weight(abstract_txt:long in 43) [ClassicSimilarity], result of:
            0.08491497 = score(doc=43,freq=1.0), product of:
              0.259205 = queryWeight, product of:
                2.2957637 = boost
                5.2415633 = idf(docFreq=638, maxDocs=44421)
                0.02154048 = queryNorm
              0.3275977 = fieldWeight in 43, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.2415633 = idf(docFreq=638, maxDocs=44421)
                0.0625 = fieldNorm(doc=43)
          0.23625913 = weight(abstract_txt:texts in 43) [ClassicSimilarity], result of:
            0.23625913 = score(doc=43,freq=5.0), product of:
              0.29986307 = queryWeight, product of:
                2.469261 = boost
                5.6376824 = idf(docFreq=429, maxDocs=44421)
                0.02154048 = queryNorm
              0.7878901 = fieldWeight in 43, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                5.6376824 = idf(docFreq=429, maxDocs=44421)
                0.0625 = fieldNorm(doc=43)
        0.24 = coord(6/25)