Document (#41544)

Author
Vani, K.
Gupta, D.
Title
Integrating syntax-semantic-based text analysis with structural and citation information for scientific plagiarism detection
Source
Journal of the Association for Information Science and Technology. 69(2018) no.11, S.1330-1345
Year
2018
Abstract
The objective of the work is to explore the potency of integrating structural and citation information with effective syntax-semantic text-based analysis for scientific plagiarism detection. One of the major limitations in today's plagiarism checkers is their sole dependence on text-based detection, where they ignore the citation and structural information. Further, the text-based detection approaches that they employ usually fail to trace out intelligent manipulations. In the proposed work, a plagiarism detection system is presented that employs the effective coupling of various modules, namely, logical structure classifications and citation parsing, two-stage candidate document selections, syntax-semantic-based exhaustive passage level analysis with plagiarism analysis using structural and citation information. Further, a new plagiarism score, namely, weighted overall similarity index is proposed, opposed to the general plagiarism scores. The proposed approach is evaluated on the data set created by Alzahrani et al. (2011),1 which contains scientific publications imposed with various plagiarism complexities. Comparison of the final system results is done against a potential baseline approach. The proposed approach exhibits considerable improvement over the comparative baseline, and hence reflects the potency of syntax-semantic text-based analysis with structural and citation information.
Content
Vgl.: https://onlinelibrary.wiley.com/doi/10.1002/asi.24027.

Similar documents (author)

  1. Gupta, S.: Decimal Classification System : a bibliography for the period 1876-1994 (1997) 5.76
    5.7603507 = sum of:
      5.7603507 = weight(author_txt:gupta in 4935) [ClassicSimilarity], result of:
        5.7603507 = fieldWeight in 4935, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.216561 = idf(docFreq=11, maxDocs=44421)
          0.625 = fieldNorm(doc=4935)
    
  2. Gupta, S.: Cataloging Ethiopian personal names (1991) 5.76
    5.7603507 = sum of:
      5.7603507 = weight(author_txt:gupta in 652) [ClassicSimilarity], result of:
        5.7603507 = fieldWeight in 652, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.216561 = idf(docFreq=11, maxDocs=44421)
          0.625 = fieldNorm(doc=652)
    
  3. Gupta, S.: Communication clothing design (2009) 5.76
    5.7603507 = sum of:
      5.7603507 = weight(author_txt:gupta in 92) [ClassicSimilarity], result of:
        5.7603507 = fieldWeight in 92, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.216561 = idf(docFreq=11, maxDocs=44421)
          0.625 = fieldNorm(doc=92)
    
  4. Gupta, U.; Salisbury, L.: Is FirstSearch really attractive? (1992) 4.61
    4.6082807 = sum of:
      4.6082807 = weight(author_txt:gupta in 3862) [ClassicSimilarity], result of:
        4.6082807 = fieldWeight in 3862, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.216561 = idf(docFreq=11, maxDocs=44421)
          0.5 = fieldNorm(doc=3862)
    
  5. Berkley, B.J.; Gupta, A.: Improving service quality with information technology (1994) 4.61
    4.6082807 = sum of:
      4.6082807 = weight(author_txt:gupta in 8020) [ClassicSimilarity], result of:
        4.6082807 = fieldWeight in 8020, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.216561 = idf(docFreq=11, maxDocs=44421)
          0.5 = fieldNorm(doc=8020)
    

Similar documents (content)

  1. Gipp, B.; Meuschke, N.; Breitinger, C.: Citation-based plagiarism detection : practicability on a large-scale scientific corpus (2014) 0.94
    0.9415224 = sum of:
      0.9415224 = product of:
        1.961505 = sum of:
          0.013276386 = weight(abstract_txt:various in 4332) [ClassicSimilarity], result of:
            0.013276386 = score(doc=4332,freq=1.0), product of:
              0.04841442 = queryWeight, product of:
                1.0654987 = boost
                4.387581 = idf(docFreq=1500, maxDocs=44421)
                0.0103561105 = queryNorm
              0.2742238 = fieldWeight in 4332, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.387581 = idf(docFreq=1500, maxDocs=44421)
                0.0625 = fieldNorm(doc=4332)
          0.017459184 = weight(abstract_txt:approach in 4332) [ClassicSimilarity], result of:
            0.017459184 = score(doc=4332,freq=2.0), product of:
              0.05279886 = queryWeight, product of:
                1.3627728 = boost
                3.741144 = idf(docFreq=2864, maxDocs=44421)
                0.0103561105 = queryNorm
              0.33067352 = fieldWeight in 4332, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.741144 = idf(docFreq=2864, maxDocs=44421)
                0.0625 = fieldNorm(doc=4332)
          0.007865258 = weight(abstract_txt:information in 4332) [ClassicSimilarity], result of:
            0.007865258 = score(doc=4332,freq=2.0), product of:
              0.036787488 = queryWeight, product of:
                1.4685395 = boost
                2.4188995 = idf(docFreq=10748, maxDocs=44421)
                0.0103561105 = queryNorm
              0.21380253 = fieldWeight in 4332, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.4188995 = idf(docFreq=10748, maxDocs=44421)
                0.0625 = fieldNorm(doc=4332)
          0.008643033 = weight(abstract_txt:with in 4332) [ClassicSimilarity], result of:
            0.008643033 = score(doc=4332,freq=2.0), product of:
              0.039174393 = queryWeight, product of:
                1.5154328 = boost
                2.4961398 = idf(docFreq=9949, maxDocs=44421)
                0.0103561105 = queryNorm
              0.22062966 = fieldWeight in 4332, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.4961398 = idf(docFreq=9949, maxDocs=44421)
                0.0625 = fieldNorm(doc=4332)
          0.028162887 = weight(abstract_txt:semantic in 4332) [ClassicSimilarity], result of:
            0.028162887 = score(doc=4332,freq=1.0), product of:
              0.10070466 = queryWeight, product of:
                2.1732283 = boost
                4.4745317 = idf(docFreq=1375, maxDocs=44421)
                0.0103561105 = queryNorm
              0.27965823 = fieldWeight in 4332, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.4745317 = idf(docFreq=1375, maxDocs=44421)
                0.0625 = fieldNorm(doc=4332)
          0.030760244 = weight(abstract_txt:proposed in 4332) [ClassicSimilarity], result of:
            0.030760244 = score(doc=4332,freq=1.0), product of:
              0.10680494 = queryWeight, product of:
                2.2380831 = boost
                4.608063 = idf(docFreq=1203, maxDocs=44421)
                0.0103561105 = queryNorm
              0.28800395 = fieldWeight in 4332, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.608063 = idf(docFreq=1203, maxDocs=44421)
                0.0625 = fieldNorm(doc=4332)
          0.04023606 = weight(abstract_txt:based in 4332) [ClassicSimilarity], result of:
            0.04023606 = score(doc=4332,freq=7.0), product of:
              0.076443315 = queryWeight, product of:
                2.3189743 = boost
                3.1830752 = idf(docFreq=5005, maxDocs=44421)
                0.0103561105 = queryNorm
              0.5263516 = fieldWeight in 4332, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                3.1830752 = idf(docFreq=5005, maxDocs=44421)
                0.0625 = fieldNorm(doc=4332)
          0.036668066 = weight(abstract_txt:text in 4332) [ClassicSimilarity], result of:
            0.036668066 = score(doc=4332,freq=2.0), product of:
              0.10266368 = queryWeight, product of:
                2.4532623 = boost
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.0103561105 = queryNorm
              0.3571669 = fieldWeight in 4332, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.0625 = fieldNorm(doc=4332)
          0.078295186 = weight(abstract_txt:structural in 4332) [ClassicSimilarity], result of:
            0.078295186 = score(doc=4332,freq=1.0), product of:
              0.21448232 = queryWeight, product of:
                3.5459394 = boost
                5.8406816 = idf(docFreq=350, maxDocs=44421)
                0.0103561105 = queryNorm
              0.3650426 = fieldWeight in 4332, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.8406816 = idf(docFreq=350, maxDocs=44421)
                0.0625 = fieldNorm(doc=4332)
          0.11029143 = weight(abstract_txt:citation in 4332) [ClassicSimilarity], result of:
            0.11029143 = score(doc=4332,freq=4.0), product of:
              0.18042764 = queryWeight, product of:
                3.5626874 = boost
                4.890223 = idf(docFreq=907, maxDocs=44421)
                0.0103561105 = queryNorm
              0.6112779 = fieldWeight in 4332, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.890223 = idf(docFreq=907, maxDocs=44421)
                0.0625 = fieldNorm(doc=4332)
          0.32319915 = weight(abstract_txt:detection in 4332) [ClassicSimilarity], result of:
            0.32319915 = score(doc=4332,freq=7.0), product of:
              0.28852424 = queryWeight, product of:
                4.112697 = boost
                6.774214 = idf(docFreq=137, maxDocs=44421)
                0.0103561105 = queryNorm
              1.1201802 = fieldWeight in 4332, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                6.774214 = idf(docFreq=137, maxDocs=44421)
                0.0625 = fieldNorm(doc=4332)
          1.2666482 = weight(abstract_txt:plagiarism in 4332) [ClassicSimilarity], result of:
            1.2666482 = score(doc=4332,freq=9.0), product of:
              0.7714326 = queryWeight, product of:
                8.506375 = boost
                8.757029 = idf(docFreq=18, maxDocs=44421)
                0.0103561105 = queryNorm
              1.6419429 = fieldWeight in 4332, product of:
                3.0 = tf(freq=9.0), with freq of:
                  9.0 = termFreq=9.0
                8.757029 = idf(docFreq=18, maxDocs=44421)
                0.0625 = fieldNorm(doc=4332)
        0.48 = coord(12/25)
    
  2. K., Vani; Gupta, D.: Unmasking text plagiarism using syntactic-semantic based natural language processing techniques : comparisons, analysis and challenges (2018) 0.76
    0.7586907 = sum of:
      0.7586907 = product of:
        1.4590206 = sum of:
          0.018775646 = weight(abstract_txt:various in 84) [ClassicSimilarity], result of:
            0.018775646 = score(doc=84,freq=2.0), product of:
              0.04841442 = queryWeight, product of:
                1.0654987 = boost
                4.387581 = idf(docFreq=1500, maxDocs=44421)
                0.0103561105 = queryNorm
              0.387811 = fieldWeight in 84, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.387581 = idf(docFreq=1500, maxDocs=44421)
                0.0625 = fieldNorm(doc=84)
          0.015897665 = weight(abstract_txt:further in 84) [ClassicSimilarity], result of:
            0.015897665 = score(doc=84,freq=1.0), product of:
              0.054593846 = queryWeight, product of:
                1.1314553 = boost
                4.6591816 = idf(docFreq=1143, maxDocs=44421)
                0.0103561105 = queryNorm
              0.29119885 = fieldWeight in 84, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.6591816 = idf(docFreq=1143, maxDocs=44421)
                0.0625 = fieldNorm(doc=84)
          0.017795559 = weight(abstract_txt:effective in 84) [ClassicSimilarity], result of:
            0.017795559 = score(doc=84,freq=1.0), product of:
              0.0588567 = queryWeight, product of:
                1.1747988 = boost
                4.837664 = idf(docFreq=956, maxDocs=44421)
                0.0103561105 = queryNorm
              0.302354 = fieldWeight in 84, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.837664 = idf(docFreq=956, maxDocs=44421)
                0.0625 = fieldNorm(doc=84)
          0.021383047 = weight(abstract_txt:approach in 84) [ClassicSimilarity], result of:
            0.021383047 = score(doc=84,freq=3.0), product of:
              0.05279886 = queryWeight, product of:
                1.3627728 = boost
                3.741144 = idf(docFreq=2864, maxDocs=44421)
                0.0103561105 = queryNorm
              0.4049907 = fieldWeight in 84, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.741144 = idf(docFreq=2864, maxDocs=44421)
                0.0625 = fieldNorm(doc=84)
          0.005561577 = weight(abstract_txt:information in 84) [ClassicSimilarity], result of:
            0.005561577 = score(doc=84,freq=1.0), product of:
              0.036787488 = queryWeight, product of:
                1.4685395 = boost
                2.4188995 = idf(docFreq=10748, maxDocs=44421)
                0.0103561105 = queryNorm
              0.15118122 = fieldWeight in 84, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.4188995 = idf(docFreq=10748, maxDocs=44421)
                0.0625 = fieldNorm(doc=84)
          0.008643033 = weight(abstract_txt:with in 84) [ClassicSimilarity], result of:
            0.008643033 = score(doc=84,freq=2.0), product of:
              0.039174393 = queryWeight, product of:
                1.5154328 = boost
                2.4961398 = idf(docFreq=9949, maxDocs=44421)
                0.0103561105 = queryNorm
              0.22062966 = fieldWeight in 84, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.4961398 = idf(docFreq=9949, maxDocs=44421)
                0.0625 = fieldNorm(doc=84)
          0.056325775 = weight(abstract_txt:semantic in 84) [ClassicSimilarity], result of:
            0.056325775 = score(doc=84,freq=4.0), product of:
              0.10070466 = queryWeight, product of:
                2.1732283 = boost
                4.4745317 = idf(docFreq=1375, maxDocs=44421)
                0.0103561105 = queryNorm
              0.55931646 = fieldWeight in 84, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.4745317 = idf(docFreq=1375, maxDocs=44421)
                0.0625 = fieldNorm(doc=84)
          0.026927335 = weight(abstract_txt:analysis in 84) [ClassicSimilarity], result of:
            0.026927335 = score(doc=84,freq=2.0), product of:
              0.08356423 = queryWeight, product of:
                2.213328 = boost
                3.6456752 = idf(docFreq=3151, maxDocs=44421)
                0.0103561105 = queryNorm
              0.3222352 = fieldWeight in 84, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.6456752 = idf(docFreq=3151, maxDocs=44421)
                0.0625 = fieldNorm(doc=84)
          0.053278305 = weight(abstract_txt:proposed in 84) [ClassicSimilarity], result of:
            0.053278305 = score(doc=84,freq=3.0), product of:
              0.10680494 = queryWeight, product of:
                2.2380831 = boost
                4.608063 = idf(docFreq=1203, maxDocs=44421)
                0.0103561105 = queryNorm
              0.49883747 = fieldWeight in 84, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.608063 = idf(docFreq=1203, maxDocs=44421)
                0.0625 = fieldNorm(doc=84)
          0.015207801 = weight(abstract_txt:based in 84) [ClassicSimilarity], result of:
            0.015207801 = score(doc=84,freq=1.0), product of:
              0.076443315 = queryWeight, product of:
                2.3189743 = boost
                3.1830752 = idf(docFreq=5005, maxDocs=44421)
                0.0103561105 = queryNorm
              0.1989422 = fieldWeight in 84, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.1830752 = idf(docFreq=5005, maxDocs=44421)
                0.0625 = fieldNorm(doc=84)
          0.15296322 = weight(abstract_txt:potency in 84) [ClassicSimilarity], result of:
            0.15296322 = score(doc=84,freq=1.0), product of:
              0.24697112 = queryWeight, product of:
                2.4065154 = boost
                9.909708 = idf(docFreq=5, maxDocs=44421)
                0.0103561105 = queryNorm
              0.61935675 = fieldWeight in 84, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.909708 = idf(docFreq=5, maxDocs=44421)
                0.0625 = fieldNorm(doc=84)
          0.122157805 = weight(abstract_txt:detection in 84) [ClassicSimilarity], result of:
            0.122157805 = score(doc=84,freq=1.0), product of:
              0.28852424 = queryWeight, product of:
                4.112697 = boost
                6.774214 = idf(docFreq=137, maxDocs=44421)
                0.0103561105 = queryNorm
              0.42338836 = fieldWeight in 84, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.774214 = idf(docFreq=137, maxDocs=44421)
                0.0625 = fieldNorm(doc=84)
          0.9441039 = weight(abstract_txt:plagiarism in 84) [ClassicSimilarity], result of:
            0.9441039 = score(doc=84,freq=5.0), product of:
              0.7714326 = queryWeight, product of:
                8.506375 = boost
                8.757029 = idf(docFreq=18, maxDocs=44421)
                0.0103561105 = queryNorm
              1.223832 = fieldWeight in 84, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                8.757029 = idf(docFreq=18, maxDocs=44421)
                0.0625 = fieldNorm(doc=84)
        0.52 = coord(13/25)
    
  3. Alzahrani, S.; Palade, V.; Salim, N.; Abraham, A.: Using structural information and citation evidence to detect significant plagiarism cases in scientific publications (2012) 0.74
    0.7400641 = sum of:
      0.7400641 = product of:
        1.6819638 = sum of:
          0.01080232 = weight(abstract_txt:approach in 982) [ClassicSimilarity], result of:
            0.01080232 = score(doc=982,freq=1.0), product of:
              0.05279886 = queryWeight, product of:
                1.3627728 = boost
                3.741144 = idf(docFreq=2864, maxDocs=44421)
                0.0103561105 = queryNorm
              0.20459381 = fieldWeight in 982, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.741144 = idf(docFreq=2864, maxDocs=44421)
                0.0546875 = fieldNorm(doc=982)
          0.0068821004 = weight(abstract_txt:information in 982) [ClassicSimilarity], result of:
            0.0068821004 = score(doc=982,freq=2.0), product of:
              0.036787488 = queryWeight, product of:
                1.4685395 = boost
                2.4188995 = idf(docFreq=10748, maxDocs=44421)
                0.0103561105 = queryNorm
              0.18707721 = fieldWeight in 982, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.4188995 = idf(docFreq=10748, maxDocs=44421)
                0.0546875 = fieldNorm(doc=982)
          0.010695208 = weight(abstract_txt:with in 982) [ClassicSimilarity], result of:
            0.010695208 = score(doc=982,freq=4.0), product of:
              0.039174393 = queryWeight, product of:
                1.5154328 = boost
                2.4961398 = idf(docFreq=9949, maxDocs=44421)
                0.0103561105 = queryNorm
              0.2730153 = fieldWeight in 982, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                2.4961398 = idf(docFreq=9949, maxDocs=44421)
                0.0546875 = fieldNorm(doc=982)
          0.03459356 = weight(abstract_txt:namely in 982) [ClassicSimilarity], result of:
            0.03459356 = score(doc=982,freq=1.0), product of:
              0.100210436 = queryWeight, product of:
                1.532929 = boost
                6.312396 = idf(docFreq=218, maxDocs=44421)
                0.0103561105 = queryNorm
              0.34520915 = fieldWeight in 982, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.312396 = idf(docFreq=218, maxDocs=44421)
                0.0546875 = fieldNorm(doc=982)
          0.029003436 = weight(abstract_txt:scientific in 982) [ClassicSimilarity], result of:
            0.029003436 = score(doc=982,freq=2.0), product of:
              0.0809536 = queryWeight, product of:
                1.6874436 = boost
                4.6324444 = idf(docFreq=1174, maxDocs=44421)
                0.0103561105 = queryNorm
              0.35827234 = fieldWeight in 982, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.6324444 = idf(docFreq=1174, maxDocs=44421)
                0.0546875 = fieldNorm(doc=982)
          0.026915213 = weight(abstract_txt:proposed in 982) [ClassicSimilarity], result of:
            0.026915213 = score(doc=982,freq=1.0), product of:
              0.10680494 = queryWeight, product of:
                2.2380831 = boost
                4.608063 = idf(docFreq=1203, maxDocs=44421)
                0.0103561105 = queryNorm
              0.25200346 = fieldWeight in 982, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.608063 = idf(docFreq=1203, maxDocs=44421)
                0.0546875 = fieldNorm(doc=982)
          0.0230481 = weight(abstract_txt:based in 982) [ClassicSimilarity], result of:
            0.0230481 = score(doc=982,freq=3.0), product of:
              0.076443315 = queryWeight, product of:
                2.3189743 = boost
                3.1830752 = idf(docFreq=5005, maxDocs=44421)
                0.0103561105 = queryNorm
              0.30150574 = fieldWeight in 982, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.1830752 = idf(docFreq=5005, maxDocs=44421)
                0.0546875 = fieldNorm(doc=982)
          0.13701656 = weight(abstract_txt:structural in 982) [ClassicSimilarity], result of:
            0.13701656 = score(doc=982,freq=4.0), product of:
              0.21448232 = queryWeight, product of:
                3.5459394 = boost
                5.8406816 = idf(docFreq=350, maxDocs=44421)
                0.0103561105 = queryNorm
              0.6388245 = fieldWeight in 982, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.8406816 = idf(docFreq=350, maxDocs=44421)
                0.0546875 = fieldNorm(doc=982)
          0.08357578 = weight(abstract_txt:citation in 982) [ClassicSimilarity], result of:
            0.08357578 = score(doc=982,freq=3.0), product of:
              0.18042764 = queryWeight, product of:
                3.5626874 = boost
                4.890223 = idf(docFreq=907, maxDocs=44421)
                0.0103561105 = queryNorm
              0.4632094 = fieldWeight in 982, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.890223 = idf(docFreq=907, maxDocs=44421)
                0.0546875 = fieldNorm(doc=982)
          0.15116256 = weight(abstract_txt:detection in 982) [ClassicSimilarity], result of:
            0.15116256 = score(doc=982,freq=2.0), product of:
              0.28852424 = queryWeight, product of:
                4.112697 = boost
                6.774214 = idf(docFreq=137, maxDocs=44421)
                0.0103561105 = queryNorm
              0.52391636 = fieldWeight in 982, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.774214 = idf(docFreq=137, maxDocs=44421)
                0.0546875 = fieldNorm(doc=982)
          1.1682689 = weight(abstract_txt:plagiarism in 982) [ClassicSimilarity], result of:
            1.1682689 = score(doc=982,freq=10.0), product of:
              0.7714326 = queryWeight, product of:
                8.506375 = boost
                8.757029 = idf(docFreq=18, maxDocs=44421)
                0.0103561105 = queryNorm
              1.5144148 = fieldWeight in 982, product of:
                3.1622777 = tf(freq=10.0), with freq of:
                  10.0 = termFreq=10.0
                8.757029 = idf(docFreq=18, maxDocs=44421)
                0.0546875 = fieldNorm(doc=982)
        0.44 = coord(11/25)
    
  4. Stamatatos, E.: Plagiarism detection using stopword n-grams (2011) 0.34
    0.34014606 = sum of:
      0.34014606 = product of:
        1.0629565 = sum of:
          0.054877095 = weight(abstract_txt:passage in 955) [ClassicSimilarity], result of:
            0.054877095 = score(doc=955,freq=1.0), product of:
              0.08529015 = queryWeight, product of:
                8.235732 = idf(docFreq=31, maxDocs=44421)
                0.0103561105 = queryNorm
              0.6434166 = fieldWeight in 955, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.235732 = idf(docFreq=31, maxDocs=44421)
                0.078125 = fieldNorm(doc=955)
          0.015431885 = weight(abstract_txt:approach in 955) [ClassicSimilarity], result of:
            0.015431885 = score(doc=955,freq=1.0), product of:
              0.05279886 = queryWeight, product of:
                1.3627728 = boost
                3.741144 = idf(docFreq=2864, maxDocs=44421)
                0.0103561105 = queryNorm
              0.29227686 = fieldWeight in 955, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.741144 = idf(docFreq=2864, maxDocs=44421)
                0.078125 = fieldNorm(doc=955)
          0.0069519714 = weight(abstract_txt:information in 955) [ClassicSimilarity], result of:
            0.0069519714 = score(doc=955,freq=1.0), product of:
              0.036787488 = queryWeight, product of:
                1.4685395 = boost
                2.4188995 = idf(docFreq=10748, maxDocs=44421)
                0.0103561105 = queryNorm
              0.18897653 = fieldWeight in 955, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.4188995 = idf(docFreq=10748, maxDocs=44421)
                0.078125 = fieldNorm(doc=955)
          0.013231888 = weight(abstract_txt:with in 955) [ClassicSimilarity], result of:
            0.013231888 = score(doc=955,freq=3.0), product of:
              0.039174393 = queryWeight, product of:
                1.5154328 = boost
                2.4961398 = idf(docFreq=9949, maxDocs=44421)
                0.0103561105 = queryNorm
              0.33776882 = fieldWeight in 955, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                2.4961398 = idf(docFreq=9949, maxDocs=44421)
                0.078125 = fieldNorm(doc=955)
          0.05437694 = weight(abstract_txt:proposed in 955) [ClassicSimilarity], result of:
            0.05437694 = score(doc=955,freq=2.0), product of:
              0.10680494 = queryWeight, product of:
                2.2380831 = boost
                4.608063 = idf(docFreq=1203, maxDocs=44421)
                0.0103561105 = queryNorm
              0.50912386 = fieldWeight in 955, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.608063 = idf(docFreq=1203, maxDocs=44421)
                0.078125 = fieldNorm(doc=955)
          0.01900975 = weight(abstract_txt:based in 955) [ClassicSimilarity], result of:
            0.01900975 = score(doc=955,freq=1.0), product of:
              0.076443315 = queryWeight, product of:
                2.3189743 = boost
                3.1830752 = idf(docFreq=5005, maxDocs=44421)
                0.0103561105 = queryNorm
              0.24867775 = fieldWeight in 955, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.1830752 = idf(docFreq=5005, maxDocs=44421)
                0.078125 = fieldNorm(doc=955)
          0.15269727 = weight(abstract_txt:detection in 955) [ClassicSimilarity], result of:
            0.15269727 = score(doc=955,freq=1.0), product of:
              0.28852424 = queryWeight, product of:
                4.112697 = boost
                6.774214 = idf(docFreq=137, maxDocs=44421)
                0.0103561105 = queryNorm
              0.5292355 = fieldWeight in 955, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.774214 = idf(docFreq=137, maxDocs=44421)
                0.078125 = fieldNorm(doc=955)
          0.7463796 = weight(abstract_txt:plagiarism in 955) [ClassicSimilarity], result of:
            0.7463796 = score(doc=955,freq=2.0), product of:
              0.7714326 = queryWeight, product of:
                8.506375 = boost
                8.757029 = idf(docFreq=18, maxDocs=44421)
                0.0103561105 = queryNorm
              0.9675241 = fieldWeight in 955, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.757029 = idf(docFreq=18, maxDocs=44421)
                0.078125 = fieldNorm(doc=955)
        0.32 = coord(8/25)
    
  5. Pertile, S. de L.; Moreira, V.P.: Comparing and combining content- and citation-based approaches for plagiarism detection (2016) 0.34
    0.33707702 = sum of:
      0.33707702 = product of:
        1.2038465 = sum of:
          0.008643033 = weight(abstract_txt:with in 4123) [ClassicSimilarity], result of:
            0.008643033 = score(doc=4123,freq=2.0), product of:
              0.039174393 = queryWeight, product of:
                1.5154328 = boost
                2.4961398 = idf(docFreq=9949, maxDocs=44421)
                0.0103561105 = queryNorm
              0.22062966 = fieldWeight in 4123, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.4961398 = idf(docFreq=9949, maxDocs=44421)
                0.0625 = fieldNorm(doc=4123)
          0.040596355 = weight(abstract_txt:scientific in 4123) [ClassicSimilarity], result of:
            0.040596355 = score(doc=4123,freq=3.0), product of:
              0.0809536 = queryWeight, product of:
                1.6874436 = boost
                4.6324444 = idf(docFreq=1174, maxDocs=44421)
                0.0103561105 = queryNorm
              0.5014768 = fieldWeight in 4123, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.6324444 = idf(docFreq=1174, maxDocs=44421)
                0.0625 = fieldNorm(doc=4123)
          0.030415602 = weight(abstract_txt:based in 4123) [ClassicSimilarity], result of:
            0.030415602 = score(doc=4123,freq=4.0), product of:
              0.076443315 = queryWeight, product of:
                2.3189743 = boost
                3.1830752 = idf(docFreq=5005, maxDocs=44421)
                0.0103561105 = queryNorm
              0.3978844 = fieldWeight in 4123, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                3.1830752 = idf(docFreq=5005, maxDocs=44421)
                0.0625 = fieldNorm(doc=4123)
          0.05185648 = weight(abstract_txt:text in 4123) [ClassicSimilarity], result of:
            0.05185648 = score(doc=4123,freq=4.0), product of:
              0.10266368 = queryWeight, product of:
                2.4532623 = boost
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.0103561105 = queryNorm
              0.50511026 = fieldWeight in 4123, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.0625 = fieldNorm(doc=4123)
          0.055145714 = weight(abstract_txt:citation in 4123) [ClassicSimilarity], result of:
            0.055145714 = score(doc=4123,freq=1.0), product of:
              0.18042764 = queryWeight, product of:
                3.5626874 = boost
                4.890223 = idf(docFreq=907, maxDocs=44421)
                0.0103561105 = queryNorm
              0.30563894 = fieldWeight in 4123, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.890223 = idf(docFreq=907, maxDocs=44421)
                0.0625 = fieldNorm(doc=4123)
          0.17275722 = weight(abstract_txt:detection in 4123) [ClassicSimilarity], result of:
            0.17275722 = score(doc=4123,freq=2.0), product of:
              0.28852424 = queryWeight, product of:
                4.112697 = boost
                6.774214 = idf(docFreq=137, maxDocs=44421)
                0.0103561105 = queryNorm
              0.59876156 = fieldWeight in 4123, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.774214 = idf(docFreq=137, maxDocs=44421)
                0.0625 = fieldNorm(doc=4123)
          0.8444321 = weight(abstract_txt:plagiarism in 4123) [ClassicSimilarity], result of:
            0.8444321 = score(doc=4123,freq=4.0), product of:
              0.7714326 = queryWeight, product of:
                8.506375 = boost
                8.757029 = idf(docFreq=18, maxDocs=44421)
                0.0103561105 = queryNorm
              1.0946286 = fieldWeight in 4123, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                8.757029 = idf(docFreq=18, maxDocs=44421)
                0.0625 = fieldNorm(doc=4123)
        0.28 = coord(7/25)