Document (#42085)

Author
K., Vani
Gupta, D.
Title
Unmasking text plagiarism using syntactic-semantic based natural language processing techniques : comparisons, analysis and challenges
Source
Information processing and management. 54(2018) no.3, S.408-432
Year
2018
Abstract
The proposed work aims to explore and compare the potency of syntactic-semantic based linguistic structures in plagiarism detection using natural language processing techniques. The current work explores linguistic features, viz., part of speech tags, chunks and semantic roles in detecting plagiarized fragments and utilizes a combined syntactic-semantic similarity metric, which extracts the semantic concepts from WordNet lexical database. The linguistic information is utilized for effective pre-processing and for availing semantically relevant comparisons. Another major contribution is the analysis of the proposed approach on plagiarism cases of various complexity levels. The impact of plagiarism types and complexity levels, upon the features extracted is analyzed and discussed. Further, unlike the existing systems, which were evaluated on some limited data sets, the proposed approach is evaluated on a larger scale using the plagiarism corpus provided by PAN1 competition from 2009 to 2014. The approach presented considerable improvement in comparison with the top-ranked systems of the respective years. The evaluation and analysis with various cases of plagiarism also reflected the supremacy of deeper linguistic features for identifying manually plagiarized data.
Content
Vgl.: https://doi.org/10.1016/j.ipm.2018.01.008.
Theme
Computerlinguistik

Similar documents (author)

  1. Gupta, S.: Decimal Classification System : a bibliography for the period 1876-1994 (1997) 5.76
    5.7603507 = sum of:
      5.7603507 = weight(author_txt:gupta in 4935) [ClassicSimilarity], result of:
        5.7603507 = fieldWeight in 4935, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.216561 = idf(docFreq=11, maxDocs=44421)
          0.625 = fieldNorm(doc=4935)
    
  2. Gupta, S.: Cataloging Ethiopian personal names (1991) 5.76
    5.7603507 = sum of:
      5.7603507 = weight(author_txt:gupta in 652) [ClassicSimilarity], result of:
        5.7603507 = fieldWeight in 652, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.216561 = idf(docFreq=11, maxDocs=44421)
          0.625 = fieldNorm(doc=652)
    
  3. Gupta, S.: Communication clothing design (2009) 5.76
    5.7603507 = sum of:
      5.7603507 = weight(author_txt:gupta in 92) [ClassicSimilarity], result of:
        5.7603507 = fieldWeight in 92, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.216561 = idf(docFreq=11, maxDocs=44421)
          0.625 = fieldNorm(doc=92)
    
  4. Gupta, U.; Salisbury, L.: Is FirstSearch really attractive? (1992) 4.61
    4.6082807 = sum of:
      4.6082807 = weight(author_txt:gupta in 3862) [ClassicSimilarity], result of:
        4.6082807 = fieldWeight in 3862, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.216561 = idf(docFreq=11, maxDocs=44421)
          0.5 = fieldNorm(doc=3862)
    
  5. Berkley, B.J.; Gupta, A.: Improving service quality with information technology (1994) 4.61
    4.6082807 = sum of:
      4.6082807 = weight(author_txt:gupta in 8020) [ClassicSimilarity], result of:
        4.6082807 = fieldWeight in 8020, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.216561 = idf(docFreq=11, maxDocs=44421)
          0.5 = fieldNorm(doc=8020)
    

Similar documents (content)

  1. Vani, K.; Gupta, D.: Integrating syntax-semantic-based text analysis with structural and citation information for scientific plagiarism detection (2018) 0.57
    0.56680244 = sum of:
      0.56680244 = product of:
        1.5744512 = sum of:
          0.025589563 = weight(abstract_txt:various in 543) [ClassicSimilarity], result of:
            0.025589563 = score(doc=543,freq=2.0), product of:
              0.06598462 = queryWeight, product of:
                1.1405002 = boost
                4.387581 = idf(docFreq=1500, maxDocs=44421)
                0.013186278 = queryNorm
              0.387811 = fieldWeight in 543, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.387581 = idf(docFreq=1500, maxDocs=44421)
                0.0625 = fieldNorm(doc=543)
          0.14741446 = weight(abstract_txt:potency in 543) [ClassicSimilarity], result of:
            0.14741446 = score(doc=543,freq=2.0), product of:
              0.16830003 = queryWeight, product of:
                1.2879562 = boost
                9.909708 = idf(docFreq=5, maxDocs=44421)
                0.013186278 = queryNorm
              0.8759027 = fieldWeight in 543, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                9.909708 = idf(docFreq=5, maxDocs=44421)
                0.0625 = fieldNorm(doc=543)
          0.0132743465 = weight(abstract_txt:using in 543) [ClassicSimilarity], result of:
            0.0132743465 = score(doc=543,freq=1.0), product of:
              0.061439827 = queryWeight, product of:
                1.3478595 = boost
                3.4568708 = idf(docFreq=3806, maxDocs=44421)
                0.013186278 = queryNorm
              0.21605442 = fieldWeight in 543, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4568708 = idf(docFreq=3806, maxDocs=44421)
                0.0625 = fieldNorm(doc=543)
          0.03114065 = weight(abstract_txt:analysis in 543) [ClassicSimilarity], result of:
            0.03114065 = score(doc=543,freq=4.0), product of:
              0.068334445 = queryWeight, product of:
                1.4214758 = boost
                3.6456752 = idf(docFreq=3151, maxDocs=44421)
                0.013186278 = queryNorm
              0.4557094 = fieldWeight in 543, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                3.6456752 = idf(docFreq=3151, maxDocs=44421)
                0.0625 = fieldNorm(doc=543)
          0.029143225 = weight(abstract_txt:approach in 543) [ClassicSimilarity], result of:
            0.029143225 = score(doc=543,freq=3.0), product of:
              0.07196023 = queryWeight, product of:
                1.4586997 = boost
                3.741144 = idf(docFreq=2864, maxDocs=44421)
                0.013186278 = queryNorm
              0.4049907 = fieldWeight in 543, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.741144 = idf(docFreq=2864, maxDocs=44421)
                0.0625 = fieldNorm(doc=543)
          0.040040374 = weight(abstract_txt:evaluated in 543) [ClassicSimilarity], result of:
            0.040040374 = score(doc=543,freq=1.0), product of:
              0.11204941 = queryWeight, product of:
                1.4862052 = boost
                5.717531 = idf(docFreq=396, maxDocs=44421)
                0.013186278 = queryNorm
              0.3573457 = fieldWeight in 543, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.717531 = idf(docFreq=396, maxDocs=44421)
                0.0625 = fieldNorm(doc=543)
          0.062885284 = weight(abstract_txt:proposed in 543) [ClassicSimilarity], result of:
            0.062885284 = score(doc=543,freq=4.0), product of:
              0.10917434 = queryWeight, product of:
                1.7967181 = boost
                4.608063 = idf(docFreq=1203, maxDocs=44421)
                0.013186278 = queryNorm
              0.5760079 = fieldWeight in 543, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.608063 = idf(docFreq=1203, maxDocs=44421)
                0.0625 = fieldNorm(doc=543)
          0.08310284 = weight(abstract_txt:semantic in 543) [ClassicSimilarity], result of:
            0.08310284 = score(doc=543,freq=3.0), product of:
              0.17156458 = queryWeight, product of:
                2.9077551 = boost
                4.4745317 = idf(docFreq=1375, maxDocs=44421)
                0.013186278 = queryNorm
              0.48438224 = fieldWeight in 543, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.4745317 = idf(docFreq=1375, maxDocs=44421)
                0.0625 = fieldNorm(doc=543)
          1.1418605 = weight(abstract_txt:plagiarism in 543) [ClassicSimilarity], result of:
            1.1418605 = score(doc=543,freq=7.0), product of:
              0.7885464 = queryWeight, product of:
                6.8288608 = boost
                8.757029 = idf(docFreq=18, maxDocs=44421)
                0.013186278 = queryNorm
              1.4480574 = fieldWeight in 543, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                8.757029 = idf(docFreq=18, maxDocs=44421)
                0.0625 = fieldNorm(doc=543)
        0.36 = coord(9/25)
    
  2. Gipp, B.; Meuschke, N.; Breitinger, C.: Citation-based plagiarism detection : practicability on a large-scale scientific corpus (2014) 0.48
    0.47797385 = sum of:
      0.47797385 = product of:
        1.4936683 = sum of:
          0.048788983 = weight(abstract_txt:detecting in 4332) [ClassicSimilarity], result of:
            0.048788983 = score(doc=4332,freq=1.0), product of:
              0.10145699 = queryWeight, product of:
                7.694134 = idf(docFreq=54, maxDocs=44421)
                0.013186278 = queryNorm
              0.4808834 = fieldWeight in 4332, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.694134 = idf(docFreq=54, maxDocs=44421)
                0.0625 = fieldNorm(doc=4332)
          0.015544944 = weight(abstract_txt:language in 4332) [ClassicSimilarity], result of:
            0.015544944 = score(doc=4332,freq=1.0), product of:
              0.059630748 = queryWeight, product of:
                1.0841993 = boost
                4.1709876 = idf(docFreq=1863, maxDocs=44421)
                0.013186278 = queryNorm
              0.26068673 = fieldWeight in 4332, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.1709876 = idf(docFreq=1863, maxDocs=44421)
                0.0625 = fieldNorm(doc=4332)
          0.018094555 = weight(abstract_txt:various in 4332) [ClassicSimilarity], result of:
            0.018094555 = score(doc=4332,freq=1.0), product of:
              0.06598462 = queryWeight, product of:
                1.1405002 = boost
                4.387581 = idf(docFreq=1500, maxDocs=44421)
                0.013186278 = queryNorm
              0.2742238 = fieldWeight in 4332, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.387581 = idf(docFreq=1500, maxDocs=44421)
                0.0625 = fieldNorm(doc=4332)
          0.0132743465 = weight(abstract_txt:using in 4332) [ClassicSimilarity], result of:
            0.0132743465 = score(doc=4332,freq=1.0), product of:
              0.061439827 = queryWeight, product of:
                1.3478595 = boost
                3.4568708 = idf(docFreq=3806, maxDocs=44421)
                0.013186278 = queryNorm
              0.21605442 = fieldWeight in 4332, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4568708 = idf(docFreq=3806, maxDocs=44421)
                0.0625 = fieldNorm(doc=4332)
          0.023795344 = weight(abstract_txt:approach in 4332) [ClassicSimilarity], result of:
            0.023795344 = score(doc=4332,freq=2.0), product of:
              0.07196023 = queryWeight, product of:
                1.4586997 = boost
                3.741144 = idf(docFreq=2864, maxDocs=44421)
                0.013186278 = queryNorm
              0.33067352 = fieldWeight in 4332, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.741144 = idf(docFreq=2864, maxDocs=44421)
                0.0625 = fieldNorm(doc=4332)
          0.031442642 = weight(abstract_txt:proposed in 4332) [ClassicSimilarity], result of:
            0.031442642 = score(doc=4332,freq=1.0), product of:
              0.10917434 = queryWeight, product of:
                1.7967181 = boost
                4.608063 = idf(docFreq=1203, maxDocs=44421)
                0.013186278 = queryNorm
              0.28800395 = fieldWeight in 4332, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.608063 = idf(docFreq=1203, maxDocs=44421)
                0.0625 = fieldNorm(doc=4332)
          0.047979448 = weight(abstract_txt:semantic in 4332) [ClassicSimilarity], result of:
            0.047979448 = score(doc=4332,freq=1.0), product of:
              0.17156458 = queryWeight, product of:
                2.9077551 = boost
                4.4745317 = idf(docFreq=1375, maxDocs=44421)
                0.013186278 = queryNorm
              0.27965823 = fieldWeight in 4332, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.4745317 = idf(docFreq=1375, maxDocs=44421)
                0.0625 = fieldNorm(doc=4332)
          1.2947481 = weight(abstract_txt:plagiarism in 4332) [ClassicSimilarity], result of:
            1.2947481 = score(doc=4332,freq=9.0), product of:
              0.7885464 = queryWeight, product of:
                6.8288608 = boost
                8.757029 = idf(docFreq=18, maxDocs=44421)
                0.013186278 = queryNorm
              1.6419429 = fieldWeight in 4332, product of:
                3.0 = tf(freq=9.0), with freq of:
                  9.0 = termFreq=9.0
                8.757029 = idf(docFreq=18, maxDocs=44421)
                0.0625 = fieldNorm(doc=4332)
        0.32 = coord(8/25)
    
  3. Alzahrani, S.; Palade, V.; Salim, N.; Abraham, A.: Using structural information and citation evidence to detect significant plagiarism cases in scientific publications (2012) 0.40
    0.3962696 = sum of:
      0.3962696 = product of:
        1.4152485 = sum of:
          0.05014059 = weight(abstract_txt:fragments in 982) [ClassicSimilarity], result of:
            0.05014059 = score(doc=982,freq=1.0), product of:
              0.1129419 = queryWeight, product of:
                1.0550828 = boost
                8.117949 = idf(docFreq=35, maxDocs=44421)
                0.013186278 = queryNorm
              0.4439503 = fieldWeight in 982, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.117949 = idf(docFreq=35, maxDocs=44421)
                0.0546875 = fieldNorm(doc=982)
          0.02597205 = weight(abstract_txt:using in 982) [ClassicSimilarity], result of:
            0.02597205 = score(doc=982,freq=5.0), product of:
              0.061439827 = queryWeight, product of:
                1.3478595 = boost
                3.4568708 = idf(docFreq=3806, maxDocs=44421)
                0.013186278 = queryNorm
              0.42272335 = fieldWeight in 982, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                3.4568708 = idf(docFreq=3806, maxDocs=44421)
                0.0546875 = fieldNorm(doc=982)
          0.014722618 = weight(abstract_txt:approach in 982) [ClassicSimilarity], result of:
            0.014722618 = score(doc=982,freq=1.0), product of:
              0.07196023 = queryWeight, product of:
                1.4586997 = boost
                3.741144 = idf(docFreq=2864, maxDocs=44421)
                0.013186278 = queryNorm
              0.20459381 = fieldWeight in 982, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.741144 = idf(docFreq=2864, maxDocs=44421)
                0.0546875 = fieldNorm(doc=982)
          0.06767928 = weight(abstract_txt:cases in 982) [ClassicSimilarity], result of:
            0.06767928 = score(doc=982,freq=4.0), product of:
              0.10948533 = queryWeight, product of:
                1.4691021 = boost
                5.6517344 = idf(docFreq=423, maxDocs=44421)
                0.013186278 = queryNorm
              0.61815846 = fieldWeight in 982, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.6517344 = idf(docFreq=423, maxDocs=44421)
                0.0546875 = fieldNorm(doc=982)
          0.03503533 = weight(abstract_txt:evaluated in 982) [ClassicSimilarity], result of:
            0.03503533 = score(doc=982,freq=1.0), product of:
              0.11204941 = queryWeight, product of:
                1.4862052 = boost
                5.717531 = idf(docFreq=396, maxDocs=44421)
                0.013186278 = queryNorm
              0.3126775 = fieldWeight in 982, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.717531 = idf(docFreq=396, maxDocs=44421)
                0.0546875 = fieldNorm(doc=982)
          0.027512312 = weight(abstract_txt:proposed in 982) [ClassicSimilarity], result of:
            0.027512312 = score(doc=982,freq=1.0), product of:
              0.10917434 = queryWeight, product of:
                1.7967181 = boost
                4.608063 = idf(docFreq=1203, maxDocs=44421)
                0.013186278 = queryNorm
              0.25200346 = fieldWeight in 982, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.608063 = idf(docFreq=1203, maxDocs=44421)
                0.0546875 = fieldNorm(doc=982)
          1.1941863 = weight(abstract_txt:plagiarism in 982) [ClassicSimilarity], result of:
            1.1941863 = score(doc=982,freq=10.0), product of:
              0.7885464 = queryWeight, product of:
                6.8288608 = boost
                8.757029 = idf(docFreq=18, maxDocs=44421)
                0.013186278 = queryNorm
              1.5144148 = fieldWeight in 982, product of:
                3.1622777 = tf(freq=10.0), with freq of:
                  10.0 = termFreq=10.0
                8.757029 = idf(docFreq=18, maxDocs=44421)
                0.0546875 = fieldNorm(doc=982)
        0.28 = coord(7/25)
    
  4. Stamatatos, E.: Plagiarism detection using stopword n-grams (2011) 0.25
    0.25458634 = sum of:
      0.25458634 = product of:
        1.0607765 = sum of:
          0.060986232 = weight(abstract_txt:detecting in 955) [ClassicSimilarity], result of:
            0.060986232 = score(doc=955,freq=1.0), product of:
              0.10145699 = queryWeight, product of:
                7.694134 = idf(docFreq=54, maxDocs=44421)
                0.013186278 = queryNorm
              0.60110426 = fieldWeight in 955, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.694134 = idf(docFreq=54, maxDocs=44421)
                0.078125 = fieldNorm(doc=955)
          0.021032311 = weight(abstract_txt:approach in 955) [ClassicSimilarity], result of:
            0.021032311 = score(doc=955,freq=1.0), product of:
              0.07196023 = queryWeight, product of:
                1.4586997 = boost
                3.741144 = idf(docFreq=2864, maxDocs=44421)
                0.013186278 = queryNorm
              0.29227686 = fieldWeight in 955, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.741144 = idf(docFreq=2864, maxDocs=44421)
                0.078125 = fieldNorm(doc=955)
          0.04834234 = weight(abstract_txt:cases in 955) [ClassicSimilarity], result of:
            0.04834234 = score(doc=955,freq=1.0), product of:
              0.10948533 = queryWeight, product of:
                1.4691021 = boost
                5.6517344 = idf(docFreq=423, maxDocs=44421)
                0.013186278 = queryNorm
              0.44154173 = fieldWeight in 955, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.6517344 = idf(docFreq=423, maxDocs=44421)
                0.078125 = fieldNorm(doc=955)
          0.05558326 = weight(abstract_txt:proposed in 955) [ClassicSimilarity], result of:
            0.05558326 = score(doc=955,freq=2.0), product of:
              0.10917434 = queryWeight, product of:
                1.7967181 = boost
                4.608063 = idf(docFreq=1203, maxDocs=44421)
                0.013186278 = queryNorm
              0.50912386 = fieldWeight in 955, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.608063 = idf(docFreq=1203, maxDocs=44421)
                0.078125 = fieldNorm(doc=955)
          0.111894704 = weight(abstract_txt:syntactic in 955) [ClassicSimilarity], result of:
            0.111894704 = score(doc=955,freq=1.0), product of:
              0.21930115 = queryWeight, product of:
                2.546479 = boost
                6.5309834 = idf(docFreq=175, maxDocs=44421)
                0.013186278 = queryNorm
              0.5102331 = fieldWeight in 955, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.5309834 = idf(docFreq=175, maxDocs=44421)
                0.078125 = fieldNorm(doc=955)
          0.76293766 = weight(abstract_txt:plagiarism in 955) [ClassicSimilarity], result of:
            0.76293766 = score(doc=955,freq=2.0), product of:
              0.7885464 = queryWeight, product of:
                6.8288608 = boost
                8.757029 = idf(docFreq=18, maxDocs=44421)
                0.013186278 = queryNorm
              0.9675241 = fieldWeight in 955, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.757029 = idf(docFreq=18, maxDocs=44421)
                0.078125 = fieldNorm(doc=955)
        0.24 = coord(6/25)
    
  5. Agarwal, B.; Ramampiaro, H.; Langseth, H.; Ruocco, M.: ¬A deep network model for paraphrase detection in short text messages (2018) 0.22
    0.2150814 = sum of:
      0.2150814 = product of:
        0.6721294 = sum of:
          0.048788983 = weight(abstract_txt:detecting in 43) [ClassicSimilarity], result of:
            0.048788983 = score(doc=43,freq=1.0), product of:
              0.10145699 = queryWeight, product of:
                7.694134 = idf(docFreq=54, maxDocs=44421)
                0.013186278 = queryNorm
              0.4808834 = fieldWeight in 43, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.694134 = idf(docFreq=54, maxDocs=44421)
                0.0625 = fieldNorm(doc=43)
          0.021983871 = weight(abstract_txt:language in 43) [ClassicSimilarity], result of:
            0.021983871 = score(doc=43,freq=2.0), product of:
              0.059630748 = queryWeight, product of:
                1.0841993 = boost
                4.1709876 = idf(docFreq=1863, maxDocs=44421)
                0.013186278 = queryNorm
              0.3686667 = fieldWeight in 43, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.1709876 = idf(docFreq=1863, maxDocs=44421)
                0.0625 = fieldNorm(doc=43)
          0.02790995 = weight(abstract_txt:natural in 43) [ClassicSimilarity], result of:
            0.02790995 = score(doc=43,freq=1.0), product of:
              0.08808802 = queryWeight, product of:
                1.3177482 = boost
                5.0694656 = idf(docFreq=758, maxDocs=44421)
                0.013186278 = queryNorm
              0.3168416 = fieldWeight in 43, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.0694656 = idf(docFreq=758, maxDocs=44421)
                0.0625 = fieldNorm(doc=43)
          0.01877276 = weight(abstract_txt:using in 43) [ClassicSimilarity], result of:
            0.01877276 = score(doc=43,freq=2.0), product of:
              0.061439827 = queryWeight, product of:
                1.3478595 = boost
                3.4568708 = idf(docFreq=3806, maxDocs=44421)
                0.013186278 = queryNorm
              0.3055471 = fieldWeight in 43, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.4568708 = idf(docFreq=3806, maxDocs=44421)
                0.0625 = fieldNorm(doc=43)
          0.023795344 = weight(abstract_txt:approach in 43) [ClassicSimilarity], result of:
            0.023795344 = score(doc=43,freq=2.0), product of:
              0.07196023 = queryWeight, product of:
                1.4586997 = boost
                3.741144 = idf(docFreq=2864, maxDocs=44421)
                0.013186278 = queryNorm
              0.33067352 = fieldWeight in 43, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.741144 = idf(docFreq=2864, maxDocs=44421)
                0.0625 = fieldNorm(doc=43)
          0.031442642 = weight(abstract_txt:proposed in 43) [ClassicSimilarity], result of:
            0.031442642 = score(doc=43,freq=1.0), product of:
              0.10917434 = queryWeight, product of:
                1.7967181 = boost
                4.608063 = idf(docFreq=1203, maxDocs=44421)
                0.013186278 = queryNorm
              0.28800395 = fieldWeight in 43, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.608063 = idf(docFreq=1203, maxDocs=44421)
                0.0625 = fieldNorm(doc=43)
          0.06785318 = weight(abstract_txt:semantic in 43) [ClassicSimilarity], result of:
            0.06785318 = score(doc=43,freq=2.0), product of:
              0.17156458 = queryWeight, product of:
                2.9077551 = boost
                4.4745317 = idf(docFreq=1375, maxDocs=44421)
                0.013186278 = queryNorm
              0.39549646 = fieldWeight in 43, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.4745317 = idf(docFreq=1375, maxDocs=44421)
                0.0625 = fieldNorm(doc=43)
          0.4315827 = weight(abstract_txt:plagiarism in 43) [ClassicSimilarity], result of:
            0.4315827 = score(doc=43,freq=1.0), product of:
              0.7885464 = queryWeight, product of:
                6.8288608 = boost
                8.757029 = idf(docFreq=18, maxDocs=44421)
                0.013186278 = queryNorm
              0.5473143 = fieldWeight in 43, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.757029 = idf(docFreq=18, maxDocs=44421)
                0.0625 = fieldNorm(doc=43)
        0.32 = coord(8/25)