Document (#37690)

Author
Wicaksana, I.W.S.
Wahyudi, B.
Title
Comparison Latent Semantic and WordNet approach for semantic similarity calculation
Source
http://arxiv.org/find/all/1/all:+EXACT+semantic_interoperability/0/1/0/all/0/1. [arXiv:1105.1406]
Year
2011
Abstract
Information exchange among many sources in Internet is more autonomous, dynamic and free. The situation drive difference view of concepts among sources. For example, word 'bank' has meaning as economic institution for economy domain, but for ecology domain it will be defined as slope of river or lake. In this paper, we will evaluate latent semantic and WordNet approach to calculate semantic similarity. The evaluation will be run for some concepts from different domain with reference by expert or human. Result of the evaluation can provide a contribution for mapping of concept, query rewriting, interoperability, etc.
Theme
Semantische Interoperabilität
Object
Latent semantic indexing
WordNet

Similar documents (content)

  1. Kiren, T.; Shoaib, M.: ¬A novel ontology matching approach using key concepts (2016) 0.18
    0.17533974 = sum of:
      0.17533974 = product of:
        0.6262134 = sum of:
          0.029436663 = weight(abstract_txt:approach in 3589) [ClassicSimilarity], result of:
            0.029436663 = score(doc=3589,freq=2.0), product of:
              0.08902032 = queryWeight, product of:
                1.2454424 = boost
                3.741144 = idf(docFreq=2864, maxDocs=44421)
                0.019105617 = queryNorm
              0.33067352 = fieldWeight in 3589, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.741144 = idf(docFreq=2864, maxDocs=44421)
                0.0625 = fieldNorm(doc=3589)
          0.035734467 = weight(abstract_txt:evaluation in 3589) [ClassicSimilarity], result of:
            0.035734467 = score(doc=3589,freq=1.0), product of:
              0.12763359 = queryWeight, product of:
                1.4912881 = boost
                4.479632 = idf(docFreq=1368, maxDocs=44421)
                0.019105617 = queryNorm
              0.279977 = fieldWeight in 3589, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.479632 = idf(docFreq=1368, maxDocs=44421)
                0.0625 = fieldNorm(doc=3589)
          0.0837287 = weight(abstract_txt:concepts in 3589) [ClassicSimilarity], result of:
            0.0837287 = score(doc=3589,freq=5.0), product of:
              0.13167389 = queryWeight, product of:
                1.5147079 = boost
                4.549982 = idf(docFreq=1275, maxDocs=44421)
                0.019105617 = queryNorm
              0.63587934 = fieldWeight in 3589, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                4.549982 = idf(docFreq=1275, maxDocs=44421)
                0.0625 = fieldNorm(doc=3589)
          0.15658279 = weight(abstract_txt:similarity in 3589) [ClassicSimilarity], result of:
            0.15658279 = score(doc=3589,freq=4.0), product of:
              0.21530268 = queryWeight, product of:
                1.9368848 = boost
                5.8181453 = idf(docFreq=358, maxDocs=44421)
                0.019105617 = queryNorm
              0.72726816 = fieldWeight in 3589, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.8181453 = idf(docFreq=358, maxDocs=44421)
                0.0625 = fieldNorm(doc=3589)
          0.06305526 = weight(abstract_txt:domain in 3589) [ClassicSimilarity], result of:
            0.06305526 = score(doc=3589,freq=1.0), product of:
              0.21334612 = queryWeight, product of:
                2.3613865 = boost
                4.7288613 = idf(docFreq=1066, maxDocs=44421)
                0.019105617 = queryNorm
              0.29555383 = fieldWeight in 3589, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.7288613 = idf(docFreq=1066, maxDocs=44421)
                0.0625 = fieldNorm(doc=3589)
          0.18645039 = weight(abstract_txt:wordnet in 3589) [ClassicSimilarity], result of:
            0.18645039 = score(doc=3589,freq=1.0), product of:
              0.38395673 = queryWeight, product of:
                2.586546 = boost
                7.769642 = idf(docFreq=50, maxDocs=44421)
                0.019105617 = queryNorm
              0.48560262 = fieldWeight in 3589, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.769642 = idf(docFreq=50, maxDocs=44421)
                0.0625 = fieldNorm(doc=3589)
          0.07122509 = weight(abstract_txt:semantic in 3589) [ClassicSimilarity], result of:
            0.07122509 = score(doc=3589,freq=1.0), product of:
              0.2546862 = queryWeight, product of:
                2.9791803 = boost
                4.4745317 = idf(docFreq=1375, maxDocs=44421)
                0.019105617 = queryNorm
              0.27965823 = fieldWeight in 3589, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.4745317 = idf(docFreq=1375, maxDocs=44421)
                0.0625 = fieldNorm(doc=3589)
        0.28 = coord(7/25)
    
  2. Kim, H.H.; Kim, Y.H.: Generic speech summarization of transcribed lecture videos : using tags and their semantic relations (2016) 0.15
    0.1478152 = sum of:
      0.1478152 = product of:
        0.61589664 = sum of:
          0.04477618 = weight(abstract_txt:difference in 3640) [ClassicSimilarity], result of:
            0.04477618 = score(doc=3640,freq=1.0), product of:
              0.11774111 = queryWeight, product of:
                1.0128103 = boost
                6.0846963 = idf(docFreq=274, maxDocs=44421)
                0.019105617 = queryNorm
              0.38029352 = fieldWeight in 3640, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.0846963 = idf(docFreq=274, maxDocs=44421)
                0.0625 = fieldNorm(doc=3640)
          0.050536167 = weight(abstract_txt:evaluation in 3640) [ClassicSimilarity], result of:
            0.050536167 = score(doc=3640,freq=2.0), product of:
              0.12763359 = queryWeight, product of:
                1.4912881 = boost
                4.479632 = idf(docFreq=1368, maxDocs=44421)
                0.019105617 = queryNorm
              0.39594725 = fieldWeight in 3640, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.479632 = idf(docFreq=1368, maxDocs=44421)
                0.0625 = fieldNorm(doc=3640)
          0.07488923 = weight(abstract_txt:concepts in 3640) [ClassicSimilarity], result of:
            0.07488923 = score(doc=3640,freq=4.0), product of:
              0.13167389 = queryWeight, product of:
                1.5147079 = boost
                4.549982 = idf(docFreq=1275, maxDocs=44421)
                0.019105617 = queryNorm
              0.56874776 = fieldWeight in 3640, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.549982 = idf(docFreq=1275, maxDocs=44421)
                0.0625 = fieldNorm(doc=3640)
          0.13587919 = weight(abstract_txt:latent in 3640) [ClassicSimilarity], result of:
            0.13587919 = score(doc=3640,freq=1.0), product of:
              0.31093913 = queryWeight, product of:
                2.327645 = boost
                6.9919376 = idf(docFreq=110, maxDocs=44421)
                0.019105617 = queryNorm
              0.4369961 = fieldWeight in 3640, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.9919376 = idf(docFreq=110, maxDocs=44421)
                0.0625 = fieldNorm(doc=3640)
          0.18645039 = weight(abstract_txt:wordnet in 3640) [ClassicSimilarity], result of:
            0.18645039 = score(doc=3640,freq=1.0), product of:
              0.38395673 = queryWeight, product of:
                2.586546 = boost
                7.769642 = idf(docFreq=50, maxDocs=44421)
                0.019105617 = queryNorm
              0.48560262 = fieldWeight in 3640, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.769642 = idf(docFreq=50, maxDocs=44421)
                0.0625 = fieldNorm(doc=3640)
          0.12336548 = weight(abstract_txt:semantic in 3640) [ClassicSimilarity], result of:
            0.12336548 = score(doc=3640,freq=3.0), product of:
              0.2546862 = queryWeight, product of:
                2.9791803 = boost
                4.4745317 = idf(docFreq=1375, maxDocs=44421)
                0.019105617 = queryNorm
              0.48438224 = fieldWeight in 3640, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.4745317 = idf(docFreq=1375, maxDocs=44421)
                0.0625 = fieldNorm(doc=3640)
        0.24 = coord(6/25)
    
  3. Burke, R.D.: Question answering from frequently asked question files : experiences with the FAQ Finder System (1997) 0.15
    0.14770651 = sum of:
      0.14770651 = product of:
        0.73853254 = sum of:
          0.03642601 = weight(abstract_txt:approach in 2191) [ClassicSimilarity], result of:
            0.03642601 = score(doc=2191,freq=1.0), product of:
              0.08902032 = queryWeight, product of:
                1.2454424 = boost
                3.741144 = idf(docFreq=2864, maxDocs=44421)
                0.019105617 = queryNorm
              0.40918761 = fieldWeight in 2191, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.741144 = idf(docFreq=2864, maxDocs=44421)
                0.109375 = fieldNorm(doc=2191)
          0.06253532 = weight(abstract_txt:evaluation in 2191) [ClassicSimilarity], result of:
            0.06253532 = score(doc=2191,freq=1.0), product of:
              0.12763359 = queryWeight, product of:
                1.4912881 = boost
                4.479632 = idf(docFreq=1368, maxDocs=44421)
                0.019105617 = queryNorm
              0.48995975 = fieldWeight in 2191, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.479632 = idf(docFreq=1368, maxDocs=44421)
                0.109375 = fieldNorm(doc=2191)
          0.13700993 = weight(abstract_txt:similarity in 2191) [ClassicSimilarity], result of:
            0.13700993 = score(doc=2191,freq=1.0), product of:
              0.21530268 = queryWeight, product of:
                1.9368848 = boost
                5.8181453 = idf(docFreq=358, maxDocs=44421)
                0.019105617 = queryNorm
              0.63635963 = fieldWeight in 2191, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.8181453 = idf(docFreq=358, maxDocs=44421)
                0.109375 = fieldNorm(doc=2191)
          0.3262882 = weight(abstract_txt:wordnet in 2191) [ClassicSimilarity], result of:
            0.3262882 = score(doc=2191,freq=1.0), product of:
              0.38395673 = queryWeight, product of:
                2.586546 = boost
                7.769642 = idf(docFreq=50, maxDocs=44421)
                0.019105617 = queryNorm
              0.8498046 = fieldWeight in 2191, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.769642 = idf(docFreq=50, maxDocs=44421)
                0.109375 = fieldNorm(doc=2191)
          0.17627312 = weight(abstract_txt:semantic in 2191) [ClassicSimilarity], result of:
            0.17627312 = score(doc=2191,freq=2.0), product of:
              0.2546862 = queryWeight, product of:
                2.9791803 = boost
                4.4745317 = idf(docFreq=1375, maxDocs=44421)
                0.019105617 = queryNorm
              0.6921188 = fieldWeight in 2191, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.4745317 = idf(docFreq=1375, maxDocs=44421)
                0.109375 = fieldNorm(doc=2191)
        0.2 = coord(5/25)
    
  4. Green, R.: WordNet (2009) 0.13
    0.12945722 = sum of:
      0.12945722 = product of:
        0.8091076 = sum of:
          0.05616692 = weight(abstract_txt:concepts in 696) [ClassicSimilarity], result of:
            0.05616692 = score(doc=696,freq=1.0), product of:
              0.13167389 = queryWeight, product of:
                1.5147079 = boost
                4.549982 = idf(docFreq=1275, maxDocs=44421)
                0.019105617 = queryNorm
              0.42656082 = fieldWeight in 696, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.549982 = idf(docFreq=1275, maxDocs=44421)
                0.09375 = fieldNorm(doc=696)
          0.11743708 = weight(abstract_txt:similarity in 696) [ClassicSimilarity], result of:
            0.11743708 = score(doc=696,freq=1.0), product of:
              0.21530268 = queryWeight, product of:
                1.9368848 = boost
                5.8181453 = idf(docFreq=358, maxDocs=44421)
                0.019105617 = queryNorm
              0.5454511 = fieldWeight in 696, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.8181453 = idf(docFreq=358, maxDocs=44421)
                0.09375 = fieldNorm(doc=696)
          0.48441237 = weight(abstract_txt:wordnet in 696) [ClassicSimilarity], result of:
            0.48441237 = score(doc=696,freq=3.0), product of:
              0.38395673 = queryWeight, product of:
                2.586546 = boost
                7.769642 = idf(docFreq=50, maxDocs=44421)
                0.019105617 = queryNorm
              1.2616327 = fieldWeight in 696, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                7.769642 = idf(docFreq=50, maxDocs=44421)
                0.09375 = fieldNorm(doc=696)
          0.15109123 = weight(abstract_txt:semantic in 696) [ClassicSimilarity], result of:
            0.15109123 = score(doc=696,freq=2.0), product of:
              0.2546862 = queryWeight, product of:
                2.9791803 = boost
                4.4745317 = idf(docFreq=1375, maxDocs=44421)
                0.019105617 = queryNorm
              0.5932447 = fieldWeight in 696, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.4745317 = idf(docFreq=1375, maxDocs=44421)
                0.09375 = fieldNorm(doc=696)
        0.16 = coord(4/25)
    
  5. K., Vani; Gupta, D.: Unmasking text plagiarism using syntactic-semantic based natural language processing techniques : comparisons, analysis and challenges (2018) 0.12
    0.12394163 = sum of:
      0.12394163 = product of:
        0.51642346 = sum of:
          0.036052402 = weight(abstract_txt:approach in 84) [ClassicSimilarity], result of:
            0.036052402 = score(doc=84,freq=3.0), product of:
              0.08902032 = queryWeight, product of:
                1.2454424 = boost
                3.741144 = idf(docFreq=2864, maxDocs=44421)
                0.019105617 = queryNorm
              0.4049907 = fieldWeight in 84, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.741144 = idf(docFreq=2864, maxDocs=44421)
                0.0625 = fieldNorm(doc=84)
          0.035734467 = weight(abstract_txt:evaluation in 84) [ClassicSimilarity], result of:
            0.035734467 = score(doc=84,freq=1.0), product of:
              0.12763359 = queryWeight, product of:
                1.4912881 = boost
                4.479632 = idf(docFreq=1368, maxDocs=44421)
                0.019105617 = queryNorm
              0.279977 = fieldWeight in 84, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.479632 = idf(docFreq=1368, maxDocs=44421)
                0.0625 = fieldNorm(doc=84)
          0.037444614 = weight(abstract_txt:concepts in 84) [ClassicSimilarity], result of:
            0.037444614 = score(doc=84,freq=1.0), product of:
              0.13167389 = queryWeight, product of:
                1.5147079 = boost
                4.549982 = idf(docFreq=1275, maxDocs=44421)
                0.019105617 = queryNorm
              0.28437388 = fieldWeight in 84, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.549982 = idf(docFreq=1275, maxDocs=44421)
                0.0625 = fieldNorm(doc=84)
          0.078291394 = weight(abstract_txt:similarity in 84) [ClassicSimilarity], result of:
            0.078291394 = score(doc=84,freq=1.0), product of:
              0.21530268 = queryWeight, product of:
                1.9368848 = boost
                5.8181453 = idf(docFreq=358, maxDocs=44421)
                0.019105617 = queryNorm
              0.36363408 = fieldWeight in 84, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.8181453 = idf(docFreq=358, maxDocs=44421)
                0.0625 = fieldNorm(doc=84)
          0.18645039 = weight(abstract_txt:wordnet in 84) [ClassicSimilarity], result of:
            0.18645039 = score(doc=84,freq=1.0), product of:
              0.38395673 = queryWeight, product of:
                2.586546 = boost
                7.769642 = idf(docFreq=50, maxDocs=44421)
                0.019105617 = queryNorm
              0.48560262 = fieldWeight in 84, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.769642 = idf(docFreq=50, maxDocs=44421)
                0.0625 = fieldNorm(doc=84)
          0.14245018 = weight(abstract_txt:semantic in 84) [ClassicSimilarity], result of:
            0.14245018 = score(doc=84,freq=4.0), product of:
              0.2546862 = queryWeight, product of:
                2.9791803 = boost
                4.4745317 = idf(docFreq=1375, maxDocs=44421)
                0.019105617 = queryNorm
              0.55931646 = fieldWeight in 84, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.4745317 = idf(docFreq=1375, maxDocs=44421)
                0.0625 = fieldNorm(doc=84)
        0.24 = coord(6/25)