Document (#37379)

Author
Makris, C.
Plegas, Y.
Stamou, S.
Title
Web query disambiguation using PageRank
Source
Journal of the American Society for Information Science and Technology. 63(2012) no.8, S.1581-1592
Year
2012
Abstract
In this article, we propose new word sense disambiguation strategies for resolving the senses of polysemous query terms issued to Web search engines, and we explore the application of those strategies when used in a query expansion framework. The novelty of our approach lies in the exploitation of the Web page PageRank values as indicators of the significance the different senses of a term carry when employed in search queries. We also aim at scalable query sense resolution techniques that can be applied without loss of efficiency to large data sets such as those on the Web. Our experimental findings validate that the proposed techniques perform more accurately than do the traditional disambiguation strategies and improve the quality of the search results, when involved in query expansion.
Theme
Suchmaschinen
Aid
PageRank

Similar documents (content)

  1. Krovetz, R.; Croft, W.B.: Lexical ambiguity and information retrieval (1992) 0.22
    0.21965908 = sum of:
      0.21965908 = product of:
        1.0982953 = sum of:
          0.1441124 = weight(abstract_txt:resolving in 4027) [ClassicSimilarity], result of:
            0.1441124 = score(doc=4027,freq=1.0), product of:
              0.18367197 = queryWeight, product of:
                1.3662946 = boost
                8.369263 = idf(docFreq=27, maxDocs=44421)
                0.016062433 = queryNorm
              0.7846184 = fieldWeight in 4027, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.369263 = idf(docFreq=27, maxDocs=44421)
                0.09375 = fieldNorm(doc=4027)
          0.09711221 = weight(abstract_txt:sense in 4027) [ClassicSimilarity], result of:
            0.09711221 = score(doc=4027,freq=1.0), product of:
              0.17786938 = queryWeight, product of:
                1.9014658 = boost
                5.823732 = idf(docFreq=356, maxDocs=44421)
                0.016062433 = queryNorm
              0.54597485 = fieldWeight in 4027, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.823732 = idf(docFreq=356, maxDocs=44421)
                0.09375 = fieldNorm(doc=4027)
          0.31386545 = weight(abstract_txt:senses in 4027) [ClassicSimilarity], result of:
            0.31386545 = score(doc=4027,freq=1.0), product of:
              0.38881916 = queryWeight, product of:
                2.8113294 = boost
                8.610425 = idf(docFreq=21, maxDocs=44421)
                0.016062433 = queryNorm
              0.8072274 = fieldWeight in 4027, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.610425 = idf(docFreq=21, maxDocs=44421)
                0.09375 = fieldNorm(doc=4027)
          0.4110994 = weight(abstract_txt:disambiguation in 4027) [ClassicSimilarity], result of:
            0.4110994 = score(doc=4027,freq=2.0), product of:
              0.4228993 = queryWeight, product of:
                3.5908895 = boost
                7.33202 = idf(docFreq=78, maxDocs=44421)
                0.016062433 = queryNorm
              0.97209764 = fieldWeight in 4027, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.33202 = idf(docFreq=78, maxDocs=44421)
                0.09375 = fieldNorm(doc=4027)
          0.13210586 = weight(abstract_txt:query in 4027) [ClassicSimilarity], result of:
            0.13210586 = score(doc=4027,freq=1.0), product of:
              0.2963785 = queryWeight, product of:
                3.8808894 = boost
                4.754492 = idf(docFreq=1039, maxDocs=44421)
                0.016062433 = queryNorm
              0.4457336 = fieldWeight in 4027, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.754492 = idf(docFreq=1039, maxDocs=44421)
                0.09375 = fieldNorm(doc=4027)
        0.2 = coord(5/25)
    
  2. Montejo-Ráez, A.; Martínez-Cámara, E.; Martín-Valdivia, M.T.; Ureña-López, L.A.: ¬A knowledge-based approach for polarity classification in Twitter (2014) 0.14
    0.13660182 = sum of:
      0.13660182 = product of:
        0.8537614 = sum of:
          0.15834941 = weight(abstract_txt:expansion in 2204) [ClassicSimilarity], result of:
            0.15834941 = score(doc=2204,freq=2.0), product of:
              0.19557783 = queryWeight, product of:
                1.9938741 = boost
                6.106756 = idf(docFreq=268, maxDocs=44421)
                0.016062433 = queryNorm
              0.8096491 = fieldWeight in 2204, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.106756 = idf(docFreq=268, maxDocs=44421)
                0.09375 = fieldNorm(doc=2204)
          0.3060878 = weight(abstract_txt:pagerank in 2204) [ClassicSimilarity], result of:
            0.3060878 = score(doc=2204,freq=2.0), product of:
              0.30348647 = queryWeight, product of:
                2.4837482 = boost
                7.607123 = idf(docFreq=59, maxDocs=44421)
                0.016062433 = queryNorm
              1.0085715 = fieldWeight in 2204, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.607123 = idf(docFreq=59, maxDocs=44421)
                0.09375 = fieldNorm(doc=2204)
          0.098633 = weight(abstract_txt:strategies in 2204) [ClassicSimilarity], result of:
            0.098633 = score(doc=2204,freq=1.0), product of:
              0.20572981 = queryWeight, product of:
                2.504564 = boost
                5.113918 = idf(docFreq=725, maxDocs=44421)
                0.016062433 = queryNorm
              0.47942978 = fieldWeight in 2204, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.113918 = idf(docFreq=725, maxDocs=44421)
                0.09375 = fieldNorm(doc=2204)
          0.2906912 = weight(abstract_txt:disambiguation in 2204) [ClassicSimilarity], result of:
            0.2906912 = score(doc=2204,freq=1.0), product of:
              0.4228993 = queryWeight, product of:
                3.5908895 = boost
                7.33202 = idf(docFreq=78, maxDocs=44421)
                0.016062433 = queryNorm
              0.68737686 = fieldWeight in 2204, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.33202 = idf(docFreq=78, maxDocs=44421)
                0.09375 = fieldNorm(doc=2204)
        0.16 = coord(4/25)
    
  3. Bando, L.L.; Scholer, F.; Turpin, A.: Query-biased summary generation assisted by query expansion : temporality (2015) 0.12
    0.123750925 = sum of:
      0.123750925 = product of:
        0.6187546 = sum of:
          0.07464965 = weight(abstract_txt:novelty in 2820) [ClassicSimilarity], result of:
            0.07464965 = score(doc=2820,freq=1.0), product of:
              0.15523441 = queryWeight, product of:
                1.2560788 = boost
                7.694134 = idf(docFreq=54, maxDocs=44421)
                0.016062433 = queryNorm
              0.4808834 = fieldWeight in 2820, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.694134 = idf(docFreq=54, maxDocs=44421)
                0.0625 = fieldNorm(doc=2820)
          0.030512791 = weight(abstract_txt:techniques in 2820) [ClassicSimilarity], result of:
            0.030512791 = score(doc=2820,freq=1.0), product of:
              0.1077211 = queryWeight, product of:
                1.4797498 = boost
                4.5321174 = idf(docFreq=1298, maxDocs=44421)
                0.016062433 = queryNorm
              0.28325734 = fieldWeight in 2820, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.5321174 = idf(docFreq=1298, maxDocs=44421)
                0.0625 = fieldNorm(doc=2820)
          0.023998749 = weight(abstract_txt:search in 2820) [ClassicSimilarity], result of:
            0.023998749 = score(doc=2820,freq=1.0), product of:
              0.10506763 = queryWeight, product of:
                1.7898557 = boost
                3.654598 = idf(docFreq=3123, maxDocs=44421)
                0.016062433 = queryNorm
              0.22841237 = fieldWeight in 2820, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.654598 = idf(docFreq=3123, maxDocs=44421)
                0.0625 = fieldNorm(doc=2820)
          0.19749641 = weight(abstract_txt:expansion in 2820) [ClassicSimilarity], result of:
            0.19749641 = score(doc=2820,freq=7.0), product of:
              0.19557783 = queryWeight, product of:
                1.9938741 = boost
                6.106756 = idf(docFreq=268, maxDocs=44421)
                0.016062433 = queryNorm
              1.0098099 = fieldWeight in 2820, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                6.106756 = idf(docFreq=268, maxDocs=44421)
                0.0625 = fieldNorm(doc=2820)
          0.29209703 = weight(abstract_txt:query in 2820) [ClassicSimilarity], result of:
            0.29209703 = score(doc=2820,freq=11.0), product of:
              0.2963785 = queryWeight, product of:
                3.8808894 = boost
                4.754492 = idf(docFreq=1039, maxDocs=44421)
                0.016062433 = queryNorm
              0.9855541 = fieldWeight in 2820, product of:
                3.3166249 = tf(freq=11.0), with freq of:
                  11.0 = termFreq=11.0
                4.754492 = idf(docFreq=1039, maxDocs=44421)
                0.0625 = fieldNorm(doc=2820)
        0.2 = coord(5/25)
    
  4. Kelley, D.: Relevance feedback : getting to know your user (2008) 0.12
    0.12020612 = sum of:
      0.12020612 = product of:
        0.50085884 = sum of:
          0.05932317 = weight(abstract_txt:accurately in 2924) [ClassicSimilarity], result of:
            0.05932317 = score(doc=2924,freq=1.0), product of:
              0.1331841 = queryWeight, product of:
                1.1634537 = boost
                7.1267567 = idf(docFreq=96, maxDocs=44421)
                0.016062433 = queryNorm
              0.4454223 = fieldWeight in 2924, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.1267567 = idf(docFreq=96, maxDocs=44421)
                0.0625 = fieldNorm(doc=2924)
          0.061025582 = weight(abstract_txt:techniques in 2924) [ClassicSimilarity], result of:
            0.061025582 = score(doc=2924,freq=4.0), product of:
              0.1077211 = queryWeight, product of:
                1.4797498 = boost
                4.5321174 = idf(docFreq=1298, maxDocs=44421)
                0.016062433 = queryNorm
              0.5665147 = fieldWeight in 2924, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.5321174 = idf(docFreq=1298, maxDocs=44421)
                0.0625 = fieldNorm(doc=2924)
          0.023998749 = weight(abstract_txt:search in 2924) [ClassicSimilarity], result of:
            0.023998749 = score(doc=2924,freq=1.0), product of:
              0.10506763 = queryWeight, product of:
                1.7898557 = boost
                3.654598 = idf(docFreq=3123, maxDocs=44421)
                0.016062433 = queryNorm
              0.22841237 = fieldWeight in 2924, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.654598 = idf(docFreq=3123, maxDocs=44421)
                0.0625 = fieldNorm(doc=2924)
          0.07464663 = weight(abstract_txt:expansion in 2924) [ClassicSimilarity], result of:
            0.07464663 = score(doc=2924,freq=1.0), product of:
              0.19557783 = queryWeight, product of:
                1.9938741 = boost
                6.106756 = idf(docFreq=268, maxDocs=44421)
                0.016062433 = queryNorm
              0.38167226 = fieldWeight in 2924, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.106756 = idf(docFreq=268, maxDocs=44421)
                0.0625 = fieldNorm(doc=2924)
          0.19379413 = weight(abstract_txt:disambiguation in 2924) [ClassicSimilarity], result of:
            0.19379413 = score(doc=2924,freq=1.0), product of:
              0.4228993 = queryWeight, product of:
                3.5908895 = boost
                7.33202 = idf(docFreq=78, maxDocs=44421)
                0.016062433 = queryNorm
              0.45825124 = fieldWeight in 2924, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.33202 = idf(docFreq=78, maxDocs=44421)
                0.0625 = fieldNorm(doc=2924)
          0.08807057 = weight(abstract_txt:query in 2924) [ClassicSimilarity], result of:
            0.08807057 = score(doc=2924,freq=1.0), product of:
              0.2963785 = queryWeight, product of:
                3.8808894 = boost
                4.754492 = idf(docFreq=1039, maxDocs=44421)
                0.016062433 = queryNorm
              0.29715574 = fieldWeight in 2924, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.754492 = idf(docFreq=1039, maxDocs=44421)
                0.0625 = fieldNorm(doc=2924)
        0.24 = coord(6/25)
    
  5. Fidel, R.; Efthimiadis, E.N.: Terminological knowledge structure for intermediary expert systems (1995) 0.12
    0.1187791 = sum of:
      0.1187791 = product of:
        0.5938955 = sum of:
          0.06606213 = weight(abstract_txt:techniques in 6695) [ClassicSimilarity], result of:
            0.06606213 = score(doc=6695,freq=3.0), product of:
              0.1077211 = queryWeight, product of:
                1.4797498 = boost
                4.5321174 = idf(docFreq=1298, maxDocs=44421)
                0.016062433 = queryNorm
              0.6132701 = fieldWeight in 6695, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.5321174 = idf(docFreq=1298, maxDocs=44421)
                0.078125 = fieldNorm(doc=6695)
          0.09330829 = weight(abstract_txt:expansion in 6695) [ClassicSimilarity], result of:
            0.09330829 = score(doc=6695,freq=1.0), product of:
              0.19557783 = queryWeight, product of:
                1.9938741 = boost
                6.106756 = idf(docFreq=268, maxDocs=44421)
                0.016062433 = queryNorm
              0.47709033 = fieldWeight in 6695, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.106756 = idf(docFreq=268, maxDocs=44421)
                0.078125 = fieldNorm(doc=6695)
          0.08219417 = weight(abstract_txt:strategies in 6695) [ClassicSimilarity], result of:
            0.08219417 = score(doc=6695,freq=1.0), product of:
              0.20572981 = queryWeight, product of:
                2.504564 = boost
                5.113918 = idf(docFreq=725, maxDocs=44421)
                0.016062433 = queryNorm
              0.39952484 = fieldWeight in 6695, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.113918 = idf(docFreq=725, maxDocs=44421)
                0.078125 = fieldNorm(doc=6695)
          0.24224266 = weight(abstract_txt:disambiguation in 6695) [ClassicSimilarity], result of:
            0.24224266 = score(doc=6695,freq=1.0), product of:
              0.4228993 = queryWeight, product of:
                3.5908895 = boost
                7.33202 = idf(docFreq=78, maxDocs=44421)
                0.016062433 = queryNorm
              0.57281405 = fieldWeight in 6695, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.33202 = idf(docFreq=78, maxDocs=44421)
                0.078125 = fieldNorm(doc=6695)
          0.110088214 = weight(abstract_txt:query in 6695) [ClassicSimilarity], result of:
            0.110088214 = score(doc=6695,freq=1.0), product of:
              0.2963785 = queryWeight, product of:
                3.8808894 = boost
                4.754492 = idf(docFreq=1039, maxDocs=44421)
                0.016062433 = queryNorm
              0.37144467 = fieldWeight in 6695, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.754492 = idf(docFreq=1039, maxDocs=44421)
                0.078125 = fieldNorm(doc=6695)
        0.2 = coord(5/25)