Document (#38281)

Author
Koumenides, C.L.
Shadbolt, N.R.
Title
Ranking methods for entity-oriented semantic web search
Source
Journal of the Association for Information Science and Technology. 65(2014) no.6, S.1091-1106
Year
2014
Series
Advances in information science
Abstract
This article provides a technical review of semantic search methods used to support text-based search over formal Semantic Web knowledge bases. Our focus is on ranking methods and auxiliary processes explored by existing semantic search systems, outlined within broad areas of classification. We present reflective examples from the literature in some detail, which should appeal to readers interested in a deeper perspective on the various methods and systems implemented in the outlined literature. The presentation covers graph exploration and propagation methods, adaptations of classic probabilistic retrieval models, and query-independent link analysis via flexible extensions to the PageRank algorithm. Future research directions are discussed, including development of more cohesive retrieval models to unlock further potentials and uses, data indexing schemes, integration with user interfaces, and building community consensus for more systematic evaluation and gradual development.
Content
Verfügbar unter: http://onlinelibrary.wiley.com/doi/10.1002/asi.23018/pdf.
Theme
Retrievalalgorithmen

Similar documents (content)

  1. Mayr, P.; Mutschke, P.; Petras, V.: Reducing semantic complexity in distributed digital libraries : Treatment of term vagueness and document re-ranking (2008) 0.14
    0.14092983 = sum of:
      0.14092983 = product of:
        0.5033208 = sum of:
          0.013753217 = weight(abstract_txt:more in 2909) [ClassicSimilarity], result of:
            0.013753217 = score(doc=2909,freq=1.0), product of:
              0.07404897 = queryWeight, product of:
                3.3962307 = idf(docFreq=4044, maxDocs=44421)
                0.021803282 = queryNorm
              0.18573137 = fieldWeight in 2909, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.3962307 = idf(docFreq=4044, maxDocs=44421)
                0.0546875 = fieldNorm(doc=2909)
          0.014751611 = weight(abstract_txt:retrieval in 2909) [ClassicSimilarity], result of:
            0.014751611 = score(doc=2909,freq=1.0), product of:
              0.07759061 = queryWeight, product of:
                1.0236348 = boost
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.021803282 = queryNorm
              0.1901211 = fieldWeight in 2909, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.0546875 = fieldNorm(doc=2909)
          0.03469125 = weight(abstract_txt:models in 2909) [ClassicSimilarity], result of:
            0.03469125 = score(doc=2909,freq=1.0), product of:
              0.13721329 = queryWeight, product of:
                1.3612521 = boost
                4.623126 = idf(docFreq=1185, maxDocs=44421)
                0.021803282 = queryNorm
              0.2528272 = fieldWeight in 2909, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.623126 = idf(docFreq=1185, maxDocs=44421)
                0.0546875 = fieldNorm(doc=2909)
          0.13762201 = weight(abstract_txt:ranking in 2909) [ClassicSimilarity], result of:
            0.13762201 = score(doc=2909,freq=5.0), product of:
              0.20108736 = queryWeight, product of:
                1.6479076 = boost
                5.5966744 = idf(docFreq=447, maxDocs=44421)
                0.021803282 = queryNorm
              0.6843892 = fieldWeight in 2909, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                5.5966744 = idf(docFreq=447, maxDocs=44421)
                0.0546875 = fieldNorm(doc=2909)
          0.068547495 = weight(abstract_txt:search in 2909) [ClassicSimilarity], result of:
            0.068547495 = score(doc=2909,freq=4.0), product of:
              0.17148808 = queryWeight, product of:
                2.1521494 = boost
                3.654598 = idf(docFreq=3123, maxDocs=44421)
                0.021803282 = queryNorm
              0.39972165 = fieldWeight in 2909, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                3.654598 = idf(docFreq=3123, maxDocs=44421)
                0.0546875 = fieldNorm(doc=2909)
          0.1258101 = weight(abstract_txt:semantic in 2909) [ClassicSimilarity], result of:
            0.1258101 = score(doc=2909,freq=4.0), product of:
              0.25706908 = queryWeight, product of:
                2.6349986 = boost
                4.4745317 = idf(docFreq=1375, maxDocs=44421)
                0.021803282 = queryNorm
              0.4894019 = fieldWeight in 2909, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.4745317 = idf(docFreq=1375, maxDocs=44421)
                0.0546875 = fieldNorm(doc=2909)
          0.108145095 = weight(abstract_txt:methods in 2909) [ClassicSimilarity], result of:
            0.108145095 = score(doc=2909,freq=3.0), product of:
              0.27554572 = queryWeight, product of:
                3.0500524 = boost
                4.1434727 = idf(docFreq=1915, maxDocs=44421)
                0.021803282 = queryNorm
              0.39247605 = fieldWeight in 2909, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.1434727 = idf(docFreq=1915, maxDocs=44421)
                0.0546875 = fieldNorm(doc=2909)
        0.28 = coord(7/25)
    
  2. Hubrich, J.: Intersystem relations : Characteristics and functionalities (2011) 0.14
    0.13680583 = sum of:
      0.13680583 = product of:
        0.68402916 = sum of:
          0.033717968 = weight(abstract_txt:retrieval in 780) [ClassicSimilarity], result of:
            0.033717968 = score(doc=780,freq=1.0), product of:
              0.07759061 = queryWeight, product of:
                1.0236348 = boost
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.021803282 = queryNorm
              0.4345625 = fieldWeight in 780, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.125 = fieldNorm(doc=780)
          0.22591689 = weight(abstract_txt:outlined in 780) [ClassicSimilarity], result of:
            0.22591689 = score(doc=780,freq=1.0), product of:
              0.27576175 = queryWeight, product of:
                1.9297786 = boost
                6.553973 = idf(docFreq=171, maxDocs=44421)
                0.021803282 = queryNorm
              0.81924665 = fieldWeight in 780, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.553973 = idf(docFreq=171, maxDocs=44421)
                0.125 = fieldNorm(doc=780)
          0.078339994 = weight(abstract_txt:search in 780) [ClassicSimilarity], result of:
            0.078339994 = score(doc=780,freq=1.0), product of:
              0.17148808 = queryWeight, product of:
                2.1521494 = boost
                3.654598 = idf(docFreq=3123, maxDocs=44421)
                0.021803282 = queryNorm
              0.45682475 = fieldWeight in 780, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.654598 = idf(docFreq=3123, maxDocs=44421)
                0.125 = fieldNorm(doc=780)
          0.20333982 = weight(abstract_txt:semantic in 780) [ClassicSimilarity], result of:
            0.20333982 = score(doc=780,freq=2.0), product of:
              0.25706908 = queryWeight, product of:
                2.6349986 = boost
                4.4745317 = idf(docFreq=1375, maxDocs=44421)
                0.021803282 = queryNorm
              0.7909929 = fieldWeight in 780, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.4745317 = idf(docFreq=1375, maxDocs=44421)
                0.125 = fieldNorm(doc=780)
          0.14271452 = weight(abstract_txt:methods in 780) [ClassicSimilarity], result of:
            0.14271452 = score(doc=780,freq=1.0), product of:
              0.27554572 = queryWeight, product of:
                3.0500524 = boost
                4.1434727 = idf(docFreq=1915, maxDocs=44421)
                0.021803282 = queryNorm
              0.5179341 = fieldWeight in 780, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.1434727 = idf(docFreq=1915, maxDocs=44421)
                0.125 = fieldNorm(doc=780)
        0.2 = coord(5/25)
    
  3. Biagetti, M.T.: Pertinence perspective and OPAC enhancement 0.13
    0.13235438 = sum of:
      0.13235438 = product of:
        0.5514766 = sum of:
          0.03422681 = weight(abstract_txt:development in 536) [ClassicSimilarity], result of:
            0.03422681 = score(doc=536,freq=1.0), product of:
              0.094937615 = queryWeight, product of:
                1.1322951 = boost
                3.8455355 = idf(docFreq=2580, maxDocs=44421)
                0.021803282 = queryNorm
              0.36051896 = fieldWeight in 536, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.8455355 = idf(docFreq=2580, maxDocs=44421)
                0.09375 = fieldNorm(doc=536)
          0.051374618 = weight(abstract_txt:literature in 536) [ClassicSimilarity], result of:
            0.051374618 = score(doc=536,freq=1.0), product of:
              0.12445904 = queryWeight, product of:
                1.2964438 = boost
                4.4030223 = idf(docFreq=1477, maxDocs=44421)
                0.021803282 = queryNorm
              0.41278332 = fieldWeight in 536, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.4030223 = idf(docFreq=1477, maxDocs=44421)
                0.09375 = fieldNorm(doc=536)
          0.10550817 = weight(abstract_txt:ranking in 536) [ClassicSimilarity], result of:
            0.10550817 = score(doc=536,freq=1.0), product of:
              0.20108736 = queryWeight, product of:
                1.6479076 = boost
                5.5966744 = idf(docFreq=447, maxDocs=44421)
                0.021803282 = queryNorm
              0.52468824 = fieldWeight in 536, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.5966744 = idf(docFreq=447, maxDocs=44421)
                0.09375 = fieldNorm(doc=536)
          0.16943766 = weight(abstract_txt:outlined in 536) [ClassicSimilarity], result of:
            0.16943766 = score(doc=536,freq=1.0), product of:
              0.27576175 = queryWeight, product of:
                1.9297786 = boost
                6.553973 = idf(docFreq=171, maxDocs=44421)
                0.021803282 = queryNorm
              0.61443496 = fieldWeight in 536, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.553973 = idf(docFreq=171, maxDocs=44421)
                0.09375 = fieldNorm(doc=536)
          0.083092116 = weight(abstract_txt:search in 536) [ClassicSimilarity], result of:
            0.083092116 = score(doc=536,freq=2.0), product of:
              0.17148808 = queryWeight, product of:
                2.1521494 = boost
                3.654598 = idf(docFreq=3123, maxDocs=44421)
                0.021803282 = queryNorm
              0.4845358 = fieldWeight in 536, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.654598 = idf(docFreq=3123, maxDocs=44421)
                0.09375 = fieldNorm(doc=536)
          0.10783723 = weight(abstract_txt:semantic in 536) [ClassicSimilarity], result of:
            0.10783723 = score(doc=536,freq=1.0), product of:
              0.25706908 = queryWeight, product of:
                2.6349986 = boost
                4.4745317 = idf(docFreq=1375, maxDocs=44421)
                0.021803282 = queryNorm
              0.41948736 = fieldWeight in 536, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.4745317 = idf(docFreq=1375, maxDocs=44421)
                0.09375 = fieldNorm(doc=536)
        0.24 = coord(6/25)
    
  4. Ning, X.; Jin, H.; Wu, H.: RSS: a framework enabling ranked search on the semantic web (2008) 0.12
    0.123033546 = sum of:
      0.123033546 = product of:
        0.61516774 = sum of:
          0.08831531 = weight(abstract_txt:pagerank in 3069) [ClassicSimilarity], result of:
            0.08831531 = score(doc=3069,freq=1.0), product of:
              0.18575287 = queryWeight, product of:
                1.1199361 = boost
                7.607123 = idf(docFreq=59, maxDocs=44421)
                0.021803282 = queryNorm
              0.47544518 = fieldWeight in 3069, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.607123 = idf(docFreq=59, maxDocs=44421)
                0.0625 = fieldNorm(doc=3069)
          0.09947405 = weight(abstract_txt:ranking in 3069) [ClassicSimilarity], result of:
            0.09947405 = score(doc=3069,freq=2.0), product of:
              0.20108736 = queryWeight, product of:
                1.6479076 = boost
                5.5966744 = idf(docFreq=447, maxDocs=44421)
                0.021803282 = queryNorm
              0.4946808 = fieldWeight in 3069, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.5966744 = idf(docFreq=447, maxDocs=44421)
                0.0625 = fieldNorm(doc=3069)
          0.11078949 = weight(abstract_txt:search in 3069) [ClassicSimilarity], result of:
            0.11078949 = score(doc=3069,freq=8.0), product of:
              0.17148808 = queryWeight, product of:
                2.1521494 = boost
                3.654598 = idf(docFreq=3123, maxDocs=44421)
                0.021803282 = queryNorm
              0.6460478 = fieldWeight in 3069, product of:
                2.828427 = tf(freq=8.0), with freq of:
                  8.0 = termFreq=8.0
                3.654598 = idf(docFreq=3123, maxDocs=44421)
                0.0625 = fieldNorm(doc=3069)
          0.21567446 = weight(abstract_txt:semantic in 3069) [ClassicSimilarity], result of:
            0.21567446 = score(doc=3069,freq=9.0), product of:
              0.25706908 = queryWeight, product of:
                2.6349986 = boost
                4.4745317 = idf(docFreq=1375, maxDocs=44421)
                0.021803282 = queryNorm
              0.8389747 = fieldWeight in 3069, product of:
                3.0 = tf(freq=9.0), with freq of:
                  9.0 = termFreq=9.0
                4.4745317 = idf(docFreq=1375, maxDocs=44421)
                0.0625 = fieldNorm(doc=3069)
          0.1009144 = weight(abstract_txt:methods in 3069) [ClassicSimilarity], result of:
            0.1009144 = score(doc=3069,freq=2.0), product of:
              0.27554572 = queryWeight, product of:
                3.0500524 = boost
                4.1434727 = idf(docFreq=1915, maxDocs=44421)
                0.021803282 = queryNorm
              0.3662347 = fieldWeight in 3069, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.1434727 = idf(docFreq=1915, maxDocs=44421)
                0.0625 = fieldNorm(doc=3069)
        0.2 = coord(5/25)
    
  5. Urbain, J.; Goharian, N.; Frieder, O.: Probabilistic passage models for semantic search of genomics literature (2008) 0.12
    0.118505664 = sum of:
      0.118505664 = product of:
        0.42323452 = sum of:
          0.050576955 = weight(abstract_txt:retrieval in 3380) [ClassicSimilarity], result of:
            0.050576955 = score(doc=3380,freq=9.0), product of:
              0.07759061 = queryWeight, product of:
                1.0236348 = boost
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.021803282 = queryNorm
              0.6518438 = fieldWeight in 3380, product of:
                3.0 = tf(freq=9.0), with freq of:
                  9.0 = termFreq=9.0
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.0625 = fieldNorm(doc=3380)
          0.034249745 = weight(abstract_txt:literature in 3380) [ClassicSimilarity], result of:
            0.034249745 = score(doc=3380,freq=1.0), product of:
              0.12445904 = queryWeight, product of:
                1.2964438 = boost
                4.4030223 = idf(docFreq=1477, maxDocs=44421)
                0.021803282 = queryNorm
              0.2751889 = fieldWeight in 3380, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.4030223 = idf(docFreq=1477, maxDocs=44421)
                0.0625 = fieldNorm(doc=3380)
          0.039647147 = weight(abstract_txt:models in 3380) [ClassicSimilarity], result of:
            0.039647147 = score(doc=3380,freq=1.0), product of:
              0.13721329 = queryWeight, product of:
                1.3612521 = boost
                4.623126 = idf(docFreq=1185, maxDocs=44421)
                0.021803282 = queryNorm
              0.28894538 = fieldWeight in 3380, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.623126 = idf(docFreq=1185, maxDocs=44421)
                0.0625 = fieldNorm(doc=3380)
          0.07033878 = weight(abstract_txt:ranking in 3380) [ClassicSimilarity], result of:
            0.07033878 = score(doc=3380,freq=1.0), product of:
              0.20108736 = queryWeight, product of:
                1.6479076 = boost
                5.5966744 = idf(docFreq=447, maxDocs=44421)
                0.021803282 = queryNorm
              0.34979215 = fieldWeight in 3380, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.5966744 = idf(docFreq=447, maxDocs=44421)
                0.0625 = fieldNorm(doc=3380)
          0.055394746 = weight(abstract_txt:search in 3380) [ClassicSimilarity], result of:
            0.055394746 = score(doc=3380,freq=2.0), product of:
              0.17148808 = queryWeight, product of:
                2.1521494 = boost
                3.654598 = idf(docFreq=3123, maxDocs=44421)
                0.021803282 = queryNorm
              0.3230239 = fieldWeight in 3380, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.654598 = idf(docFreq=3123, maxDocs=44421)
                0.0625 = fieldNorm(doc=3380)
          0.10166991 = weight(abstract_txt:semantic in 3380) [ClassicSimilarity], result of:
            0.10166991 = score(doc=3380,freq=2.0), product of:
              0.25706908 = queryWeight, product of:
                2.6349986 = boost
                4.4745317 = idf(docFreq=1375, maxDocs=44421)
                0.021803282 = queryNorm
              0.39549646 = fieldWeight in 3380, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.4745317 = idf(docFreq=1375, maxDocs=44421)
                0.0625 = fieldNorm(doc=3380)
          0.07135726 = weight(abstract_txt:methods in 3380) [ClassicSimilarity], result of:
            0.07135726 = score(doc=3380,freq=1.0), product of:
              0.27554572 = queryWeight, product of:
                3.0500524 = boost
                4.1434727 = idf(docFreq=1915, maxDocs=44421)
                0.021803282 = queryNorm
              0.25896704 = fieldWeight in 3380, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.1434727 = idf(docFreq=1915, maxDocs=44421)
                0.0625 = fieldNorm(doc=3380)
        0.28 = coord(7/25)