Document (#39208)

Author
Arenas, M.
Cuenca Grau, B.
Kharlamov, E.
Marciuska, S.
Zheleznyakov, D.
Title
Faceted search over ontology-enhanced RDF data
Source
https://www.cs.ox.ac.uk%2Ffiles%2F7357%2Fmain.pdf
Year
2014
Abstract
An increasing number of applications rely on RDF, OWL2, and SPARQL for storing and querying data. SPARQL, however, is not targeted towards end-users, and suitable query interfaces are needed. Faceted search is a prominent approach for end-user data access, and several RDF-based faceted search systems have been developed. There is, however, a lack of rigorous theoretical underpinning for faceted search in the context of RDF and OWL2. In this paper, we provide such solid foundations. We formalise faceted interfaces for this context, identify a fragment of first-order logic capturing the underlying queries, and study the complexity of answering such queries for RDF and OWL2 profiles. We then study interface generation and update, and devise efficiently implementable algorithms. Finally, we have implemented and tested our faceted search algorithms for scalability, with encouraging results.
Theme
Wissensrepräsentation
Semantisches Umfeld in Indexierung u. Retrieval
Object
RDF
OWL2

Similar documents (author)

  1. Grau, O.: Infos lokal gewoben : die WWW-Sprache HTML und die passende Software (1994) 6.01
    6.0137663 = sum of:
      6.0137663 = weight(author_txt:grau in 7565) [ClassicSimilarity], result of:
        6.0137663 = fieldWeight in 7565, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.622026 = idf(docFreq=7, maxDocs=44421)
          0.625 = fieldNorm(doc=7565)
    
  2. Grau, O.: Alles integriert : Informationssurfen im World Wide Web (1994) 6.01
    6.0137663 = sum of:
      6.0137663 = weight(author_txt:grau in 7612) [ClassicSimilarity], result of:
        6.0137663 = fieldWeight in 7612, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.622026 = idf(docFreq=7, maxDocs=44421)
          0.625 = fieldNorm(doc=7612)
    
  3. Grau, B.: Finding answers to questions, in text collections or Web, in open domain or specialty domains (2012) 6.01
    6.0137663 = sum of:
      6.0137663 = weight(author_txt:grau in 1107) [ClassicSimilarity], result of:
        6.0137663 = fieldWeight in 1107, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.622026 = idf(docFreq=7, maxDocs=44421)
          0.625 = fieldNorm(doc=1107)
    
  4. Grau, J.E.; Mehrotra, R.: Similar shape retrieval using a structural feature index (1993) 4.81
    4.811013 = sum of:
      4.811013 = weight(author_txt:grau in 7331) [ClassicSimilarity], result of:
        4.811013 = fieldWeight in 7331, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.622026 = idf(docFreq=7, maxDocs=44421)
          0.5 = fieldNorm(doc=7331)
    
  5. Ferret, O.; Grau, B.; Masson, N.: Utilisation d'un réseau de cooccurences lexikales pour a méliorer une analyse thématique fondée sur la distribution des mots (1999) 3.61
    3.60826 = sum of:
      3.60826 = weight(author_txt:grau in 295) [ClassicSimilarity], result of:
        3.60826 = fieldWeight in 295, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.622026 = idf(docFreq=7, maxDocs=44421)
          0.375 = fieldNorm(doc=295)
    

Similar documents (content)

  1. Materska, K.: Faceted navigation in search and discovery tools (2014) 0.27
    0.27200112 = sum of:
      0.27200112 = product of:
        0.97143257 = sum of:
          0.014044208 = weight(abstract_txt:such in 2435) [ClassicSimilarity], result of:
            0.014044208 = score(doc=2435,freq=2.0), product of:
              0.061927944 = queryWeight, product of:
                3.42101 = idf(docFreq=3945, maxDocs=44421)
                0.01810224 = queryNorm
              0.22678305 = fieldWeight in 2435, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.42101 = idf(docFreq=3945, maxDocs=44421)
                0.046875 = fieldNorm(doc=2435)
          0.04255661 = weight(abstract_txt:querying in 2435) [ClassicSimilarity], result of:
            0.04255661 = score(doc=2435,freq=1.0), product of:
              0.12967806 = queryWeight, product of:
                1.0232339 = boost
                7.000987 = idf(docFreq=109, maxDocs=44421)
                0.01810224 = queryNorm
              0.32817125 = fieldWeight in 2435, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.000987 = idf(docFreq=109, maxDocs=44421)
                0.046875 = fieldNorm(doc=2435)
          0.043410603 = weight(abstract_txt:prominent in 2435) [ClassicSimilarity], result of:
            0.043410603 = score(doc=2435,freq=1.0), product of:
              0.13140716 = queryWeight, product of:
                1.0300331 = boost
                7.0475073 = idf(docFreq=104, maxDocs=44421)
                0.01810224 = queryNorm
              0.3303519 = fieldWeight in 2435, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.0475073 = idf(docFreq=104, maxDocs=44421)
                0.046875 = fieldNorm(doc=2435)
          0.032933492 = weight(abstract_txt:queries in 2435) [ClassicSimilarity], result of:
            0.032933492 = score(doc=2435,freq=1.0), product of:
              0.13771787 = queryWeight, product of:
                1.4912547 = boost
                5.1015973 = idf(docFreq=734, maxDocs=44421)
                0.01810224 = queryNorm
              0.23913738 = fieldWeight in 2435, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.1015973 = idf(docFreq=734, maxDocs=44421)
                0.046875 = fieldNorm(doc=2435)
          0.06967675 = weight(abstract_txt:interfaces in 2435) [ClassicSimilarity], result of:
            0.06967675 = score(doc=2435,freq=3.0), product of:
              0.15736808 = queryWeight, product of:
                1.594098 = boost
                5.453425 = idf(docFreq=516, maxDocs=44421)
                0.01810224 = queryNorm
              0.4427629 = fieldWeight in 2435, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.453425 = idf(docFreq=516, maxDocs=44421)
                0.046875 = fieldNorm(doc=2435)
          0.13193345 = weight(abstract_txt:search in 2435) [ClassicSimilarity], result of:
            0.13193345 = score(doc=2435,freq=19.0), product of:
              0.17668399 = queryWeight, product of:
                2.670701 = boost
                3.654598 = idf(docFreq=3123, maxDocs=44421)
                0.01810224 = queryNorm
              0.7467199 = fieldWeight in 2435, product of:
                4.358899 = tf(freq=19.0), with freq of:
                  19.0 = termFreq=19.0
                3.654598 = idf(docFreq=3123, maxDocs=44421)
                0.046875 = fieldNorm(doc=2435)
          0.6368774 = weight(abstract_txt:faceted in 2435) [ClassicSimilarity], result of:
            0.6368774 = score(doc=2435,freq=16.0), product of:
              0.56789684 = queryWeight, product of:
                5.245079 = boost
                5.981156 = idf(docFreq=304, maxDocs=44421)
                0.01810224 = queryNorm
              1.1214668 = fieldWeight in 2435, product of:
                4.0 = tf(freq=16.0), with freq of:
                  16.0 = termFreq=16.0
                5.981156 = idf(docFreq=304, maxDocs=44421)
                0.046875 = fieldNorm(doc=2435)
        0.28 = coord(7/25)
    
  2. Wang, H.; Liu, Q.; Penin, T.; Fu, L.; Zhang, L.; Tran, T.; Yu, Y.; Pan, Y.: Semplore: a scalable IR approach to search the Web of Data (2009) 0.18
    0.18493386 = sum of:
      0.18493386 = product of:
        0.66047806 = sum of:
          0.06758652 = weight(abstract_txt:update in 2638) [ClassicSimilarity], result of:
            0.06758652 = score(doc=2638,freq=1.0), product of:
              0.12557293 = queryWeight, product of:
                1.0069078 = boost
                6.889283 = idf(docFreq=122, maxDocs=44421)
                0.01810224 = queryNorm
              0.53822523 = fieldWeight in 2638, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.889283 = idf(docFreq=122, maxDocs=44421)
                0.078125 = fieldNorm(doc=2638)
          0.098453745 = weight(abstract_txt:scalability in 2638) [ClassicSimilarity], result of:
            0.098453745 = score(doc=2638,freq=1.0), product of:
              0.16136554 = queryWeight, product of:
                1.1414242 = boost
                7.809647 = idf(docFreq=48, maxDocs=44421)
                0.01810224 = queryNorm
              0.6101287 = fieldWeight in 2638, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.809647 = idf(docFreq=48, maxDocs=44421)
                0.078125 = fieldNorm(doc=2638)
          0.030708946 = weight(abstract_txt:however in 2638) [ClassicSimilarity], result of:
            0.030708946 = score(doc=2638,freq=1.0), product of:
              0.09350667 = queryWeight, product of:
                1.2287909 = boost
                4.203706 = idf(docFreq=1803, maxDocs=44421)
                0.01810224 = queryNorm
              0.3284145 = fieldWeight in 2638, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.203706 = idf(docFreq=1803, maxDocs=44421)
                0.078125 = fieldNorm(doc=2638)
          0.05609909 = weight(abstract_txt:data in 2638) [ClassicSimilarity], result of:
            0.05609909 = score(doc=2638,freq=6.0), product of:
              0.08802713 = queryWeight, product of:
                1.460194 = boost
                3.3302255 = idf(docFreq=4320, maxDocs=44421)
                0.01810224 = queryNorm
              0.6372932 = fieldWeight in 2638, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                3.3302255 = idf(docFreq=4320, maxDocs=44421)
                0.078125 = fieldNorm(doc=2638)
          0.05488915 = weight(abstract_txt:queries in 2638) [ClassicSimilarity], result of:
            0.05488915 = score(doc=2638,freq=1.0), product of:
              0.13771787 = queryWeight, product of:
                1.4912547 = boost
                5.1015973 = idf(docFreq=734, maxDocs=44421)
                0.01810224 = queryNorm
              0.39856228 = fieldWeight in 2638, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.1015973 = idf(docFreq=734, maxDocs=44421)
                0.078125 = fieldNorm(doc=2638)
          0.08737505 = weight(abstract_txt:search in 2638) [ClassicSimilarity], result of:
            0.08737505 = score(doc=2638,freq=3.0), product of:
              0.17668399 = queryWeight, product of:
                2.670701 = boost
                3.654598 = idf(docFreq=3123, maxDocs=44421)
                0.01810224 = queryNorm
              0.49452728 = fieldWeight in 2638, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.654598 = idf(docFreq=3123, maxDocs=44421)
                0.078125 = fieldNorm(doc=2638)
          0.26536557 = weight(abstract_txt:faceted in 2638) [ClassicSimilarity], result of:
            0.26536557 = score(doc=2638,freq=1.0), product of:
              0.56789684 = queryWeight, product of:
                5.245079 = boost
                5.981156 = idf(docFreq=304, maxDocs=44421)
                0.01810224 = queryNorm
              0.4672778 = fieldWeight in 2638, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.981156 = idf(docFreq=304, maxDocs=44421)
                0.078125 = fieldNorm(doc=2638)
        0.28 = coord(7/25)
    
  3. Ruotsalo, T.; Jacucci, G.; Kaski, S.: Interactive faceted query suggestion for exploratory search : whole-session effectiveness and interaction engagement (2020) 0.14
    0.1426859 = sum of:
      0.1426859 = product of:
        0.7134295 = sum of:
          0.06806667 = weight(abstract_txt:targeted in 915) [ClassicSimilarity], result of:
            0.06806667 = score(doc=915,freq=1.0), product of:
              0.14640379 = queryWeight, product of:
                1.087221 = boost
                7.438788 = idf(docFreq=70, maxDocs=44421)
                0.01810224 = queryNorm
              0.46492425 = fieldWeight in 915, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.438788 = idf(docFreq=70, maxDocs=44421)
                0.0625 = fieldNorm(doc=915)
          0.018321887 = weight(abstract_txt:data in 915) [ClassicSimilarity], result of:
            0.018321887 = score(doc=915,freq=1.0), product of:
              0.08802713 = queryWeight, product of:
                1.460194 = boost
                3.3302255 = idf(docFreq=4320, maxDocs=44421)
                0.01810224 = queryNorm
              0.20813909 = fieldWeight in 915, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.3302255 = idf(docFreq=4320, maxDocs=44421)
                0.0625 = fieldNorm(doc=915)
          0.062099982 = weight(abstract_txt:queries in 915) [ClassicSimilarity], result of:
            0.062099982 = score(doc=915,freq=2.0), product of:
              0.13771787 = queryWeight, product of:
                1.4912547 = boost
                5.1015973 = idf(docFreq=734, maxDocs=44421)
                0.01810224 = queryNorm
              0.45092174 = fieldWeight in 915, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.1015973 = idf(docFreq=734, maxDocs=44421)
                0.0625 = fieldNorm(doc=915)
          0.09024057 = weight(abstract_txt:search in 915) [ClassicSimilarity], result of:
            0.09024057 = score(doc=915,freq=5.0), product of:
              0.17668399 = queryWeight, product of:
                2.670701 = boost
                3.654598 = idf(docFreq=3123, maxDocs=44421)
                0.01810224 = queryNorm
              0.5107456 = fieldWeight in 915, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                3.654598 = idf(docFreq=3123, maxDocs=44421)
                0.0625 = fieldNorm(doc=915)
          0.47470042 = weight(abstract_txt:faceted in 915) [ClassicSimilarity], result of:
            0.47470042 = score(doc=915,freq=5.0), product of:
              0.56789684 = queryWeight, product of:
                5.245079 = boost
                5.981156 = idf(docFreq=304, maxDocs=44421)
                0.01810224 = queryNorm
              0.83589196 = fieldWeight in 915, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                5.981156 = idf(docFreq=304, maxDocs=44421)
                0.0625 = fieldNorm(doc=915)
        0.2 = coord(5/25)
    
  4. Binding, C.; Gnoli, C.; Tudhope, D.: Migrating a complex classification scheme to the semantic web : expressing the Integrative Levels Classification using SKOS RDF (2021) 0.14
    0.14197497 = sum of:
      0.14197497 = product of:
        0.70987487 = sum of:
          0.013241007 = weight(abstract_txt:such in 1601) [ClassicSimilarity], result of:
            0.013241007 = score(doc=1601,freq=1.0), product of:
              0.061927944 = queryWeight, product of:
                3.42101 = idf(docFreq=3945, maxDocs=44421)
                0.01810224 = queryNorm
              0.21381313 = fieldWeight in 1601, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.42101 = idf(docFreq=3945, maxDocs=44421)
                0.0625 = fieldNorm(doc=1601)
          0.026906831 = weight(abstract_txt:context in 1601) [ClassicSimilarity], result of:
            0.026906831 = score(doc=1601,freq=1.0), product of:
              0.09935301 = queryWeight, product of:
                1.2666224 = boost
                4.333128 = idf(docFreq=1584, maxDocs=44421)
                0.01810224 = queryNorm
              0.2708205 = fieldWeight in 1601, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.333128 = idf(docFreq=1584, maxDocs=44421)
                0.0625 = fieldNorm(doc=1601)
          0.20478526 = weight(abstract_txt:sparql in 1601) [ClassicSimilarity], result of:
            0.20478526 = score(doc=1601,freq=1.0), product of:
              0.38441923 = queryWeight, product of:
                2.4914904 = boost
                8.523414 = idf(docFreq=23, maxDocs=44421)
                0.01810224 = queryNorm
              0.53271335 = fieldWeight in 1601, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.523414 = idf(docFreq=23, maxDocs=44421)
                0.0625 = fieldNorm(doc=1601)
          0.04035681 = weight(abstract_txt:search in 1601) [ClassicSimilarity], result of:
            0.04035681 = score(doc=1601,freq=1.0), product of:
              0.17668399 = queryWeight, product of:
                2.670701 = boost
                3.654598 = idf(docFreq=3123, maxDocs=44421)
                0.01810224 = queryNorm
              0.22841237 = fieldWeight in 1601, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.654598 = idf(docFreq=3123, maxDocs=44421)
                0.0625 = fieldNorm(doc=1601)
          0.42458495 = weight(abstract_txt:faceted in 1601) [ClassicSimilarity], result of:
            0.42458495 = score(doc=1601,freq=4.0), product of:
              0.56789684 = queryWeight, product of:
                5.245079 = boost
                5.981156 = idf(docFreq=304, maxDocs=44421)
                0.01810224 = queryNorm
              0.7476445 = fieldWeight in 1601, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.981156 = idf(docFreq=304, maxDocs=44421)
                0.0625 = fieldNorm(doc=1601)
        0.2 = coord(5/25)
    
  5. Devadason, F.J.; Intaraksa, N.; Patamawongjariya, P.; Desai, K.: Faceted indexing application for organizing and accessing internet resources (2003) 0.14
    0.13557981 = sum of:
      0.13557981 = product of:
        0.5649159 = sum of:
          0.009930756 = weight(abstract_txt:such in 4966) [ClassicSimilarity], result of:
            0.009930756 = score(doc=4966,freq=1.0), product of:
              0.061927944 = queryWeight, product of:
                3.42101 = idf(docFreq=3945, maxDocs=44421)
                0.01810224 = queryNorm
              0.16035984 = fieldWeight in 4966, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.42101 = idf(docFreq=3945, maxDocs=44421)
                0.046875 = fieldNorm(doc=4966)
          0.0491385 = weight(abstract_txt:rigorous in 4966) [ClassicSimilarity], result of:
            0.0491385 = score(doc=4966,freq=1.0), product of:
              0.14272599 = queryWeight, product of:
                1.0734781 = boost
                7.344759 = idf(docFreq=77, maxDocs=44421)
                0.01810224 = queryNorm
              0.34428558 = fieldWeight in 4966, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.344759 = idf(docFreq=77, maxDocs=44421)
                0.046875 = fieldNorm(doc=4966)
          0.01842537 = weight(abstract_txt:however in 4966) [ClassicSimilarity], result of:
            0.01842537 = score(doc=4966,freq=1.0), product of:
              0.09350667 = queryWeight, product of:
                1.2287909 = boost
                4.203706 = idf(docFreq=1803, maxDocs=44421)
                0.01810224 = queryNorm
              0.19704871 = fieldWeight in 4966, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.203706 = idf(docFreq=1803, maxDocs=44421)
                0.046875 = fieldNorm(doc=4966)
          0.013741415 = weight(abstract_txt:data in 4966) [ClassicSimilarity], result of:
            0.013741415 = score(doc=4966,freq=1.0), product of:
              0.08802713 = queryWeight, product of:
                1.460194 = boost
                3.3302255 = idf(docFreq=4320, maxDocs=44421)
                0.01810224 = queryNorm
              0.15610433 = fieldWeight in 4966, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.3302255 = idf(docFreq=4320, maxDocs=44421)
                0.046875 = fieldNorm(doc=4966)
          0.052425038 = weight(abstract_txt:search in 4966) [ClassicSimilarity], result of:
            0.052425038 = score(doc=4966,freq=3.0), product of:
              0.17668399 = queryWeight, product of:
                2.670701 = boost
                3.654598 = idf(docFreq=3123, maxDocs=44421)
                0.01810224 = queryNorm
              0.2967164 = fieldWeight in 4966, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.654598 = idf(docFreq=3123, maxDocs=44421)
                0.046875 = fieldNorm(doc=4966)
          0.4212548 = weight(abstract_txt:faceted in 4966) [ClassicSimilarity], result of:
            0.4212548 = score(doc=4966,freq=7.0), product of:
              0.56789684 = queryWeight, product of:
                5.245079 = boost
                5.981156 = idf(docFreq=304, maxDocs=44421)
                0.01810224 = queryNorm
              0.7417805 = fieldWeight in 4966, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                5.981156 = idf(docFreq=304, maxDocs=44421)
                0.046875 = fieldNorm(doc=4966)
        0.24 = coord(6/25)