Document (#28393)

Author
Abiteboul, S.
Cluet, S.
Christophides, V.
Milo, T.
Moerkotte, G.
Siméon, J.
Title
Querying documents in object databases
Source
International journal of digital libraries. 1(1997) no.1, S.5-19
Year
1997
Abstract
We consider the problem of storing and accessing documents (SGML and HTML, in particular) using database technology. To specify the database image of documents, we use structuring schemas that consist in grammars annotated with database programs. To query documents, we introduce an extension of OQL, the ODMG standard query language for object databases. Our extension (named OQL-doc) allows us to query documents without a precise knowledge of their structure using in particular generalzed path expressions and pattern matching. This allows us to introduce in a declarative langugae (in the style of SQL or OQL), navigational and information retrieval styles of accessing data. We also consider the interaction of full-text indexes (e.g. inverted files) with standard database collection indexes (e.g, B-trees) that provide important speed-up
Object
ODMG
OQL

Similar documents (content)

  1. Falquet, G.; Guyot, J.; Nerima, L.: Languages and tools to specify hypertext views on databases (1999) 0.14
    0.14355975 = sum of:
      0.14355975 = product of:
        0.59816563 = sum of:
          0.10366484 = weight(abstract_txt:specify in 4968) [ClassicSimilarity], result of:
            0.10366484 = score(doc=4968,freq=1.0), product of:
              0.17592245 = queryWeight, product of:
                1.0745438 = boost
                7.5425844 = idf(docFreq=63, maxDocs=44421)
                0.021705858 = queryNorm
              0.5892644 = fieldWeight in 4968, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.5425844 = idf(docFreq=63, maxDocs=44421)
                0.078125 = fieldNorm(doc=4968)
          0.15673463 = weight(abstract_txt:declarative in 4968) [ClassicSimilarity], result of:
            0.15673463 = score(doc=4968,freq=1.0), product of:
              0.23174495 = queryWeight, product of:
                1.2332997 = boost
                8.656945 = idf(docFreq=20, maxDocs=44421)
                0.021705858 = queryNorm
              0.67632383 = fieldWeight in 4968, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.656945 = idf(docFreq=20, maxDocs=44421)
                0.078125 = fieldNorm(doc=4968)
          0.04118637 = weight(abstract_txt:particular in 4968) [ClassicSimilarity], result of:
            0.04118637 = score(doc=4968,freq=1.0), product of:
              0.11978781 = queryWeight, product of:
                1.2539632 = boost
                4.400995 = idf(docFreq=1480, maxDocs=44421)
                0.021705858 = queryNorm
              0.34382772 = fieldWeight in 4968, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.400995 = idf(docFreq=1480, maxDocs=44421)
                0.078125 = fieldNorm(doc=4968)
          0.041549943 = weight(abstract_txt:databases in 4968) [ClassicSimilarity], result of:
            0.041549943 = score(doc=4968,freq=1.0), product of:
              0.120491736 = queryWeight, product of:
                1.2576423 = boost
                4.413907 = idf(docFreq=1461, maxDocs=44421)
                0.021705858 = queryNorm
              0.34483647 = fieldWeight in 4968, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.413907 = idf(docFreq=1461, maxDocs=44421)
                0.078125 = fieldNorm(doc=4968)
          0.12366608 = weight(abstract_txt:object in 4968) [ClassicSimilarity], result of:
            0.12366608 = score(doc=4968,freq=2.0), product of:
              0.19787945 = queryWeight, product of:
                1.61168 = boost
                5.656462 = idf(docFreq=421, maxDocs=44421)
                0.021705858 = queryNorm
              0.62495667 = fieldWeight in 4968, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.656462 = idf(docFreq=421, maxDocs=44421)
                0.078125 = fieldNorm(doc=4968)
          0.13136376 = weight(abstract_txt:database in 4968) [ClassicSimilarity], result of:
            0.13136376 = score(doc=4968,freq=3.0), product of:
              0.22674109 = queryWeight, product of:
                2.4398246 = boost
                4.2814875 = idf(docFreq=1668, maxDocs=44421)
                0.021705858 = queryNorm
              0.5793558 = fieldWeight in 4968, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.2814875 = idf(docFreq=1668, maxDocs=44421)
                0.078125 = fieldNorm(doc=4968)
        0.24 = coord(6/25)
    
  2. Castelli, V.: Progressive search and retrieval from image databases (2002) 0.12
    0.12482474 = sum of:
      0.12482474 = product of:
        0.39007732 = sum of:
          0.08796254 = weight(abstract_txt:specify in 5253) [ClassicSimilarity], result of:
            0.08796254 = score(doc=5253,freq=2.0), product of:
              0.17592245 = queryWeight, product of:
                1.0745438 = boost
                7.5425844 = idf(docFreq=63, maxDocs=44421)
                0.021705858 = queryNorm
              0.50000745 = fieldWeight in 5253, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.5425844 = idf(docFreq=63, maxDocs=44421)
                0.046875 = fieldNorm(doc=5253)
          0.024711821 = weight(abstract_txt:particular in 5253) [ClassicSimilarity], result of:
            0.024711821 = score(doc=5253,freq=1.0), product of:
              0.11978781 = queryWeight, product of:
                1.2539632 = boost
                4.400995 = idf(docFreq=1480, maxDocs=44421)
                0.021705858 = queryNorm
              0.20629662 = fieldWeight in 5253, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.400995 = idf(docFreq=1480, maxDocs=44421)
                0.046875 = fieldNorm(doc=5253)
          0.0352563 = weight(abstract_txt:databases in 5253) [ClassicSimilarity], result of:
            0.0352563 = score(doc=5253,freq=2.0), product of:
              0.120491736 = queryWeight, product of:
                1.2576423 = boost
                4.413907 = idf(docFreq=1461, maxDocs=44421)
                0.021705858 = queryNorm
              0.29260346 = fieldWeight in 5253, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.413907 = idf(docFreq=1461, maxDocs=44421)
                0.046875 = fieldNorm(doc=5253)
          0.030656459 = weight(abstract_txt:standard in 5253) [ClassicSimilarity], result of:
            0.030656459 = score(doc=5253,freq=1.0), product of:
              0.13830061 = queryWeight, product of:
                1.3473814 = boost
                4.7288613 = idf(docFreq=1066, maxDocs=44421)
                0.021705858 = queryNorm
              0.22166538 = fieldWeight in 5253, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.7288613 = idf(docFreq=1066, maxDocs=44421)
                0.046875 = fieldNorm(doc=5253)
          0.047422078 = weight(abstract_txt:allows in 5253) [ClassicSimilarity], result of:
            0.047422078 = score(doc=5253,freq=1.0), product of:
              0.18498215 = queryWeight, product of:
                1.5582725 = boost
                5.4690194 = idf(docFreq=508, maxDocs=44421)
                0.021705858 = queryNorm
              0.2563603 = fieldWeight in 5253, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.4690194 = idf(docFreq=508, maxDocs=44421)
                0.046875 = fieldNorm(doc=5253)
          0.052467078 = weight(abstract_txt:object in 5253) [ClassicSimilarity], result of:
            0.052467078 = score(doc=5253,freq=1.0), product of:
              0.19787945 = queryWeight, product of:
                1.61168 = boost
                5.656462 = idf(docFreq=421, maxDocs=44421)
                0.021705858 = queryNorm
              0.26514667 = fieldWeight in 5253, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.656462 = idf(docFreq=421, maxDocs=44421)
                0.046875 = fieldNorm(doc=5253)
          0.06609533 = weight(abstract_txt:query in 5253) [ClassicSimilarity], result of:
            0.06609533 = score(doc=5253,freq=2.0), product of:
              0.20970577 = queryWeight, product of:
                2.0320263 = boost
                4.754492 = idf(docFreq=1039, maxDocs=44421)
                0.021705858 = queryNorm
              0.31518126 = fieldWeight in 5253, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.754492 = idf(docFreq=1039, maxDocs=44421)
                0.046875 = fieldNorm(doc=5253)
          0.04550574 = weight(abstract_txt:database in 5253) [ClassicSimilarity], result of:
            0.04550574 = score(doc=5253,freq=1.0), product of:
              0.22674109 = queryWeight, product of:
                2.4398246 = boost
                4.2814875 = idf(docFreq=1668, maxDocs=44421)
                0.021705858 = queryNorm
              0.20069472 = fieldWeight in 5253, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.2814875 = idf(docFreq=1668, maxDocs=44421)
                0.046875 = fieldNorm(doc=5253)
        0.32 = coord(8/25)
    
  3. Ozkarahan, E.: Multimedia document retrieval (1995) 0.12
    0.12434143 = sum of:
      0.12434143 = product of:
        0.62170714 = sum of:
          0.20740706 = weight(abstract_txt:langugae in 1560) [ClassicSimilarity], result of:
            0.20740706 = score(doc=1560,freq=1.0), product of:
              0.27932897 = queryWeight, product of:
                1.3540088 = boost
                9.504243 = idf(docFreq=8, maxDocs=44421)
                0.021705858 = queryNorm
              0.74251896 = fieldWeight in 1560, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.504243 = idf(docFreq=8, maxDocs=44421)
                0.078125 = fieldNorm(doc=1560)
          0.087445125 = weight(abstract_txt:object in 1560) [ClassicSimilarity], result of:
            0.087445125 = score(doc=1560,freq=1.0), product of:
              0.19787945 = queryWeight, product of:
                1.61168 = boost
                5.656462 = idf(docFreq=421, maxDocs=44421)
                0.021705858 = queryNorm
              0.4419111 = fieldWeight in 1560, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.656462 = idf(docFreq=421, maxDocs=44421)
                0.078125 = fieldNorm(doc=1560)
          0.13491653 = weight(abstract_txt:query in 1560) [ClassicSimilarity], result of:
            0.13491653 = score(doc=1560,freq=3.0), product of:
              0.20970577 = queryWeight, product of:
                2.0320263 = boost
                4.754492 = idf(docFreq=1039, maxDocs=44421)
                0.021705858 = queryNorm
              0.6433611 = fieldWeight in 1560, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.754492 = idf(docFreq=1039, maxDocs=44421)
                0.078125 = fieldNorm(doc=1560)
          0.10725805 = weight(abstract_txt:database in 1560) [ClassicSimilarity], result of:
            0.10725805 = score(doc=1560,freq=2.0), product of:
              0.22674109 = queryWeight, product of:
                2.4398246 = boost
                4.2814875 = idf(docFreq=1668, maxDocs=44421)
                0.021705858 = queryNorm
              0.47304198 = fieldWeight in 1560, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.2814875 = idf(docFreq=1668, maxDocs=44421)
                0.078125 = fieldNorm(doc=1560)
          0.08468036 = weight(abstract_txt:documents in 1560) [ClassicSimilarity], result of:
            0.08468036 = score(doc=1560,freq=1.0), product of:
              0.26287267 = queryWeight, product of:
                2.9371166 = boost
                4.123322 = idf(docFreq=1954, maxDocs=44421)
                0.021705858 = queryNorm
              0.32213452 = fieldWeight in 1560, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.123322 = idf(docFreq=1954, maxDocs=44421)
                0.078125 = fieldNorm(doc=1560)
        0.2 = coord(5/25)
    
  4. Niemi, T.; Jämsen , J.: ¬A query language for discovering semantic associations, part I : approach and formal definition of query primitives (2007) 0.12
    0.12000679 = sum of:
      0.12000679 = product of:
        0.5000283 = sum of:
          0.12538771 = weight(abstract_txt:declarative in 1591) [ClassicSimilarity], result of:
            0.12538771 = score(doc=1591,freq=1.0), product of:
              0.23174495 = queryWeight, product of:
                1.2332997 = boost
                8.656945 = idf(docFreq=20, maxDocs=44421)
                0.021705858 = queryNorm
              0.5410591 = fieldWeight in 1591, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.656945 = idf(docFreq=20, maxDocs=44421)
                0.0625 = fieldNorm(doc=1591)
          0.033239957 = weight(abstract_txt:databases in 1591) [ClassicSimilarity], result of:
            0.033239957 = score(doc=1591,freq=1.0), product of:
              0.120491736 = queryWeight, product of:
                1.2576423 = boost
                4.413907 = idf(docFreq=1461, maxDocs=44421)
                0.021705858 = queryNorm
              0.2758692 = fieldWeight in 1591, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.413907 = idf(docFreq=1461, maxDocs=44421)
                0.0625 = fieldNorm(doc=1591)
          0.08835151 = weight(abstract_txt:introduce in 1591) [ClassicSimilarity], result of:
            0.08835151 = score(doc=1591,freq=1.0), product of:
              0.23120272 = queryWeight, product of:
                1.7421075 = boost
                6.114219 = idf(docFreq=266, maxDocs=44421)
                0.021705858 = queryNorm
              0.3821387 = fieldWeight in 1591, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.114219 = idf(docFreq=266, maxDocs=44421)
                0.0625 = fieldNorm(doc=1591)
          0.12463055 = weight(abstract_txt:query in 1591) [ClassicSimilarity], result of:
            0.12463055 = score(doc=1591,freq=4.0), product of:
              0.20970577 = queryWeight, product of:
                2.0320263 = boost
                4.754492 = idf(docFreq=1039, maxDocs=44421)
                0.021705858 = queryNorm
              0.5943115 = fieldWeight in 1591, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.754492 = idf(docFreq=1039, maxDocs=44421)
                0.0625 = fieldNorm(doc=1591)
          0.06067432 = weight(abstract_txt:database in 1591) [ClassicSimilarity], result of:
            0.06067432 = score(doc=1591,freq=1.0), product of:
              0.22674109 = queryWeight, product of:
                2.4398246 = boost
                4.2814875 = idf(docFreq=1668, maxDocs=44421)
                0.021705858 = queryNorm
              0.26759297 = fieldWeight in 1591, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.2814875 = idf(docFreq=1668, maxDocs=44421)
                0.0625 = fieldNorm(doc=1591)
          0.06774429 = weight(abstract_txt:documents in 1591) [ClassicSimilarity], result of:
            0.06774429 = score(doc=1591,freq=1.0), product of:
              0.26287267 = queryWeight, product of:
                2.9371166 = boost
                4.123322 = idf(docFreq=1954, maxDocs=44421)
                0.021705858 = queryNorm
              0.25770763 = fieldWeight in 1591, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.123322 = idf(docFreq=1954, maxDocs=44421)
                0.0625 = fieldNorm(doc=1591)
        0.24 = coord(6/25)
    
  5. Aldana, J.F.; Gómez, A.C.; Moreno, N.; Nebro, A.J.; Roldán, M.M.: Metadata functionality for semantic Web integration (2003) 0.12
    0.11520722 = sum of:
      0.11520722 = product of:
        0.41145435 = sum of:
          0.062198907 = weight(abstract_txt:specify in 3731) [ClassicSimilarity], result of:
            0.062198907 = score(doc=3731,freq=1.0), product of:
              0.17592245 = queryWeight, product of:
                1.0745438 = boost
                7.5425844 = idf(docFreq=63, maxDocs=44421)
                0.021705858 = queryNorm
              0.35355866 = fieldWeight in 3731, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.5425844 = idf(docFreq=63, maxDocs=44421)
                0.046875 = fieldNorm(doc=3731)
          0.024929969 = weight(abstract_txt:databases in 3731) [ClassicSimilarity], result of:
            0.024929969 = score(doc=3731,freq=1.0), product of:
              0.120491736 = queryWeight, product of:
                1.2576423 = boost
                4.413907 = idf(docFreq=1461, maxDocs=44421)
                0.021705858 = queryNorm
              0.2069019 = fieldWeight in 3731, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.413907 = idf(docFreq=1461, maxDocs=44421)
                0.046875 = fieldNorm(doc=3731)
          0.030656459 = weight(abstract_txt:standard in 3731) [ClassicSimilarity], result of:
            0.030656459 = score(doc=3731,freq=1.0), product of:
              0.13830061 = queryWeight, product of:
                1.3473814 = boost
                4.7288613 = idf(docFreq=1066, maxDocs=44421)
                0.021705858 = queryNorm
              0.22166538 = fieldWeight in 3731, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.7288613 = idf(docFreq=1066, maxDocs=44421)
                0.046875 = fieldNorm(doc=3731)
          0.11730609 = weight(abstract_txt:extension in 3731) [ClassicSimilarity], result of:
            0.11730609 = score(doc=3731,freq=2.0), product of:
              0.2685426 = queryWeight, product of:
                1.8775222 = boost
                6.58948 = idf(docFreq=165, maxDocs=44421)
                0.021705858 = queryNorm
              0.43682492 = fieldWeight in 3731, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.58948 = idf(docFreq=165, maxDocs=44421)
                0.046875 = fieldNorm(doc=3731)
          0.046736456 = weight(abstract_txt:query in 3731) [ClassicSimilarity], result of:
            0.046736456 = score(doc=3731,freq=1.0), product of:
              0.20970577 = queryWeight, product of:
                2.0320263 = boost
                4.754492 = idf(docFreq=1039, maxDocs=44421)
                0.021705858 = queryNorm
              0.2228668 = fieldWeight in 3731, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.754492 = idf(docFreq=1039, maxDocs=44421)
                0.046875 = fieldNorm(doc=3731)
          0.078818254 = weight(abstract_txt:database in 3731) [ClassicSimilarity], result of:
            0.078818254 = score(doc=3731,freq=3.0), product of:
              0.22674109 = queryWeight, product of:
                2.4398246 = boost
                4.2814875 = idf(docFreq=1668, maxDocs=44421)
                0.021705858 = queryNorm
              0.34761345 = fieldWeight in 3731, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.2814875 = idf(docFreq=1668, maxDocs=44421)
                0.046875 = fieldNorm(doc=3731)
          0.05080822 = weight(abstract_txt:documents in 3731) [ClassicSimilarity], result of:
            0.05080822 = score(doc=3731,freq=1.0), product of:
              0.26287267 = queryWeight, product of:
                2.9371166 = boost
                4.123322 = idf(docFreq=1954, maxDocs=44421)
                0.021705858 = queryNorm
              0.19328073 = fieldWeight in 3731, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.123322 = idf(docFreq=1954, maxDocs=44421)
                0.046875 = fieldNorm(doc=3731)
        0.28 = coord(7/25)