Document (#18596)

Author
Scheuermann, P.
Li, W.-S.
Clifton, C.
Title
Multidatabase query processing with uncertainty in global keys and attribute values
Source
Journal of the American Society for Information Science. 49(1998) no.3, S.283-301
Year
1998
Abstract
Semantic integration and data integration are 2 main processes that multidatabase systems need to employ in order to support interoperability. Both these processes involve uncertainty when attribute correspondences and global IDs are unknown or imprecise. The role-set approach is a new conceptual framework for data integration in multidatabase systems that maintains the materialization autonomy of local database systems by presenting the answer to a query as a set of sets representing the ddistinct intersections between the relations corresponding to the various roles played by an entity. In this article, we present an approach for dynamic database integration and query processing in the absence of information about attribute correspondences and global IDs. We define different types of equivalence conditions for the construction of global IDs. We propose a strategy based on ranked role-sets that makes use of an automated semantic integration procedure based on neural networks to determine candidate global IDs. The data integration and query processing stepts then produce a number a role-sets, ranked by the similarity of the candidate IDs

Similar documents (content)

  1. Gyseghem, N. van; Caluwe, R. de: Imprecision and uncertainty in the UFO database model (1998) 0.18
    0.18469842 = sum of:
      0.18469842 = product of:
        0.5771826 = sum of:
          0.118003935 = weight(abstract_txt:imprecise in 1591) [ClassicSimilarity], result of:
            0.118003935 = score(doc=1591,freq=2.0), product of:
              0.15951969 = queryWeight, product of:
                1.1639159 = boost
                8.369263 = idf(docFreq=27, maxDocs=44421)
                0.016375912 = queryNorm
              0.73974526 = fieldWeight in 1591, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.369263 = idf(docFreq=27, maxDocs=44421)
                0.0625 = fieldNorm(doc=1591)
          0.059112996 = weight(abstract_txt:database in 1591) [ClassicSimilarity], result of:
            0.059112996 = score(doc=1591,freq=7.0), product of:
              0.08349477 = queryWeight, product of:
                1.1908555 = boost
                4.2814875 = idf(docFreq=1668, maxDocs=44421)
                0.016375912 = queryNorm
              0.70798445 = fieldWeight in 1591, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                4.2814875 = idf(docFreq=1668, maxDocs=44421)
                0.0625 = fieldNorm(doc=1591)
          0.025503082 = weight(abstract_txt:semantic in 1591) [ClassicSimilarity], result of:
            0.025503082 = score(doc=1591,freq=1.0), product of:
              0.09119375 = queryWeight, product of:
                1.2445489 = boost
                4.4745317 = idf(docFreq=1375, maxDocs=44421)
                0.016375912 = queryNorm
              0.27965823 = fieldWeight in 1591, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.4745317 = idf(docFreq=1375, maxDocs=44421)
                0.0625 = fieldNorm(doc=1591)
          0.015771104 = weight(abstract_txt:data in 1591) [ClassicSimilarity], result of:
            0.015771104 = score(doc=1591,freq=1.0), product of:
              0.07577194 = queryWeight, product of:
                1.3894064 = boost
                3.3302255 = idf(docFreq=4320, maxDocs=44421)
                0.016375912 = queryNorm
              0.20813909 = fieldWeight in 1591, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.3302255 = idf(docFreq=4320, maxDocs=44421)
                0.0625 = fieldNorm(doc=1591)
          0.016949354 = weight(abstract_txt:systems in 1591) [ClassicSimilarity], result of:
            0.016949354 = score(doc=1591,freq=1.0), product of:
              0.07950037 = queryWeight, product of:
                1.4231794 = boost
                3.411175 = idf(docFreq=3984, maxDocs=44421)
                0.016375912 = queryNorm
              0.21319844 = fieldWeight in 1591, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.411175 = idf(docFreq=3984, maxDocs=44421)
                0.0625 = fieldNorm(doc=1591)
          0.052566808 = weight(abstract_txt:role in 1591) [ClassicSimilarity], result of:
            0.052566808 = score(doc=1591,freq=2.0), product of:
              0.1341935 = queryWeight, product of:
                1.8490165 = boost
                4.431851 = idf(docFreq=1435, maxDocs=44421)
                0.016375912 = queryNorm
              0.39172396 = fieldWeight in 1591, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.431851 = idf(docFreq=1435, maxDocs=44421)
                0.0625 = fieldNorm(doc=1591)
          0.13403302 = weight(abstract_txt:uncertainty in 1591) [ClassicSimilarity], result of:
            0.13403302 = score(doc=1591,freq=2.0), product of:
              0.21879354 = queryWeight, product of:
                1.9277321 = boost
                6.930783 = idf(docFreq=117, maxDocs=44421)
                0.016375912 = queryNorm
              0.61260045 = fieldWeight in 1591, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.930783 = idf(docFreq=117, maxDocs=44421)
                0.0625 = fieldNorm(doc=1591)
          0.1552423 = weight(abstract_txt:attribute in 1591) [ClassicSimilarity], result of:
            0.1552423 = score(doc=1591,freq=1.0), product of:
              0.3480223 = queryWeight, product of:
                2.977684 = boost
                7.1371193 = idf(docFreq=95, maxDocs=44421)
                0.016375912 = queryNorm
              0.44606996 = fieldWeight in 1591, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.1371193 = idf(docFreq=95, maxDocs=44421)
                0.0625 = fieldNorm(doc=1591)
        0.32 = coord(8/25)
    
  2. Boßmeyer, C.: OSI-Anwendungen in Bibliotheken oder Was ein Bibliothekar von OSI wissen sollte (1995) 0.18
    0.17597543 = sum of:
      0.17597543 = product of:
        0.73323095 = sum of:
          0.031542208 = weight(abstract_txt:data in 5150) [ClassicSimilarity], result of:
            0.031542208 = score(doc=5150,freq=1.0), product of:
              0.07577194 = queryWeight, product of:
                1.3894064 = boost
                3.3302255 = idf(docFreq=4320, maxDocs=44421)
                0.016375912 = queryNorm
              0.41627818 = fieldWeight in 5150, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.3302255 = idf(docFreq=4320, maxDocs=44421)
                0.125 = fieldNorm(doc=5150)
          0.047940016 = weight(abstract_txt:systems in 5150) [ClassicSimilarity], result of:
            0.047940016 = score(doc=5150,freq=2.0), product of:
              0.07950037 = queryWeight, product of:
                1.4231794 = boost
                3.411175 = idf(docFreq=3984, maxDocs=44421)
                0.016375912 = queryNorm
              0.60301626 = fieldWeight in 5150, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.411175 = idf(docFreq=3984, maxDocs=44421)
                0.125 = fieldNorm(doc=5150)
          0.10201862 = weight(abstract_txt:processing in 5150) [ClassicSimilarity], result of:
            0.10201862 = score(doc=5150,freq=1.0), product of:
              0.16571684 = queryWeight, product of:
                2.054747 = boost
                4.9249606 = idf(docFreq=876, maxDocs=44421)
                0.016375912 = queryNorm
              0.6156201 = fieldWeight in 5150, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.9249606 = idf(docFreq=876, maxDocs=44421)
                0.125 = fieldNorm(doc=5150)
          0.118862234 = weight(abstract_txt:sets in 5150) [ClassicSimilarity], result of:
            0.118862234 = score(doc=5150,freq=1.0), product of:
              0.18348883 = queryWeight, product of:
                2.1621206 = boost
                5.18232 = idf(docFreq=677, maxDocs=44421)
                0.016375912 = queryNorm
              0.64779 = fieldWeight in 5150, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.18232 = idf(docFreq=677, maxDocs=44421)
                0.125 = fieldNorm(doc=5150)
          0.1223833 = weight(abstract_txt:query in 5150) [ClassicSimilarity], result of:
            0.1223833 = score(doc=5150,freq=1.0), product of:
              0.20592451 = queryWeight, product of:
                2.6448343 = boost
                4.754492 = idf(docFreq=1039, maxDocs=44421)
                0.016375912 = queryNorm
              0.5943115 = fieldWeight in 5150, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.754492 = idf(docFreq=1039, maxDocs=44421)
                0.125 = fieldNorm(doc=5150)
          0.3104846 = weight(abstract_txt:attribute in 5150) [ClassicSimilarity], result of:
            0.3104846 = score(doc=5150,freq=1.0), product of:
              0.3480223 = queryWeight, product of:
                2.977684 = boost
                7.1371193 = idf(docFreq=95, maxDocs=44421)
                0.016375912 = queryNorm
              0.8921399 = fieldWeight in 5150, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.1371193 = idf(docFreq=95, maxDocs=44421)
                0.125 = fieldNorm(doc=5150)
        0.24 = coord(6/25)
    
  3. Quast, D.: Rationaliy, neural sets and deterministic chaos : knowledge organisation for the human mind; the user of libraries and information centres (1996) 0.15
    0.15394193 = sum of:
      0.15394193 = product of:
        0.5497926 = sum of:
          0.12516208 = weight(abstract_txt:imprecise in 1002) [ClassicSimilarity], result of:
            0.12516208 = score(doc=1002,freq=1.0), product of:
              0.15951969 = queryWeight, product of:
                1.1639159 = boost
                8.369263 = idf(docFreq=27, maxDocs=44421)
                0.016375912 = queryNorm
              0.7846184 = fieldWeight in 1002, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.369263 = idf(docFreq=27, maxDocs=44421)
                0.09375 = fieldNorm(doc=1002)
          0.047395837 = weight(abstract_txt:database in 1002) [ClassicSimilarity], result of:
            0.047395837 = score(doc=1002,freq=2.0), product of:
              0.08349477 = queryWeight, product of:
                1.1908555 = boost
                4.2814875 = idf(docFreq=1668, maxDocs=44421)
                0.016375912 = queryNorm
              0.5676504 = fieldWeight in 1002, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.2814875 = idf(docFreq=1668, maxDocs=44421)
                0.09375 = fieldNorm(doc=1002)
          0.03345556 = weight(abstract_txt:data in 1002) [ClassicSimilarity], result of:
            0.03345556 = score(doc=1002,freq=2.0), product of:
              0.07577194 = queryWeight, product of:
                1.3894064 = boost
                3.3302255 = idf(docFreq=4320, maxDocs=44421)
                0.016375912 = queryNorm
              0.4415297 = fieldWeight in 1002, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.3302255 = idf(docFreq=4320, maxDocs=44421)
                0.09375 = fieldNorm(doc=1002)
          0.03595501 = weight(abstract_txt:systems in 1002) [ClassicSimilarity], result of:
            0.03595501 = score(doc=1002,freq=2.0), product of:
              0.07950037 = queryWeight, product of:
                1.4231794 = boost
                3.411175 = idf(docFreq=3984, maxDocs=44421)
                0.016375912 = queryNorm
              0.4522622 = fieldWeight in 1002, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.411175 = idf(docFreq=3984, maxDocs=44421)
                0.09375 = fieldNorm(doc=1002)
          0.14216349 = weight(abstract_txt:uncertainty in 1002) [ClassicSimilarity], result of:
            0.14216349 = score(doc=1002,freq=1.0), product of:
              0.21879354 = queryWeight, product of:
                1.9277321 = boost
                6.930783 = idf(docFreq=117, maxDocs=44421)
                0.016375912 = queryNorm
              0.6497609 = fieldWeight in 1002, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.930783 = idf(docFreq=117, maxDocs=44421)
                0.09375 = fieldNorm(doc=1002)
          0.07651396 = weight(abstract_txt:processing in 1002) [ClassicSimilarity], result of:
            0.07651396 = score(doc=1002,freq=1.0), product of:
              0.16571684 = queryWeight, product of:
                2.054747 = boost
                4.9249606 = idf(docFreq=876, maxDocs=44421)
                0.016375912 = queryNorm
              0.46171504 = fieldWeight in 1002, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.9249606 = idf(docFreq=876, maxDocs=44421)
                0.09375 = fieldNorm(doc=1002)
          0.08914667 = weight(abstract_txt:sets in 1002) [ClassicSimilarity], result of:
            0.08914667 = score(doc=1002,freq=1.0), product of:
              0.18348883 = queryWeight, product of:
                2.1621206 = boost
                5.18232 = idf(docFreq=677, maxDocs=44421)
                0.016375912 = queryNorm
              0.48584253 = fieldWeight in 1002, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.18232 = idf(docFreq=677, maxDocs=44421)
                0.09375 = fieldNorm(doc=1002)
        0.28 = coord(7/25)
    
  4. Euzenat, J.; Shvaiko, P.: Ontology matching (2010) 0.14
    0.14314047 = sum of:
      0.14314047 = product of:
        0.5964186 = sum of:
          0.053113833 = weight(abstract_txt:equivalence in 1168) [ClassicSimilarity], result of:
            0.053113833 = score(doc=1168,freq=1.0), product of:
              0.12903069 = queryWeight, product of:
                1.0467933 = boost
                7.5270805 = idf(docFreq=64, maxDocs=44421)
                0.016375912 = queryNorm
              0.41163722 = fieldWeight in 1168, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.5270805 = idf(docFreq=64, maxDocs=44421)
                0.0546875 = fieldNorm(doc=1168)
          0.033861224 = weight(abstract_txt:database in 1168) [ClassicSimilarity], result of:
            0.033861224 = score(doc=1168,freq=3.0), product of:
              0.08349477 = queryWeight, product of:
                1.1908555 = boost
                4.2814875 = idf(docFreq=1668, maxDocs=44421)
                0.016375912 = queryNorm
              0.40554905 = fieldWeight in 1168, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.2814875 = idf(docFreq=1668, maxDocs=44421)
                0.0546875 = fieldNorm(doc=1168)
          0.044630397 = weight(abstract_txt:semantic in 1168) [ClassicSimilarity], result of:
            0.044630397 = score(doc=1168,freq=4.0), product of:
              0.09119375 = queryWeight, product of:
                1.2445489 = boost
                4.4745317 = idf(docFreq=1375, maxDocs=44421)
                0.016375912 = queryNorm
              0.4894019 = fieldWeight in 1168, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.4745317 = idf(docFreq=1375, maxDocs=44421)
                0.0546875 = fieldNorm(doc=1168)
          0.036327615 = weight(abstract_txt:systems in 1168) [ClassicSimilarity], result of:
            0.036327615 = score(doc=1168,freq=6.0), product of:
              0.07950037 = queryWeight, product of:
                1.4231794 = boost
                3.411175 = idf(docFreq=3984, maxDocs=44421)
                0.016375912 = queryNorm
              0.456949 = fieldWeight in 1168, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                3.411175 = idf(docFreq=3984, maxDocs=44421)
                0.0546875 = fieldNorm(doc=1168)
          0.23656107 = weight(abstract_txt:correspondences in 1168) [ClassicSimilarity], result of:
            0.23656107 = score(doc=1168,freq=2.0), product of:
              0.3492878 = queryWeight, product of:
                2.4356852 = boost
                8.757029 = idf(docFreq=18, maxDocs=44421)
                0.016375912 = queryNorm
              0.6772669 = fieldWeight in 1168, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.757029 = idf(docFreq=18, maxDocs=44421)
                0.0546875 = fieldNorm(doc=1168)
          0.1919245 = weight(abstract_txt:integration in 1168) [ClassicSimilarity], result of:
            0.1919245 = score(doc=1168,freq=3.0), product of:
              0.3828114 = queryWeight, product of:
                4.4165435 = boost
                5.2929387 = idf(docFreq=606, maxDocs=44421)
                0.016375912 = queryNorm
              0.50135523 = fieldWeight in 1168, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.2929387 = idf(docFreq=606, maxDocs=44421)
                0.0546875 = fieldNorm(doc=1168)
        0.24 = coord(6/25)
    
  5. Cheung, W.; Hsu, C.: ¬The model-assisted global query system for multiple databases in distributed enterprises (1996) 0.13
    0.13426138 = sum of:
      0.13426138 = product of:
        0.6713069 = sum of:
          0.029962512 = weight(abstract_txt:systems in 348) [ClassicSimilarity], result of:
            0.029962512 = score(doc=348,freq=2.0), product of:
              0.07950037 = queryWeight, product of:
                1.4231794 = boost
                3.411175 = idf(docFreq=3984, maxDocs=44421)
                0.016375912 = queryNorm
              0.37688518 = fieldWeight in 348, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.411175 = idf(docFreq=3984, maxDocs=44421)
                0.078125 = fieldNorm(doc=348)
          0.06376164 = weight(abstract_txt:processing in 348) [ClassicSimilarity], result of:
            0.06376164 = score(doc=348,freq=1.0), product of:
              0.16571684 = queryWeight, product of:
                2.054747 = boost
                4.9249606 = idf(docFreq=876, maxDocs=44421)
                0.016375912 = queryNorm
              0.38476256 = fieldWeight in 348, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.9249606 = idf(docFreq=876, maxDocs=44421)
                0.078125 = fieldNorm(doc=348)
          0.15297912 = weight(abstract_txt:query in 348) [ClassicSimilarity], result of:
            0.15297912 = score(doc=348,freq=4.0), product of:
              0.20592451 = queryWeight, product of:
                2.6448343 = boost
                4.754492 = idf(docFreq=1039, maxDocs=44421)
                0.016375912 = queryNorm
              0.74288934 = fieldWeight in 348, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.754492 = idf(docFreq=1039, maxDocs=44421)
                0.078125 = fieldNorm(doc=348)
          0.26630697 = weight(abstract_txt:global in 348) [ClassicSimilarity], result of:
            0.26630697 = score(doc=348,freq=3.0), product of:
              0.35331157 = queryWeight, product of:
                3.8732753 = boost
                5.570241 = idf(docFreq=459, maxDocs=44421)
                0.016375912 = queryNorm
              0.7537454 = fieldWeight in 348, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.570241 = idf(docFreq=459, maxDocs=44421)
                0.078125 = fieldNorm(doc=348)
          0.15829666 = weight(abstract_txt:integration in 348) [ClassicSimilarity], result of:
            0.15829666 = score(doc=348,freq=1.0), product of:
              0.3828114 = queryWeight, product of:
                4.4165435 = boost
                5.2929387 = idf(docFreq=606, maxDocs=44421)
                0.016375912 = queryNorm
              0.41351083 = fieldWeight in 348, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.2929387 = idf(docFreq=606, maxDocs=44421)
                0.078125 = fieldNorm(doc=348)
        0.2 = coord(5/25)