Document (#20914)

Author
Becker, S.
Title
¬A practical perspective on data quality issues
Source
Journal of database management. 9(1998) no.1, S.35-37
Year
1998
Abstract
Explains why data quality is important. Problems that impact data quality include: data corruption due to incorrect conversion, historical and current data have different meanings, the same data has more than 1 data definition, missing data, hidden data, missing granularity, and violation of integrity rules. Suggests an improvement strategy to establish organizational commitment to cahnge what has been done in promoting data quality. Misconceptions that impact data quality are: data quality improves with the introduction of new technology; old data quality will not have an impact on new database development; and data quality is a database administration problem

Similar documents (author)

  1. Becker, J.: Zentrallager : Data Warehouse - zentrale Sammelstelle für Informationen (1997) 4.69
    4.6854844 = sum of:
      4.6854844 = weight(author_txt:becker in 4479) [ClassicSimilarity], result of:
        4.6854844 = fieldWeight in 4479, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          7.496775 = idf(docFreq=66, maxDocs=44421)
          0.625 = fieldNorm(doc=4479)
    
  2. Becker, C.A.: Community information service (1974) 4.69
    4.6854844 = sum of:
      4.6854844 = weight(author_txt:becker in 5736) [ClassicSimilarity], result of:
        4.6854844 = fieldWeight in 5736, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          7.496775 = idf(docFreq=66, maxDocs=44421)
          0.625 = fieldNorm(doc=5736)
    
  3. Becker, J.: Strategische Ausrichtung der Informations- und Organisationsstruktur des Unternehmens (1994) 4.69
    4.6854844 = sum of:
      4.6854844 = weight(author_txt:becker in 8382) [ClassicSimilarity], result of:
        4.6854844 = fieldWeight in 8382, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          7.496775 = idf(docFreq=66, maxDocs=44421)
          0.625 = fieldNorm(doc=8382)
    
  4. Becker, J.: Probleme des grenzüberschreitenden Datenflusses (1988) 4.69
    4.6854844 = sum of:
      4.6854844 = weight(author_txt:becker in 580) [ClassicSimilarity], result of:
        4.6854844 = fieldWeight in 580, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          7.496775 = idf(docFreq=66, maxDocs=44421)
          0.625 = fieldNorm(doc=580)
    
  5. Becker, J.: ¬Die Postmoderne und ihr Verhältnis zur Informationstheorie (1995) 4.69
    4.6854844 = sum of:
      4.6854844 = weight(author_txt:becker in 1108) [ClassicSimilarity], result of:
        4.6854844 = fieldWeight in 1108, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          7.496775 = idf(docFreq=66, maxDocs=44421)
          0.625 = fieldNorm(doc=1108)
    

Similar documents (content)

  1. Beamsley, T.G.: Securing digital image assets in museums and libraries : a risk management approach (1999) 0.14
    0.13888852 = sum of:
      0.13888852 = product of:
        0.5787022 = sum of:
          0.043082215 = weight(abstract_txt:establish in 967) [ClassicSimilarity], result of:
            0.043082215 = score(doc=967,freq=1.0), product of:
              0.10990147 = queryWeight, product of:
                1.1079113 = boost
                6.272122 = idf(docFreq=227, maxDocs=44421)
                0.01581554 = queryNorm
              0.39200762 = fieldWeight in 967, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.272122 = idf(docFreq=227, maxDocs=44421)
                0.0625 = fieldNorm(doc=967)
          0.011436293 = weight(abstract_txt:have in 967) [ClassicSimilarity], result of:
            0.011436293 = score(doc=967,freq=1.0), product of:
              0.0571924 = queryWeight, product of:
                1.1302836 = boost
                3.199388 = idf(docFreq=4924, maxDocs=44421)
                0.01581554 = queryNorm
              0.19996175 = fieldWeight in 967, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.199388 = idf(docFreq=4924, maxDocs=44421)
                0.0625 = fieldNorm(doc=967)
          0.13141605 = weight(abstract_txt:integrity in 967) [ClassicSimilarity], result of:
            0.13141605 = score(doc=967,freq=3.0), product of:
              0.1602741 = queryWeight, product of:
                1.3379347 = boost
                7.574333 = idf(docFreq=61, maxDocs=44421)
                0.01581554 = queryNorm
              0.81994563 = fieldWeight in 967, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                7.574333 = idf(docFreq=61, maxDocs=44421)
                0.0625 = fieldNorm(doc=967)
          0.16211034 = weight(abstract_txt:corruption in 967) [ClassicSimilarity], result of:
            0.16211034 = score(doc=967,freq=1.0), product of:
              0.2658757 = queryWeight, product of:
                1.7232274 = boost
                9.755557 = idf(docFreq=6, maxDocs=44421)
                0.01581554 = queryNorm
              0.6097223 = fieldWeight in 967, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.755557 = idf(docFreq=6, maxDocs=44421)
                0.0625 = fieldNorm(doc=967)
          0.050092347 = weight(abstract_txt:impact in 967) [ClassicSimilarity], result of:
            0.050092347 = score(doc=967,freq=1.0), product of:
              0.17526405 = queryWeight, product of:
                2.4233174 = boost
                4.572972 = idf(docFreq=1246, maxDocs=44421)
                0.01581554 = queryNorm
              0.28581074 = fieldWeight in 967, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.572972 = idf(docFreq=1246, maxDocs=44421)
                0.0625 = fieldNorm(doc=967)
          0.18056497 = weight(abstract_txt:data in 967) [ClassicSimilarity], result of:
            0.18056497 = score(doc=967,freq=4.0), product of:
              0.43376034 = queryWeight, product of:
                8.235542 = boost
                3.3302255 = idf(docFreq=4320, maxDocs=44421)
                0.01581554 = queryNorm
              0.41627818 = fieldWeight in 967, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                3.3302255 = idf(docFreq=4320, maxDocs=44421)
                0.0625 = fieldNorm(doc=967)
        0.24 = coord(6/25)
    
  2. Jiang, Z.; Liu, X.; Chen, Y.: Recovering uncaptured citations in a scholarly network : a two-step citation analysis to estimate publication importance (2016) 0.13
    0.13192515 = sum of:
      0.13192515 = product of:
        0.65962577 = sum of:
          0.011436293 = weight(abstract_txt:have in 4018) [ClassicSimilarity], result of:
            0.011436293 = score(doc=4018,freq=1.0), product of:
              0.0571924 = queryWeight, product of:
                1.1302836 = boost
                3.199388 = idf(docFreq=4924, maxDocs=44421)
                0.01581554 = queryNorm
              0.19996175 = fieldWeight in 4018, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.199388 = idf(docFreq=4924, maxDocs=44421)
                0.0625 = fieldNorm(doc=4018)
          0.050092347 = weight(abstract_txt:impact in 4018) [ClassicSimilarity], result of:
            0.050092347 = score(doc=4018,freq=1.0), product of:
              0.17526405 = queryWeight, product of:
                2.4233174 = boost
                4.572972 = idf(docFreq=1246, maxDocs=44421)
                0.01581554 = queryNorm
              0.28581074 = fieldWeight in 4018, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.572972 = idf(docFreq=1246, maxDocs=44421)
                0.0625 = fieldNorm(doc=4018)
          0.22803521 = weight(abstract_txt:missing in 4018) [ClassicSimilarity], result of:
            0.22803521 = score(doc=4018,freq=3.0), product of:
              0.29159206 = queryWeight, product of:
                2.5521495 = boost
                7.2241306 = idf(docFreq=87, maxDocs=44421)
                0.01581554 = queryNorm
              0.78203505 = fieldWeight in 4018, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                7.2241306 = idf(docFreq=87, maxDocs=44421)
                0.0625 = fieldNorm(doc=4018)
          0.24238326 = weight(abstract_txt:quality in 4018) [ClassicSimilarity], result of:
            0.24238326 = score(doc=4018,freq=3.0), product of:
              0.48209152 = queryWeight, product of:
                6.56316 = boost
                4.6444306 = idf(docFreq=1160, maxDocs=44421)
                0.01581554 = queryNorm
              0.50277436 = fieldWeight in 4018, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.6444306 = idf(docFreq=1160, maxDocs=44421)
                0.0625 = fieldNorm(doc=4018)
          0.1276787 = weight(abstract_txt:data in 4018) [ClassicSimilarity], result of:
            0.1276787 = score(doc=4018,freq=2.0), product of:
              0.43376034 = queryWeight, product of:
                8.235542 = boost
                3.3302255 = idf(docFreq=4320, maxDocs=44421)
                0.01581554 = queryNorm
              0.29435313 = fieldWeight in 4018, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.3302255 = idf(docFreq=4320, maxDocs=44421)
                0.0625 = fieldNorm(doc=4018)
        0.2 = coord(5/25)
    
  3. Lunati, G.: On line union catalogue (OLUC) compie 25 anni (1996) 0.13
    0.13098255 = sum of:
      0.13098255 = product of:
        0.6549127 = sum of:
          0.047519762 = weight(abstract_txt:explains in 1105) [ClassicSimilarity], result of:
            0.047519762 = score(doc=1105,freq=1.0), product of:
              0.08953513 = queryWeight, product of:
                5.661213 = idf(docFreq=419, maxDocs=44421)
                0.01581554 = queryNorm
              0.5307387 = fieldWeight in 1105, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.661213 = idf(docFreq=419, maxDocs=44421)
                0.09375 = fieldNorm(doc=1105)
          0.060877252 = weight(abstract_txt:improvement in 1105) [ClassicSimilarity], result of:
            0.060877252 = score(doc=1105,freq=1.0), product of:
              0.10561218 = queryWeight, product of:
                1.0860761 = boost
                6.148508 = idf(docFreq=257, maxDocs=44421)
                0.01581554 = queryNorm
              0.57642263 = fieldWeight in 1105, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.148508 = idf(docFreq=257, maxDocs=44421)
                0.09375 = fieldNorm(doc=1105)
          0.058139972 = weight(abstract_txt:database in 1105) [ClassicSimilarity], result of:
            0.058139972 = score(doc=1105,freq=2.0), product of:
              0.10242215 = queryWeight, product of:
                1.512569 = boost
                4.2814875 = idf(docFreq=1668, maxDocs=44421)
                0.01581554 = queryNorm
              0.5676504 = fieldWeight in 1105, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.2814875 = idf(docFreq=1668, maxDocs=44421)
                0.09375 = fieldNorm(doc=1105)
          0.29685766 = weight(abstract_txt:quality in 1105) [ClassicSimilarity], result of:
            0.29685766 = score(doc=1105,freq=2.0), product of:
              0.48209152 = queryWeight, product of:
                6.56316 = boost
                4.6444306 = idf(docFreq=1160, maxDocs=44421)
                0.01581554 = queryNorm
              0.61577034 = fieldWeight in 1105, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.6444306 = idf(docFreq=1160, maxDocs=44421)
                0.09375 = fieldNorm(doc=1105)
          0.19151807 = weight(abstract_txt:data in 1105) [ClassicSimilarity], result of:
            0.19151807 = score(doc=1105,freq=2.0), product of:
              0.43376034 = queryWeight, product of:
                8.235542 = boost
                3.3302255 = idf(docFreq=4320, maxDocs=44421)
                0.01581554 = queryNorm
              0.4415297 = fieldWeight in 1105, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.3302255 = idf(docFreq=4320, maxDocs=44421)
                0.09375 = fieldNorm(doc=1105)
        0.2 = coord(5/25)
    
  4. Baroncini, S.; Sartini, B.; Erp, M. Van; Tomasi, F.; Gangemi, A.: Is dc:subject enough? : A landscape on iconography and iconology statements of knowledge graphs in the semantic web (2023) 0.13
    0.1272615 = sum of:
      0.1272615 = product of:
        0.6363075 = sum of:
          0.046895612 = weight(abstract_txt:meanings in 2032) [ClassicSimilarity], result of:
            0.046895612 = score(doc=2032,freq=1.0), product of:
              0.12712206 = queryWeight, product of:
                1.191554 = boost
                6.7456408 = idf(docFreq=141, maxDocs=44421)
                0.01581554 = queryNorm
              0.36890224 = fieldWeight in 2032, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.7456408 = idf(docFreq=141, maxDocs=44421)
                0.0546875 = fieldNorm(doc=2032)
          0.06862454 = weight(abstract_txt:granularity in 2032) [ClassicSimilarity], result of:
            0.06862454 = score(doc=2032,freq=1.0), product of:
              0.16385226 = queryWeight, product of:
                1.3527871 = boost
                7.6584163 = idf(docFreq=56, maxDocs=44421)
                0.01581554 = queryNorm
              0.41881964 = fieldWeight in 2032, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.6584163 = idf(docFreq=56, maxDocs=44421)
                0.0546875 = fieldNorm(doc=2032)
          0.11519918 = weight(abstract_txt:missing in 2032) [ClassicSimilarity], result of:
            0.11519918 = score(doc=2032,freq=1.0), product of:
              0.29159206 = queryWeight, product of:
                2.5521495 = boost
                7.2241306 = idf(docFreq=87, maxDocs=44421)
                0.01581554 = queryNorm
              0.39506966 = fieldWeight in 2032, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.2241306 = idf(docFreq=87, maxDocs=44421)
                0.0546875 = fieldNorm(doc=2032)
          0.21208535 = weight(abstract_txt:quality in 2032) [ClassicSimilarity], result of:
            0.21208535 = score(doc=2032,freq=3.0), product of:
              0.48209152 = queryWeight, product of:
                6.56316 = boost
                4.6444306 = idf(docFreq=1160, maxDocs=44421)
                0.01581554 = queryNorm
              0.43992758 = fieldWeight in 2032, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.6444306 = idf(docFreq=1160, maxDocs=44421)
                0.0546875 = fieldNorm(doc=2032)
          0.19350278 = weight(abstract_txt:data in 2032) [ClassicSimilarity], result of:
            0.19350278 = score(doc=2032,freq=6.0), product of:
              0.43376034 = queryWeight, product of:
                8.235542 = boost
                3.3302255 = idf(docFreq=4320, maxDocs=44421)
                0.01581554 = queryNorm
              0.44610527 = fieldWeight in 2032, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                3.3302255 = idf(docFreq=4320, maxDocs=44421)
                0.0546875 = fieldNorm(doc=2032)
        0.2 = coord(5/25)
    
  5. Barnett, J.: OCLC cataloging peer committees : an overview (1993) 0.11
    0.11473647 = sum of:
      0.11473647 = product of:
        0.71710294 = sum of:
          0.028590731 = weight(abstract_txt:have in 5987) [ClassicSimilarity], result of:
            0.028590731 = score(doc=5987,freq=1.0), product of:
              0.0571924 = queryWeight, product of:
                1.1302836 = boost
                3.199388 = idf(docFreq=4924, maxDocs=44421)
                0.01581554 = queryNorm
              0.4999044 = fieldWeight in 5987, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.199388 = idf(docFreq=4924, maxDocs=44421)
                0.15625 = fieldNorm(doc=5987)
          0.068518616 = weight(abstract_txt:database in 5987) [ClassicSimilarity], result of:
            0.068518616 = score(doc=5987,freq=1.0), product of:
              0.10242215 = queryWeight, product of:
                1.512569 = boost
                4.2814875 = idf(docFreq=1668, maxDocs=44421)
                0.01581554 = queryNorm
              0.6689824 = fieldWeight in 5987, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.2814875 = idf(docFreq=1668, maxDocs=44421)
                0.15625 = fieldNorm(doc=5987)
          0.12523086 = weight(abstract_txt:impact in 5987) [ClassicSimilarity], result of:
            0.12523086 = score(doc=5987,freq=1.0), product of:
              0.17526405 = queryWeight, product of:
                2.4233174 = boost
                4.572972 = idf(docFreq=1246, maxDocs=44421)
                0.01581554 = queryNorm
              0.71452683 = fieldWeight in 5987, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.572972 = idf(docFreq=1246, maxDocs=44421)
                0.15625 = fieldNorm(doc=5987)
          0.49476275 = weight(abstract_txt:quality in 5987) [ClassicSimilarity], result of:
            0.49476275 = score(doc=5987,freq=2.0), product of:
              0.48209152 = queryWeight, product of:
                6.56316 = boost
                4.6444306 = idf(docFreq=1160, maxDocs=44421)
                0.01581554 = queryNorm
              1.0262839 = fieldWeight in 5987, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.6444306 = idf(docFreq=1160, maxDocs=44421)
                0.15625 = fieldNorm(doc=5987)
        0.16 = coord(4/25)