Document (#37470)

Author
Rauber, A.
Title
Digital preservation in data-driven science : on the importance of process capture, preservation and validation
Source
Proceedings of the 2nd International Workshop on Semantic Digital Archives held in conjunction with the 16th Int. Conference on Theory and Practice of Digital Libraries (TPDL) on September 27, 2012 in Paphos, Cyprus [http://ceur-ws.org/Vol-912/proceedings.pdf]. Eds.: A. Mitschik et al
Year
2012
Pages
S.7-17
Abstract
Current digital preservation is strongly biased towards data objects: digital files of document-style objects, or encapsulated and largely self-contained objects. To provide authenticity and provenance information, comprehensive metadata models are deployed to document information on an object's context. Yet, we claim that simply documenting an objects context may not be sufficient to ensure proper provenance and to fulfill the stated preservation goals. Specifically in e-Science and business settings, capturing, documenting and preserving entire processes may be necessary to meet the preservation goals. We thus present an approach for capturing, documenting and preserving processes, and means to assess their authenticity upon re-execution. We will discuss options as well as limitations and open challenges to achieve sound preservation, speci?cally within scientific processes.
Content
Vgl. auch: http://sda2012.dke-research.de.

Similar documents (author)

  1. Rauch, C.; Rauber, A.: Anwendung der Nutzwertanalyse zur Bewertung von Strategien zur langfristigen Erhaltung digitaler Objekte (2005) 4.88
    4.8777785 = sum of:
      4.8777785 = weight(author_txt:rauber in 4859) [ClassicSimilarity], result of:
        4.8777785 = fieldWeight in 4859, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.755557 = idf(docFreq=6, maxDocs=44421)
          0.5 = fieldNorm(doc=4859)
    
  2. Becker, C.; Rauber, A.: Decision criteria in digital preservation : what to measure and how (2011) 4.88
    4.8777785 = sum of:
      4.8777785 = weight(author_txt:rauber in 456) [ClassicSimilarity], result of:
        4.8777785 = fieldWeight in 456, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.755557 = idf(docFreq=6, maxDocs=44421)
          0.5 = fieldNorm(doc=456)
    
  3. Rauber, K.; Nilges, A.: Was hieß noch mal schnell "Unterbegriff" auf Englisch? : Finden Sie die Antwort im Glossary to Terms of Information Literacy (2011) 4.88
    4.8777785 = sum of:
      4.8777785 = weight(author_txt:rauber in 518) [ClassicSimilarity], result of:
        4.8777785 = fieldWeight in 518, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.755557 = idf(docFreq=6, maxDocs=44421)
          0.5 = fieldNorm(doc=518)
    
  4. Bashir, S.; Rauber, A.: On the relationship between query characteristics and IR functions retrieval bias (2011) 4.88
    4.8777785 = sum of:
      4.8777785 = weight(author_txt:rauber in 628) [ClassicSimilarity], result of:
        4.8777785 = fieldWeight in 628, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.755557 = idf(docFreq=6, maxDocs=44421)
          0.5 = fieldNorm(doc=628)
    
  5. Klein, A.; Mitschang, J.; Nilges, A.; Oberhausen, B.; Rauber, K.; Weiß, A.: "Aus der Praxis für die Praxis" : ein Glossar zu Begriffen der Informationskompetenz (2008) 2.44
    2.4388893 = sum of:
      2.4388893 = weight(author_txt:rauber in 2282) [ClassicSimilarity], result of:
        2.4388893 = fieldWeight in 2282, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.755557 = idf(docFreq=6, maxDocs=44421)
          0.25 = fieldNorm(doc=2282)
    

Similar documents (content)

  1. Tognoli, N.; Chaves-Guimarães, J.A.: Provenance as a knowledge organization principle (2019) 0.28
    0.28067645 = sum of:
      0.28067645 = product of:
        0.87711394 = sum of:
          0.029769847 = weight(abstract_txt:science in 489) [ClassicSimilarity], result of:
            0.029769847 = score(doc=489,freq=4.0), product of:
              0.061850026 = queryWeight, product of:
                1.15442 = boost
                3.850585 = idf(docFreq=2567, maxDocs=44421)
                0.013913915 = queryNorm
              0.48132312 = fieldWeight in 489, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                3.850585 = idf(docFreq=2567, maxDocs=44421)
                0.0625 = fieldNorm(doc=489)
          0.020644201 = weight(abstract_txt:document in 489) [ClassicSimilarity], result of:
            0.020644201 = score(doc=489,freq=1.0), product of:
              0.07692028 = queryWeight, product of:
                1.2874024 = boost
                4.29415 = idf(docFreq=1647, maxDocs=44421)
                0.013913915 = queryNorm
              0.26838437 = fieldWeight in 489, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.29415 = idf(docFreq=1647, maxDocs=44421)
                0.0625 = fieldNorm(doc=489)
          0.021211484 = weight(abstract_txt:context in 489) [ClassicSimilarity], result of:
            0.021211484 = score(doc=489,freq=1.0), product of:
              0.07832304 = queryWeight, product of:
                1.2990882 = boost
                4.333128 = idf(docFreq=1584, maxDocs=44421)
                0.013913915 = queryNorm
              0.2708205 = fieldWeight in 489, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.333128 = idf(docFreq=1584, maxDocs=44421)
                0.0625 = fieldNorm(doc=489)
          0.06338531 = weight(abstract_txt:digital in 489) [ClassicSimilarity], result of:
            0.06338531 = score(doc=489,freq=4.0), product of:
              0.117177695 = queryWeight, product of:
                1.9460859 = boost
                4.3274655 = idf(docFreq=1593, maxDocs=44421)
                0.013913915 = queryNorm
              0.5409332 = fieldWeight in 489, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.3274655 = idf(docFreq=1593, maxDocs=44421)
                0.0625 = fieldNorm(doc=489)
          0.2995333 = weight(abstract_txt:provenance in 489) [ClassicSimilarity], result of:
            0.2995333 = score(doc=489,freq=6.0), product of:
              0.25181898 = queryWeight, product of:
                2.329368 = boost
                7.769642 = idf(docFreq=50, maxDocs=44421)
                0.013913915 = queryNorm
              1.1894786 = fieldWeight in 489, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                7.769642 = idf(docFreq=50, maxDocs=44421)
                0.0625 = fieldNorm(doc=489)
          0.14732827 = weight(abstract_txt:authenticity in 489) [ClassicSimilarity], result of:
            0.14732827 = score(doc=489,freq=1.0), product of:
              0.2851234 = queryWeight, product of:
                2.4786222 = boost
                8.267481 = idf(docFreq=30, maxDocs=44421)
                0.013913915 = queryNorm
              0.51671755 = fieldWeight in 489, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.267481 = idf(docFreq=30, maxDocs=44421)
                0.0625 = fieldNorm(doc=489)
          0.084929526 = weight(abstract_txt:objects in 489) [ClassicSimilarity], result of:
            0.084929526 = score(doc=489,freq=1.0), product of:
              0.24882343 = queryWeight, product of:
                3.274572 = boost
                5.4611917 = idf(docFreq=512, maxDocs=44421)
                0.013913915 = queryNorm
              0.34132448 = fieldWeight in 489, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.4611917 = idf(docFreq=512, maxDocs=44421)
                0.0625 = fieldNorm(doc=489)
          0.21031204 = weight(abstract_txt:preservation in 489) [ClassicSimilarity], result of:
            0.21031204 = score(doc=489,freq=1.0), product of:
              0.52134514 = queryWeight, product of:
                5.8052 = boost
                6.4544435 = idf(docFreq=189, maxDocs=44421)
                0.013913915 = queryNorm
              0.40340272 = fieldWeight in 489, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.4544435 = idf(docFreq=189, maxDocs=44421)
                0.0625 = fieldNorm(doc=489)
        0.32 = coord(8/25)
    
  2. Dobratz, S.; Neuroth, H.: nestor: Network of Expertise in long-term STOrage of digital Resources : a digital preservation initiative for Germany (2004) 0.19
    0.19021781 = sum of:
      0.19021781 = product of:
        0.6793493 = sum of:
          0.007442462 = weight(abstract_txt:science in 2195) [ClassicSimilarity], result of:
            0.007442462 = score(doc=2195,freq=1.0), product of:
              0.061850026 = queryWeight, product of:
                1.15442 = boost
                3.850585 = idf(docFreq=2567, maxDocs=44421)
                0.013913915 = queryNorm
              0.12033078 = fieldWeight in 2195, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.850585 = idf(docFreq=2567, maxDocs=44421)
                0.03125 = fieldNorm(doc=2195)
          0.014597654 = weight(abstract_txt:document in 2195) [ClassicSimilarity], result of:
            0.014597654 = score(doc=2195,freq=2.0), product of:
              0.07692028 = queryWeight, product of:
                1.2874024 = boost
                4.29415 = idf(docFreq=1647, maxDocs=44421)
                0.013913915 = queryNorm
              0.1897764 = fieldWeight in 2195, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.29415 = idf(docFreq=1647, maxDocs=44421)
                0.03125 = fieldNorm(doc=2195)
          0.010605742 = weight(abstract_txt:context in 2195) [ClassicSimilarity], result of:
            0.010605742 = score(doc=2195,freq=1.0), product of:
              0.07832304 = queryWeight, product of:
                1.2990882 = boost
                4.333128 = idf(docFreq=1584, maxDocs=44421)
                0.013913915 = queryNorm
              0.13541025 = fieldWeight in 2195, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.333128 = idf(docFreq=1584, maxDocs=44421)
                0.03125 = fieldNorm(doc=2195)
          0.06723026 = weight(abstract_txt:digital in 2195) [ClassicSimilarity], result of:
            0.06723026 = score(doc=2195,freq=18.0), product of:
              0.117177695 = queryWeight, product of:
                1.9460859 = boost
                4.3274655 = idf(docFreq=1593, maxDocs=44421)
                0.013913915 = queryNorm
              0.57374626 = fieldWeight in 2195, product of:
                4.2426405 = tf(freq=18.0), with freq of:
                  18.0 = termFreq=18.0
                4.3274655 = idf(docFreq=1593, maxDocs=44421)
                0.03125 = fieldNorm(doc=2195)
          0.073664136 = weight(abstract_txt:authenticity in 2195) [ClassicSimilarity], result of:
            0.073664136 = score(doc=2195,freq=1.0), product of:
              0.2851234 = queryWeight, product of:
                2.4786222 = boost
                8.267481 = idf(docFreq=30, maxDocs=44421)
                0.013913915 = queryNorm
              0.25835878 = fieldWeight in 2195, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.267481 = idf(docFreq=30, maxDocs=44421)
                0.03125 = fieldNorm(doc=2195)
          0.1123512 = weight(abstract_txt:objects in 2195) [ClassicSimilarity], result of:
            0.1123512 = score(doc=2195,freq=7.0), product of:
              0.24882343 = queryWeight, product of:
                3.274572 = boost
                5.4611917 = idf(docFreq=512, maxDocs=44421)
                0.013913915 = queryNorm
              0.45152983 = fieldWeight in 2195, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                5.4611917 = idf(docFreq=512, maxDocs=44421)
                0.03125 = fieldNorm(doc=2195)
          0.3934578 = weight(abstract_txt:preservation in 2195) [ClassicSimilarity], result of:
            0.3934578 = score(doc=2195,freq=14.0), product of:
              0.52134514 = queryWeight, product of:
                5.8052 = boost
                6.4544435 = idf(docFreq=189, maxDocs=44421)
                0.013913915 = queryNorm
              0.7546974 = fieldWeight in 2195, product of:
                3.7416575 = tf(freq=14.0), with freq of:
                  14.0 = termFreq=14.0
                6.4544435 = idf(docFreq=189, maxDocs=44421)
                0.03125 = fieldNorm(doc=2195)
        0.28 = coord(7/25)
    
  3. Hendley, T.: ¬The preservation of digital material (1996) 0.19
    0.18600893 = sum of:
      0.18600893 = product of:
        0.93004465 = sum of:
          0.08481474 = weight(abstract_txt:stated in 5153) [ClassicSimilarity], result of:
            0.08481474 = score(doc=5153,freq=1.0), product of:
              0.10784193 = queryWeight, product of:
                1.0778857 = boost
                7.190608 = idf(docFreq=90, maxDocs=44421)
                0.013913915 = queryNorm
              0.78647274 = fieldWeight in 5153, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.190608 = idf(docFreq=90, maxDocs=44421)
                0.109375 = fieldNorm(doc=5153)
          0.037120096 = weight(abstract_txt:context in 5153) [ClassicSimilarity], result of:
            0.037120096 = score(doc=5153,freq=1.0), product of:
              0.07832304 = queryWeight, product of:
                1.2990882 = boost
                4.333128 = idf(docFreq=1584, maxDocs=44421)
                0.013913915 = queryNorm
              0.47393587 = fieldWeight in 5153, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.333128 = idf(docFreq=1584, maxDocs=44421)
                0.109375 = fieldNorm(doc=5153)
          0.07843531 = weight(abstract_txt:digital in 5153) [ClassicSimilarity], result of:
            0.07843531 = score(doc=5153,freq=2.0), product of:
              0.117177695 = queryWeight, product of:
                1.9460859 = boost
                4.3274655 = idf(docFreq=1593, maxDocs=44421)
                0.013913915 = queryNorm
              0.66937065 = fieldWeight in 5153, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.3274655 = idf(docFreq=1593, maxDocs=44421)
                0.109375 = fieldNorm(doc=5153)
          0.092200056 = weight(abstract_txt:processes in 5153) [ClassicSimilarity], result of:
            0.092200056 = score(doc=5153,freq=1.0), product of:
              0.16443767 = queryWeight, product of:
                2.3053675 = boost
                5.126392 = idf(docFreq=716, maxDocs=44421)
                0.013913915 = queryNorm
              0.5606991 = fieldWeight in 5153, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.126392 = idf(docFreq=716, maxDocs=44421)
                0.109375 = fieldNorm(doc=5153)
          0.6374745 = weight(abstract_txt:preservation in 5153) [ClassicSimilarity], result of:
            0.6374745 = score(doc=5153,freq=3.0), product of:
              0.52134514 = queryWeight, product of:
                5.8052 = boost
                6.4544435 = idf(docFreq=189, maxDocs=44421)
                0.013913915 = queryNorm
              1.2227495 = fieldWeight in 5153, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.4544435 = idf(docFreq=189, maxDocs=44421)
                0.109375 = fieldNorm(doc=5153)
        0.2 = coord(5/25)
    
  4. Dalkir, K.: Knowledge management (2009) 0.17
    0.1697729 = sum of:
      0.1697729 = product of:
        0.7073871 = sum of:
          0.025781445 = weight(abstract_txt:science in 819) [ClassicSimilarity], result of:
            0.025781445 = score(doc=819,freq=3.0), product of:
              0.061850026 = queryWeight, product of:
                1.15442 = boost
                3.850585 = idf(docFreq=2567, maxDocs=44421)
                0.013913915 = queryNorm
              0.41683805 = fieldWeight in 819, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.850585 = idf(docFreq=2567, maxDocs=44421)
                0.0625 = fieldNorm(doc=819)
          0.10823437 = weight(abstract_txt:encapsulated in 819) [ClassicSimilarity], result of:
            0.10823437 = score(doc=819,freq=1.0), product of:
              0.18425061 = queryWeight, product of:
                1.4089104 = boost
                9.398883 = idf(docFreq=9, maxDocs=44421)
                0.013913915 = queryNorm
              0.5874302 = fieldWeight in 819, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.398883 = idf(docFreq=9, maxDocs=44421)
                0.0625 = fieldNorm(doc=819)
          0.109197475 = weight(abstract_txt:capturing in 819) [ClassicSimilarity], result of:
            0.109197475 = score(doc=819,freq=1.0), product of:
              0.2335163 = queryWeight, product of:
                2.24312 = boost
                7.48196 = idf(docFreq=67, maxDocs=44421)
                0.013913915 = queryNorm
              0.4676225 = fieldWeight in 819, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.48196 = idf(docFreq=67, maxDocs=44421)
                0.0625 = fieldNorm(doc=819)
          0.052685745 = weight(abstract_txt:processes in 819) [ClassicSimilarity], result of:
            0.052685745 = score(doc=819,freq=1.0), product of:
              0.16443767 = queryWeight, product of:
                2.3053675 = boost
                5.126392 = idf(docFreq=716, maxDocs=44421)
                0.013913915 = queryNorm
              0.3203995 = fieldWeight in 819, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.126392 = idf(docFreq=716, maxDocs=44421)
                0.0625 = fieldNorm(doc=819)
          0.20117605 = weight(abstract_txt:documenting in 819) [ClassicSimilarity], result of:
            0.20117605 = score(doc=819,freq=1.0), product of:
              0.40171996 = queryWeight, product of:
                3.6033068 = boost
                8.0125885 = idf(docFreq=39, maxDocs=44421)
                0.013913915 = queryNorm
              0.5007868 = fieldWeight in 819, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.0125885 = idf(docFreq=39, maxDocs=44421)
                0.0625 = fieldNorm(doc=819)
          0.21031204 = weight(abstract_txt:preservation in 819) [ClassicSimilarity], result of:
            0.21031204 = score(doc=819,freq=1.0), product of:
              0.52134514 = queryWeight, product of:
                5.8052 = boost
                6.4544435 = idf(docFreq=189, maxDocs=44421)
                0.013913915 = queryNorm
              0.40340272 = fieldWeight in 819, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.4544435 = idf(docFreq=189, maxDocs=44421)
                0.0625 = fieldNorm(doc=819)
        0.24 = coord(6/25)
    
  5. Maemura, E.; Moles, N.; Becker, C.: Organizational assessment frameworks for digital preservation : a literature review and mapping (2017) 0.13
    0.13266192 = sum of:
      0.13266192 = product of:
        0.6633096 = sum of:
          0.06950375 = weight(abstract_txt:validation in 4743) [ClassicSimilarity], result of:
            0.06950375 = score(doc=4743,freq=2.0), product of:
              0.1088498 = queryWeight, product of:
                1.0829109 = boost
                7.2241306 = idf(docFreq=87, maxDocs=44421)
                0.013913915 = queryNorm
              0.63852894 = fieldWeight in 4743, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.2241306 = idf(docFreq=87, maxDocs=44421)
                0.0625 = fieldNorm(doc=4743)
          0.057110697 = weight(abstract_txt:goals in 4743) [ClassicSimilarity], result of:
            0.057110697 = score(doc=4743,freq=1.0), product of:
              0.15158415 = queryWeight, product of:
                1.8072606 = boost
                6.0281444 = idf(docFreq=290, maxDocs=44421)
                0.013913915 = queryNorm
              0.37675902 = fieldWeight in 4743, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.0281444 = idf(docFreq=290, maxDocs=44421)
                0.0625 = fieldNorm(doc=4743)
          0.06338531 = weight(abstract_txt:digital in 4743) [ClassicSimilarity], result of:
            0.06338531 = score(doc=4743,freq=4.0), product of:
              0.117177695 = queryWeight, product of:
                1.9460859 = boost
                4.3274655 = idf(docFreq=1593, maxDocs=44421)
                0.013913915 = queryNorm
              0.5409332 = fieldWeight in 4743, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.3274655 = idf(docFreq=1593, maxDocs=44421)
                0.0625 = fieldNorm(doc=4743)
          0.052685745 = weight(abstract_txt:processes in 4743) [ClassicSimilarity], result of:
            0.052685745 = score(doc=4743,freq=1.0), product of:
              0.16443767 = queryWeight, product of:
                2.3053675 = boost
                5.126392 = idf(docFreq=716, maxDocs=44421)
                0.013913915 = queryNorm
              0.3203995 = fieldWeight in 4743, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.126392 = idf(docFreq=716, maxDocs=44421)
                0.0625 = fieldNorm(doc=4743)
          0.42062408 = weight(abstract_txt:preservation in 4743) [ClassicSimilarity], result of:
            0.42062408 = score(doc=4743,freq=4.0), product of:
              0.52134514 = queryWeight, product of:
                5.8052 = boost
                6.4544435 = idf(docFreq=189, maxDocs=44421)
                0.013913915 = queryNorm
              0.80680543 = fieldWeight in 4743, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                6.4544435 = idf(docFreq=189, maxDocs=44421)
                0.0625 = fieldNorm(doc=4743)
        0.2 = coord(5/25)