Document (#41466)

Author
Maemura, E.
Worby, N.
Milligan, I.
Becker, C.
Title
If these crawls could talk : studying and documenting web archives provenance
Source
Journal of the Association for Information Science and Technology. 69(2018) no.10, S.1223-1233
Year
2018
Abstract
The increasing use and prominence of web archives raises the urgency of establishing mechanisms for transparency in the making of web archives to facilitate the process of evaluating a web archive's provenance, scoping, and absences. Some choices and process events are captured automatically, but their interactions are not currently well understood or documented. This study examined the decision space of web archives and its role in shaping what is and what is not captured in the web archiving process. By comparing how three different web archives collections were created and documented, we investigate how curatorial decisions interact with technical and external factors and we compare commonalities and differences. The findings reveal the need to understand both the social and technical context that shapes those decisions and the ways in which these individual decisions interact. Based on the study, we propose a framework for documenting key dimensions of a collection that addresses the situated nature of the organizational context, technical specificities, and unique characteristics of web materials that are the focus of a collection. The framework enables future researchers to undertake empirical work studying the process of creating web archives collections in different contexts.
Content
Vgl.: https://onlinelibrary.wiley.com/doi/10.1002/asi.24048.
Theme
Internet

Similar documents (author)

  1. Becker, J.: Zentrallager : Data Warehouse - zentrale Sammelstelle für Informationen (1997) 4.68
    4.682621 = sum of:
      4.682621 = weight(author_txt:becker in 4480) [ClassicSimilarity], result of:
        4.682621 = score(doc=4480,freq=1.0), product of:
          0.99999994 = queryWeight, product of:
            7.4921947 = idf(docFreq=66, maxDocs=44218)
            0.13347223 = queryNorm
          4.6826215 = fieldWeight in 4480, product of:
            1.0 = tf(freq=1.0), with freq of:
              1.0 = termFreq=1.0
            7.4921947 = idf(docFreq=66, maxDocs=44218)
            0.625 = fieldNorm(doc=4480)
    
  2. Becker, C.A.: Community information service (1974) 4.68
    4.682621 = sum of:
      4.682621 = weight(author_txt:becker in 5737) [ClassicSimilarity], result of:
        4.682621 = score(doc=5737,freq=1.0), product of:
          0.99999994 = queryWeight, product of:
            7.4921947 = idf(docFreq=66, maxDocs=44218)
            0.13347223 = queryNorm
          4.6826215 = fieldWeight in 5737, product of:
            1.0 = tf(freq=1.0), with freq of:
              1.0 = termFreq=1.0
            7.4921947 = idf(docFreq=66, maxDocs=44218)
            0.625 = fieldNorm(doc=5737)
    
  3. Becker, J.: Strategische Ausrichtung der Informations- und Organisationsstruktur des Unternehmens (1994) 4.68
    4.682621 = sum of:
      4.682621 = weight(author_txt:becker in 8383) [ClassicSimilarity], result of:
        4.682621 = score(doc=8383,freq=1.0), product of:
          0.99999994 = queryWeight, product of:
            7.4921947 = idf(docFreq=66, maxDocs=44218)
            0.13347223 = queryNorm
          4.6826215 = fieldWeight in 8383, product of:
            1.0 = tf(freq=1.0), with freq of:
              1.0 = termFreq=1.0
            7.4921947 = idf(docFreq=66, maxDocs=44218)
            0.625 = fieldNorm(doc=8383)
    
  4. Becker, J.: Probleme des grenzüberschreitenden Datenflusses (1988) 4.68
    4.682621 = sum of:
      4.682621 = weight(author_txt:becker in 512) [ClassicSimilarity], result of:
        4.682621 = score(doc=512,freq=1.0), product of:
          0.99999994 = queryWeight, product of:
            7.4921947 = idf(docFreq=66, maxDocs=44218)
            0.13347223 = queryNorm
          4.6826215 = fieldWeight in 512, product of:
            1.0 = tf(freq=1.0), with freq of:
              1.0 = termFreq=1.0
            7.4921947 = idf(docFreq=66, maxDocs=44218)
            0.625 = fieldNorm(doc=512)
    
  5. Becker, J.: ¬Die Postmoderne und ihr Verhältnis zur Informationstheorie (1995) 4.68
    4.682621 = sum of:
      4.682621 = weight(author_txt:becker in 1040) [ClassicSimilarity], result of:
        4.682621 = score(doc=1040,freq=1.0), product of:
          0.99999994 = queryWeight, product of:
            7.4921947 = idf(docFreq=66, maxDocs=44218)
            0.13347223 = queryNorm
          4.6826215 = fieldWeight in 1040, product of:
            1.0 = tf(freq=1.0), with freq of:
              1.0 = termFreq=1.0
            7.4921947 = idf(docFreq=66, maxDocs=44218)
            0.625 = fieldNorm(doc=1040)
    

Similar documents (content)

  1. Ogden, J.; Summers, E.; Walker, S.: Know(ing) Infrastructure : the wayback machine as object and instrument of digital research (2023) 0.20
    0.2046937 = sum of:
      0.2046937 = product of:
        0.73104894 = sum of:
          0.060313758 = weight(abstract_txt:situated in 1084) [ClassicSimilarity], result of:
            0.060313758 = score(doc=1084,freq=1.0), product of:
              0.12854539 = queryWeight, product of:
                7.5072327 = idf(docFreq=65, maxDocs=44218)
                0.017122872 = queryNorm
              0.46920204 = fieldWeight in 1084, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.5072327 = idf(docFreq=65, maxDocs=44218)
                0.0625 = fieldNorm(doc=1084)
          0.06436113 = weight(abstract_txt:shaping in 1084) [ClassicSimilarity], result of:
            0.06436113 = score(doc=1084,freq=1.0), product of:
              0.13423364 = queryWeight, product of:
                1.021886 = boost
                7.6715355 = idf(docFreq=55, maxDocs=44218)
                0.017122872 = queryNorm
              0.47947097 = fieldWeight in 1084, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.6715355 = idf(docFreq=55, maxDocs=44218)
                0.0625 = fieldNorm(doc=1084)
          0.118187174 = weight(abstract_txt:archive's in 1084) [ClassicSimilarity], result of:
            0.118187174 = score(doc=1084,freq=1.0), product of:
              0.20129167 = queryWeight, product of:
                1.2513669 = boost
                9.394302 = idf(docFreq=9, maxDocs=44218)
                0.017122872 = queryNorm
              0.5871439 = fieldWeight in 1084, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.394302 = idf(docFreq=9, maxDocs=44218)
                0.0625 = fieldNorm(doc=1084)
          0.07706715 = weight(abstract_txt:studying in 1084) [ClassicSimilarity], result of:
            0.07706715 = score(doc=1084,freq=1.0), product of:
              0.19070779 = queryWeight, product of:
                1.7225466 = boost
                6.465779 = idf(docFreq=186, maxDocs=44218)
                0.017122872 = queryNorm
              0.40411118 = fieldWeight in 1084, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.465779 = idf(docFreq=186, maxDocs=44218)
                0.0625 = fieldNorm(doc=1084)
          0.20705953 = weight(abstract_txt:documenting in 1084) [ClassicSimilarity], result of:
            0.20705953 = score(doc=1084,freq=2.0), product of:
              0.29253358 = queryWeight, product of:
                2.1334114 = boost
                8.008008 = idf(docFreq=39, maxDocs=44218)
                0.017122872 = queryNorm
              0.7078146 = fieldWeight in 1084, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.008008 = idf(docFreq=39, maxDocs=44218)
                0.0625 = fieldNorm(doc=1084)
          0.03790768 = weight(abstract_txt:process in 1084) [ClassicSimilarity], result of:
            0.03790768 = score(doc=1084,freq=1.0), product of:
              0.14972134 = queryWeight, product of:
                2.158458 = boost
                4.0510116 = idf(docFreq=2091, maxDocs=44218)
                0.017122872 = queryNorm
              0.25318822 = fieldWeight in 1084, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.0510116 = idf(docFreq=2091, maxDocs=44218)
                0.0625 = fieldNorm(doc=1084)
          0.1661525 = weight(abstract_txt:archives in 1084) [ClassicSimilarity], result of:
            0.1661525 = score(doc=1084,freq=1.0), product of:
              0.45902243 = queryWeight, product of:
                4.6287565 = boost
                5.7915254 = idf(docFreq=366, maxDocs=44218)
                0.017122872 = queryNorm
              0.36197034 = fieldWeight in 1084, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.7915254 = idf(docFreq=366, maxDocs=44218)
                0.0625 = fieldNorm(doc=1084)
        0.28 = coord(7/25)
    
  2. Pitti, D.V.: Encoded Archival Description (EAD) (2009) 0.11
    0.11407725 = sum of:
      0.11407725 = product of:
        0.57038623 = sum of:
          0.026871964 = weight(abstract_txt:framework in 3777) [ClassicSimilarity], result of:
            0.026871964 = score(doc=3777,freq=1.0), product of:
              0.094476074 = queryWeight, product of:
                1.2124048 = boost
                4.550903 = idf(docFreq=1268, maxDocs=44218)
                0.017122872 = queryNorm
              0.28443143 = fieldWeight in 3777, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.550903 = idf(docFreq=1268, maxDocs=44218)
                0.0625 = fieldNorm(doc=3777)
          0.028638186 = weight(abstract_txt:collection in 3777) [ClassicSimilarity], result of:
            0.028638186 = score(doc=3777,freq=1.0), product of:
              0.09857177 = queryWeight, product of:
                1.238406 = boost
                4.648501 = idf(docFreq=1150, maxDocs=44218)
                0.017122872 = queryNorm
              0.2905313 = fieldWeight in 3777, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.648501 = idf(docFreq=1150, maxDocs=44218)
                0.0625 = fieldNorm(doc=3777)
          0.13348776 = weight(abstract_txt:provenance in 3777) [ClassicSimilarity], result of:
            0.13348776 = score(doc=3777,freq=1.0), product of:
              0.27505308 = queryWeight, product of:
                2.0686882 = boost
                7.7650614 = idf(docFreq=50, maxDocs=44218)
                0.017122872 = queryNorm
              0.48531634 = fieldWeight in 3777, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.7650614 = idf(docFreq=50, maxDocs=44218)
                0.0625 = fieldNorm(doc=3777)
          0.1464132 = weight(abstract_txt:documenting in 3777) [ClassicSimilarity], result of:
            0.1464132 = score(doc=3777,freq=1.0), product of:
              0.29253358 = queryWeight, product of:
                2.1334114 = boost
                8.008008 = idf(docFreq=39, maxDocs=44218)
                0.017122872 = queryNorm
              0.5005005 = fieldWeight in 3777, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.008008 = idf(docFreq=39, maxDocs=44218)
                0.0625 = fieldNorm(doc=3777)
          0.23497511 = weight(abstract_txt:archives in 3777) [ClassicSimilarity], result of:
            0.23497511 = score(doc=3777,freq=2.0), product of:
              0.45902243 = queryWeight, product of:
                4.6287565 = boost
                5.7915254 = idf(docFreq=366, maxDocs=44218)
                0.017122872 = queryNorm
              0.51190335 = fieldWeight in 3777, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.7915254 = idf(docFreq=366, maxDocs=44218)
                0.0625 = fieldNorm(doc=3777)
        0.2 = coord(5/25)
    
  3. Tognoli, N.; Chaves-Guimarães, J.A.: Provenance as a knowledge organization principle (2019) 0.10
    0.09835871 = sum of:
      0.09835871 = product of:
        0.614742 = sum of:
          0.023305943 = weight(abstract_txt:context in 5489) [ClassicSimilarity], result of:
            0.023305943 = score(doc=5489,freq=1.0), product of:
              0.08592114 = queryWeight, product of:
                1.1562101 = boost
                4.339969 = idf(docFreq=1566, maxDocs=44218)
                0.017122872 = queryNorm
              0.27124807 = fieldWeight in 5489, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.339969 = idf(docFreq=1566, maxDocs=44218)
                0.0625 = fieldNorm(doc=5489)
          0.029484011 = weight(abstract_txt:collections in 5489) [ClassicSimilarity], result of:
            0.029484011 = score(doc=5489,freq=1.0), product of:
              0.10050321 = queryWeight, product of:
                1.2504799 = boost
                4.693822 = idf(docFreq=1099, maxDocs=44218)
                0.017122872 = queryNorm
              0.29336387 = fieldWeight in 5489, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.693822 = idf(docFreq=1099, maxDocs=44218)
                0.0625 = fieldNorm(doc=5489)
          0.3269769 = weight(abstract_txt:provenance in 5489) [ClassicSimilarity], result of:
            0.3269769 = score(doc=5489,freq=6.0), product of:
              0.27505308 = queryWeight, product of:
                2.0686882 = boost
                7.7650614 = idf(docFreq=50, maxDocs=44218)
                0.017122872 = queryNorm
              1.1887774 = fieldWeight in 5489, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                7.7650614 = idf(docFreq=50, maxDocs=44218)
                0.0625 = fieldNorm(doc=5489)
          0.23497511 = weight(abstract_txt:archives in 5489) [ClassicSimilarity], result of:
            0.23497511 = score(doc=5489,freq=2.0), product of:
              0.45902243 = queryWeight, product of:
                4.6287565 = boost
                5.7915254 = idf(docFreq=366, maxDocs=44218)
                0.017122872 = queryNorm
              0.51190335 = fieldWeight in 5489, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.7915254 = idf(docFreq=366, maxDocs=44218)
                0.0625 = fieldNorm(doc=5489)
        0.16 = coord(4/25)
    
  4. Hardesty, J.L.; Young, J.B.: ¬The semantics of metadata : Avalon Media System and the move to RDF (2017) 0.10
    0.0953519 = sum of:
      0.0953519 = product of:
        0.3972996 = sum of:
          0.026871964 = weight(abstract_txt:framework in 3896) [ClassicSimilarity], result of:
            0.026871964 = score(doc=3896,freq=1.0), product of:
              0.094476074 = queryWeight, product of:
                1.2124048 = boost
                4.550903 = idf(docFreq=1268, maxDocs=44218)
                0.017122872 = queryNorm
              0.28443143 = fieldWeight in 3896, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.550903 = idf(docFreq=1268, maxDocs=44218)
                0.0625 = fieldNorm(doc=3896)
          0.029484011 = weight(abstract_txt:collections in 3896) [ClassicSimilarity], result of:
            0.029484011 = score(doc=3896,freq=1.0), product of:
              0.10050321 = queryWeight, product of:
                1.2504799 = boost
                4.693822 = idf(docFreq=1099, maxDocs=44218)
                0.017122872 = queryNorm
              0.29336387 = fieldWeight in 3896, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.693822 = idf(docFreq=1099, maxDocs=44218)
                0.0625 = fieldNorm(doc=3896)
          0.0535718 = weight(abstract_txt:technical in 3896) [ClassicSimilarity], result of:
            0.0535718 = score(doc=3896,freq=1.0), product of:
              0.17130768 = queryWeight, product of:
                1.9994972 = boost
                5.0035634 = idf(docFreq=806, maxDocs=44218)
                0.017122872 = queryNorm
              0.3127227 = fieldWeight in 3896, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.0035634 = idf(docFreq=806, maxDocs=44218)
                0.0625 = fieldNorm(doc=3896)
          0.03790768 = weight(abstract_txt:process in 3896) [ClassicSimilarity], result of:
            0.03790768 = score(doc=3896,freq=1.0), product of:
              0.14972134 = queryWeight, product of:
                2.158458 = boost
                4.0510116 = idf(docFreq=2091, maxDocs=44218)
                0.017122872 = queryNorm
              0.25318822 = fieldWeight in 3896, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.0510116 = idf(docFreq=2091, maxDocs=44218)
                0.0625 = fieldNorm(doc=3896)
          0.083311625 = weight(abstract_txt:decisions in 3896) [ClassicSimilarity], result of:
            0.083311625 = score(doc=3896,freq=1.0), product of:
              0.22994451 = queryWeight, product of:
                2.316562 = boost
                5.79699 = idf(docFreq=364, maxDocs=44218)
                0.017122872 = queryNorm
              0.36231187 = fieldWeight in 3896, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.79699 = idf(docFreq=364, maxDocs=44218)
                0.0625 = fieldNorm(doc=3896)
          0.1661525 = weight(abstract_txt:archives in 3896) [ClassicSimilarity], result of:
            0.1661525 = score(doc=3896,freq=1.0), product of:
              0.45902243 = queryWeight, product of:
                4.6287565 = boost
                5.7915254 = idf(docFreq=366, maxDocs=44218)
                0.017122872 = queryNorm
              0.36197034 = fieldWeight in 3896, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.7915254 = idf(docFreq=366, maxDocs=44218)
                0.0625 = fieldNorm(doc=3896)
        0.24 = coord(6/25)
    
  5. Trace, C.B.; Francisco-Revilla, L.: ¬The value and complexity of collection arrangement for evidentiary work (2015) 0.09
    0.09043875 = sum of:
      0.09043875 = product of:
        0.45219377 = sum of:
          0.022902224 = weight(abstract_txt:what in 2164) [ClassicSimilarity], result of:
            0.022902224 = score(doc=2164,freq=1.0), product of:
              0.084926 = queryWeight, product of:
                1.149495 = boost
                4.314763 = idf(docFreq=1606, maxDocs=44218)
                0.017122872 = queryNorm
              0.2696727 = fieldWeight in 2164, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.314763 = idf(docFreq=1606, maxDocs=44218)
                0.0625 = fieldNorm(doc=2164)
          0.051067807 = weight(abstract_txt:collections in 2164) [ClassicSimilarity], result of:
            0.051067807 = score(doc=2164,freq=3.0), product of:
              0.10050321 = queryWeight, product of:
                1.2504799 = boost
                4.693822 = idf(docFreq=1099, maxDocs=44218)
                0.017122872 = queryNorm
              0.50812113 = fieldWeight in 2164, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.693822 = idf(docFreq=1099, maxDocs=44218)
                0.0625 = fieldNorm(doc=2164)
          0.1464132 = weight(abstract_txt:documenting in 2164) [ClassicSimilarity], result of:
            0.1464132 = score(doc=2164,freq=1.0), product of:
              0.29253358 = queryWeight, product of:
                2.1334114 = boost
                8.008008 = idf(docFreq=39, maxDocs=44218)
                0.017122872 = queryNorm
              0.5005005 = fieldWeight in 2164, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.008008 = idf(docFreq=39, maxDocs=44218)
                0.0625 = fieldNorm(doc=2164)
          0.065658025 = weight(abstract_txt:process in 2164) [ClassicSimilarity], result of:
            0.065658025 = score(doc=2164,freq=3.0), product of:
              0.14972134 = queryWeight, product of:
                2.158458 = boost
                4.0510116 = idf(docFreq=2091, maxDocs=44218)
                0.017122872 = queryNorm
              0.43853486 = fieldWeight in 2164, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.0510116 = idf(docFreq=2091, maxDocs=44218)
                0.0625 = fieldNorm(doc=2164)
          0.1661525 = weight(abstract_txt:archives in 2164) [ClassicSimilarity], result of:
            0.1661525 = score(doc=2164,freq=1.0), product of:
              0.45902243 = queryWeight, product of:
                4.6287565 = boost
                5.7915254 = idf(docFreq=366, maxDocs=44218)
                0.017122872 = queryNorm
              0.36197034 = fieldWeight in 2164, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.7915254 = idf(docFreq=366, maxDocs=44218)
                0.0625 = fieldNorm(doc=2164)
        0.2 = coord(5/25)