Document (#41466)

Author
Maemura, E.
Worby, N.
Milligan, I.
Becker, C.
Title
If these crawls could talk : studying and documenting web archives provenance
Source
Journal of the Association for Information Science and Technology. 69(2018) no.10, S.1223-1233
Year
2018
Abstract
The increasing use and prominence of web archives raises the urgency of establishing mechanisms for transparency in the making of web archives to facilitate the process of evaluating a web archive's provenance, scoping, and absences. Some choices and process events are captured automatically, but their interactions are not currently well understood or documented. This study examined the decision space of web archives and its role in shaping what is and what is not captured in the web archiving process. By comparing how three different web archives collections were created and documented, we investigate how curatorial decisions interact with technical and external factors and we compare commonalities and differences. The findings reveal the need to understand both the social and technical context that shapes those decisions and the ways in which these individual decisions interact. Based on the study, we propose a framework for documenting key dimensions of a collection that addresses the situated nature of the organizational context, technical specificities, and unique characteristics of web materials that are the focus of a collection. The framework enables future researchers to undertake empirical work studying the process of creating web archives collections in different contexts.
Content
Vgl.: https://onlinelibrary.wiley.com/doi/10.1002/asi.24048.
Theme
Internet

Similar documents (author)

  1. Becker, J.: Zentrallager : Data Warehouse - zentrale Sammelstelle für Informationen (1997) 4.69
    4.6854844 = sum of:
      4.6854844 = weight(author_txt:becker in 4479) [ClassicSimilarity], result of:
        4.6854844 = fieldWeight in 4479, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          7.496775 = idf(docFreq=66, maxDocs=44421)
          0.625 = fieldNorm(doc=4479)
    
  2. Becker, C.A.: Community information service (1974) 4.69
    4.6854844 = sum of:
      4.6854844 = weight(author_txt:becker in 5736) [ClassicSimilarity], result of:
        4.6854844 = fieldWeight in 5736, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          7.496775 = idf(docFreq=66, maxDocs=44421)
          0.625 = fieldNorm(doc=5736)
    
  3. Becker, J.: Strategische Ausrichtung der Informations- und Organisationsstruktur des Unternehmens (1994) 4.69
    4.6854844 = sum of:
      4.6854844 = weight(author_txt:becker in 8382) [ClassicSimilarity], result of:
        4.6854844 = fieldWeight in 8382, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          7.496775 = idf(docFreq=66, maxDocs=44421)
          0.625 = fieldNorm(doc=8382)
    
  4. Becker, J.: Probleme des grenzüberschreitenden Datenflusses (1988) 4.69
    4.6854844 = sum of:
      4.6854844 = weight(author_txt:becker in 580) [ClassicSimilarity], result of:
        4.6854844 = fieldWeight in 580, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          7.496775 = idf(docFreq=66, maxDocs=44421)
          0.625 = fieldNorm(doc=580)
    
  5. Becker, J.: ¬Die Postmoderne und ihr Verhältnis zur Informationstheorie (1995) 4.69
    4.6854844 = sum of:
      4.6854844 = weight(author_txt:becker in 1108) [ClassicSimilarity], result of:
        4.6854844 = fieldWeight in 1108, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          7.496775 = idf(docFreq=66, maxDocs=44421)
          0.625 = fieldNorm(doc=1108)
    

Similar documents (content)

  1. Ogden, J.; Summers, E.; Walker, S.: Know(ing) infrastructure : the Wayback Machine as object and instrument of digital research (2023) 0.16
    0.15836842 = sum of:
      0.15836842 = product of:
        0.6598684 = sum of:
          0.061903216 = weight(abstract_txt:shaping in 2166) [ClassicSimilarity], result of:
            0.061903216 = score(doc=2166,freq=1.0), product of:
              0.13020054 = queryWeight, product of:
                7.607123 = idf(docFreq=59, maxDocs=44421)
                0.017115608 = queryNorm
              0.47544518 = fieldWeight in 2166, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.607123 = idf(docFreq=59, maxDocs=44421)
                0.0625 = fieldNorm(doc=2166)
          0.11675638 = weight(abstract_txt:archive's in 2166) [ClassicSimilarity], result of:
            0.11675638 = score(doc=2166,freq=1.0), product of:
              0.19875789 = queryWeight, product of:
                1.2355372 = boost
                9.398883 = idf(docFreq=9, maxDocs=44421)
                0.017115608 = queryNorm
              0.5874302 = fieldWeight in 2166, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.398883 = idf(docFreq=9, maxDocs=44421)
                0.0625 = fieldNorm(doc=2166)
          0.0754394 = weight(abstract_txt:studying in 2166) [ClassicSimilarity], result of:
            0.0754394 = score(doc=2166,freq=1.0), product of:
              0.18715988 = queryWeight, product of:
                1.6955671 = boost
                6.449194 = idf(docFreq=190, maxDocs=44421)
                0.017115608 = queryNorm
              0.40307462 = fieldWeight in 2166, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.449194 = idf(docFreq=190, maxDocs=44421)
                0.0625 = fieldNorm(doc=2166)
          0.20460461 = weight(abstract_txt:documenting in 2166) [ClassicSimilarity], result of:
            0.20460461 = score(doc=2166,freq=2.0), product of:
              0.28890002 = queryWeight, product of:
                2.1066015 = boost
                8.0125885 = idf(docFreq=39, maxDocs=44421)
                0.017115608 = queryNorm
              0.70821947 = fieldWeight in 2166, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.0125885 = idf(docFreq=39, maxDocs=44421)
                0.0625 = fieldNorm(doc=2166)
          0.0373362 = weight(abstract_txt:process in 2166) [ClassicSimilarity], result of:
            0.0373362 = score(doc=2166,freq=1.0), product of:
              0.1475403 = queryWeight, product of:
                2.1290162 = boost
                4.048922 = idf(docFreq=2105, maxDocs=44421)
                0.017115608 = queryNorm
              0.25305763 = fieldWeight in 2166, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.048922 = idf(docFreq=2105, maxDocs=44421)
                0.0625 = fieldNorm(doc=2166)
          0.16382863 = weight(abstract_txt:archives in 2166) [ClassicSimilarity], result of:
            0.16382863 = score(doc=2166,freq=1.0), product of:
              0.45266914 = queryWeight, product of:
                4.5673018 = boost
                5.790671 = idf(docFreq=368, maxDocs=44421)
                0.017115608 = queryNorm
              0.36191693 = fieldWeight in 2166, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.790671 = idf(docFreq=368, maxDocs=44421)
                0.0625 = fieldNorm(doc=2166)
        0.24 = coord(6/25)
    
  2. Pitti, D.V.: Encoded Archival Description (EAD) (2009) 0.11
    0.11258898 = sum of:
      0.11258898 = product of:
        0.5629449 = sum of:
          0.026396325 = weight(abstract_txt:framework in 764) [ClassicSimilarity], result of:
            0.026396325 = score(doc=764,freq=1.0), product of:
              0.09293435 = queryWeight, product of:
                1.1948042 = boost
                4.5445113 = idf(docFreq=1282, maxDocs=44421)
                0.017115608 = queryNorm
              0.28403196 = fieldWeight in 764, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.5445113 = idf(docFreq=1282, maxDocs=44421)
                0.0625 = fieldNorm(doc=764)
          0.028270407 = weight(abstract_txt:collection in 764) [ClassicSimilarity], result of:
            0.028270407 = score(doc=764,freq=1.0), product of:
              0.09728263 = queryWeight, product of:
                1.2224364 = boost
                4.649612 = idf(docFreq=1154, maxDocs=44421)
                0.017115608 = queryNorm
              0.29060075 = fieldWeight in 764, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.649612 = idf(docFreq=1154, maxDocs=44421)
                0.0625 = fieldNorm(doc=764)
          0.13191219 = weight(abstract_txt:provenance in 764) [ClassicSimilarity], result of:
            0.13191219 = score(doc=764,freq=1.0), product of:
              0.27164638 = queryWeight, product of:
                2.0427282 = boost
                7.769642 = idf(docFreq=50, maxDocs=44421)
                0.017115608 = queryNorm
              0.48560262 = fieldWeight in 764, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.769642 = idf(docFreq=50, maxDocs=44421)
                0.0625 = fieldNorm(doc=764)
          0.14467731 = weight(abstract_txt:documenting in 764) [ClassicSimilarity], result of:
            0.14467731 = score(doc=764,freq=1.0), product of:
              0.28890002 = queryWeight, product of:
                2.1066015 = boost
                8.0125885 = idf(docFreq=39, maxDocs=44421)
                0.017115608 = queryNorm
              0.5007868 = fieldWeight in 764, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.0125885 = idf(docFreq=39, maxDocs=44421)
                0.0625 = fieldNorm(doc=764)
          0.23168866 = weight(abstract_txt:archives in 764) [ClassicSimilarity], result of:
            0.23168866 = score(doc=764,freq=2.0), product of:
              0.45266914 = queryWeight, product of:
                4.5673018 = boost
                5.790671 = idf(docFreq=368, maxDocs=44421)
                0.017115608 = queryNorm
              0.5118278 = fieldWeight in 764, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.790671 = idf(docFreq=368, maxDocs=44421)
                0.0625 = fieldNorm(doc=764)
        0.2 = coord(5/25)
    
  3. Tognoli, N.; Chaves-Guimarães, J.A.: Provenance as a knowledge organization principle (2019) 0.10
    0.0970945 = sum of:
      0.0970945 = product of:
        0.6068406 = sum of:
          0.022881605 = weight(abstract_txt:context in 489) [ClassicSimilarity], result of:
            0.022881605 = score(doc=489,freq=1.0), product of:
              0.08448993 = queryWeight, product of:
                1.1392292 = boost
                4.333128 = idf(docFreq=1584, maxDocs=44421)
                0.017115608 = queryNorm
              0.2708205 = fieldWeight in 489, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.333128 = idf(docFreq=1584, maxDocs=44421)
                0.0625 = fieldNorm(doc=489)
          0.029152814 = weight(abstract_txt:collections in 489) [ClassicSimilarity], result of:
            0.029152814 = score(doc=489,freq=1.0), product of:
              0.09929658 = queryWeight, product of:
                1.235025 = boost
                4.6974936 = idf(docFreq=1100, maxDocs=44421)
                0.017115608 = queryNorm
              0.29359335 = fieldWeight in 489, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.6974936 = idf(docFreq=1100, maxDocs=44421)
                0.0625 = fieldNorm(doc=489)
          0.32311755 = weight(abstract_txt:provenance in 489) [ClassicSimilarity], result of:
            0.32311755 = score(doc=489,freq=6.0), product of:
              0.27164638 = queryWeight, product of:
                2.0427282 = boost
                7.769642 = idf(docFreq=50, maxDocs=44421)
                0.017115608 = queryNorm
              1.1894786 = fieldWeight in 489, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                7.769642 = idf(docFreq=50, maxDocs=44421)
                0.0625 = fieldNorm(doc=489)
          0.23168866 = weight(abstract_txt:archives in 489) [ClassicSimilarity], result of:
            0.23168866 = score(doc=489,freq=2.0), product of:
              0.45266914 = queryWeight, product of:
                4.5673018 = boost
                5.790671 = idf(docFreq=368, maxDocs=44421)
                0.017115608 = queryNorm
              0.5118278 = fieldWeight in 489, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.790671 = idf(docFreq=368, maxDocs=44421)
                0.0625 = fieldNorm(doc=489)
        0.16 = coord(4/25)
    
  4. Hardesty, J.L.; Young, J.B.: ¬The semantics of metadata : Avalon Media System and the move to RDF (2017) 0.09
    0.09395997 = sum of:
      0.09395997 = product of:
        0.3914999 = sum of:
          0.026396325 = weight(abstract_txt:framework in 4896) [ClassicSimilarity], result of:
            0.026396325 = score(doc=4896,freq=1.0), product of:
              0.09293435 = queryWeight, product of:
                1.1948042 = boost
                4.5445113 = idf(docFreq=1282, maxDocs=44421)
                0.017115608 = queryNorm
              0.28403196 = fieldWeight in 4896, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.5445113 = idf(docFreq=1282, maxDocs=44421)
                0.0625 = fieldNorm(doc=4896)
          0.029152814 = weight(abstract_txt:collections in 4896) [ClassicSimilarity], result of:
            0.029152814 = score(doc=4896,freq=1.0), product of:
              0.09929658 = queryWeight, product of:
                1.235025 = boost
                4.6974936 = idf(docFreq=1100, maxDocs=44421)
                0.017115608 = queryNorm
              0.29359335 = fieldWeight in 4896, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.6974936 = idf(docFreq=1100, maxDocs=44421)
                0.0625 = fieldNorm(doc=4896)
          0.0527564 = weight(abstract_txt:technical in 4896) [ClassicSimilarity], result of:
            0.0527564 = score(doc=4896,freq=1.0), product of:
              0.1687956 = queryWeight, product of:
                1.9721267 = boost
                5.0007367 = idf(docFreq=812, maxDocs=44421)
                0.017115608 = queryNorm
              0.31254604 = fieldWeight in 4896, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.0007367 = idf(docFreq=812, maxDocs=44421)
                0.0625 = fieldNorm(doc=4896)
          0.0373362 = weight(abstract_txt:process in 4896) [ClassicSimilarity], result of:
            0.0373362 = score(doc=4896,freq=1.0), product of:
              0.1475403 = queryWeight, product of:
                2.1290162 = boost
                4.048922 = idf(docFreq=2105, maxDocs=44421)
                0.017115608 = queryNorm
              0.25305763 = fieldWeight in 4896, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.048922 = idf(docFreq=2105, maxDocs=44421)
                0.0625 = fieldNorm(doc=4896)
          0.08202954 = weight(abstract_txt:decisions in 4896) [ClassicSimilarity], result of:
            0.08202954 = score(doc=4896,freq=1.0), product of:
              0.22654678 = queryWeight, product of:
                2.2847211 = boost
                5.7933846 = idf(docFreq=367, maxDocs=44421)
                0.017115608 = queryNorm
              0.36208653 = fieldWeight in 4896, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.7933846 = idf(docFreq=367, maxDocs=44421)
                0.0625 = fieldNorm(doc=4896)
          0.16382863 = weight(abstract_txt:archives in 4896) [ClassicSimilarity], result of:
            0.16382863 = score(doc=4896,freq=1.0), product of:
              0.45266914 = queryWeight, product of:
                4.5673018 = boost
                5.790671 = idf(docFreq=368, maxDocs=44421)
                0.017115608 = queryNorm
              0.36191693 = fieldWeight in 4896, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.790671 = idf(docFreq=368, maxDocs=44421)
                0.0625 = fieldNorm(doc=4896)
        0.24 = coord(6/25)
    
  5. Trace, C.B.; Francisco-Revilla, L.: ¬The value and complexity of collection arrangement for evidentiary work (2015) 0.09
    0.08922954 = sum of:
      0.08922954 = product of:
        0.44614768 = sum of:
          0.022479406 = weight(abstract_txt:what in 3164) [ClassicSimilarity], result of:
            0.022479406 = score(doc=3164,freq=1.0), product of:
              0.08349693 = queryWeight, product of:
                1.1325147 = boost
                4.3075895 = idf(docFreq=1625, maxDocs=44421)
                0.017115608 = queryNorm
              0.26922435 = fieldWeight in 3164, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.3075895 = idf(docFreq=1625, maxDocs=44421)
                0.0625 = fieldNorm(doc=3164)
          0.050494153 = weight(abstract_txt:collections in 3164) [ClassicSimilarity], result of:
            0.050494153 = score(doc=3164,freq=3.0), product of:
              0.09929658 = queryWeight, product of:
                1.235025 = boost
                4.6974936 = idf(docFreq=1100, maxDocs=44421)
                0.017115608 = queryNorm
              0.5085186 = fieldWeight in 3164, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.6974936 = idf(docFreq=1100, maxDocs=44421)
                0.0625 = fieldNorm(doc=3164)
          0.14467731 = weight(abstract_txt:documenting in 3164) [ClassicSimilarity], result of:
            0.14467731 = score(doc=3164,freq=1.0), product of:
              0.28890002 = queryWeight, product of:
                2.1066015 = boost
                8.0125885 = idf(docFreq=39, maxDocs=44421)
                0.017115608 = queryNorm
              0.5007868 = fieldWeight in 3164, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.0125885 = idf(docFreq=39, maxDocs=44421)
                0.0625 = fieldNorm(doc=3164)
          0.06466819 = weight(abstract_txt:process in 3164) [ClassicSimilarity], result of:
            0.06466819 = score(doc=3164,freq=3.0), product of:
              0.1475403 = queryWeight, product of:
                2.1290162 = boost
                4.048922 = idf(docFreq=2105, maxDocs=44421)
                0.017115608 = queryNorm
              0.43830866 = fieldWeight in 3164, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.048922 = idf(docFreq=2105, maxDocs=44421)
                0.0625 = fieldNorm(doc=3164)
          0.16382863 = weight(abstract_txt:archives in 3164) [ClassicSimilarity], result of:
            0.16382863 = score(doc=3164,freq=1.0), product of:
              0.45266914 = queryWeight, product of:
                4.5673018 = boost
                5.790671 = idf(docFreq=368, maxDocs=44421)
                0.017115608 = queryNorm
              0.36191693 = fieldWeight in 3164, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.790671 = idf(docFreq=368, maxDocs=44421)
                0.0625 = fieldNorm(doc=3164)
        0.2 = coord(5/25)