Document (#38523)

Author
Dougherty, M.
Meyer, E.T.
Title
Community, tools, and practices in web archiving : the state-of-the-art in relation to social science and humanities research needs
Source
Journal of the Association for Information Science and Technology. 65(2014) no.11, S.2195-2209
Year
2014
Abstract
The web encourages the constant creation and distribution of large amounts of information; it is also a valuable resource for understanding human behavior and communication. To take full advantage of the web as a research resource that extends beyond the consideration of snapshots of the present, however, it is necessary to begin to take web archiving much more seriously as an important element of any research program involving web resources. The ephemeral character of the web requires that researchers take proactive steps in the present to enable future analysis. Efforts to archive the web or portions thereof have been developed around the world, but these efforts have not yet provided reliable and scalable solutions. This article summarizes the current state of web archiving in relation to researchers and research needs. Interviews with researchers, archivists, and technologists identify the differences in purpose, scope, and scale of current web archiving practice, and the professional tensions that arise given these differences. Findings outline the challenges that still face researchers who wish to engage seriously with web content as an object of research, and archivists who must strike a balance reflecting a range of user needs.

Similar documents (author)

  1. Dougherty, R.M.: Realities of reclassification (1967) 2.63
    2.6277409 = sum of:
      2.6277409 = product of:
        5.2554817 = sum of:
          5.2554817 = weight(author_txt:dougherty in 1711) [ClassicSimilarity], result of:
            5.2554817 = score(doc=1711,freq=1.0), product of:
              0.8619467 = queryWeight, product of:
                1.3038772 = boost
                9.755557 = idf(docFreq=6, maxDocs=44421)
                0.067762844 = queryNorm
              6.0972233 = fieldWeight in 1711, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.755557 = idf(docFreq=6, maxDocs=44421)
                0.625 = fieldNorm(doc=1711)
        0.5 = coord(1/2)
    
  2. Dougherty, R.M.: Pathways to our future (1992) 2.63
    2.6277409 = sum of:
      2.6277409 = product of:
        5.2554817 = sum of:
          5.2554817 = weight(author_txt:dougherty in 5869) [ClassicSimilarity], result of:
            5.2554817 = score(doc=5869,freq=1.0), product of:
              0.8619467 = queryWeight, product of:
                1.3038772 = boost
                9.755557 = idf(docFreq=6, maxDocs=44421)
                0.067762844 = queryNorm
              6.0972233 = fieldWeight in 5869, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.755557 = idf(docFreq=6, maxDocs=44421)
                0.625 = fieldNorm(doc=5869)
        0.5 = coord(1/2)
    
  3. Dougherty, R.M.: ¬The new campus information environment (1993) 2.63
    2.6277409 = sum of:
      2.6277409 = product of:
        5.2554817 = sum of:
          5.2554817 = weight(author_txt:dougherty in 7002) [ClassicSimilarity], result of:
            5.2554817 = score(doc=7002,freq=1.0), product of:
              0.8619467 = queryWeight, product of:
                1.3038772 = boost
                9.755557 = idf(docFreq=6, maxDocs=44421)
                0.067762844 = queryNorm
              6.0972233 = fieldWeight in 7002, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.755557 = idf(docFreq=6, maxDocs=44421)
                0.625 = fieldNorm(doc=7002)
        0.5 = coord(1/2)
    
  4. Dougherty, R.M.: ¬The realities of reclassification (1967) 2.63
    2.6277409 = sum of:
      2.6277409 = product of:
        5.2554817 = sum of:
          5.2554817 = weight(author_txt:dougherty in 6321) [ClassicSimilarity], result of:
            5.2554817 = score(doc=6321,freq=1.0), product of:
              0.8619467 = queryWeight, product of:
                1.3038772 = boost
                9.755557 = idf(docFreq=6, maxDocs=44421)
                0.067762844 = queryNorm
              6.0972233 = fieldWeight in 6321, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.755557 = idf(docFreq=6, maxDocs=44421)
                0.625 = fieldNorm(doc=6321)
        0.5 = coord(1/2)
    
  5. Dougherty, N.E.; Youngkin, M.E.; Carleton, M.O.; Cheves, C.G.; MacCloskey, K.M.: Evaluation of CORE MEDLINE/EBSCO CD-ROM, SilverPlatter and KnowledgeFinder at University of Utah, Spencer S. Eccles Health Sciences Library (1989) 1.31
    1.3138704 = sum of:
      1.3138704 = product of:
        2.6277409 = sum of:
          2.6277409 = weight(author_txt:dougherty in 4056) [ClassicSimilarity], result of:
            2.6277409 = score(doc=4056,freq=1.0), product of:
              0.8619467 = queryWeight, product of:
                1.3038772 = boost
                9.755557 = idf(docFreq=6, maxDocs=44421)
                0.067762844 = queryNorm
              3.0486116 = fieldWeight in 4056, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.755557 = idf(docFreq=6, maxDocs=44421)
                0.3125 = fieldNorm(doc=4056)
        0.5 = coord(1/2)
    

Similar documents (content)

  1. Gracy, K.F.: Enriching and enhancing moving images with Linked Data : an exploration in the alignment of metadata models (2018) 0.16
    0.15596065 = sum of:
      0.15596065 = product of:
        0.48737702 = sum of:
          0.036032937 = weight(abstract_txt:current in 200) [ClassicSimilarity], result of:
            0.036032937 = score(doc=200,freq=4.0), product of:
              0.08946784 = queryWeight, product of:
                1.1058352 = boost
                4.295972 = idf(docFreq=1644, maxDocs=44421)
                0.018832808 = queryNorm
              0.40274736 = fieldWeight in 200, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.295972 = idf(docFreq=1644, maxDocs=44421)
                0.046875 = fieldNorm(doc=200)
          0.006011365 = weight(abstract_txt:that in 200) [ClassicSimilarity], result of:
            0.006011365 = score(doc=200,freq=1.0), product of:
              0.054226622 = queryWeight, product of:
                1.217526 = boost
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.018832808 = queryNorm
              0.11085634 = fieldWeight in 200, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.046875 = fieldNorm(doc=200)
          0.025545292 = weight(abstract_txt:state in 200) [ClassicSimilarity], result of:
            0.025545292 = score(doc=200,freq=1.0), product of:
              0.11291745 = queryWeight, product of:
                1.2423315 = boost
                4.8262353 = idf(docFreq=967, maxDocs=44421)
                0.018832808 = queryNorm
              0.22622979 = fieldWeight in 200, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.8262353 = idf(docFreq=967, maxDocs=44421)
                0.046875 = fieldNorm(doc=200)
          0.02818909 = weight(abstract_txt:resource in 200) [ClassicSimilarity], result of:
            0.02818909 = score(doc=200,freq=1.0), product of:
              0.120579794 = queryWeight, product of:
                1.2837907 = boost
                4.987297 = idf(docFreq=823, maxDocs=44421)
                0.018832808 = queryNorm
              0.23377955 = fieldWeight in 200, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.987297 = idf(docFreq=823, maxDocs=44421)
                0.046875 = fieldNorm(doc=200)
          0.029587328 = weight(abstract_txt:needs in 200) [ClassicSimilarity], result of:
            0.029587328 = score(doc=200,freq=1.0), product of:
              0.14255685 = queryWeight, product of:
                1.7096083 = boost
                4.4276814 = idf(docFreq=1441, maxDocs=44421)
                0.018832808 = queryNorm
              0.20754758 = fieldWeight in 200, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.4276814 = idf(docFreq=1441, maxDocs=44421)
                0.046875 = fieldNorm(doc=200)
          0.08865555 = weight(abstract_txt:archivists in 200) [ClassicSimilarity], result of:
            0.08865555 = score(doc=200,freq=1.0), product of:
              0.25883585 = queryWeight, product of:
                1.880915 = boost
                7.3070183 = idf(docFreq=80, maxDocs=44421)
                0.018832808 = queryNorm
              0.34251648 = fieldWeight in 200, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.3070183 = idf(docFreq=80, maxDocs=44421)
                0.046875 = fieldNorm(doc=200)
          0.031036649 = weight(abstract_txt:research in 200) [ClassicSimilarity], result of:
            0.031036649 = score(doc=200,freq=3.0), product of:
              0.120988294 = queryWeight, product of:
                2.0332868 = boost
                3.159582 = idf(docFreq=5124, maxDocs=44421)
                0.018832808 = queryNorm
              0.25652605 = fieldWeight in 200, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.159582 = idf(docFreq=5124, maxDocs=44421)
                0.046875 = fieldNorm(doc=200)
          0.24231881 = weight(abstract_txt:archiving in 200) [ClassicSimilarity], result of:
            0.24231881 = score(doc=200,freq=2.0), product of:
              0.5059939 = queryWeight, product of:
                3.7191577 = boost
                7.2241306 = idf(docFreq=87, maxDocs=44421)
                0.018832808 = queryNorm
              0.4788967 = fieldWeight in 200, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.2241306 = idf(docFreq=87, maxDocs=44421)
                0.046875 = fieldNorm(doc=200)
        0.32 = coord(8/25)
    
  2. Sköld, O.: Understanding the "expanded notion" of videogames as archival objects : a review of priorities, methods, and conceptions (2018) 0.13
    0.13352124 = sum of:
      0.13352124 = product of:
        0.55633855 = sum of:
          0.024021957 = weight(abstract_txt:current in 17) [ClassicSimilarity], result of:
            0.024021957 = score(doc=17,freq=1.0), product of:
              0.08946784 = queryWeight, product of:
                1.1058352 = boost
                4.295972 = idf(docFreq=1644, maxDocs=44421)
                0.018832808 = queryNorm
              0.26849824 = fieldWeight in 17, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.295972 = idf(docFreq=1644, maxDocs=44421)
                0.0625 = fieldNorm(doc=17)
          0.008015153 = weight(abstract_txt:that in 17) [ClassicSimilarity], result of:
            0.008015153 = score(doc=17,freq=1.0), product of:
              0.054226622 = queryWeight, product of:
                1.217526 = boost
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.018832808 = queryNorm
              0.14780845 = fieldWeight in 17, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.0625 = fieldNorm(doc=17)
          0.03406039 = weight(abstract_txt:state in 17) [ClassicSimilarity], result of:
            0.03406039 = score(doc=17,freq=1.0), product of:
              0.11291745 = queryWeight, product of:
                1.2423315 = boost
                4.8262353 = idf(docFreq=967, maxDocs=44421)
                0.018832808 = queryNorm
              0.3016397 = fieldWeight in 17, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.8262353 = idf(docFreq=967, maxDocs=44421)
                0.0625 = fieldNorm(doc=17)
          0.053153858 = weight(abstract_txt:resource in 17) [ClassicSimilarity], result of:
            0.053153858 = score(doc=17,freq=2.0), product of:
              0.120579794 = queryWeight, product of:
                1.2837907 = boost
                4.987297 = idf(docFreq=823, maxDocs=44421)
                0.018832808 = queryNorm
              0.44081894 = fieldWeight in 17, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.987297 = idf(docFreq=823, maxDocs=44421)
                0.0625 = fieldNorm(doc=17)
          0.0413822 = weight(abstract_txt:research in 17) [ClassicSimilarity], result of:
            0.0413822 = score(doc=17,freq=3.0), product of:
              0.120988294 = queryWeight, product of:
                2.0332868 = boost
                3.159582 = idf(docFreq=5124, maxDocs=44421)
                0.018832808 = queryNorm
              0.34203476 = fieldWeight in 17, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.159582 = idf(docFreq=5124, maxDocs=44421)
                0.0625 = fieldNorm(doc=17)
          0.39570495 = weight(abstract_txt:archiving in 17) [ClassicSimilarity], result of:
            0.39570495 = score(doc=17,freq=3.0), product of:
              0.5059939 = queryWeight, product of:
                3.7191577 = boost
                7.2241306 = idf(docFreq=87, maxDocs=44421)
                0.018832808 = queryNorm
              0.78203505 = fieldWeight in 17, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                7.2241306 = idf(docFreq=87, maxDocs=44421)
                0.0625 = fieldNorm(doc=17)
        0.24 = coord(6/25)
    
  3. Liu, H.; Hu, G.; Li, Y.: ¬The enhanced research impact of self-archiving platforms : evidence from bioRxiv (2024) 0.13
    0.12656622 = sum of:
      0.12656622 = product of:
        0.6328311 = sum of:
          0.013882651 = weight(abstract_txt:that in 2338) [ClassicSimilarity], result of:
            0.013882651 = score(doc=2338,freq=3.0), product of:
              0.054226622 = queryWeight, product of:
                1.217526 = boost
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.018832808 = queryNorm
              0.25601172 = fieldWeight in 2338, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.0625 = fieldNorm(doc=2338)
          0.036835015 = weight(abstract_txt:differences in 2338) [ClassicSimilarity], result of:
            0.036835015 = score(doc=2338,freq=1.0), product of:
              0.118969396 = queryWeight, product of:
                1.275189 = boost
                4.9538813 = idf(docFreq=851, maxDocs=44421)
                0.018832808 = queryNorm
              0.30961758 = fieldWeight in 2338, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.9538813 = idf(docFreq=851, maxDocs=44421)
                0.0625 = fieldNorm(doc=2338)
          0.058523275 = weight(abstract_txt:research in 2338) [ClassicSimilarity], result of:
            0.058523275 = score(doc=2338,freq=6.0), product of:
              0.120988294 = queryWeight, product of:
                2.0332868 = boost
                3.159582 = idf(docFreq=5124, maxDocs=44421)
                0.018832808 = queryNorm
              0.48371023 = fieldWeight in 2338, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                3.159582 = idf(docFreq=5124, maxDocs=44421)
                0.0625 = fieldNorm(doc=2338)
          0.06666944 = weight(abstract_txt:researchers in 2338) [ClassicSimilarity], result of:
            0.06666944 = score(doc=2338,freq=1.0), product of:
              0.22261575 = queryWeight, product of:
                2.4668906 = boost
                4.791714 = idf(docFreq=1001, maxDocs=44421)
                0.018832808 = queryNorm
              0.29948214 = fieldWeight in 2338, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.791714 = idf(docFreq=1001, maxDocs=44421)
                0.0625 = fieldNorm(doc=2338)
          0.45692074 = weight(abstract_txt:archiving in 2338) [ClassicSimilarity], result of:
            0.45692074 = score(doc=2338,freq=4.0), product of:
              0.5059939 = queryWeight, product of:
                3.7191577 = boost
                7.2241306 = idf(docFreq=87, maxDocs=44421)
                0.018832808 = queryNorm
              0.9030163 = fieldWeight in 2338, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                7.2241306 = idf(docFreq=87, maxDocs=44421)
                0.0625 = fieldNorm(doc=2338)
        0.2 = coord(5/25)
    
  4. Hemphill, L.H.; Hedstrom, M.L.; Leonard, S.H.: Saving social media data : understanding data management practices among social media researchers and their implications for archives (2021) 0.12
    0.11980915 = sum of:
      0.11980915 = product of:
        0.4992048 = sum of:
          0.008015153 = weight(abstract_txt:that in 1065) [ClassicSimilarity], result of:
            0.008015153 = score(doc=1065,freq=1.0), product of:
              0.054226622 = queryWeight, product of:
                1.217526 = boost
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.018832808 = queryNorm
              0.14780845 = fieldWeight in 1065, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.0625 = fieldNorm(doc=1065)
          0.036835015 = weight(abstract_txt:differences in 1065) [ClassicSimilarity], result of:
            0.036835015 = score(doc=1065,freq=1.0), product of:
              0.118969396 = queryWeight, product of:
                1.275189 = boost
                4.9538813 = idf(docFreq=851, maxDocs=44421)
                0.018832808 = queryNorm
              0.30961758 = fieldWeight in 1065, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.9538813 = idf(docFreq=851, maxDocs=44421)
                0.0625 = fieldNorm(doc=1065)
          0.05292484 = weight(abstract_txt:efforts in 1065) [ClassicSimilarity], result of:
            0.05292484 = score(doc=1065,freq=1.0), product of:
              0.15148434 = queryWeight, product of:
                1.4389338 = boost
                5.59 = idf(docFreq=450, maxDocs=44421)
                0.018832808 = queryNorm
              0.349375 = fieldWeight in 1065, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.59 = idf(docFreq=450, maxDocs=44421)
                0.0625 = fieldNorm(doc=1065)
          0.023892026 = weight(abstract_txt:research in 1065) [ClassicSimilarity], result of:
            0.023892026 = score(doc=1065,freq=1.0), product of:
              0.120988294 = queryWeight, product of:
                2.0332868 = boost
                3.159582 = idf(docFreq=5124, maxDocs=44421)
                0.018832808 = queryNorm
              0.19747387 = fieldWeight in 1065, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.159582 = idf(docFreq=5124, maxDocs=44421)
                0.0625 = fieldNorm(doc=1065)
          0.1490774 = weight(abstract_txt:researchers in 1065) [ClassicSimilarity], result of:
            0.1490774 = score(doc=1065,freq=5.0), product of:
              0.22261575 = queryWeight, product of:
                2.4668906 = boost
                4.791714 = idf(docFreq=1001, maxDocs=44421)
                0.018832808 = queryNorm
              0.6696624 = fieldWeight in 1065, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                4.791714 = idf(docFreq=1001, maxDocs=44421)
                0.0625 = fieldNorm(doc=1065)
          0.22846037 = weight(abstract_txt:archiving in 1065) [ClassicSimilarity], result of:
            0.22846037 = score(doc=1065,freq=1.0), product of:
              0.5059939 = queryWeight, product of:
                3.7191577 = boost
                7.2241306 = idf(docFreq=87, maxDocs=44421)
                0.018832808 = queryNorm
              0.45150816 = fieldWeight in 1065, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.2241306 = idf(docFreq=87, maxDocs=44421)
                0.0625 = fieldNorm(doc=1065)
        0.24 = coord(6/25)
    
  5. Kodua-Ntim, K.: Narrative review on open access institutional repositories and knowledge sharing in South Africa (2023) 0.10
    0.09667572 = sum of:
      0.09667572 = product of:
        0.48337856 = sum of:
          0.07518336 = weight(abstract_txt:encourages in 2052) [ClassicSimilarity], result of:
            0.07518336 = score(doc=2052,freq=1.0), product of:
              0.1519378 = queryWeight, product of:
                1.0190016 = boost
                7.917278 = idf(docFreq=43, maxDocs=44421)
                0.018832808 = queryNorm
              0.49482986 = fieldWeight in 2052, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.917278 = idf(docFreq=43, maxDocs=44421)
                0.0625 = fieldNorm(doc=2052)
          0.011335138 = weight(abstract_txt:that in 2052) [ClassicSimilarity], result of:
            0.011335138 = score(doc=2052,freq=2.0), product of:
              0.054226622 = queryWeight, product of:
                1.217526 = boost
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.018832808 = queryNorm
              0.20903271 = fieldWeight in 2052, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.0625 = fieldNorm(doc=2052)
          0.05292484 = weight(abstract_txt:efforts in 2052) [ClassicSimilarity], result of:
            0.05292484 = score(doc=2052,freq=1.0), product of:
              0.15148434 = queryWeight, product of:
                1.4389338 = boost
                5.59 = idf(docFreq=450, maxDocs=44421)
                0.018832808 = queryNorm
              0.349375 = fieldWeight in 2052, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.59 = idf(docFreq=450, maxDocs=44421)
                0.0625 = fieldNorm(doc=2052)
          0.11547485 = weight(abstract_txt:researchers in 2052) [ClassicSimilarity], result of:
            0.11547485 = score(doc=2052,freq=3.0), product of:
              0.22261575 = queryWeight, product of:
                2.4668906 = boost
                4.791714 = idf(docFreq=1001, maxDocs=44421)
                0.018832808 = queryNorm
              0.51871824 = fieldWeight in 2052, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.791714 = idf(docFreq=1001, maxDocs=44421)
                0.0625 = fieldNorm(doc=2052)
          0.22846037 = weight(abstract_txt:archiving in 2052) [ClassicSimilarity], result of:
            0.22846037 = score(doc=2052,freq=1.0), product of:
              0.5059939 = queryWeight, product of:
                3.7191577 = boost
                7.2241306 = idf(docFreq=87, maxDocs=44421)
                0.018832808 = queryNorm
              0.45150816 = fieldWeight in 2052, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.2241306 = idf(docFreq=87, maxDocs=44421)
                0.0625 = fieldNorm(doc=2052)
        0.2 = coord(5/25)