Document (#25204)

Author
Koehler, W.
Title
Web page change and persistence : a four-year longitudinal study
Source
Journal of the American Society for Information Science and technology. 53(2002) no.2, S.162-171
Year
2002
Abstract
Changes in the topography of the Web can be expressed in at least four ways: (1) more sites on more servers in more places, (2) more pages and objects added to existing sites and pages, (3) changes in traffic, and (4) modifications to existing text, graphic, and other Web objects. This article does not address the first three factors (more sites, more pages, more traffic) in the growth of the Web. It focuses instead on changes to an existing set of Web documents. The article documents changes to an aging set of Web pages, first identified and "collected" in December 1996 and followed weekly thereafter. Results are reported through February 2001. The article addresses two related phenomena: (1) the life cycle of Web objects, and (2) changes to Web objects. These data reaffirm that the half-life of a Web page is approximately 2 years. There is variation among Web pages by top-level domain and by page type (navigation, content). Web page content appears to stabilize over time; aging pages change less often than once they did
Theme
Internet
Informetrie
Object
WWW

Similar documents (author)

  1. Koehler, W.C.: Internet search note : specialized retrieval and Web search engines (1997) 6.10
    6.0972233 = sum of:
      6.0972233 = weight(author_txt:koehler in 1769) [ClassicSimilarity], result of:
        6.0972233 = fieldWeight in 1769, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.755557 = idf(docFreq=6, maxDocs=44421)
          0.625 = fieldNorm(doc=1769)
    
  2. Koehler, W.: ¬An analysis of Web page and Web site constancy and performance (1999) 6.10
    6.0972233 = sum of:
      6.0972233 = weight(author_txt:koehler in 3945) [ClassicSimilarity], result of:
        6.0972233 = fieldWeight in 3945, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.755557 = idf(docFreq=6, maxDocs=44421)
          0.625 = fieldNorm(doc=3945)
    
  3. Koehler, W.; Mincey, D.: FirstSearch and NetFirst - Web and dial-up access : plus ça change, plus c'est la même chose? (1996) 4.88
    4.8777785 = sum of:
      4.8777785 = weight(author_txt:koehler in 6600) [ClassicSimilarity], result of:
        4.8777785 = fieldWeight in 6600, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.755557 = idf(docFreq=6, maxDocs=44421)
          0.5 = fieldNorm(doc=6600)
    
  4. Oguz, F.; Koehler, W.: URL decay at year 20 : a research note (2016) 4.88
    4.8777785 = sum of:
      4.8777785 = weight(author_txt:koehler in 3651) [ClassicSimilarity], result of:
        4.8777785 = fieldWeight in 3651, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.755557 = idf(docFreq=6, maxDocs=44421)
          0.5 = fieldNorm(doc=3651)
    
  5. McDonnell, J.P.; Koehler Jr., W.C.; Carroll, B.C.: Cataloging challenges in an area studies virtual library catalog (ASVLC) : results of a case study (1999) 3.66
    3.6583338 = sum of:
      3.6583338 = weight(author_txt:koehler in 101) [ClassicSimilarity], result of:
        3.6583338 = fieldWeight in 101, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.755557 = idf(docFreq=6, maxDocs=44421)
          0.375 = fieldNorm(doc=101)
    

Similar documents (content)

  1. Spink, A.; Wolfram, D.; Jansen, B.J.; Saracevic, T.: Searching the Web : the public and their queries (2001) 0.22
    0.2209467 = sum of:
      0.2209467 = product of:
        0.6137408 = sum of:
          0.05706957 = weight(abstract_txt:december in 980) [ClassicSimilarity], result of:
            0.05706957 = score(doc=980,freq=1.0), product of:
              0.14997436 = queryWeight, product of:
                1.1340811 = boost
                8.117949 = idf(docFreq=35, maxDocs=44421)
                0.016290206 = queryNorm
              0.38052884 = fieldWeight in 980, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.117949 = idf(docFreq=35, maxDocs=44421)
                0.046875 = fieldNorm(doc=980)
          0.015577667 = weight(abstract_txt:content in 980) [ClassicSimilarity], result of:
            0.015577667 = score(doc=980,freq=1.0), product of:
              0.079510696 = queryWeight, product of:
                1.1677864 = boost
                4.1796083 = idf(docFreq=1847, maxDocs=44421)
                0.016290206 = queryNorm
              0.19591914 = fieldWeight in 980, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.1796083 = idf(docFreq=1847, maxDocs=44421)
                0.046875 = fieldNorm(doc=980)
          0.054744642 = weight(abstract_txt:change in 980) [ClassicSimilarity], result of:
            0.054744642 = score(doc=980,freq=3.0), product of:
              0.12743184 = queryWeight, product of:
                1.478392 = boost
                5.2912927 = idf(docFreq=607, maxDocs=44421)
                0.016290206 = queryNorm
              0.4295994 = fieldWeight in 980, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.2912927 = idf(docFreq=607, maxDocs=44421)
                0.046875 = fieldNorm(doc=980)
          0.017551526 = weight(abstract_txt:article in 980) [ClassicSimilarity], result of:
            0.017551526 = score(doc=980,freq=1.0), product of:
              0.098551735 = queryWeight, product of:
                1.5923128 = boost
                3.79935 = idf(docFreq=2702, maxDocs=44421)
                0.016290206 = queryNorm
              0.17809454 = fieldWeight in 980, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.79935 = idf(docFreq=2702, maxDocs=44421)
                0.046875 = fieldNorm(doc=980)
          0.07143884 = weight(abstract_txt:sites in 980) [ClassicSimilarity], result of:
            0.07143884 = score(doc=980,freq=2.0), product of:
              0.19940406 = queryWeight, product of:
                2.264974 = boost
                5.4043584 = idf(docFreq=542, maxDocs=44421)
                0.016290206 = queryNorm
              0.3582617 = fieldWeight in 980, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.4043584 = idf(docFreq=542, maxDocs=44421)
                0.046875 = fieldNorm(doc=980)
          0.04136854 = weight(abstract_txt:more in 980) [ClassicSimilarity], result of:
            0.04136854 = score(doc=980,freq=2.0), product of:
              0.18374553 = queryWeight, product of:
                3.3211849 = boost
                3.3962307 = idf(docFreq=4044, maxDocs=44421)
                0.016290206 = queryNorm
              0.2251404 = fieldWeight in 980, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.3962307 = idf(docFreq=4044, maxDocs=44421)
                0.046875 = fieldNorm(doc=980)
          0.1295473 = weight(abstract_txt:page in 980) [ClassicSimilarity], result of:
            0.1295473 = score(doc=980,freq=2.0), product of:
              0.32636946 = queryWeight, product of:
                3.3459573 = boost
                5.987735 = idf(docFreq=302, maxDocs=44421)
                0.016290206 = queryNorm
              0.39693448 = fieldWeight in 980, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.987735 = idf(docFreq=302, maxDocs=44421)
                0.046875 = fieldNorm(doc=980)
          0.06699866 = weight(abstract_txt:changes in 980) [ClassicSimilarity], result of:
            0.06699866 = score(doc=980,freq=1.0), product of:
              0.2853961 = queryWeight, product of:
                3.4982002 = boost
                5.008144 = idf(docFreq=806, maxDocs=44421)
                0.016290206 = queryNorm
              0.23475674 = fieldWeight in 980, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.008144 = idf(docFreq=806, maxDocs=44421)
                0.046875 = fieldNorm(doc=980)
          0.15944403 = weight(abstract_txt:pages in 980) [ClassicSimilarity], result of:
            0.15944403 = score(doc=980,freq=2.0), product of:
              0.42906842 = queryWeight, product of:
                4.698665 = boost
                5.6056433 = idf(docFreq=443, maxDocs=44421)
                0.016290206 = queryNorm
              0.37160516 = fieldWeight in 980, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.6056433 = idf(docFreq=443, maxDocs=44421)
                0.046875 = fieldNorm(doc=980)
        0.36 = coord(9/25)
    
  2. Craven, T.: Changes in metatag descriptions over time (2001) 0.18
    0.1846077 = sum of:
      0.1846077 = product of:
        0.9230385 = sum of:
          0.05948959 = weight(abstract_txt:four in 601) [ClassicSimilarity], result of:
            0.05948959 = score(doc=601,freq=1.0), product of:
              0.12237647 = queryWeight, product of:
                1.4487705 = boost
                5.1852746 = idf(docFreq=675, maxDocs=44421)
                0.016290206 = queryNorm
              0.4861195 = fieldWeight in 601, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.1852746 = idf(docFreq=675, maxDocs=44421)
                0.09375 = fieldNorm(doc=601)
          0.063213676 = weight(abstract_txt:change in 601) [ClassicSimilarity], result of:
            0.063213676 = score(doc=601,freq=1.0), product of:
              0.12743184 = queryWeight, product of:
                1.478392 = boost
                5.2912927 = idf(docFreq=607, maxDocs=44421)
                0.016290206 = queryNorm
              0.4960587 = fieldWeight in 601, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.2912927 = idf(docFreq=607, maxDocs=44421)
                0.09375 = fieldNorm(doc=601)
          0.05850396 = weight(abstract_txt:more in 601) [ClassicSimilarity], result of:
            0.05850396 = score(doc=601,freq=1.0), product of:
              0.18374553 = queryWeight, product of:
                3.3211849 = boost
                3.3962307 = idf(docFreq=4044, maxDocs=44421)
                0.016290206 = queryNorm
              0.31839663 = fieldWeight in 601, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.3962307 = idf(docFreq=4044, maxDocs=44421)
                0.09375 = fieldNorm(doc=601)
          0.18950082 = weight(abstract_txt:changes in 601) [ClassicSimilarity], result of:
            0.18950082 = score(doc=601,freq=2.0), product of:
              0.2853961 = queryWeight, product of:
                3.4982002 = boost
                5.008144 = idf(docFreq=806, maxDocs=44421)
                0.016290206 = queryNorm
              0.66399235 = fieldWeight in 601, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.008144 = idf(docFreq=806, maxDocs=44421)
                0.09375 = fieldNorm(doc=601)
          0.55233043 = weight(abstract_txt:pages in 601) [ClassicSimilarity], result of:
            0.55233043 = score(doc=601,freq=6.0), product of:
              0.42906842 = queryWeight, product of:
                4.698665 = boost
                5.6056433 = idf(docFreq=443, maxDocs=44421)
                0.016290206 = queryNorm
              1.2872782 = fieldWeight in 601, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                5.6056433 = idf(docFreq=443, maxDocs=44421)
                0.09375 = fieldNorm(doc=601)
        0.2 = coord(5/25)
    
  3. Barsky, E.; Bar-Ilan, J.: ¬The impact of task phrasing on the choice of search keywords and on the search process and success (2012) 0.18
    0.17565934 = sum of:
      0.17565934 = product of:
        0.7319139 = sum of:
          0.052168813 = weight(abstract_txt:modifications in 1455) [ClassicSimilarity], result of:
            0.052168813 = score(doc=1455,freq=1.0), product of:
              0.11660811 = queryWeight, product of:
                7.1581726 = idf(docFreq=93, maxDocs=44421)
                0.016290206 = queryNorm
              0.4473858 = fieldWeight in 1455, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.1581726 = idf(docFreq=93, maxDocs=44421)
                0.0625 = fieldNorm(doc=1455)
          0.08561543 = weight(abstract_txt:persistence in 1455) [ClassicSimilarity], result of:
            0.08561543 = score(doc=1455,freq=1.0), product of:
              0.16223933 = queryWeight, product of:
                1.1795428 = boost
                8.443371 = idf(docFreq=25, maxDocs=44421)
                0.016290206 = queryNorm
              0.5277107 = fieldWeight in 1455, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.443371 = idf(docFreq=25, maxDocs=44421)
                0.0625 = fieldNorm(doc=1455)
          0.039659727 = weight(abstract_txt:four in 1455) [ClassicSimilarity], result of:
            0.039659727 = score(doc=1455,freq=1.0), product of:
              0.12237647 = queryWeight, product of:
                1.4487705 = boost
                5.1852746 = idf(docFreq=675, maxDocs=44421)
                0.016290206 = queryNorm
              0.32407966 = fieldWeight in 1455, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.1852746 = idf(docFreq=675, maxDocs=44421)
                0.0625 = fieldNorm(doc=1455)
          0.042701203 = weight(abstract_txt:existing in 1455) [ClassicSimilarity], result of:
            0.042701203 = score(doc=1455,freq=1.0), product of:
              0.1471596 = queryWeight, product of:
                1.9457657 = boost
                4.6427093 = idf(docFreq=1162, maxDocs=44421)
                0.016290206 = queryNorm
              0.29016933 = fieldWeight in 1455, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.6427093 = idf(docFreq=1162, maxDocs=44421)
                0.0625 = fieldNorm(doc=1455)
          0.2991767 = weight(abstract_txt:page in 1455) [ClassicSimilarity], result of:
            0.2991767 = score(doc=1455,freq=6.0), product of:
              0.32636946 = queryWeight, product of:
                3.3459573 = boost
                5.987735 = idf(docFreq=302, maxDocs=44421)
                0.016290206 = queryNorm
              0.916681 = fieldWeight in 1455, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                5.987735 = idf(docFreq=302, maxDocs=44421)
                0.0625 = fieldNorm(doc=1455)
          0.21259205 = weight(abstract_txt:pages in 1455) [ClassicSimilarity], result of:
            0.21259205 = score(doc=1455,freq=2.0), product of:
              0.42906842 = queryWeight, product of:
                4.698665 = boost
                5.6056433 = idf(docFreq=443, maxDocs=44421)
                0.016290206 = queryNorm
              0.49547353 = fieldWeight in 1455, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.6056433 = idf(docFreq=443, maxDocs=44421)
                0.0625 = fieldNorm(doc=1455)
        0.24 = coord(6/25)
    
  4. Lawrence, S.; Giles, C.L.: Accessibility and distribution of information on the Web (1999) 0.17
    0.16986442 = sum of:
      0.16986442 = product of:
        0.70776844 = sum of:
          0.074582495 = weight(abstract_txt:february in 5952) [ClassicSimilarity], result of:
            0.074582495 = score(doc=5952,freq=1.0), product of:
              0.14798331 = queryWeight, product of:
                1.126528 = boost
                8.063882 = idf(docFreq=37, maxDocs=44421)
                0.016290206 = queryNorm
              0.5039926 = fieldWeight in 5952, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.063882 = idf(docFreq=37, maxDocs=44421)
                0.0625 = fieldNorm(doc=5952)
          0.07609276 = weight(abstract_txt:december in 5952) [ClassicSimilarity], result of:
            0.07609276 = score(doc=5952,freq=1.0), product of:
              0.14997436 = queryWeight, product of:
                1.1340811 = boost
                8.117949 = idf(docFreq=35, maxDocs=44421)
                0.016290206 = queryNorm
              0.5073718 = fieldWeight in 5952, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.117949 = idf(docFreq=35, maxDocs=44421)
                0.0625 = fieldNorm(doc=5952)
          0.035975084 = weight(abstract_txt:content in 5952) [ClassicSimilarity], result of:
            0.035975084 = score(doc=5952,freq=3.0), product of:
              0.079510696 = queryWeight, product of:
                1.1677864 = boost
                4.1796083 = idf(docFreq=1847, maxDocs=44421)
                0.016290206 = queryNorm
              0.45245588 = fieldWeight in 5952, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.1796083 = idf(docFreq=1847, maxDocs=44421)
                0.0625 = fieldNorm(doc=5952)
          0.21298948 = weight(abstract_txt:sites in 5952) [ClassicSimilarity], result of:
            0.21298948 = score(doc=5952,freq=10.0), product of:
              0.19940406 = queryWeight, product of:
                2.264974 = boost
                5.4043584 = idf(docFreq=542, maxDocs=44421)
                0.016290206 = queryNorm
              1.0681301 = fieldWeight in 5952, product of:
                3.1622777 = tf(freq=10.0), with freq of:
                  10.0 = termFreq=10.0
                5.4043584 = idf(docFreq=542, maxDocs=44421)
                0.0625 = fieldNorm(doc=5952)
          0.09553657 = weight(abstract_txt:more in 5952) [ClassicSimilarity], result of:
            0.09553657 = score(doc=5952,freq=6.0), product of:
              0.18374553 = queryWeight, product of:
                3.3211849 = boost
                3.3962307 = idf(docFreq=4044, maxDocs=44421)
                0.016290206 = queryNorm
              0.51993954 = fieldWeight in 5952, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                3.3962307 = idf(docFreq=4044, maxDocs=44421)
                0.0625 = fieldNorm(doc=5952)
          0.21259205 = weight(abstract_txt:pages in 5952) [ClassicSimilarity], result of:
            0.21259205 = score(doc=5952,freq=2.0), product of:
              0.42906842 = queryWeight, product of:
                4.698665 = boost
                5.6056433 = idf(docFreq=443, maxDocs=44421)
                0.016290206 = queryNorm
              0.49547353 = fieldWeight in 5952, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.6056433 = idf(docFreq=443, maxDocs=44421)
                0.0625 = fieldNorm(doc=5952)
        0.24 = coord(6/25)
    
  5. Bhavnani, S.K.; Peck, F.A.: Scatter matters : regularities and implications for the scatter of healthcare information on the Web (2010) 0.15
    0.14765425 = sum of:
      0.14765425 = product of:
        0.61522603 = sum of:
          0.020770224 = weight(abstract_txt:content in 420) [ClassicSimilarity], result of:
            0.020770224 = score(doc=420,freq=1.0), product of:
              0.079510696 = queryWeight, product of:
                1.1677864 = boost
                4.1796083 = idf(docFreq=1847, maxDocs=44421)
                0.016290206 = queryNorm
              0.26122552 = fieldWeight in 420, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.1796083 = idf(docFreq=1847, maxDocs=44421)
                0.0625 = fieldNorm(doc=420)
          0.023402033 = weight(abstract_txt:article in 420) [ClassicSimilarity], result of:
            0.023402033 = score(doc=420,freq=1.0), product of:
              0.098551735 = queryWeight, product of:
                1.5923128 = boost
                3.79935 = idf(docFreq=2702, maxDocs=44421)
                0.016290206 = queryNorm
              0.23745938 = fieldWeight in 420, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.79935 = idf(docFreq=2702, maxDocs=44421)
                0.0625 = fieldNorm(doc=420)
          0.042701203 = weight(abstract_txt:existing in 420) [ClassicSimilarity], result of:
            0.042701203 = score(doc=420,freq=1.0), product of:
              0.1471596 = queryWeight, product of:
                1.9457657 = boost
                4.6427093 = idf(docFreq=1162, maxDocs=44421)
                0.016290206 = queryNorm
              0.29016933 = fieldWeight in 420, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.6427093 = idf(docFreq=1162, maxDocs=44421)
                0.0625 = fieldNorm(doc=420)
          0.09525179 = weight(abstract_txt:sites in 420) [ClassicSimilarity], result of:
            0.09525179 = score(doc=420,freq=2.0), product of:
              0.19940406 = queryWeight, product of:
                2.264974 = boost
                5.4043584 = idf(docFreq=542, maxDocs=44421)
                0.016290206 = queryNorm
              0.4776823 = fieldWeight in 420, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.4043584 = idf(docFreq=542, maxDocs=44421)
                0.0625 = fieldNorm(doc=420)
          0.17272973 = weight(abstract_txt:page in 420) [ClassicSimilarity], result of:
            0.17272973 = score(doc=420,freq=2.0), product of:
              0.32636946 = queryWeight, product of:
                3.3459573 = boost
                5.987735 = idf(docFreq=302, maxDocs=44421)
                0.016290206 = queryNorm
              0.529246 = fieldWeight in 420, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.987735 = idf(docFreq=302, maxDocs=44421)
                0.0625 = fieldNorm(doc=420)
          0.26037103 = weight(abstract_txt:pages in 420) [ClassicSimilarity], result of:
            0.26037103 = score(doc=420,freq=3.0), product of:
              0.42906842 = queryWeight, product of:
                4.698665 = boost
                5.6056433 = idf(docFreq=443, maxDocs=44421)
                0.016290206 = queryNorm
              0.6068287 = fieldWeight in 420, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.6056433 = idf(docFreq=443, maxDocs=44421)
                0.0625 = fieldNorm(doc=420)
        0.24 = coord(6/25)