Document (#39820)

Author
Anderson, C.
Title
¬The end of theory : the data deluge makes the scientific method obsolete
Source
Wired. 23.06.2008 [http://www.wired.com/2008/06/pb-theory/]
Year
2008
Abstract
So proclaimed statistician George Box 30 years ago, and he was right. But what choice did we have? Only models, from cosmological equations to theories of human behavior, seemed to be able to consistently, if imperfectly, explain the world around us. Until now. Today companies like Google, which have grown up in an era of massively abundant data, don't have to settle for wrong models. Indeed, they don't have to settle for models at all. Sixty years ago, digital computers made information readable. Twenty years ago, the Internet made it reachable. Ten years ago, the first search engine crawlers made it a single database. Now Google and like-minded companies are sifting through the most measured age in history, treating this massive corpus as a laboratory of the human condition. They are the children of the Petabyte Age. The Petabyte Age is different because more is different. Kilobytes were stored on floppy disks. Megabytes were stored on hard disks. Terabytes were stored in disk arrays. Petabytes are stored in the cloud. As we moved along that progression, we went from the folder analogy to the file cabinet analogy to the library analogy to - well, at petabytes we ran out of organizational analogies.

Similar documents (author)

  1. Anderson, J.D.: Indexing, teaching of, See: Information retrieval design (2002) 4.76
    4.7620935 = sum of:
      4.7620935 = weight(author_txt:anderson in 550) [ClassicSimilarity], result of:
        4.7620935 = fieldWeight in 550, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          7.61935 = idf(docFreq=58, maxDocs=44218)
          0.625 = fieldNorm(doc=550)
    
  2. Anderson, J.D.: Indexing and classification : file organization and display for information retrieval (1989) 4.76
    4.7620935 = sum of:
      4.7620935 = weight(author_txt:anderson in 873) [ClassicSimilarity], result of:
        4.7620935 = fieldWeight in 873, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          7.61935 = idf(docFreq=58, maxDocs=44218)
          0.625 = fieldNorm(doc=873)
    
  3. Anderson, M.D.: Book indexing (1971) 4.76
    4.7620935 = sum of:
      4.7620935 = weight(author_txt:anderson in 2742) [ClassicSimilarity], result of:
        4.7620935 = fieldWeight in 2742, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          7.61935 = idf(docFreq=58, maxDocs=44218)
          0.625 = fieldNorm(doc=2742)
    
  4. Anderson, B.: CD-ROM LANs : a new challenge for reference librarians (1992) 4.76
    4.7620935 = sum of:
      4.7620935 = weight(author_txt:anderson in 3951) [ClassicSimilarity], result of:
        4.7620935 = fieldWeight in 3951, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          7.61935 = idf(docFreq=58, maxDocs=44218)
          0.625 = fieldNorm(doc=3951)
    
  5. Anderson, C.: Reference on disc : the New Grolier Electronic Encyclopedia (1991) 4.76
    4.7620935 = sum of:
      4.7620935 = weight(author_txt:anderson in 4937) [ClassicSimilarity], result of:
        4.7620935 = fieldWeight in 4937, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          7.61935 = idf(docFreq=58, maxDocs=44218)
          0.625 = fieldNorm(doc=4937)
    

Similar documents (content)

  1. Gnoli, C.: ISKO News (2007) 0.11
    0.11139331 = sum of:
      0.11139331 = product of:
        0.55696654 = sum of:
          0.021797398 = weight(abstract_txt:human in 1092) [ClassicSimilarity], result of:
            0.021797398 = score(doc=1092,freq=1.0), product of:
              0.084948964 = queryWeight, product of:
                1.0587044 = boost
                4.692005 = idf(docFreq=1101, maxDocs=44218)
                0.017101133 = queryNorm
              0.25659403 = fieldWeight in 1092, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.692005 = idf(docFreq=1101, maxDocs=44218)
                0.0546875 = fieldNorm(doc=1092)
          0.029881706 = weight(abstract_txt:like in 1092) [ClassicSimilarity], result of:
            0.029881706 = score(doc=1092,freq=1.0), product of:
              0.10483153 = queryWeight, product of:
                1.1760929 = boost
                5.212252 = idf(docFreq=654, maxDocs=44218)
                0.017101133 = queryNorm
              0.28504503 = fieldWeight in 1092, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.212252 = idf(docFreq=654, maxDocs=44218)
                0.0546875 = fieldNorm(doc=1092)
          0.104544915 = weight(abstract_txt:don't in 1092) [ClassicSimilarity], result of:
            0.104544915 = score(doc=1092,freq=1.0), product of:
              0.24159628 = queryWeight, product of:
                1.7854216 = boost
                7.912698 = idf(docFreq=43, maxDocs=44218)
                0.017101133 = queryNorm
              0.43272567 = fieldWeight in 1092, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.912698 = idf(docFreq=43, maxDocs=44218)
                0.0546875 = fieldNorm(doc=1092)
          0.043594796 = weight(abstract_txt:years in 1092) [ClassicSimilarity], result of:
            0.043594796 = score(doc=1092,freq=1.0), product of:
              0.16989793 = queryWeight, product of:
                2.1174088 = boost
                4.692005 = idf(docFreq=1101, maxDocs=44218)
                0.017101133 = queryNorm
              0.25659403 = fieldWeight in 1092, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.692005 = idf(docFreq=1101, maxDocs=44218)
                0.0546875 = fieldNorm(doc=1092)
          0.35714775 = weight(abstract_txt:analogy in 1092) [ClassicSimilarity], result of:
            0.35714775 = score(doc=1092,freq=4.0), product of:
              0.3951822 = queryWeight, product of:
                2.7966619 = boost
                8.2629 = idf(docFreq=30, maxDocs=44218)
                0.017101133 = queryNorm
              0.9037547 = fieldWeight in 1092, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                8.2629 = idf(docFreq=30, maxDocs=44218)
                0.0546875 = fieldNorm(doc=1092)
        0.2 = coord(5/25)
    
  2. Jha, A.: Why GPT-4 isn't all it's cracked up to be (2023) 0.11
    0.11014643 = sum of:
      0.11014643 = product of:
        0.3059623 = sum of:
          0.022018697 = weight(abstract_txt:human in 923) [ClassicSimilarity], result of:
            0.022018697 = score(doc=923,freq=2.0), product of:
              0.084948964 = queryWeight, product of:
                1.0587044 = boost
                4.692005 = idf(docFreq=1101, maxDocs=44218)
                0.017101133 = queryNorm
              0.2591991 = fieldWeight in 923, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.692005 = idf(docFreq=1101, maxDocs=44218)
                0.0390625 = fieldNorm(doc=923)
          0.021344077 = weight(abstract_txt:like in 923) [ClassicSimilarity], result of:
            0.021344077 = score(doc=923,freq=1.0), product of:
              0.10483153 = queryWeight, product of:
                1.1760929 = boost
                5.212252 = idf(docFreq=654, maxDocs=44218)
                0.017101133 = queryNorm
              0.2036036 = fieldWeight in 923, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.212252 = idf(docFreq=654, maxDocs=44218)
                0.0390625 = fieldNorm(doc=923)
          0.023327561 = weight(abstract_txt:google in 923) [ClassicSimilarity], result of:
            0.023327561 = score(doc=923,freq=1.0), product of:
              0.11122948 = queryWeight, product of:
                1.2114503 = boost
                5.3689504 = idf(docFreq=559, maxDocs=44218)
                0.017101133 = queryNorm
              0.20972462 = fieldWeight in 923, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.3689504 = idf(docFreq=559, maxDocs=44218)
                0.0390625 = fieldNorm(doc=923)
          0.017183593 = weight(abstract_txt:have in 923) [ClassicSimilarity], result of:
            0.017183593 = score(doc=923,freq=3.0), product of:
              0.079253644 = queryWeight, product of:
                1.4461731 = boost
                3.2046018 = idf(docFreq=4876, maxDocs=44218)
                0.017101133 = queryNorm
              0.21681769 = fieldWeight in 923, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.2046018 = idf(docFreq=4876, maxDocs=44218)
                0.0390625 = fieldNorm(doc=923)
          0.041154128 = weight(abstract_txt:companies in 923) [ClassicSimilarity], result of:
            0.041154128 = score(doc=923,freq=1.0), product of:
              0.16239873 = queryWeight, product of:
                1.4638176 = boost
                6.487401 = idf(docFreq=182, maxDocs=44218)
                0.017101133 = queryNorm
              0.2534141 = fieldWeight in 923, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.487401 = idf(docFreq=182, maxDocs=44218)
                0.0390625 = fieldNorm(doc=923)
          0.021579431 = weight(abstract_txt:made in 923) [ClassicSimilarity], result of:
            0.021579431 = score(doc=923,freq=1.0), product of:
              0.12088269 = queryWeight, product of:
                1.5467615 = boost
                4.5699964 = idf(docFreq=1244, maxDocs=44218)
                0.017101133 = queryNorm
              0.17851548 = fieldWeight in 923, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.5699964 = idf(docFreq=1244, maxDocs=44218)
                0.0390625 = fieldNorm(doc=923)
          0.022609364 = weight(abstract_txt:models in 923) [ClassicSimilarity], result of:
            0.022609364 = score(doc=923,freq=1.0), product of:
              0.124699004 = queryWeight, product of:
                1.5709877 = boost
                4.6415744 = idf(docFreq=1158, maxDocs=44218)
                0.017101133 = queryNorm
              0.1813115 = fieldWeight in 923, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.6415744 = idf(docFreq=1158, maxDocs=44218)
                0.0390625 = fieldNorm(doc=923)
          0.10560631 = weight(abstract_txt:don't in 923) [ClassicSimilarity], result of:
            0.10560631 = score(doc=923,freq=2.0), product of:
              0.24159628 = queryWeight, product of:
                1.7854216 = boost
                7.912698 = idf(docFreq=43, maxDocs=44218)
                0.017101133 = queryNorm
              0.43711895 = fieldWeight in 923, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.912698 = idf(docFreq=43, maxDocs=44218)
                0.0390625 = fieldNorm(doc=923)
          0.03113914 = weight(abstract_txt:years in 923) [ClassicSimilarity], result of:
            0.03113914 = score(doc=923,freq=1.0), product of:
              0.16989793 = queryWeight, product of:
                2.1174088 = boost
                4.692005 = idf(docFreq=1101, maxDocs=44218)
                0.017101133 = queryNorm
              0.18328145 = fieldWeight in 923, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.692005 = idf(docFreq=1101, maxDocs=44218)
                0.0390625 = fieldNorm(doc=923)
        0.36 = coord(9/25)
    
  3. Whaley, J.H.: Digitizing history (1994) 0.08
    0.08323783 = sum of:
      0.08323783 = product of:
        0.6936486 = sum of:
          0.06905418 = weight(abstract_txt:made in 5467) [ClassicSimilarity], result of:
            0.06905418 = score(doc=5467,freq=1.0), product of:
              0.12088269 = queryWeight, product of:
                1.5467615 = boost
                4.5699964 = idf(docFreq=1244, maxDocs=44218)
                0.017101133 = queryNorm
              0.57124954 = fieldWeight in 5467, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.5699964 = idf(docFreq=1244, maxDocs=44218)
                0.125 = fieldNorm(doc=5467)
          0.37705973 = weight(abstract_txt:disks in 5467) [ClassicSimilarity], result of:
            0.37705973 = score(doc=5467,freq=1.0), product of:
              0.3274516 = queryWeight, product of:
                2.0785918 = boost
                9.211981 = idf(docFreq=11, maxDocs=44218)
                0.017101133 = queryNorm
              1.1514976 = fieldWeight in 5467, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.211981 = idf(docFreq=11, maxDocs=44218)
                0.125 = fieldNorm(doc=5467)
          0.24753469 = weight(abstract_txt:stored in 5467) [ClassicSimilarity], result of:
            0.24753469 = score(doc=5467,freq=1.0), product of:
              0.3116313 = queryWeight, product of:
                2.8676834 = boost
                6.3545527 = idf(docFreq=208, maxDocs=44218)
                0.017101133 = queryNorm
              0.7943191 = fieldWeight in 5467, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.3545527 = idf(docFreq=208, maxDocs=44218)
                0.125 = fieldNorm(doc=5467)
        0.12 = coord(3/25)
    
  4. Ford, D.A.; Christodoukalis, S.: File organizations for optical disks (1992) 0.07
    0.07312587 = sum of:
      0.07312587 = product of:
        0.6093823 = sum of:
          0.17577723 = weight(abstract_txt:cabinet in 3501) [ClassicSimilarity], result of:
            0.17577723 = score(doc=3501,freq=1.0), product of:
              0.18929157 = queryWeight, product of:
                1.1174968 = boost
                9.905128 = idf(docFreq=5, maxDocs=44218)
                0.017101133 = queryNorm
              0.9286057 = fieldWeight in 3501, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.905128 = idf(docFreq=5, maxDocs=44218)
                0.09375 = fieldNorm(doc=3501)
          0.033672825 = weight(abstract_txt:have in 3501) [ClassicSimilarity], result of:
            0.033672825 = score(doc=3501,freq=2.0), product of:
              0.079253644 = queryWeight, product of:
                1.4461731 = boost
                3.2046018 = idf(docFreq=4876, maxDocs=44218)
                0.017101133 = queryNorm
              0.42487416 = fieldWeight in 3501, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.2046018 = idf(docFreq=4876, maxDocs=44218)
                0.09375 = fieldNorm(doc=3501)
          0.3999322 = weight(abstract_txt:disks in 3501) [ClassicSimilarity], result of:
            0.3999322 = score(doc=3501,freq=2.0), product of:
              0.3274516 = queryWeight, product of:
                2.0785918 = boost
                9.211981 = idf(docFreq=11, maxDocs=44218)
                0.017101133 = queryNorm
              1.2213476 = fieldWeight in 3501, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                9.211981 = idf(docFreq=11, maxDocs=44218)
                0.09375 = fieldNorm(doc=3501)
        0.12 = coord(3/25)
    
  5. Robertson, C.: ¬The filing cabinet : a vertical history of information (2021) 0.07
    0.07113065 = sum of:
      0.07113065 = product of:
        0.44456658 = sum of:
          0.287043 = weight(abstract_txt:cabinet in 641) [ClassicSimilarity], result of:
            0.287043 = score(doc=641,freq=6.0), product of:
              0.18929157 = queryWeight, product of:
                1.1174968 = boost
                9.905128 = idf(docFreq=5, maxDocs=44218)
                0.017101133 = queryNorm
              1.5164068 = fieldWeight in 641, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                9.905128 = idf(docFreq=5, maxDocs=44218)
                0.0625 = fieldNorm(doc=641)
          0.017882705 = weight(abstract_txt:were in 641) [ClassicSimilarity], result of:
            0.017882705 = score(doc=641,freq=1.0), product of:
              0.07796139 = queryWeight, product of:
                1.2421701 = boost
                3.6700637 = idf(docFreq=3061, maxDocs=44218)
                0.017101133 = queryNorm
              0.22937898 = fieldWeight in 641, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6700637 = idf(docFreq=3061, maxDocs=44218)
                0.0625 = fieldNorm(doc=641)
          0.015873523 = weight(abstract_txt:have in 641) [ClassicSimilarity], result of:
            0.015873523 = score(doc=641,freq=1.0), product of:
              0.079253644 = queryWeight, product of:
                1.4461731 = boost
                3.2046018 = idf(docFreq=4876, maxDocs=44218)
                0.017101133 = queryNorm
              0.20028761 = fieldWeight in 641, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.2046018 = idf(docFreq=4876, maxDocs=44218)
                0.0625 = fieldNorm(doc=641)
          0.123767346 = weight(abstract_txt:stored in 641) [ClassicSimilarity], result of:
            0.123767346 = score(doc=641,freq=1.0), product of:
              0.3116313 = queryWeight, product of:
                2.8676834 = boost
                6.3545527 = idf(docFreq=208, maxDocs=44218)
                0.017101133 = queryNorm
              0.39715955 = fieldWeight in 641, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.3545527 = idf(docFreq=208, maxDocs=44218)
                0.0625 = fieldNorm(doc=641)
        0.16 = coord(4/25)