Document (#38812)

Author
Frické, M.
Title
Big data and its epistemology
Source
Journal of the Association for Information Science and Technology. 66(2015) no.4, S.651-661
Year
2015
Abstract
The article considers whether Big Data, in the form of data-driven science, will enable the discovery, or appraisal, of universal scientific theories, instrumentalist tools, or inductive inferences. It points out, initially, that such aspirations are similar to the now-discredited inductivist approach to science. On the positive side, Big Data may permit larger sample sizes, cheaper and more extensive testing of theories, and the continuous assessment of theories. On the negative side, data-driven science encourages passive data collection, as opposed to experimentation and testing, and hornswoggling ("unsound statistical fiddling"). The roles of theory and data in inductive algorithms, statistical modeling, and scientific discoveries are analyzed, and it is argued that theory is needed at every turn. Data-driven science is a chimera.
Content
Vgl.: http://onlinelibrary.wiley.com/doi/10.1002/asi.23212/abstract.
Theme
Data Mining

Similar documents (author)

  1. Frické, M.: Faceted classification : orthogonal facets and graphs of foci? (2011) 5.76
    5.7603507 = sum of:
      5.7603507 = weight(author_txt:frické in 850) [ClassicSimilarity], result of:
        5.7603507 = fieldWeight in 850, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.216561 = idf(docFreq=11, maxDocs=44421)
          0.625 = fieldNorm(doc=850)
    
  2. Frické, M.: Reflections on classification : Thomas Reid and bibliographic description (2013) 5.76
    5.7603507 = sum of:
      5.7603507 = weight(author_txt:frické in 2766) [ClassicSimilarity], result of:
        5.7603507 = fieldWeight in 2766, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.216561 = idf(docFreq=11, maxDocs=44421)
          0.625 = fieldNorm(doc=2766)
    
  3. Frické, M.: Logic and the organization of information (2012) 5.76
    5.7603507 = sum of:
      5.7603507 = weight(author_txt:frické in 2782) [ClassicSimilarity], result of:
        5.7603507 = fieldWeight in 2782, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.216561 = idf(docFreq=11, maxDocs=44421)
          0.625 = fieldNorm(doc=2782)
    
  4. Frické, M.: Logical division (2016) 5.76
    5.7603507 = sum of:
      5.7603507 = weight(author_txt:frické in 4183) [ClassicSimilarity], result of:
        5.7603507 = fieldWeight in 4183, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.216561 = idf(docFreq=11, maxDocs=44421)
          0.625 = fieldNorm(doc=4183)
    
  5. Frické, M.: Logic and librarianship (2017) 5.76
    5.7603507 = sum of:
      5.7603507 = weight(author_txt:frické in 4504) [ClassicSimilarity], result of:
        5.7603507 = fieldWeight in 4504, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.216561 = idf(docFreq=11, maxDocs=44421)
          0.625 = fieldNorm(doc=4504)
    

Similar documents (content)

  1. Fonseca, F.; Marcinkowski, M.; Davis, C.: Cyber-human systems of thought and understanding (2019) 0.10
    0.098893635 = sum of:
      0.098893635 = product of:
        0.49446815 = sum of:
          0.08523098 = weight(abstract_txt:encourages in 11) [ClassicSimilarity], result of:
            0.08523098 = score(doc=11,freq=1.0), product of:
              0.172243 = queryWeight, product of:
                1.1981865 = boost
                7.917278 = idf(docFreq=43, maxDocs=44421)
                0.01815688 = queryNorm
              0.49482986 = fieldWeight in 11, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.917278 = idf(docFreq=43, maxDocs=44421)
                0.0625 = fieldNorm(doc=11)
          0.059141327 = weight(abstract_txt:scientific in 11) [ClassicSimilarity], result of:
            0.059141327 = score(doc=11,freq=3.0), product of:
              0.11793432 = queryWeight, product of:
                1.4021314 = boost
                4.6324444 = idf(docFreq=1174, maxDocs=44421)
                0.01815688 = queryNorm
              0.5014768 = fieldWeight in 11, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.6324444 = idf(docFreq=1174, maxDocs=44421)
                0.0625 = fieldNorm(doc=11)
          0.05546575 = weight(abstract_txt:science in 11) [ClassicSimilarity], result of:
            0.05546575 = score(doc=11,freq=2.0), product of:
              0.16296831 = queryWeight, product of:
                2.3309622 = boost
                3.850585 = idf(docFreq=2567, maxDocs=44421)
                0.01815688 = queryNorm
              0.34034684 = fieldWeight in 11, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.850585 = idf(docFreq=2567, maxDocs=44421)
                0.0625 = fieldNorm(doc=11)
          0.12633285 = weight(abstract_txt:driven in 11) [ClassicSimilarity], result of:
            0.12633285 = score(doc=11,freq=1.0), product of:
              0.32294446 = queryWeight, product of:
                2.8416998 = boost
                6.25905 = idf(docFreq=230, maxDocs=44421)
                0.01815688 = queryNorm
              0.39119062 = fieldWeight in 11, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.25905 = idf(docFreq=230, maxDocs=44421)
                0.0625 = fieldNorm(doc=11)
          0.16829726 = weight(abstract_txt:data in 11) [ClassicSimilarity], result of:
            0.16829726 = score(doc=11,freq=11.0), product of:
              0.24379626 = queryWeight, product of:
                4.0319223 = boost
                3.3302255 = idf(docFreq=4320, maxDocs=44421)
                0.01815688 = queryNorm
              0.6903193 = fieldWeight in 11, product of:
                3.3166249 = tf(freq=11.0), with freq of:
                  11.0 = termFreq=11.0
                3.3302255 = idf(docFreq=4320, maxDocs=44421)
                0.0625 = fieldNorm(doc=11)
        0.2 = coord(5/25)
    
  2. Szostak, R.: Classifying science : phenomena, data, theory, method, practice (2004) 0.09
    0.08764543 = sum of:
      0.08764543 = product of:
        0.43822712 = sum of:
          0.045149714 = weight(abstract_txt:theory in 1325) [ClassicSimilarity], result of:
            0.045149714 = score(doc=1325,freq=2.0), product of:
              0.11276645 = queryWeight, product of:
                1.3710668 = boost
                4.529811 = idf(docFreq=1301, maxDocs=44421)
                0.01815688 = queryNorm
              0.4003825 = fieldWeight in 1325, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.529811 = idf(docFreq=1301, maxDocs=44421)
                0.0625 = fieldNorm(doc=1325)
          0.04828869 = weight(abstract_txt:scientific in 1325) [ClassicSimilarity], result of:
            0.04828869 = score(doc=1325,freq=2.0), product of:
              0.11793432 = queryWeight, product of:
                1.4021314 = boost
                4.6324444 = idf(docFreq=1174, maxDocs=44421)
                0.01815688 = queryNorm
              0.4094541 = fieldWeight in 1325, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.6324444 = idf(docFreq=1174, maxDocs=44421)
                0.0625 = fieldNorm(doc=1325)
          0.087699056 = weight(abstract_txt:science in 1325) [ClassicSimilarity], result of:
            0.087699056 = score(doc=1325,freq=5.0), product of:
              0.16296831 = queryWeight, product of:
                2.3309622 = boost
                3.850585 = idf(docFreq=2567, maxDocs=44421)
                0.01815688 = queryNorm
              0.53813565 = fieldWeight in 1325, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                3.850585 = idf(docFreq=2567, maxDocs=44421)
                0.0625 = fieldNorm(doc=1325)
          0.18532747 = weight(abstract_txt:theories in 1325) [ClassicSimilarity], result of:
            0.18532747 = score(doc=1325,freq=4.0), product of:
              0.26265773 = queryWeight, product of:
                2.5627685 = boost
                5.6446834 = idf(docFreq=426, maxDocs=44421)
                0.01815688 = queryNorm
              0.7055854 = fieldWeight in 1325, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.6446834 = idf(docFreq=426, maxDocs=44421)
                0.0625 = fieldNorm(doc=1325)
          0.07176219 = weight(abstract_txt:data in 1325) [ClassicSimilarity], result of:
            0.07176219 = score(doc=1325,freq=2.0), product of:
              0.24379626 = queryWeight, product of:
                4.0319223 = boost
                3.3302255 = idf(docFreq=4320, maxDocs=44421)
                0.01815688 = queryNorm
              0.29435313 = fieldWeight in 1325, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.3302255 = idf(docFreq=4320, maxDocs=44421)
                0.0625 = fieldNorm(doc=1325)
        0.2 = coord(5/25)
    
  3. Wu, D.; Xu, H.; Sun, Y.; Lv, S.: What should we teach? : A human-centered data science graduate curriculum model design for iField schools (2023) 0.08
    0.084636614 = sum of:
      0.084636614 = product of:
        0.5289788 = sum of:
          0.1553547 = weight(abstract_txt:inductive in 1963) [ClassicSimilarity], result of:
            0.1553547 = score(doc=1963,freq=1.0), product of:
              0.32381937 = queryWeight, product of:
                2.323379 = boost
                7.676116 = idf(docFreq=55, maxDocs=44421)
                0.01815688 = queryNorm
              0.47975725 = fieldWeight in 1963, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.676116 = idf(docFreq=55, maxDocs=44421)
                0.0625 = fieldNorm(doc=1963)
          0.10376691 = weight(abstract_txt:science in 1963) [ClassicSimilarity], result of:
            0.10376691 = score(doc=1963,freq=7.0), product of:
              0.16296831 = queryWeight, product of:
                2.3309622 = boost
                3.850585 = idf(docFreq=2567, maxDocs=44421)
                0.01815688 = queryNorm
              0.6367306 = fieldWeight in 1963, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                3.850585 = idf(docFreq=2567, maxDocs=44421)
                0.0625 = fieldNorm(doc=1963)
          0.12633285 = weight(abstract_txt:driven in 1963) [ClassicSimilarity], result of:
            0.12633285 = score(doc=1963,freq=1.0), product of:
              0.32294446 = queryWeight, product of:
                2.8416998 = boost
                6.25905 = idf(docFreq=230, maxDocs=44421)
                0.01815688 = queryNorm
              0.39119062 = fieldWeight in 1963, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.25905 = idf(docFreq=230, maxDocs=44421)
                0.0625 = fieldNorm(doc=1963)
          0.14352438 = weight(abstract_txt:data in 1963) [ClassicSimilarity], result of:
            0.14352438 = score(doc=1963,freq=8.0), product of:
              0.24379626 = queryWeight, product of:
                4.0319223 = boost
                3.3302255 = idf(docFreq=4320, maxDocs=44421)
                0.01815688 = queryNorm
              0.58870625 = fieldWeight in 1963, product of:
                2.828427 = tf(freq=8.0), with freq of:
                  8.0 = termFreq=8.0
                3.3302255 = idf(docFreq=4320, maxDocs=44421)
                0.0625 = fieldNorm(doc=1963)
        0.16 = coord(4/25)
    
  4. Vakkari, P.; Kuokkanen, M.: Theory growth in information science : applications of the theory of science to a theory of information seeking (1997) 0.08
    0.083680496 = sum of:
      0.083680496 = product of:
        0.5230031 = sum of:
          0.1368528 = weight(abstract_txt:theory in 5710) [ClassicSimilarity], result of:
            0.1368528 = score(doc=5710,freq=6.0), product of:
              0.11276645 = queryWeight, product of:
                1.3710668 = boost
                4.529811 = idf(docFreq=1301, maxDocs=44421)
                0.01815688 = queryNorm
              1.213595 = fieldWeight in 5710, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                4.529811 = idf(docFreq=1301, maxDocs=44421)
                0.109375 = fieldNorm(doc=5710)
          0.0597542 = weight(abstract_txt:scientific in 5710) [ClassicSimilarity], result of:
            0.0597542 = score(doc=5710,freq=1.0), product of:
              0.11793432 = queryWeight, product of:
                1.4021314 = boost
                4.6324444 = idf(docFreq=1174, maxDocs=44421)
                0.01815688 = queryNorm
              0.5066736 = fieldWeight in 5710, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.6324444 = idf(docFreq=1174, maxDocs=44421)
                0.109375 = fieldNorm(doc=5710)
          0.09706506 = weight(abstract_txt:science in 5710) [ClassicSimilarity], result of:
            0.09706506 = score(doc=5710,freq=2.0), product of:
              0.16296831 = queryWeight, product of:
                2.3309622 = boost
                3.850585 = idf(docFreq=2567, maxDocs=44421)
                0.01815688 = queryNorm
              0.595607 = fieldWeight in 5710, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.850585 = idf(docFreq=2567, maxDocs=44421)
                0.109375 = fieldNorm(doc=5710)
          0.22933103 = weight(abstract_txt:theories in 5710) [ClassicSimilarity], result of:
            0.22933103 = score(doc=5710,freq=2.0), product of:
              0.26265773 = queryWeight, product of:
                2.5627685 = boost
                5.6446834 = idf(docFreq=426, maxDocs=44421)
                0.01815688 = queryNorm
              0.8731174 = fieldWeight in 5710, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.6446834 = idf(docFreq=426, maxDocs=44421)
                0.109375 = fieldNorm(doc=5710)
        0.16 = coord(4/25)
    
  5. Fattahi, R.: Towards developing theories about data : a philosophical and scientific approach (2022) 0.08
    0.08286381 = sum of:
      0.08286381 = product of:
        0.5178988 = sum of:
          0.04828869 = weight(abstract_txt:scientific in 2103) [ClassicSimilarity], result of:
            0.04828869 = score(doc=2103,freq=2.0), product of:
              0.11793432 = queryWeight, product of:
                1.4021314 = boost
                4.6324444 = idf(docFreq=1174, maxDocs=44421)
                0.01815688 = queryNorm
              0.4094541 = fieldWeight in 2103, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.6324444 = idf(docFreq=1174, maxDocs=44421)
                0.0625 = fieldNorm(doc=2103)
          0.039220206 = weight(abstract_txt:science in 2103) [ClassicSimilarity], result of:
            0.039220206 = score(doc=2103,freq=1.0), product of:
              0.16296831 = queryWeight, product of:
                2.3309622 = boost
                3.850585 = idf(docFreq=2567, maxDocs=44421)
                0.01815688 = queryNorm
              0.24066156 = fieldWeight in 2103, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.850585 = idf(docFreq=2567, maxDocs=44421)
                0.0625 = fieldNorm(doc=2103)
          0.26209262 = weight(abstract_txt:theories in 2103) [ClassicSimilarity], result of:
            0.26209262 = score(doc=2103,freq=8.0), product of:
              0.26265773 = queryWeight, product of:
                2.5627685 = boost
                5.6446834 = idf(docFreq=426, maxDocs=44421)
                0.01815688 = queryNorm
              0.99784845 = fieldWeight in 2103, product of:
                2.828427 = tf(freq=8.0), with freq of:
                  8.0 = termFreq=8.0
                5.6446834 = idf(docFreq=426, maxDocs=44421)
                0.0625 = fieldNorm(doc=2103)
          0.16829726 = weight(abstract_txt:data in 2103) [ClassicSimilarity], result of:
            0.16829726 = score(doc=2103,freq=11.0), product of:
              0.24379626 = queryWeight, product of:
                4.0319223 = boost
                3.3302255 = idf(docFreq=4320, maxDocs=44421)
                0.01815688 = queryNorm
              0.6903193 = fieldWeight in 2103, product of:
                3.3166249 = tf(freq=11.0), with freq of:
                  11.0 = termFreq=11.0
                3.3302255 = idf(docFreq=4320, maxDocs=44421)
                0.0625 = fieldNorm(doc=2103)
        0.16 = coord(4/25)