Document (#43411)

Author
Fonseca, F.
Title
Whether or when : the question on the use of theories in data science
Source
Journal of the Association for Information Science and Technology. 72(2021) no.12, S.1593-1604
Year
2021
Abstract
Data Science can be considered a technique or a science. As a technique, it is more interested in the "what" than in the "why" of data. It does not need theories that explain how things work, it just needs the results. As a science, however, working strictly from data and without theories contradicts the post-empiricist view of science. In this view, theories come before data and data is used to corroborate or falsify theories. Nevertheless, one of the most controversial statements about Data Science is that it is a science that can work without theories. In this conceptual paper, we focus on the science aspect of Data Science. How is Data Science as a science? We propose a three-phased view of Data Science that shows that different theories have different roles in each of the phases we consider. We focus on when theories are used in Data Science rather than the controversy of whether theories are used in Data Science or not. In the end, we will see that the statement "Data Science works without theories" is better put as "in some of its phases, Data Science works without the theories that originally motivated the creation of the data."
Content
Vgl.: https://asistdl.onlinelibrary.wiley.com/doi/10.1002/asi.24537.

Similar documents (author)

  1. Fonseca, F.: ¬The double role of ontologies in information science research (2007) 5.66
    5.664006 = sum of:
      5.664006 = weight(author_txt:fonseca in 1277) [ClassicSimilarity], result of:
        5.664006 = fieldWeight in 1277, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.06241 = idf(docFreq=13, maxDocs=44421)
          0.625 = fieldNorm(doc=1277)
    
  2. Scott, M.; Fonseca, F.: Methodology for functional appraisal of records and creation of a functional thesaurus (1992) 4.53
    4.531205 = sum of:
      4.531205 = weight(author_txt:fonseca in 2095) [ClassicSimilarity], result of:
        4.531205 = fieldWeight in 2095, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.06241 = idf(docFreq=13, maxDocs=44421)
          0.5 = fieldNorm(doc=2095)
    
  3. Fonseca, F.T.; Martin, J.E.: Toward an alternative notion of information systems ontologies : information engineering as a hermeneutic enterprise (2005) 4.53
    4.531205 = sum of:
      4.531205 = weight(author_txt:fonseca in 4266) [ClassicSimilarity], result of:
        4.531205 = fieldWeight in 4266, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.06241 = idf(docFreq=13, maxDocs=44421)
          0.5 = fieldNorm(doc=4266)
    
  4. Câmara, G.; Fonseca, F.: Information policies and open source software in developing countries (2007) 4.53
    4.531205 = sum of:
      4.531205 = weight(author_txt:fonseca in 1090) [ClassicSimilarity], result of:
        4.531205 = fieldWeight in 1090, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.06241 = idf(docFreq=13, maxDocs=44421)
          0.5 = fieldNorm(doc=1090)
    
  5. Marcinkowski, M.; Fonseca, F.: ¬The conditions of peak empiricism in big data and interaction design (2016) 4.53
    4.531205 = sum of:
      4.531205 = weight(author_txt:fonseca in 3924) [ClassicSimilarity], result of:
        4.531205 = fieldWeight in 3924, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.06241 = idf(docFreq=13, maxDocs=44421)
          0.5 = fieldNorm(doc=3924)
    

Similar documents (content)

  1. Fattahi, R.: Towards developing theories about data : a philosophical and scientific approach (2022) 0.28
    0.2845892 = sum of:
      0.2845892 = product of:
        1.01639 = sum of:
          0.011102594 = weight(abstract_txt:different in 2103) [ClassicSimilarity], result of:
            0.011102594 = score(doc=2103,freq=1.0), product of:
              0.048539475 = queryWeight, product of:
                1.0508518 = boost
                3.6597328 = idf(docFreq=3107, maxDocs=44421)
                0.012621303 = queryNorm
              0.2287333 = fieldWeight in 2103, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6597328 = idf(docFreq=3107, maxDocs=44421)
                0.0625 = fieldNorm(doc=2103)
          0.012855804 = weight(abstract_txt:used in 2103) [ClassicSimilarity], result of:
            0.012855804 = score(doc=2103,freq=1.0), product of:
              0.061269168 = queryWeight, product of:
                1.4459742 = boost
                3.3572001 = idf(docFreq=4205, maxDocs=44421)
                0.012621303 = queryNorm
              0.20982501 = fieldWeight in 2103, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.3572001 = idf(docFreq=4205, maxDocs=44421)
                0.0625 = fieldNorm(doc=2103)
          0.032329075 = weight(abstract_txt:works in 2103) [ClassicSimilarity], result of:
            0.032329075 = score(doc=2103,freq=1.0), product of:
              0.09897852 = queryWeight, product of:
                1.5005982 = boost
                5.226035 = idf(docFreq=648, maxDocs=44421)
                0.012621303 = queryNorm
              0.3266272 = fieldWeight in 2103, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.226035 = idf(docFreq=648, maxDocs=44421)
                0.0625 = fieldNorm(doc=2103)
          0.014829178 = weight(abstract_txt:that in 2103) [ClassicSimilarity], result of:
            0.014829178 = score(doc=2103,freq=2.0), product of:
              0.0709419 = queryWeight, product of:
                2.3767276 = boost
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.012621303 = queryNorm
              0.20903271 = fieldWeight in 2103, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.0625 = fieldNorm(doc=2103)
          0.20809174 = weight(abstract_txt:data in 2103) [ClassicSimilarity], result of:
            0.20809174 = score(doc=2103,freq=11.0), product of:
              0.30144274 = queryWeight, product of:
                7.1717806 = boost
                3.3302255 = idf(docFreq=4320, maxDocs=44421)
                0.012621303 = queryNorm
              0.6903193 = fieldWeight in 2103, product of:
                3.3166249 = tf(freq=11.0), with freq of:
                  11.0 = termFreq=11.0
                3.3302255 = idf(docFreq=4320, maxDocs=44421)
                0.0625 = fieldNorm(doc=2103)
          0.1034538 = weight(abstract_txt:science in 2103) [ClassicSimilarity], result of:
            0.1034538 = score(doc=2103,freq=1.0), product of:
              0.42987254 = queryWeight, product of:
                8.845223 = boost
                3.850585 = idf(docFreq=2567, maxDocs=44421)
                0.012621303 = queryNorm
              0.24066156 = fieldWeight in 2103, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.850585 = idf(docFreq=2567, maxDocs=44421)
                0.0625 = fieldNorm(doc=2103)
          0.6337278 = weight(abstract_txt:theories in 2103) [ClassicSimilarity], result of:
            0.6337278 = score(doc=2103,freq=8.0), product of:
              0.6350942 = queryWeight, product of:
                8.914447 = boost
                5.6446834 = idf(docFreq=426, maxDocs=44421)
                0.012621303 = queryNorm
              0.99784845 = fieldWeight in 2103, product of:
                2.828427 = tf(freq=8.0), with freq of:
                  8.0 = termFreq=8.0
                5.6446834 = idf(docFreq=426, maxDocs=44421)
                0.0625 = fieldNorm(doc=2103)
        0.28 = coord(7/25)
    
  2. Krebs, J.: Information transfer as a metaphor (2014) 0.22
    0.2223554 = sum of:
      0.2223554 = product of:
        0.69486064 = sum of:
          0.011102594 = weight(abstract_txt:different in 4395) [ClassicSimilarity], result of:
            0.011102594 = score(doc=4395,freq=1.0), product of:
              0.048539475 = queryWeight, product of:
                1.0508518 = boost
                3.6597328 = idf(docFreq=3107, maxDocs=44421)
                0.012621303 = queryNorm
              0.2287333 = fieldWeight in 4395, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6597328 = idf(docFreq=3107, maxDocs=44421)
                0.0625 = fieldNorm(doc=4395)
          0.016143255 = weight(abstract_txt:when in 4395) [ClassicSimilarity], result of:
            0.016143255 = score(doc=4395,freq=1.0), product of:
              0.062297817 = queryWeight, product of:
                1.1905026 = boost
                4.1460857 = idf(docFreq=1910, maxDocs=44421)
                0.012621303 = queryNorm
              0.25913036 = fieldWeight in 4395, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.1460857 = idf(docFreq=1910, maxDocs=44421)
                0.0625 = fieldNorm(doc=4395)
          0.012855804 = weight(abstract_txt:used in 4395) [ClassicSimilarity], result of:
            0.012855804 = score(doc=4395,freq=1.0), product of:
              0.061269168 = queryWeight, product of:
                1.4459742 = boost
                3.3572001 = idf(docFreq=4205, maxDocs=44421)
                0.012621303 = queryNorm
              0.20982501 = fieldWeight in 4395, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.3572001 = idf(docFreq=4205, maxDocs=44421)
                0.0625 = fieldNorm(doc=4395)
          0.04280456 = weight(abstract_txt:view in 4395) [ClassicSimilarity], result of:
            0.04280456 = score(doc=4395,freq=1.0), product of:
              0.1366163 = queryWeight, product of:
                2.15919 = boost
                5.013113 = idf(docFreq=802, maxDocs=44421)
                0.012621303 = queryNorm
              0.31331956 = fieldWeight in 4395, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.013113 = idf(docFreq=802, maxDocs=44421)
                0.0625 = fieldNorm(doc=4395)
          0.014829178 = weight(abstract_txt:that in 4395) [ClassicSimilarity], result of:
            0.014829178 = score(doc=4395,freq=2.0), product of:
              0.0709419 = queryWeight, product of:
                2.3767276 = boost
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.012621303 = queryNorm
              0.20903271 = fieldWeight in 4395, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.0625 = fieldNorm(doc=4395)
          0.06274202 = weight(abstract_txt:data in 4395) [ClassicSimilarity], result of:
            0.06274202 = score(doc=4395,freq=1.0), product of:
              0.30144274 = queryWeight, product of:
                7.1717806 = boost
                3.3302255 = idf(docFreq=4320, maxDocs=44421)
                0.012621303 = queryNorm
              0.20813909 = fieldWeight in 4395, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.3302255 = idf(docFreq=4320, maxDocs=44421)
                0.0625 = fieldNorm(doc=4395)
          0.14630577 = weight(abstract_txt:science in 4395) [ClassicSimilarity], result of:
            0.14630577 = score(doc=4395,freq=2.0), product of:
              0.42987254 = queryWeight, product of:
                8.845223 = boost
                3.850585 = idf(docFreq=2567, maxDocs=44421)
                0.012621303 = queryNorm
              0.34034684 = fieldWeight in 4395, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.850585 = idf(docFreq=2567, maxDocs=44421)
                0.0625 = fieldNorm(doc=4395)
          0.38807744 = weight(abstract_txt:theories in 4395) [ClassicSimilarity], result of:
            0.38807744 = score(doc=4395,freq=3.0), product of:
              0.6350942 = queryWeight, product of:
                8.914447 = boost
                5.6446834 = idf(docFreq=426, maxDocs=44421)
                0.012621303 = queryNorm
              0.6110549 = fieldWeight in 4395, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.6446834 = idf(docFreq=426, maxDocs=44421)
                0.0625 = fieldNorm(doc=4395)
        0.32 = coord(8/25)
    
  3. Hjoerland, B.: Concept theory (2009) 0.20
    0.2020259 = sum of:
      0.2020259 = product of:
        0.84177464 = sum of:
          0.01570144 = weight(abstract_txt:different in 448) [ClassicSimilarity], result of:
            0.01570144 = score(doc=448,freq=2.0), product of:
              0.048539475 = queryWeight, product of:
                1.0508518 = boost
                3.6597328 = idf(docFreq=3107, maxDocs=44421)
                0.012621303 = queryNorm
              0.32347775 = fieldWeight in 448, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.6597328 = idf(docFreq=3107, maxDocs=44421)
                0.0625 = fieldNorm(doc=448)
          0.06053479 = weight(abstract_txt:view in 448) [ClassicSimilarity], result of:
            0.06053479 = score(doc=448,freq=2.0), product of:
              0.1366163 = queryWeight, product of:
                2.15919 = boost
                5.013113 = idf(docFreq=802, maxDocs=44421)
                0.012621303 = queryNorm
              0.44310078 = fieldWeight in 448, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.013113 = idf(docFreq=802, maxDocs=44421)
                0.0625 = fieldNorm(doc=448)
          0.020971624 = weight(abstract_txt:that in 448) [ClassicSimilarity], result of:
            0.020971624 = score(doc=448,freq=4.0), product of:
              0.0709419 = queryWeight, product of:
                2.3767276 = boost
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.012621303 = queryNorm
              0.2956169 = fieldWeight in 448, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.0625 = fieldNorm(doc=448)
          0.06437371 = weight(abstract_txt:without in 448) [ClassicSimilarity], result of:
            0.06437371 = score(doc=448,freq=1.0), product of:
              0.19737604 = queryWeight, product of:
                2.996789 = boost
                5.2183604 = idf(docFreq=653, maxDocs=44421)
                0.012621303 = queryNorm
              0.32614753 = fieldWeight in 448, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.2183604 = idf(docFreq=653, maxDocs=44421)
                0.0625 = fieldNorm(doc=448)
          0.17918724 = weight(abstract_txt:science in 448) [ClassicSimilarity], result of:
            0.17918724 = score(doc=448,freq=3.0), product of:
              0.42987254 = queryWeight, product of:
                8.845223 = boost
                3.850585 = idf(docFreq=2567, maxDocs=44421)
                0.012621303 = queryNorm
              0.41683805 = fieldWeight in 448, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.850585 = idf(docFreq=2567, maxDocs=44421)
                0.0625 = fieldNorm(doc=448)
          0.5010058 = weight(abstract_txt:theories in 448) [ClassicSimilarity], result of:
            0.5010058 = score(doc=448,freq=5.0), product of:
              0.6350942 = queryWeight, product of:
                8.914447 = boost
                5.6446834 = idf(docFreq=426, maxDocs=44421)
                0.012621303 = queryNorm
              0.7888685 = fieldWeight in 448, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                5.6446834 = idf(docFreq=426, maxDocs=44421)
                0.0625 = fieldNorm(doc=448)
        0.24 = coord(6/25)
    
  4. Frické, M.: Big data and its epistemology (2015) 0.20
    0.2003895 = sum of:
      0.2003895 = product of:
        1.0019475 = sum of:
          0.032180015 = weight(abstract_txt:whether in 2811) [ClassicSimilarity], result of:
            0.032180015 = score(doc=2811,freq=1.0), product of:
              0.08503471 = queryWeight, product of:
                1.3908877 = boost
                4.8439536 = idf(docFreq=950, maxDocs=44421)
                0.012621303 = queryNorm
              0.37843388 = fieldWeight in 2811, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.8439536 = idf(docFreq=950, maxDocs=44421)
                0.078125 = fieldNorm(doc=2811)
          0.018536475 = weight(abstract_txt:that in 2811) [ClassicSimilarity], result of:
            0.018536475 = score(doc=2811,freq=2.0), product of:
              0.0709419 = queryWeight, product of:
                2.3767276 = boost
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.012621303 = queryNorm
              0.2612909 = fieldWeight in 2811, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.078125 = fieldNorm(doc=2811)
          0.20749971 = weight(abstract_txt:data in 2811) [ClassicSimilarity], result of:
            0.20749971 = score(doc=2811,freq=7.0), product of:
              0.30144274 = queryWeight, product of:
                7.1717806 = boost
                3.3302255 = idf(docFreq=4320, maxDocs=44421)
                0.012621303 = queryNorm
              0.6883553 = fieldWeight in 2811, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                3.3302255 = idf(docFreq=4320, maxDocs=44421)
                0.078125 = fieldNorm(doc=2811)
          0.2586345 = weight(abstract_txt:science in 2811) [ClassicSimilarity], result of:
            0.2586345 = score(doc=2811,freq=4.0), product of:
              0.42987254 = queryWeight, product of:
                8.845223 = boost
                3.850585 = idf(docFreq=2567, maxDocs=44421)
                0.012621303 = queryNorm
              0.60165393 = fieldWeight in 2811, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                3.850585 = idf(docFreq=2567, maxDocs=44421)
                0.078125 = fieldNorm(doc=2811)
          0.48509678 = weight(abstract_txt:theories in 2811) [ClassicSimilarity], result of:
            0.48509678 = score(doc=2811,freq=3.0), product of:
              0.6350942 = queryWeight, product of:
                8.914447 = boost
                5.6446834 = idf(docFreq=426, maxDocs=44421)
                0.012621303 = queryNorm
              0.7638186 = fieldWeight in 2811, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.6446834 = idf(docFreq=426, maxDocs=44421)
                0.078125 = fieldNorm(doc=2811)
        0.2 = coord(5/25)
    
  5. Szostak, R.: Classifying science : phenomena, data, theory, method, practice (2004) 0.20
    0.19501 = sum of:
      0.19501 = product of:
        0.8125417 = sum of:
          0.01570144 = weight(abstract_txt:different in 1325) [ClassicSimilarity], result of:
            0.01570144 = score(doc=1325,freq=2.0), product of:
              0.048539475 = queryWeight, product of:
                1.0508518 = boost
                3.6597328 = idf(docFreq=3107, maxDocs=44421)
                0.012621303 = queryNorm
              0.32347775 = fieldWeight in 1325, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.6597328 = idf(docFreq=3107, maxDocs=44421)
                0.0625 = fieldNorm(doc=1325)
          0.018180853 = weight(abstract_txt:used in 1325) [ClassicSimilarity], result of:
            0.018180853 = score(doc=1325,freq=2.0), product of:
              0.061269168 = queryWeight, product of:
                1.4459742 = boost
                3.3572001 = idf(docFreq=4205, maxDocs=44421)
                0.012621303 = queryNorm
              0.29673737 = fieldWeight in 1325, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.3572001 = idf(docFreq=4205, maxDocs=44421)
                0.0625 = fieldNorm(doc=1325)
          0.010485812 = weight(abstract_txt:that in 1325) [ClassicSimilarity], result of:
            0.010485812 = score(doc=1325,freq=1.0), product of:
              0.0709419 = queryWeight, product of:
                2.3767276 = boost
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.012621303 = queryNorm
              0.14780845 = fieldWeight in 1325, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.0625 = fieldNorm(doc=1325)
          0.08873061 = weight(abstract_txt:data in 1325) [ClassicSimilarity], result of:
            0.08873061 = score(doc=1325,freq=2.0), product of:
              0.30144274 = queryWeight, product of:
                7.1717806 = boost
                3.3302255 = idf(docFreq=4320, maxDocs=44421)
                0.012621303 = queryNorm
              0.29435313 = fieldWeight in 1325, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.3302255 = idf(docFreq=4320, maxDocs=44421)
                0.0625 = fieldNorm(doc=1325)
          0.23132974 = weight(abstract_txt:science in 1325) [ClassicSimilarity], result of:
            0.23132974 = score(doc=1325,freq=5.0), product of:
              0.42987254 = queryWeight, product of:
                8.845223 = boost
                3.850585 = idf(docFreq=2567, maxDocs=44421)
                0.012621303 = queryNorm
              0.53813565 = fieldWeight in 1325, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                3.850585 = idf(docFreq=2567, maxDocs=44421)
                0.0625 = fieldNorm(doc=1325)
          0.44811323 = weight(abstract_txt:theories in 1325) [ClassicSimilarity], result of:
            0.44811323 = score(doc=1325,freq=4.0), product of:
              0.6350942 = queryWeight, product of:
                8.914447 = boost
                5.6446834 = idf(docFreq=426, maxDocs=44421)
                0.012621303 = queryNorm
              0.7055854 = fieldWeight in 1325, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.6446834 = idf(docFreq=426, maxDocs=44421)
                0.0625 = fieldNorm(doc=1325)
        0.24 = coord(6/25)