Document (#43411)

Author
Fonseca, F.
Title
Whether or when : the question on the use of theories in data science
Source
Journal of the Association for Information Science and Technology. 72(2021) no.12, S.1593-1604
Year
2021
Abstract
Data Science can be considered a technique or a science. As a technique, it is more interested in the "what" than in the "why" of data. It does not need theories that explain how things work, it just needs the results. As a science, however, working strictly from data and without theories contradicts the post-empiricist view of science. In this view, theories come before data and data is used to corroborate or falsify theories. Nevertheless, one of the most controversial statements about Data Science is that it is a science that can work without theories. In this conceptual paper, we focus on the science aspect of Data Science. How is Data Science as a science? We propose a three-phased view of Data Science that shows that different theories have different roles in each of the phases we consider. We focus on when theories are used in Data Science rather than the controversy of whether theories are used in Data Science or not. In the end, we will see that the statement "Data Science works without theories" is better put as "in some of its phases, Data Science works without the theories that originally motivated the creation of the data."
Content
Vgl.: https://asistdl.onlinelibrary.wiley.com/doi/10.1002/asi.24537.

Similar documents (author)

  1. Fonseca, F.: ¬The double role of ontologies in information science research (2007) 5.66
    5.661144 = sum of:
      5.661144 = weight(author_txt:fonseca in 277) [ClassicSimilarity], result of:
        5.661144 = fieldWeight in 277, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.05783 = idf(docFreq=13, maxDocs=44218)
          0.625 = fieldNorm(doc=277)
    
  2. Scott, M.; Fonseca, F.: Methodology for functional appraisal of records and creation of a functional thesaurus (1992) 4.53
    4.528915 = sum of:
      4.528915 = weight(author_txt:fonseca in 2096) [ClassicSimilarity], result of:
        4.528915 = fieldWeight in 2096, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.05783 = idf(docFreq=13, maxDocs=44218)
          0.5 = fieldNorm(doc=2096)
    
  3. Fonseca, F.T.; Martin, J.E.: Toward an alternative notion of information systems ontologies : information engineering as a hermeneutic enterprise (2005) 4.53
    4.528915 = sum of:
      4.528915 = weight(author_txt:fonseca in 3266) [ClassicSimilarity], result of:
        4.528915 = fieldWeight in 3266, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.05783 = idf(docFreq=13, maxDocs=44218)
          0.5 = fieldNorm(doc=3266)
    
  4. Câmara, G.; Fonseca, F.: Information policies and open source software in developing countries (2007) 4.53
    4.528915 = sum of:
      4.528915 = weight(author_txt:fonseca in 90) [ClassicSimilarity], result of:
        4.528915 = fieldWeight in 90, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.05783 = idf(docFreq=13, maxDocs=44218)
          0.5 = fieldNorm(doc=90)
    
  5. Marcinkowski, M.; Fonseca, F.: ¬The conditions of peak empiricism in big data and interaction design (2016) 4.53
    4.528915 = sum of:
      4.528915 = weight(author_txt:fonseca in 2924) [ClassicSimilarity], result of:
        4.528915 = fieldWeight in 2924, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.05783 = idf(docFreq=13, maxDocs=44218)
          0.5 = fieldNorm(doc=2924)
    

Similar documents (content)

  1. Fattahi, R.: Towards developing theories about data : a philosophical and scientific approach (2022) 0.29
    0.2850801 = sum of:
      0.2850801 = product of:
        1.0181432 = sum of:
          0.011131165 = weight(abstract_txt:different in 1101) [ClassicSimilarity], result of:
            0.011131165 = score(doc=1101,freq=1.0), product of:
              0.04858779 = queryWeight, product of:
                1.053201 = boost
                3.6655018 = idf(docFreq=3075, maxDocs=44218)
                0.012585848 = queryNorm
              0.22909386 = fieldWeight in 1101, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6655018 = idf(docFreq=3075, maxDocs=44218)
                0.0625 = fieldNorm(doc=1101)
          0.012852204 = weight(abstract_txt:used in 1101) [ClassicSimilarity], result of:
            0.012852204 = score(doc=1101,freq=1.0), product of:
              0.06121374 = queryWeight, product of:
                1.4478306 = boost
                3.3592992 = idf(docFreq=4177, maxDocs=44218)
                0.012585848 = queryNorm
              0.2099562 = fieldWeight in 1101, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.3592992 = idf(docFreq=4177, maxDocs=44218)
                0.0625 = fieldNorm(doc=1101)
          0.032433562 = weight(abstract_txt:works in 1101) [ClassicSimilarity], result of:
            0.032433562 = score(doc=1101,freq=1.0), product of:
              0.09912043 = queryWeight, product of:
                1.504282 = boost
                5.2354193 = idf(docFreq=639, maxDocs=44218)
                0.012585848 = queryNorm
              0.3272137 = fieldWeight in 1101, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.2354193 = idf(docFreq=639, maxDocs=44218)
                0.0625 = fieldNorm(doc=1101)
          0.014882633 = weight(abstract_txt:that in 1101) [ClassicSimilarity], result of:
            0.014882633 = score(doc=1101,freq=2.0), product of:
              0.07106122 = queryWeight, product of:
                2.382857 = boost
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.012585848 = queryNorm
              0.20943399 = fieldWeight in 1101, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.0625 = fieldNorm(doc=1101)
          0.20879105 = weight(abstract_txt:data in 1101) [ClassicSimilarity], result of:
            0.20879105 = score(doc=1101,freq=11.0), product of:
              0.30190074 = queryWeight, product of:
                7.1896935 = boost
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.012585848 = queryNorm
              0.6915884 = fieldWeight in 1101, product of:
                3.3166249 = tf(freq=11.0), with freq of:
                  11.0 = termFreq=11.0
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.0625 = fieldNorm(doc=1101)
          0.10406391 = weight(abstract_txt:science in 1101) [ClassicSimilarity], result of:
            0.10406391 = score(doc=1101,freq=1.0), product of:
              0.43125105 = queryWeight, product of:
                8.874783 = boost
                3.8609126 = idf(docFreq=2529, maxDocs=44218)
                0.012585848 = queryNorm
              0.24130704 = fieldWeight in 1101, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.8609126 = idf(docFreq=2529, maxDocs=44218)
                0.0625 = fieldNorm(doc=1101)
          0.6339886 = weight(abstract_txt:theories in 1101) [ClassicSimilarity], result of:
            0.6339886 = score(doc=1101,freq=8.0), product of:
              0.63481224 = queryWeight, product of:
                8.927948 = boost
                5.649515 = idf(docFreq=422, maxDocs=44218)
                0.012585848 = queryNorm
              0.9987026 = fieldWeight in 1101, product of:
                2.828427 = tf(freq=8.0), with freq of:
                  8.0 = termFreq=8.0
                5.649515 = idf(docFreq=422, maxDocs=44218)
                0.0625 = fieldNorm(doc=1101)
        0.28 = coord(7/25)
    
  2. Krebs, J.: Information transfer as a metaphor (2014) 0.22
    0.22275656 = sum of:
      0.22275656 = product of:
        0.6961143 = sum of:
          0.011131165 = weight(abstract_txt:different in 3395) [ClassicSimilarity], result of:
            0.011131165 = score(doc=3395,freq=1.0), product of:
              0.04858779 = queryWeight, product of:
                1.053201 = boost
                3.6655018 = idf(docFreq=3075, maxDocs=44218)
                0.012585848 = queryNorm
              0.22909386 = fieldWeight in 3395, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6655018 = idf(docFreq=3075, maxDocs=44218)
                0.0625 = fieldNorm(doc=3395)
          0.016134687 = weight(abstract_txt:when in 3395) [ClassicSimilarity], result of:
            0.016134687 = score(doc=3395,freq=1.0), product of:
              0.06223105 = queryWeight, product of:
                1.1919312 = boost
                4.148331 = idf(docFreq=1897, maxDocs=44218)
                0.012585848 = queryNorm
              0.2592707 = fieldWeight in 3395, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.148331 = idf(docFreq=1897, maxDocs=44218)
                0.0625 = fieldNorm(doc=3395)
          0.012852204 = weight(abstract_txt:used in 3395) [ClassicSimilarity], result of:
            0.012852204 = score(doc=3395,freq=1.0), product of:
              0.06121374 = queryWeight, product of:
                1.4478306 = boost
                3.3592992 = idf(docFreq=4177, maxDocs=44218)
                0.012585848 = queryNorm
              0.2099562 = fieldWeight in 3395, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.3592992 = idf(docFreq=4177, maxDocs=44218)
                0.0625 = fieldNorm(doc=3395)
          0.042754993 = weight(abstract_txt:view in 3395) [ClassicSimilarity], result of:
            0.042754993 = score(doc=3395,freq=1.0), product of:
              0.13641278 = queryWeight, product of:
                2.1613288 = boost
                5.0147786 = idf(docFreq=797, maxDocs=44218)
                0.012585848 = queryNorm
              0.31342366 = fieldWeight in 3395, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.0147786 = idf(docFreq=797, maxDocs=44218)
                0.0625 = fieldNorm(doc=3395)
          0.014882633 = weight(abstract_txt:that in 3395) [ClassicSimilarity], result of:
            0.014882633 = score(doc=3395,freq=2.0), product of:
              0.07106122 = queryWeight, product of:
                2.382857 = boost
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.012585848 = queryNorm
              0.20943399 = fieldWeight in 3395, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.0625 = fieldNorm(doc=3395)
          0.06295287 = weight(abstract_txt:data in 3395) [ClassicSimilarity], result of:
            0.06295287 = score(doc=3395,freq=1.0), product of:
              0.30190074 = queryWeight, product of:
                7.1896935 = boost
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.012585848 = queryNorm
              0.20852174 = fieldWeight in 3395, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.0625 = fieldNorm(doc=3395)
          0.14716859 = weight(abstract_txt:science in 3395) [ClassicSimilarity], result of:
            0.14716859 = score(doc=3395,freq=2.0), product of:
              0.43125105 = queryWeight, product of:
                8.874783 = boost
                3.8609126 = idf(docFreq=2529, maxDocs=44218)
                0.012585848 = queryNorm
              0.3412597 = fieldWeight in 3395, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.8609126 = idf(docFreq=2529, maxDocs=44218)
                0.0625 = fieldNorm(doc=3395)
          0.38823715 = weight(abstract_txt:theories in 3395) [ClassicSimilarity], result of:
            0.38823715 = score(doc=3395,freq=3.0), product of:
              0.63481224 = queryWeight, product of:
                8.927948 = boost
                5.649515 = idf(docFreq=422, maxDocs=44218)
                0.012585848 = queryNorm
              0.6115779 = fieldWeight in 3395, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.649515 = idf(docFreq=422, maxDocs=44218)
                0.0625 = fieldNorm(doc=3395)
        0.32 = coord(8/25)
    
  3. Hjoerland, B.: Concept theory (2009) 0.20
    0.20237535 = sum of:
      0.20237535 = product of:
        0.84323066 = sum of:
          0.015741844 = weight(abstract_txt:different in 3461) [ClassicSimilarity], result of:
            0.015741844 = score(doc=3461,freq=2.0), product of:
              0.04858779 = queryWeight, product of:
                1.053201 = boost
                3.6655018 = idf(docFreq=3075, maxDocs=44218)
                0.012585848 = queryNorm
              0.32398763 = fieldWeight in 3461, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.6655018 = idf(docFreq=3075, maxDocs=44218)
                0.0625 = fieldNorm(doc=3461)
          0.06046469 = weight(abstract_txt:view in 3461) [ClassicSimilarity], result of:
            0.06046469 = score(doc=3461,freq=2.0), product of:
              0.13641278 = queryWeight, product of:
                2.1613288 = boost
                5.0147786 = idf(docFreq=797, maxDocs=44218)
                0.012585848 = queryNorm
              0.44324797 = fieldWeight in 3461, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.0147786 = idf(docFreq=797, maxDocs=44218)
                0.0625 = fieldNorm(doc=3461)
          0.021047223 = weight(abstract_txt:that in 3461) [ClassicSimilarity], result of:
            0.021047223 = score(doc=3461,freq=4.0), product of:
              0.07106122 = queryWeight, product of:
                2.382857 = boost
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.012585848 = queryNorm
              0.2961844 = fieldWeight in 3461, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.0625 = fieldNorm(doc=3461)
          0.06452089 = weight(abstract_txt:without in 3461) [ClassicSimilarity], result of:
            0.06452089 = score(doc=3461,freq=1.0), product of:
              0.1975348 = queryWeight, product of:
                3.0032015 = boost
                5.2260876 = idf(docFreq=645, maxDocs=44218)
                0.012585848 = queryNorm
              0.32663047 = fieldWeight in 3461, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.2260876 = idf(docFreq=645, maxDocs=44218)
                0.0625 = fieldNorm(doc=3461)
          0.18024397 = weight(abstract_txt:science in 3461) [ClassicSimilarity], result of:
            0.18024397 = score(doc=3461,freq=3.0), product of:
              0.43125105 = queryWeight, product of:
                8.874783 = boost
                3.8609126 = idf(docFreq=2529, maxDocs=44218)
                0.012585848 = queryNorm
              0.41795602 = fieldWeight in 3461, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.8609126 = idf(docFreq=2529, maxDocs=44218)
                0.0625 = fieldNorm(doc=3461)
          0.50121206 = weight(abstract_txt:theories in 3461) [ClassicSimilarity], result of:
            0.50121206 = score(doc=3461,freq=5.0), product of:
              0.63481224 = queryWeight, product of:
                8.927948 = boost
                5.649515 = idf(docFreq=422, maxDocs=44218)
                0.012585848 = queryNorm
              0.78954375 = fieldWeight in 3461, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                5.649515 = idf(docFreq=422, maxDocs=44218)
                0.0625 = fieldNorm(doc=3461)
        0.24 = coord(6/25)
    
  4. Frické, M.: Big data and its epistemology (2015) 0.20
    0.20086782 = sum of:
      0.20086782 = product of:
        1.0043391 = sum of:
          0.032082483 = weight(abstract_txt:whether in 1811) [ClassicSimilarity], result of:
            0.032082483 = score(doc=1811,freq=1.0), product of:
              0.08480186 = queryWeight, product of:
                1.3913947 = boost
                4.8425326 = idf(docFreq=947, maxDocs=44218)
                0.012585848 = queryNorm
              0.37832287 = fieldWeight in 1811, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.8425326 = idf(docFreq=947, maxDocs=44218)
                0.078125 = fieldNorm(doc=1811)
          0.018603291 = weight(abstract_txt:that in 1811) [ClassicSimilarity], result of:
            0.018603291 = score(doc=1811,freq=2.0), product of:
              0.07106122 = queryWeight, product of:
                2.382857 = boost
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.012585848 = queryNorm
              0.26179248 = fieldWeight in 1811, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.078125 = fieldNorm(doc=1811)
          0.20819704 = weight(abstract_txt:data in 1811) [ClassicSimilarity], result of:
            0.20819704 = score(doc=1811,freq=7.0), product of:
              0.30190074 = queryWeight, product of:
                7.1896935 = boost
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.012585848 = queryNorm
              0.68962085 = fieldWeight in 1811, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.078125 = fieldNorm(doc=1811)
          0.2601598 = weight(abstract_txt:science in 1811) [ClassicSimilarity], result of:
            0.2601598 = score(doc=1811,freq=4.0), product of:
              0.43125105 = queryWeight, product of:
                8.874783 = boost
                3.8609126 = idf(docFreq=2529, maxDocs=44218)
                0.012585848 = queryNorm
              0.6032676 = fieldWeight in 1811, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                3.8609126 = idf(docFreq=2529, maxDocs=44218)
                0.078125 = fieldNorm(doc=1811)
          0.48529646 = weight(abstract_txt:theories in 1811) [ClassicSimilarity], result of:
            0.48529646 = score(doc=1811,freq=3.0), product of:
              0.63481224 = queryWeight, product of:
                8.927948 = boost
                5.649515 = idf(docFreq=422, maxDocs=44218)
                0.012585848 = queryNorm
              0.7644724 = fieldWeight in 1811, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.649515 = idf(docFreq=422, maxDocs=44218)
                0.078125 = fieldNorm(doc=1811)
        0.2 = coord(5/25)
    
  5. Szostak, R.: Classifying science : phenomena, data, theory, method, practice (2004) 0.20
    0.19547081 = sum of:
      0.19547081 = product of:
        0.8144617 = sum of:
          0.015741844 = weight(abstract_txt:different in 325) [ClassicSimilarity], result of:
            0.015741844 = score(doc=325,freq=2.0), product of:
              0.04858779 = queryWeight, product of:
                1.053201 = boost
                3.6655018 = idf(docFreq=3075, maxDocs=44218)
                0.012585848 = queryNorm
              0.32398763 = fieldWeight in 325, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.6655018 = idf(docFreq=3075, maxDocs=44218)
                0.0625 = fieldNorm(doc=325)
          0.01817576 = weight(abstract_txt:used in 325) [ClassicSimilarity], result of:
            0.01817576 = score(doc=325,freq=2.0), product of:
              0.06121374 = queryWeight, product of:
                1.4478306 = boost
                3.3592992 = idf(docFreq=4177, maxDocs=44218)
                0.012585848 = queryNorm
              0.2969229 = fieldWeight in 325, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.3592992 = idf(docFreq=4177, maxDocs=44218)
                0.0625 = fieldNorm(doc=325)
          0.010523612 = weight(abstract_txt:that in 325) [ClassicSimilarity], result of:
            0.010523612 = score(doc=325,freq=1.0), product of:
              0.07106122 = queryWeight, product of:
                2.382857 = boost
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.012585848 = queryNorm
              0.1480922 = fieldWeight in 325, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.0625 = fieldNorm(doc=325)
          0.089028805 = weight(abstract_txt:data in 325) [ClassicSimilarity], result of:
            0.089028805 = score(doc=325,freq=2.0), product of:
              0.30190074 = queryWeight, product of:
                7.1896935 = boost
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.012585848 = queryNorm
              0.29489428 = fieldWeight in 325, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.0625 = fieldNorm(doc=325)
          0.23269397 = weight(abstract_txt:science in 325) [ClassicSimilarity], result of:
            0.23269397 = score(doc=325,freq=5.0), product of:
              0.43125105 = queryWeight, product of:
                8.874783 = boost
                3.8609126 = idf(docFreq=2529, maxDocs=44218)
                0.012585848 = queryNorm
              0.5395789 = fieldWeight in 325, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                3.8609126 = idf(docFreq=2529, maxDocs=44218)
                0.0625 = fieldNorm(doc=325)
          0.44829768 = weight(abstract_txt:theories in 325) [ClassicSimilarity], result of:
            0.44829768 = score(doc=325,freq=4.0), product of:
              0.63481224 = queryWeight, product of:
                8.927948 = boost
                5.649515 = idf(docFreq=422, maxDocs=44218)
                0.012585848 = queryNorm
              0.7061894 = fieldWeight in 325, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.649515 = idf(docFreq=422, maxDocs=44218)
                0.0625 = fieldNorm(doc=325)
        0.24 = coord(6/25)