Document (#38402)

Jacobfeuerborn, B.
Muraszkiewicz, M.
Big data and knowledge : Extracting to automate innovation. An outline of a formal model
Knowledge organization in the 21st century: between historical patterns and future prospects. Proceedings of the Thirteenth International ISKO Conference 19-22 May 2014, Kraków, Poland. Ed.: Wieslaw Babik
Würzburg : Ergon Verlag
Advances in knowledge organization; vol. 14
This paper strives for elaborating a preliminary methodology to significantly increase the number of created innovations, be they of technological, economic, social, or any other nature. The need for having such a methodology stems from the fact that the most important factor of growth of contemporary and future economies and societies is innovation. A sine qua non pre-requisite of any type of innovativeness is knowledge. And there is the rub. The increase of knowledge worldwide is relatively slow, not up to the expectations and needs of scientific, business and other communities whose operations and growth are intrinsically knowledge dependent. We still suffer from the problem that a quick upsurge of data, towards what is dubbed big data, does not entail a comparable increase in knowledge. Thus we need methods and tools to discover and/or create more knowledge in a faster manner, and then-on top of that knowledge-to generate in a semi-automatic way more innovations. We propose a framework of a methodology to achieve this objective by tapping into big datasets for discovering, exploring and analysing knowledge and through combinatorial operations obtain innovative solutions and objects. In the paper we briefly present our approach to defining data, information, and knowledge and show how these notions could be applied to define the notion of innovation, then we present the process of generating innovations. We close the discussion by a few remarks on the role big data could play in boosting not only innovation but also science, perhaps leading to a new scientific paradigm and significant changes in knowledge organisation, and library and information sciences.

Similar documents (content)

  1. Jaffe, A.B.; Rassenfosse, G. de: Patent citation data in social science research : overview and best practices (2017) 0.13
    0.12898062 = sum of:
      0.12898062 = product of:
        0.64490306 = sum of:
          0.06733875 = weight(abstract_txt:data in 4646) [ClassicSimilarity], result of:
            0.06733875 = score(doc=4646,freq=4.0), product of:
              0.12941106 = queryWeight, product of:
                2.0368085 = boost
                3.3302255 = idf(docFreq=4320, maxDocs=44421)
                0.019078646 = queryNorm
              0.5203477 = fieldWeight in 4646, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                3.3302255 = idf(docFreq=4320, maxDocs=44421)
                0.078125 = fieldNorm(doc=4646)
          0.09646408 = weight(abstract_txt:increase in 4646) [ClassicSimilarity], result of:
            0.09646408 = score(doc=4646,freq=1.0), product of:
              0.2201788 = queryWeight, product of:
                2.057917 = boost
                5.6078978 = idf(docFreq=442, maxDocs=44421)
                0.019078646 = queryNorm
              0.43811703 = fieldWeight in 4646, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.6078978 = idf(docFreq=442, maxDocs=44421)
                0.078125 = fieldNorm(doc=4646)
          0.1997305 = weight(abstract_txt:innovations in 4646) [ClassicSimilarity], result of:
            0.1997305 = score(doc=4646,freq=1.0), product of:
              0.35768002 = queryWeight, product of:
                2.6229348 = boost
                7.1475906 = idf(docFreq=94, maxDocs=44421)
                0.019078646 = queryNorm
              0.5584055 = fieldWeight in 4646, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.1475906 = idf(docFreq=94, maxDocs=44421)
                0.078125 = fieldNorm(doc=4646)
          0.20004842 = weight(abstract_txt:innovation in 4646) [ClassicSimilarity], result of:
            0.20004842 = score(doc=4646,freq=1.0), product of:
              0.39409545 = queryWeight, product of:
                3.1791441 = boost
                6.497461 = idf(docFreq=181, maxDocs=44421)
                0.019078646 = queryNorm
              0.50761414 = fieldWeight in 4646, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.497461 = idf(docFreq=181, maxDocs=44421)
                0.078125 = fieldNorm(doc=4646)
          0.081321366 = weight(abstract_txt:knowledge in 4646) [ClassicSimilarity], result of:
            0.081321366 = score(doc=4646,freq=1.0), product of:
              0.2935133 = queryWeight, product of:
                4.3380384 = boost
                3.5463927 = idf(docFreq=3480, maxDocs=44421)
                0.019078646 = queryNorm
              0.27706194 = fieldWeight in 4646, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.5463927 = idf(docFreq=3480, maxDocs=44421)
                0.078125 = fieldNorm(doc=4646)
        0.2 = coord(5/25)
  2. Evans, P.M.: Facilitating scientific communication for industrial innovation (2000) 0.12
    0.11675012 = sum of:
      0.11675012 = product of:
        0.5837506 = sum of:
          0.02658317 = weight(abstract_txt:need in 845) [ClassicSimilarity], result of:
            0.02658317 = score(doc=845,freq=1.0), product of:
              0.08145277 = queryWeight, product of:
                1.0219918 = boost
                4.1774464 = idf(docFreq=1851, maxDocs=44421)
                0.019078646 = queryNorm
              0.326363 = fieldWeight in 845, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.1774464 = idf(docFreq=1851, maxDocs=44421)
                0.078125 = fieldNorm(doc=845)
          0.06278636 = weight(abstract_txt:scientific in 845) [ClassicSimilarity], result of:
            0.06278636 = score(doc=845,freq=3.0), product of:
              0.10016234 = queryWeight, product of:
                1.1333048 = boost
                4.6324444 = idf(docFreq=1174, maxDocs=44421)
                0.019078646 = queryNorm
              0.626846 = fieldWeight in 845, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.6324444 = idf(docFreq=1174, maxDocs=44421)
                0.078125 = fieldNorm(doc=845)
          0.09646408 = weight(abstract_txt:increase in 845) [ClassicSimilarity], result of:
            0.09646408 = score(doc=845,freq=1.0), product of:
              0.2201788 = queryWeight, product of:
                2.057917 = boost
                5.6078978 = idf(docFreq=442, maxDocs=44421)
                0.019078646 = queryNorm
              0.43811703 = fieldWeight in 845, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.6078978 = idf(docFreq=442, maxDocs=44421)
                0.078125 = fieldNorm(doc=845)
          0.28291118 = weight(abstract_txt:innovation in 845) [ClassicSimilarity], result of:
            0.28291118 = score(doc=845,freq=2.0), product of:
              0.39409545 = queryWeight, product of:
                3.1791441 = boost
                6.497461 = idf(docFreq=181, maxDocs=44421)
                0.019078646 = queryNorm
              0.71787477 = fieldWeight in 845, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.497461 = idf(docFreq=181, maxDocs=44421)
                0.078125 = fieldNorm(doc=845)
          0.11500577 = weight(abstract_txt:knowledge in 845) [ClassicSimilarity], result of:
            0.11500577 = score(doc=845,freq=2.0), product of:
              0.2935133 = queryWeight, product of:
                4.3380384 = boost
                3.5463927 = idf(docFreq=3480, maxDocs=44421)
                0.019078646 = queryNorm
              0.39182472 = fieldWeight in 845, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.5463927 = idf(docFreq=3480, maxDocs=44421)
                0.078125 = fieldNorm(doc=845)
        0.2 = coord(5/25)
  3. Martin, K.; Quan-Haase, A.: Are e-books replacing print books? : tradition, serendipity, and opportunity in the adoption and use of e-books for historical research and teaching (2013) 0.11
    0.10937845 = sum of:
      0.10937845 = product of:
        0.5468922 = sum of:
          0.021266537 = weight(abstract_txt:need in 1748) [ClassicSimilarity], result of:
            0.021266537 = score(doc=1748,freq=1.0), product of:
              0.08145277 = queryWeight, product of:
                1.0219918 = boost
                4.1774464 = idf(docFreq=1851, maxDocs=44421)
                0.019078646 = queryNorm
              0.2610904 = fieldWeight in 1748, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.1774464 = idf(docFreq=1851, maxDocs=44421)
                0.0625 = fieldNorm(doc=1748)
          0.0269355 = weight(abstract_txt:data in 1748) [ClassicSimilarity], result of:
            0.0269355 = score(doc=1748,freq=1.0), product of:
              0.12941106 = queryWeight, product of:
                2.0368085 = boost
                3.3302255 = idf(docFreq=4320, maxDocs=44421)
                0.019078646 = queryNorm
              0.20813909 = fieldWeight in 1748, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.3302255 = idf(docFreq=4320, maxDocs=44421)
                0.0625 = fieldNorm(doc=1748)
          0.22596925 = weight(abstract_txt:innovations in 1748) [ClassicSimilarity], result of:
            0.22596925 = score(doc=1748,freq=2.0), product of:
              0.35768002 = queryWeight, product of:
                2.6229348 = boost
                7.1475906 = idf(docFreq=94, maxDocs=44421)
                0.019078646 = queryNorm
              0.6317637 = fieldWeight in 1748, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.1475906 = idf(docFreq=94, maxDocs=44421)
                0.0625 = fieldNorm(doc=1748)
          0.16003874 = weight(abstract_txt:innovation in 1748) [ClassicSimilarity], result of:
            0.16003874 = score(doc=1748,freq=1.0), product of:
              0.39409545 = queryWeight, product of:
                3.1791441 = boost
                6.497461 = idf(docFreq=181, maxDocs=44421)
                0.019078646 = queryNorm
              0.4060913 = fieldWeight in 1748, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.497461 = idf(docFreq=181, maxDocs=44421)
                0.0625 = fieldNorm(doc=1748)
          0.112682186 = weight(abstract_txt:knowledge in 1748) [ClassicSimilarity], result of:
            0.112682186 = score(doc=1748,freq=3.0), product of:
              0.2935133 = queryWeight, product of:
                4.3380384 = boost
                3.5463927 = idf(docFreq=3480, maxDocs=44421)
                0.019078646 = queryNorm
              0.38390827 = fieldWeight in 1748, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.5463927 = idf(docFreq=3480, maxDocs=44421)
                0.0625 = fieldNorm(doc=1748)
        0.2 = coord(5/25)
  4. Huang, T.; Nie, R.; Zhao, Y.: Archival knowledge in the field of personal archiving : an exploratory study based on grounded theory (2021) 0.10
    0.098894276 = sum of:
      0.098894276 = product of:
        0.49447137 = sum of:
          0.021266537 = weight(abstract_txt:need in 1174) [ClassicSimilarity], result of:
            0.021266537 = score(doc=1174,freq=1.0), product of:
              0.08145277 = queryWeight, product of:
                1.0219918 = boost
                4.1774464 = idf(docFreq=1851, maxDocs=44421)
                0.019078646 = queryNorm
              0.2610904 = fieldWeight in 1174, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.1774464 = idf(docFreq=1851, maxDocs=44421)
                0.0625 = fieldNorm(doc=1174)
          0.059303597 = weight(abstract_txt:methodology in 1174) [ClassicSimilarity], result of:
            0.059303597 = score(doc=1174,freq=2.0), product of:
              0.14661637 = queryWeight, product of:
                1.6793119 = boost
                4.5761847 = idf(docFreq=1242, maxDocs=44421)
                0.019078646 = queryNorm
              0.4044814 = fieldWeight in 1174, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.5761847 = idf(docFreq=1242, maxDocs=44421)
                0.0625 = fieldNorm(doc=1174)
          0.03809255 = weight(abstract_txt:data in 1174) [ClassicSimilarity], result of:
            0.03809255 = score(doc=1174,freq=2.0), product of:
              0.12941106 = queryWeight, product of:
                2.0368085 = boost
                3.3302255 = idf(docFreq=4320, maxDocs=44421)
                0.019078646 = queryNorm
              0.29435313 = fieldWeight in 1174, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.3302255 = idf(docFreq=4320, maxDocs=44421)
                0.0625 = fieldNorm(doc=1174)
          0.16003874 = weight(abstract_txt:innovation in 1174) [ClassicSimilarity], result of:
            0.16003874 = score(doc=1174,freq=1.0), product of:
              0.39409545 = queryWeight, product of:
                3.1791441 = boost
                6.497461 = idf(docFreq=181, maxDocs=44421)
                0.019078646 = queryNorm
              0.4060913 = fieldWeight in 1174, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.497461 = idf(docFreq=181, maxDocs=44421)
                0.0625 = fieldNorm(doc=1174)
          0.21576996 = weight(abstract_txt:knowledge in 1174) [ClassicSimilarity], result of:
            0.21576996 = score(doc=1174,freq=11.0), product of:
              0.2935133 = queryWeight, product of:
                4.3380384 = boost
                3.5463927 = idf(docFreq=3480, maxDocs=44421)
                0.019078646 = queryNorm
              0.7351284 = fieldWeight in 1174, product of:
                3.3166249 = tf(freq=11.0), with freq of:
                  11.0 = termFreq=11.0
                3.5463927 = idf(docFreq=3480, maxDocs=44421)
                0.0625 = fieldNorm(doc=1174)
        0.2 = coord(5/25)
  5. Zhang, Y.; Zhang, G.; Zhu, D.; Lu, J.: Scientific evolutionary pathways : identifying and visualizing relationships for scientific topics (2017) 0.10
    0.09751605 = sum of:
      0.09751605 = product of:
        0.4063169 = sum of:
          0.040433325 = weight(abstract_txt:then in 4758) [ClassicSimilarity], result of:
            0.040433325 = score(doc=4758,freq=2.0), product of:
              0.09921812 = queryWeight, product of:
                1.1279504 = boost
                4.6105576 = idf(docFreq=1200, maxDocs=44421)
                0.019078646 = queryNorm
              0.40751955 = fieldWeight in 4758, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.6105576 = idf(docFreq=1200, maxDocs=44421)
                0.0625 = fieldNorm(doc=4758)
          0.07103466 = weight(abstract_txt:scientific in 4758) [ClassicSimilarity], result of:
            0.07103466 = score(doc=4758,freq=6.0), product of:
              0.10016234 = queryWeight, product of:
                1.1333048 = boost
                4.6324444 = idf(docFreq=1174, maxDocs=44421)
                0.019078646 = queryNorm
              0.7091953 = fieldWeight in 4758, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                4.6324444 = idf(docFreq=1174, maxDocs=44421)
                0.0625 = fieldNorm(doc=4758)
          0.031660523 = weight(abstract_txt:could in 4758) [ClassicSimilarity], result of:
            0.031660523 = score(doc=4758,freq=1.0), product of:
              0.10619892 = queryWeight, product of:
                1.1669562 = boost
                4.7699957 = idf(docFreq=1023, maxDocs=44421)
                0.019078646 = queryNorm
              0.29812473 = fieldWeight in 4758, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.7699957 = idf(docFreq=1023, maxDocs=44421)
                0.0625 = fieldNorm(doc=4758)
          0.03809255 = weight(abstract_txt:data in 4758) [ClassicSimilarity], result of:
            0.03809255 = score(doc=4758,freq=2.0), product of:
              0.12941106 = queryWeight, product of:
                2.0368085 = boost
                3.3302255 = idf(docFreq=4320, maxDocs=44421)
                0.019078646 = queryNorm
              0.29435313 = fieldWeight in 4758, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.3302255 = idf(docFreq=4320, maxDocs=44421)
                0.0625 = fieldNorm(doc=4758)
          0.16003874 = weight(abstract_txt:innovation in 4758) [ClassicSimilarity], result of:
            0.16003874 = score(doc=4758,freq=1.0), product of:
              0.39409545 = queryWeight, product of:
                3.1791441 = boost
                6.497461 = idf(docFreq=181, maxDocs=44421)
                0.019078646 = queryNorm
              0.4060913 = fieldWeight in 4758, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.497461 = idf(docFreq=181, maxDocs=44421)
                0.0625 = fieldNorm(doc=4758)
          0.06505709 = weight(abstract_txt:knowledge in 4758) [ClassicSimilarity], result of:
            0.06505709 = score(doc=4758,freq=1.0), product of:
              0.2935133 = queryWeight, product of:
                4.3380384 = boost
                3.5463927 = idf(docFreq=3480, maxDocs=44421)
                0.019078646 = queryNorm
              0.22164954 = fieldWeight in 4758, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.5463927 = idf(docFreq=3480, maxDocs=44421)
                0.0625 = fieldNorm(doc=4758)
        0.24 = coord(6/25)