Document (#39569)

Author
Bauckhage, C.
Title
Moderne Textanalyse : neues Wissen für intelligente Lösungen
Source
https://login.mailingwork.de/public/a_5668_LVrTK/file/data/1125_Textanalyse_Christian-Bauckhage.pdf
Year
2016
Abstract
Im Zuge der immer größeren Verfügbarkeit von Daten (Big Data) und rasanter Fortschritte im Daten-basierten maschinellen Lernen haben wir in den letzten Jahren Durchbrüche in der künstlichen Intelligenz erlebt. Dieser Vortrag beleuchtet diese Entwicklungen insbesondere im Hinblick auf die automatische Analyse von Textdaten. Anhand einfacher Beispiele illustrieren wir, wie moderne Textanalyse abläuft und zeigen wiederum anhand von Beispielen, welche praktischen Anwendungsmöglichkeiten sich heutzutage in Branchen wie dem Verlagswesen, der Finanzindustrie oder dem Consulting ergeben.
Content
Folien der Präsentation anlässlich des GENIOS Datenbankfrühstücks 2016, 19. Oktober 2016.
Theme
Wissensrepräsentation
Data Mining
Field
Informatik

Similar documents (content)

  1. Giesselbach, S.; Estler-Ziegler, T.: Dokumente schneller analysieren mit Künstlicher Intelligenz (2021) 0.07
    0.073933646 = sum of:
      0.073933646 = product of:
        0.46208528 = sum of:
          0.063097 = weight(abstract_txt:vortrag in 1129) [ClassicSimilarity], result of:
            0.063097 = score(doc=1129,freq=1.0), product of:
              0.13816196 = queryWeight, product of:
                1.0114738 = boost
                7.3070183 = idf(docFreq=80, maxDocs=44421)
                0.018693632 = queryNorm
              0.45668864 = fieldWeight in 1129, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.3070183 = idf(docFreq=80, maxDocs=44421)
                0.0625 = fieldNorm(doc=1129)
          0.078918725 = weight(abstract_txt:basierten in 1129) [ClassicSimilarity], result of:
            0.078918725 = score(doc=1129,freq=1.0), product of:
              0.16038708 = queryWeight, product of:
                1.0897956 = boost
                7.872826 = idf(docFreq=45, maxDocs=44421)
                0.018693632 = queryNorm
              0.49205163 = fieldWeight in 1129, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.872826 = idf(docFreq=45, maxDocs=44421)
                0.0625 = fieldNorm(doc=1129)
          0.05959376 = weight(abstract_txt:anhand in 1129) [ClassicSimilarity], result of:
            0.05959376 = score(doc=1129,freq=1.0), product of:
              0.16756882 = queryWeight, product of:
                1.5753316 = boost
                5.6902003 = idf(docFreq=407, maxDocs=44421)
                0.018693632 = queryNorm
              0.35563752 = fieldWeight in 1129, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.6902003 = idf(docFreq=407, maxDocs=44421)
                0.0625 = fieldNorm(doc=1129)
          0.26047578 = weight(abstract_txt:textanalyse in 1129) [ClassicSimilarity], result of:
            0.26047578 = score(doc=1129,freq=1.0), product of:
              0.4479583 = queryWeight, product of:
                2.5756934 = boost
                9.303573 = idf(docFreq=10, maxDocs=44421)
                0.018693632 = queryNorm
              0.5814733 = fieldWeight in 1129, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.303573 = idf(docFreq=10, maxDocs=44421)
                0.0625 = fieldNorm(doc=1129)
        0.16 = coord(4/25)
    
  2. Boltzendahl, S.: Ontologien in digitalen Bibliotheken unter dem Schwerpunkt Inhaltserschliessung und Recherche (2004) 0.07
    0.07138632 = sum of:
      0.07138632 = product of:
        0.3569316 = sum of:
          0.064672716 = weight(abstract_txt:künstlichen in 2414) [ClassicSimilarity], result of:
            0.064672716 = score(doc=2414,freq=2.0), product of:
              0.13504523 = queryWeight, product of:
                7.2241306 = idf(docFreq=87, maxDocs=44421)
                0.018693632 = queryNorm
              0.4788967 = fieldWeight in 2414, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.2241306 = idf(docFreq=87, maxDocs=44421)
                0.046875 = fieldNorm(doc=2414)
          0.09177052 = weight(abstract_txt:intelligente in 2414) [ClassicSimilarity], result of:
            0.09177052 = score(doc=2414,freq=2.0), product of:
              0.17052995 = queryWeight, product of:
                1.1237267 = boost
                8.117949 = idf(docFreq=35, maxDocs=44421)
                0.018693632 = queryNorm
              0.538149 = fieldWeight in 2414, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.117949 = idf(docFreq=35, maxDocs=44421)
                0.046875 = fieldNorm(doc=2414)
          0.08459872 = weight(abstract_txt:anwendungsmöglichkeiten in 2414) [ClassicSimilarity], result of:
            0.08459872 = score(doc=2414,freq=1.0), product of:
              0.20350936 = queryWeight, product of:
                1.2275878 = boost
                8.868255 = idf(docFreq=16, maxDocs=44421)
                0.018693632 = queryNorm
              0.41569942 = fieldWeight in 2414, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.868255 = idf(docFreq=16, maxDocs=44421)
                0.046875 = fieldNorm(doc=2414)
          0.035124056 = weight(abstract_txt:daten in 2414) [ClassicSimilarity], result of:
            0.035124056 = score(doc=2414,freq=1.0), product of:
              0.14269923 = queryWeight, product of:
                1.4537381 = boost
                5.250997 = idf(docFreq=632, maxDocs=44421)
                0.018693632 = queryNorm
              0.24614048 = fieldWeight in 2414, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.250997 = idf(docFreq=632, maxDocs=44421)
                0.046875 = fieldNorm(doc=2414)
          0.08076557 = weight(abstract_txt:moderne in 2414) [ClassicSimilarity], result of:
            0.08076557 = score(doc=2414,freq=1.0), product of:
              0.24860089 = queryWeight, product of:
                1.9187866 = boost
                6.930783 = idf(docFreq=117, maxDocs=44421)
                0.018693632 = queryNorm
              0.32488045 = fieldWeight in 2414, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.930783 = idf(docFreq=117, maxDocs=44421)
                0.046875 = fieldNorm(doc=2414)
        0.2 = coord(5/25)
    
  3. Sack, H.: Hybride Künstliche Intelligenz in der automatisierten Inhaltserschließung (2021) 0.06
    0.05786479 = sum of:
      0.05786479 = product of:
        0.36165494 = sum of:
          0.107787855 = weight(abstract_txt:künstlichen in 1373) [ClassicSimilarity], result of:
            0.107787855 = score(doc=1373,freq=2.0), product of:
              0.13504523 = queryWeight, product of:
                7.2241306 = idf(docFreq=87, maxDocs=44421)
                0.018693632 = queryNorm
              0.79816115 = fieldWeight in 1373, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.2241306 = idf(docFreq=87, maxDocs=44421)
                0.078125 = fieldNorm(doc=1373)
          0.08729234 = weight(abstract_txt:maschinellen in 1373) [ClassicSimilarity], result of:
            0.08729234 = score(doc=1373,freq=1.0), product of:
              0.14782916 = queryWeight, product of:
                1.0462619 = boost
                7.558333 = idf(docFreq=62, maxDocs=44421)
                0.018693632 = queryNorm
              0.59049475 = fieldWeight in 1373, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.558333 = idf(docFreq=62, maxDocs=44421)
                0.078125 = fieldNorm(doc=1373)
          0.09208256 = weight(abstract_txt:fortschritte in 1373) [ClassicSimilarity], result of:
            0.09208256 = score(doc=1373,freq=1.0), product of:
              0.153189 = queryWeight, product of:
                1.0650603 = boost
                7.694134 = idf(docFreq=54, maxDocs=44421)
                0.018693632 = queryNorm
              0.60110426 = fieldWeight in 1373, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.694134 = idf(docFreq=54, maxDocs=44421)
                0.078125 = fieldNorm(doc=1373)
          0.0744922 = weight(abstract_txt:anhand in 1373) [ClassicSimilarity], result of:
            0.0744922 = score(doc=1373,freq=1.0), product of:
              0.16756882 = queryWeight, product of:
                1.5753316 = boost
                5.6902003 = idf(docFreq=407, maxDocs=44421)
                0.018693632 = queryNorm
              0.4445469 = fieldWeight in 1373, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.6902003 = idf(docFreq=407, maxDocs=44421)
                0.078125 = fieldNorm(doc=1373)
        0.16 = coord(4/25)
    
  4. Eckert, K.: Linked Open Projects : Nachnutzung von Projektergebnissen als Linked Data (2010) 0.04
    0.044910394 = sum of:
      0.044910394 = product of:
        0.3742533 = sum of:
          0.12801202 = weight(abstract_txt:einfacher in 278) [ClassicSimilarity], result of:
            0.12801202 = score(doc=278,freq=1.0), product of:
              0.15247238 = queryWeight, product of:
                1.062566 = boost
                7.676116 = idf(docFreq=55, maxDocs=44421)
                0.018693632 = queryNorm
              0.8395752 = fieldWeight in 278, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.676116 = idf(docFreq=55, maxDocs=44421)
                0.109375 = fieldNorm(doc=278)
          0.14195219 = weight(abstract_txt:daten in 278) [ClassicSimilarity], result of:
            0.14195219 = score(doc=278,freq=3.0), product of:
              0.14269923 = queryWeight, product of:
                1.4537381 = boost
                5.250997 = idf(docFreq=632, maxDocs=44421)
                0.018693632 = queryNorm
              0.9947649 = fieldWeight in 278, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.250997 = idf(docFreq=632, maxDocs=44421)
                0.109375 = fieldNorm(doc=278)
          0.10428908 = weight(abstract_txt:anhand in 278) [ClassicSimilarity], result of:
            0.10428908 = score(doc=278,freq=1.0), product of:
              0.16756882 = queryWeight, product of:
                1.5753316 = boost
                5.6902003 = idf(docFreq=407, maxDocs=44421)
                0.018693632 = queryNorm
              0.62236565 = fieldWeight in 278, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.6902003 = idf(docFreq=407, maxDocs=44421)
                0.109375 = fieldNorm(doc=278)
        0.12 = coord(3/25)
    
  5. Hartmann, S.; Haffner, A.: Linked-RDA-Data in der Praxis (2010) 0.04
    0.039673474 = sum of:
      0.039673474 = product of:
        0.3306123 = sum of:
          0.11041974 = weight(abstract_txt:vortrag in 2679) [ClassicSimilarity], result of:
            0.11041974 = score(doc=2679,freq=1.0), product of:
              0.13816196 = queryWeight, product of:
                1.0114738 = boost
                7.3070183 = idf(docFreq=80, maxDocs=44421)
                0.018693632 = queryNorm
              0.7992051 = fieldWeight in 2679, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.3070183 = idf(docFreq=80, maxDocs=44421)
                0.109375 = fieldNorm(doc=2679)
          0.11590347 = weight(abstract_txt:daten in 2679) [ClassicSimilarity], result of:
            0.11590347 = score(doc=2679,freq=2.0), product of:
              0.14269923 = queryWeight, product of:
                1.4537381 = boost
                5.250997 = idf(docFreq=632, maxDocs=44421)
                0.018693632 = queryNorm
              0.8122221 = fieldWeight in 2679, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.250997 = idf(docFreq=632, maxDocs=44421)
                0.109375 = fieldNorm(doc=2679)
          0.10428908 = weight(abstract_txt:anhand in 2679) [ClassicSimilarity], result of:
            0.10428908 = score(doc=2679,freq=1.0), product of:
              0.16756882 = queryWeight, product of:
                1.5753316 = boost
                5.6902003 = idf(docFreq=407, maxDocs=44421)
                0.018693632 = queryNorm
              0.62236565 = fieldWeight in 2679, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.6902003 = idf(docFreq=407, maxDocs=44421)
                0.109375 = fieldNorm(doc=2679)
        0.12 = coord(3/25)