Document (#43792)

Author
Hahn, U.
Title
Automatische Sprachverarbeitung
Source
Grundlagen der Informationswissenschaft. Hrsg.: Rainer Kuhlen, Dirk Lewandowski, Wolfgang Semar und Christa Womser-Hacker. 7., völlig neu gefasste Ausg
Imprint
Berlin : DeGruyter
Year
2023
Pages
S.281-292
Abstract
Dieses Kapitel gibt eine Übersicht über die maschinelle Verarbeitung natürlicher Sprachen (wie das Deutsche oder Englische; natural language - NL) durch Computer. Grundlegende Konzepte der automatischen Sprachverarbeitung (natural language processing - NLP) stammen aus der Sprachwissenschaft (s. Abschnitt 2) und sind in zunehmend selbstständiger Weise mit formalen Methoden und technischen Grundlagen der Informatik in einer eigenständigen Disziplin, der Computerlinguistik (CL; s. Abschnitte 3 und 4), verknüpft worden. Natürlichsprachliche Systeme (NatS) mit anwendungsbezogenen Funktionalitätsvorgaben bilden den Kern der informationswissenschaftlich geprägten NLP, die häufig als Sprachtechnologie oder im Deutschen auch (mittlerweile veraltet) als Informationslinguistik bezeichnet wird (s. Abschnitt 5).
Footnote
Vgl.: https://doi.org/10.1515/9783110769043.
Theme
Computerlinguistik

Similar documents (author)

  1. Hahn, G.: ¬Die Bibliothek des Wissenschaftlichen Dienstes des US-Kongresses : Eine Bibliothek in der Library of Congress (1985) 4.88
    4.881029 = sum of:
      4.881029 = weight(author_txt:hahn in 1310) [ClassicSimilarity], result of:
        4.881029 = score(doc=1310,freq=1.0), product of:
          0.99999994 = queryWeight, product of:
            7.809647 = idf(docFreq=48, maxDocs=44421)
            0.12804675 = queryNorm
          4.8810296 = fieldWeight in 1310, product of:
            1.0 = tf(freq=1.0), with freq of:
              1.0 = termFreq=1.0
            7.809647 = idf(docFreq=48, maxDocs=44421)
            0.625 = fieldNorm(doc=1310)
    
  2. Hahn, G.: ¬Die Entwicklung der Wirtschaftswissenschaften im Spiegel von Klassifikationssystemen : ein Beitrag zur Wissenschafts- und Klassifikationskunde der Nationalökonomie (1978) 4.88
    4.881029 = sum of:
      4.881029 = weight(author_txt:hahn in 1697) [ClassicSimilarity], result of:
        4.881029 = score(doc=1697,freq=1.0), product of:
          0.99999994 = queryWeight, product of:
            7.809647 = idf(docFreq=48, maxDocs=44421)
            0.12804675 = queryNorm
          4.8810296 = fieldWeight in 1697, product of:
            1.0 = tf(freq=1.0), with freq of:
              1.0 = termFreq=1.0
            7.809647 = idf(docFreq=48, maxDocs=44421)
            0.625 = fieldNorm(doc=1697)
    
  3. Hahn, G.: Sacherschließung durch Schlagwortkataloge : theoretische und praktische Fragen, dargestellt am Beispiel der Bibliotheken der Industrie- und Handelskammern (1983) 4.88
    4.881029 = sum of:
      4.881029 = weight(author_txt:hahn in 1698) [ClassicSimilarity], result of:
        4.881029 = score(doc=1698,freq=1.0), product of:
          0.99999994 = queryWeight, product of:
            7.809647 = idf(docFreq=48, maxDocs=44421)
            0.12804675 = queryNorm
          4.8810296 = fieldWeight in 1698, product of:
            1.0 = tf(freq=1.0), with freq of:
              1.0 = termFreq=1.0
            7.809647 = idf(docFreq=48, maxDocs=44421)
            0.625 = fieldNorm(doc=1698)
    
  4. Hahn, G.: ¬Die Bibliothek des Deutschen Bundestages : Informationsbasis für die parlamentarische Arbeit (1983) 4.88
    4.881029 = sum of:
      4.881029 = weight(author_txt:hahn in 1699) [ClassicSimilarity], result of:
        4.881029 = score(doc=1699,freq=1.0), product of:
          0.99999994 = queryWeight, product of:
            7.809647 = idf(docFreq=48, maxDocs=44421)
            0.12804675 = queryNorm
          4.8810296 = fieldWeight in 1699, product of:
            1.0 = tf(freq=1.0), with freq of:
              1.0 = termFreq=1.0
            7.809647 = idf(docFreq=48, maxDocs=44421)
            0.625 = fieldNorm(doc=1699)
    
  5. Hahn, G.: Information und Dokumentation in der Bibliothek des Deutschen Bundestages : ein Beispiel der Praxis für die Einheit bibliothekarischer und dokumentarischer Prinzipien (1978-79) 4.88
    4.881029 = sum of:
      4.881029 = weight(author_txt:hahn in 1700) [ClassicSimilarity], result of:
        4.881029 = score(doc=1700,freq=1.0), product of:
          0.99999994 = queryWeight, product of:
            7.809647 = idf(docFreq=48, maxDocs=44421)
            0.12804675 = queryNorm
          4.8810296 = fieldWeight in 1700, product of:
            1.0 = tf(freq=1.0), with freq of:
              1.0 = termFreq=1.0
            7.809647 = idf(docFreq=48, maxDocs=44421)
            0.625 = fieldNorm(doc=1700)
    

Similar documents (content)

  1. Sprachtechnologie : ein Überblick (2012) 0.50
    0.50016963 = sum of:
      0.50016963 = product of:
        1.7863201 = sum of:
          0.15555736 = weight(abstract_txt:maschinelle in 2750) [ClassicSimilarity], result of:
            0.15555736 = score(doc=2750,freq=4.0), product of:
              0.15892932 = queryWeight, product of:
                1.0804659 = boost
                7.8302665 = idf(docFreq=47, maxDocs=44421)
                0.018785225 = queryNorm
              0.9787833 = fieldWeight in 2750, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                7.8302665 = idf(docFreq=47, maxDocs=44421)
                0.0625 = fieldNorm(doc=2750)
          0.03325007 = weight(abstract_txt:language in 2750) [ClassicSimilarity], result of:
            0.03325007 = score(doc=2750,freq=2.0), product of:
              0.09019006 = queryWeight, product of:
                1.1510744 = boost
                4.1709876 = idf(docFreq=1863, maxDocs=44421)
                0.018785225 = queryNorm
              0.3686667 = fieldWeight in 2750, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.1709876 = idf(docFreq=1863, maxDocs=44421)
                0.0625 = fieldNorm(doc=2750)
          0.042824883 = weight(abstract_txt:oder in 2750) [ClassicSimilarity], result of:
            0.042824883 = score(doc=2750,freq=3.0), product of:
              0.093267575 = queryWeight, product of:
                1.1705484 = boost
                4.241553 = idf(docFreq=1736, maxDocs=44421)
                0.018785225 = queryNorm
              0.45916155 = fieldWeight in 2750, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.241553 = idf(docFreq=1736, maxDocs=44421)
                0.0625 = fieldNorm(doc=2750)
          0.20063266 = weight(abstract_txt:computerlinguistik in 2750) [ClassicSimilarity], result of:
            0.20063266 = score(doc=2750,freq=4.0), product of:
              0.18831202 = queryWeight, product of:
                1.1761104 = boost
                8.523414 = idf(docFreq=23, maxDocs=44421)
                0.018785225 = queryNorm
              1.0654267 = fieldWeight in 2750, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                8.523414 = idf(docFreq=23, maxDocs=44421)
                0.0625 = fieldNorm(doc=2750)
          1.0146767 = weight(title_txt:sprachtechnologie in 2750) [ClassicSimilarity], result of:
            1.0146767 = score(doc=2750,freq=1.0), product of:
              0.22018552 = queryWeight, product of:
                1.271755 = boost
                9.216561 = idf(docFreq=11, maxDocs=44421)
                0.018785225 = queryNorm
              4.6082807 = fieldWeight in 2750, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.216561 = idf(docFreq=11, maxDocs=44421)
                0.5 = fieldNorm(doc=2750)
          0.059698377 = weight(abstract_txt:natural in 2750) [ClassicSimilarity], result of:
            0.059698377 = score(doc=2750,freq=2.0), product of:
              0.13323101 = queryWeight, product of:
                1.399029 = boost
                5.0694656 = idf(docFreq=758, maxDocs=44421)
                0.018785225 = queryNorm
              0.44808167 = fieldWeight in 2750, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.0694656 = idf(docFreq=758, maxDocs=44421)
                0.0625 = fieldNorm(doc=2750)
          0.2796801 = weight(abstract_txt:sprachverarbeitung in 2750) [ClassicSimilarity], result of:
            0.2796801 = score(doc=2750,freq=2.0), product of:
              0.37302506 = queryWeight, product of:
                2.340955 = boost
                8.482592 = idf(docFreq=24, maxDocs=44421)
                0.018785225 = queryNorm
              0.74976224 = fieldWeight in 2750, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.482592 = idf(docFreq=24, maxDocs=44421)
                0.0625 = fieldNorm(doc=2750)
        0.28 = coord(7/25)
    
  2. Computerlinguistik und Sprachtechnologie : Eine Einführung (2010) 0.35
    0.34809658 = sum of:
      0.34809658 = product of:
        1.4504025 = sum of:
          0.087205194 = weight(abstract_txt:verarbeitung in 2735) [ClassicSimilarity], result of:
            0.087205194 = score(doc=2735,freq=2.0), product of:
              0.13613878 = queryWeight, product of:
                7.2471204 = idf(docFreq=85, maxDocs=44421)
                0.018785225 = queryNorm
              0.640561 = fieldWeight in 2735, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.2471204 = idf(docFreq=85, maxDocs=44421)
                0.0625 = fieldNorm(doc=2735)
          0.06908927 = weight(abstract_txt:formalen in 2735) [ClassicSimilarity], result of:
            0.06908927 = score(doc=2735,freq=1.0), product of:
              0.14686017 = queryWeight, product of:
                1.0386305 = boost
                7.5270805 = idf(docFreq=64, maxDocs=44421)
                0.018785225 = queryNorm
              0.47044253 = fieldWeight in 2735, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.5270805 = idf(docFreq=64, maxDocs=44421)
                0.0625 = fieldNorm(doc=2735)
          0.092641935 = weight(abstract_txt:natürlicher in 2735) [ClassicSimilarity], result of:
            0.092641935 = score(doc=2735,freq=1.0), product of:
              0.17858104 = queryWeight, product of:
                1.1453197 = boost
                8.30027 = idf(docFreq=29, maxDocs=44421)
                0.018785225 = queryNorm
              0.5187669 = fieldWeight in 2735, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.30027 = idf(docFreq=29, maxDocs=44421)
                0.0625 = fieldNorm(doc=2735)
          0.20063266 = weight(abstract_txt:computerlinguistik in 2735) [ClassicSimilarity], result of:
            0.20063266 = score(doc=2735,freq=4.0), product of:
              0.18831202 = queryWeight, product of:
                1.1761104 = boost
                8.523414 = idf(docFreq=23, maxDocs=44421)
                0.018785225 = queryNorm
              1.0654267 = fieldWeight in 2735, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                8.523414 = idf(docFreq=23, maxDocs=44421)
                0.0625 = fieldNorm(doc=2735)
          0.11299141 = weight(abstract_txt:eigenständigen in 2735) [ClassicSimilarity], result of:
            0.11299141 = score(doc=2735,freq=1.0), product of:
              0.20385776 = queryWeight, product of:
                1.2236936 = boost
                8.868255 = idf(docFreq=16, maxDocs=44421)
                0.018785225 = queryNorm
              0.5542659 = fieldWeight in 2735, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.868255 = idf(docFreq=16, maxDocs=44421)
                0.0625 = fieldNorm(doc=2735)
          0.8878421 = weight(title_txt:sprachtechnologie in 2735) [ClassicSimilarity], result of:
            0.8878421 = score(doc=2735,freq=1.0), product of:
              0.22018552 = queryWeight, product of:
                1.271755 = boost
                9.216561 = idf(docFreq=11, maxDocs=44421)
                0.018785225 = queryNorm
              4.0322456 = fieldWeight in 2735, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.216561 = idf(docFreq=11, maxDocs=44421)
                0.4375 = fieldNorm(doc=2735)
        0.24 = coord(6/25)
    
  3. Ludwig, B.; Reischer, J.: Informationslinguistik in Regensburg (2012) 0.29
    0.28997177 = sum of:
      0.28997177 = product of:
        1.8123236 = sum of:
          0.13611269 = weight(abstract_txt:maschinelle in 1555) [ClassicSimilarity], result of:
            0.13611269 = score(doc=1555,freq=1.0), product of:
              0.15892932 = queryWeight, product of:
                1.0804659 = boost
                7.8302665 = idf(docFreq=47, maxDocs=44421)
                0.018785225 = queryNorm
              0.8564354 = fieldWeight in 1555, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.8302665 = idf(docFreq=47, maxDocs=44421)
                0.109375 = fieldNorm(doc=1555)
          0.17555358 = weight(abstract_txt:computerlinguistik in 1555) [ClassicSimilarity], result of:
            0.17555358 = score(doc=1555,freq=1.0), product of:
              0.18831202 = queryWeight, product of:
                1.1761104 = boost
                8.523414 = idf(docFreq=23, maxDocs=44421)
                0.018785225 = queryNorm
              0.93224835 = fieldWeight in 1555, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.523414 = idf(docFreq=23, maxDocs=44421)
                0.109375 = fieldNorm(doc=1555)
          1.1545708 = weight(title_txt:informationslinguistik in 1555) [ClassicSimilarity], result of:
            1.1545708 = score(doc=1555,freq=1.0), product of:
              0.23998496 = queryWeight, product of:
                1.3277034 = boost
                9.622026 = idf(docFreq=7, maxDocs=44421)
                0.018785225 = queryNorm
              4.811013 = fieldWeight in 1555, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.622026 = idf(docFreq=7, maxDocs=44421)
                0.5 = fieldNorm(doc=1555)
          0.3460865 = weight(abstract_txt:sprachverarbeitung in 1555) [ClassicSimilarity], result of:
            0.3460865 = score(doc=1555,freq=1.0), product of:
              0.37302506 = queryWeight, product of:
                2.340955 = boost
                8.482592 = idf(docFreq=24, maxDocs=44421)
                0.018785225 = queryNorm
              0.9277835 = fieldWeight in 1555, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.482592 = idf(docFreq=24, maxDocs=44421)
                0.109375 = fieldNorm(doc=1555)
        0.16 = coord(4/25)
    
  4. Computerlinguistik und Sprachtechnologie : Eine Einführung (2001) 0.26
    0.2621063 = sum of:
      0.2621063 = product of:
        1.3105315 = sum of:
          0.087205194 = weight(abstract_txt:verarbeitung in 2749) [ClassicSimilarity], result of:
            0.087205194 = score(doc=2749,freq=2.0), product of:
              0.13613878 = queryWeight, product of:
                7.2471204 = idf(docFreq=85, maxDocs=44421)
                0.018785225 = queryNorm
              0.640561 = fieldWeight in 2749, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.2471204 = idf(docFreq=85, maxDocs=44421)
                0.0625 = fieldNorm(doc=2749)
          0.06908927 = weight(abstract_txt:formalen in 2749) [ClassicSimilarity], result of:
            0.06908927 = score(doc=2749,freq=1.0), product of:
              0.14686017 = queryWeight, product of:
                1.0386305 = boost
                7.5270805 = idf(docFreq=64, maxDocs=44421)
                0.018785225 = queryNorm
              0.47044253 = fieldWeight in 2749, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.5270805 = idf(docFreq=64, maxDocs=44421)
                0.0625 = fieldNorm(doc=2749)
          0.092641935 = weight(abstract_txt:natürlicher in 2749) [ClassicSimilarity], result of:
            0.092641935 = score(doc=2749,freq=1.0), product of:
              0.17858104 = queryWeight, product of:
                1.1453197 = boost
                8.30027 = idf(docFreq=29, maxDocs=44421)
                0.018785225 = queryNorm
              0.5187669 = fieldWeight in 2749, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.30027 = idf(docFreq=29, maxDocs=44421)
                0.0625 = fieldNorm(doc=2749)
          0.17375298 = weight(abstract_txt:computerlinguistik in 2749) [ClassicSimilarity], result of:
            0.17375298 = score(doc=2749,freq=3.0), product of:
              0.18831202 = queryWeight, product of:
                1.1761104 = boost
                8.523414 = idf(docFreq=23, maxDocs=44421)
                0.018785225 = queryNorm
              0.9226866 = fieldWeight in 2749, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                8.523414 = idf(docFreq=23, maxDocs=44421)
                0.0625 = fieldNorm(doc=2749)
          0.8878421 = weight(title_txt:sprachtechnologie in 2749) [ClassicSimilarity], result of:
            0.8878421 = score(doc=2749,freq=1.0), product of:
              0.22018552 = queryWeight, product of:
                1.271755 = boost
                9.216561 = idf(docFreq=11, maxDocs=44421)
                0.018785225 = queryNorm
              4.0322456 = fieldWeight in 2749, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.216561 = idf(docFreq=11, maxDocs=44421)
                0.4375 = fieldNorm(doc=2749)
        0.2 = coord(5/25)
    
  5. Hausser, R.: Grundlagen der Computerlinguistik : Mensch-Maschine-Kommunikation in natürlicher Sprache; mit 772 Übungen (2000) 0.11
    0.10763848 = sum of:
      0.10763848 = product of:
        0.8969873 = sum of:
          0.20063266 = weight(abstract_txt:computerlinguistik in 352) [ClassicSimilarity], result of:
            0.20063266 = score(doc=352,freq=1.0), product of:
              0.18831202 = queryWeight, product of:
                1.1761104 = boost
                8.523414 = idf(docFreq=23, maxDocs=44421)
                0.018785225 = queryNorm
              1.0654267 = fieldWeight in 352, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.523414 = idf(docFreq=23, maxDocs=44421)
                0.125 = fieldNorm(doc=352)
          0.30082723 = weight(abstract_txt:natürlichsprachliche in 352) [ClassicSimilarity], result of:
            0.30082723 = score(doc=352,freq=1.0), product of:
              0.246692 = queryWeight, product of:
                1.3461287 = boost
                9.755557 = idf(docFreq=6, maxDocs=44421)
                0.018785225 = queryNorm
              1.2194446 = fieldWeight in 352, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.755557 = idf(docFreq=6, maxDocs=44421)
                0.125 = fieldNorm(doc=352)
          0.3955274 = weight(abstract_txt:sprachverarbeitung in 352) [ClassicSimilarity], result of:
            0.3955274 = score(doc=352,freq=1.0), product of:
              0.37302506 = queryWeight, product of:
                2.340955 = boost
                8.482592 = idf(docFreq=24, maxDocs=44421)
                0.018785225 = queryNorm
              1.060324 = fieldWeight in 352, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.482592 = idf(docFreq=24, maxDocs=44421)
                0.125 = fieldNorm(doc=352)
        0.12 = coord(3/25)