Document (#23283)

Author
Larroche-Boutet, V.
Pöhl, K.
Title
¬Das Nominalsyntagna : über die Nutzbarmachung eines logico-semantischen Konzeptes für dokumentarische Fragestellungen
Source
Nachrichten für Dokumentation. 44(1993) H.5, S.269-276
Year
1993
Abstract
Am Anfang nachfolgender Ausführungen werden die für die Indexierung großer textmengen notwendigen strategischen Entscheidungen aufgezeigt: es müssen sowohl das Indexierungsverfahren (menschliche oder automatische Indexierung) als auch die Indexierungssparche (freie, kontrollierte oder natürliche Sprache) ausgewählt werden. Hierbei hat sich die Forschungsgruppe SYDO-LYON für natürlichsprachige automatische Vollindexierung entschieden. Auf der Grundlage der Unterscheidung zwischen prädikativen und referentiellen Textteilen wird d as Nominalsyntagma als kleinste referentielle Texteinheit definiert, dann das für die Konstituierung eines Nominalsyntagmas entscheidende Phänomen der Aktualisierung erläutert und schließlich auf die morphologischen Mittel zur Erkennung des Nominalsyntagmas hingewiesen. Alle Nominalsyntagma eines Textes werden als dessen potentielle Deskriptoren extrahiert, und Hilfsmittel für die Benutzer einer mit diesem Indexierungsverfahren arbeitenden Datenbank werden vorgestellt. Außerdem wird der begriff der Anapher (d.h. die Wiederaufnahme von Nominalsyntagmen durch Pronomen) kurz definiert, ihre Anwendung als Mittel zur Gewichtung des Deskriptorterme (durch Zählung ihrer Häufigkeit im text) aufgezeigt und morphologische uns syntaktische Regeln zur automatischen Bestimmung des von einem anaphorischen Pronomen aufgenommenen Nominalsyntagmas aufgestellt. Bevor abschließend Ziele und Grenzen der Arbeit diskutiert werden, wird noch auf einen Unterschied zwischen Nominalsyntagma und Deskriptorterm hingewiesen: das Nonimalsyntagma verweist auf ein Objekt, das ein Einzelobjekt oder eine Klasse sein kann, der Deskriptorterm verweist immer auf eine Klasse
Theme
Automatisches Indexieren
Computerlinguistik

Similar documents (content)

  1. Panyr, J.: Automatische Indexierung und Klassifikation (1983) 0.16
    0.16366549 = sum of:
      0.16366549 = product of:
        0.8183274 = sum of:
          0.06465737 = weight(abstract_txt:zwischen in 761) [ClassicSimilarity], result of:
            0.06465737 = score(doc=761,freq=1.0), product of:
              0.10242963 = queryWeight, product of:
                1.0958308 = boost
                5.049896 = idf(docFreq=773, maxDocs=44421)
                0.018509714 = queryNorm
              0.631237 = fieldWeight in 761, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.049896 = idf(docFreq=773, maxDocs=44421)
                0.125 = fieldNorm(doc=761)
          0.05719187 = weight(abstract_txt:wird in 761) [ClassicSimilarity], result of:
            0.05719187 = score(doc=761,freq=2.0), product of:
              0.08575449 = queryWeight, product of:
                1.228018 = boost
                3.7727013 = idf(docFreq=2775, maxDocs=44421)
                0.018509714 = queryNorm
              0.66692567 = fieldWeight in 761, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.7727013 = idf(docFreq=2775, maxDocs=44421)
                0.125 = fieldNorm(doc=761)
          0.2686197 = weight(abstract_txt:indexierung in 761) [ClassicSimilarity], result of:
            0.2686197 = score(doc=761,freq=3.0), product of:
              0.18354042 = queryWeight, product of:
                1.4668866 = boost
                6.759825 = idf(docFreq=139, maxDocs=44421)
                0.018509714 = queryNorm
              1.4635451 = fieldWeight in 761, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.759825 = idf(docFreq=139, maxDocs=44421)
                0.125 = fieldNorm(doc=761)
          0.05417663 = weight(abstract_txt:werden in 761) [ClassicSimilarity], result of:
            0.05417663 = score(doc=761,freq=1.0), product of:
              0.123557255 = queryWeight, product of:
                1.9029826 = boost
                3.507791 = idf(docFreq=3617, maxDocs=44421)
                0.018509714 = queryNorm
              0.43847388 = fieldWeight in 761, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.507791 = idf(docFreq=3617, maxDocs=44421)
                0.125 = fieldNorm(doc=761)
          0.37368184 = weight(abstract_txt:indexierungsverfahren in 761) [ClassicSimilarity], result of:
            0.37368184 = score(doc=761,freq=1.0), product of:
              0.32987413 = queryWeight, product of:
                1.9665492 = boost
                9.06241 = idf(docFreq=13, maxDocs=44421)
                0.018509714 = queryNorm
              1.1328013 = fieldWeight in 761, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.06241 = idf(docFreq=13, maxDocs=44421)
                0.125 = fieldNorm(doc=761)
        0.2 = coord(5/25)
    
  2. Fuhr, N.: Modelle im Information Retrieval (2023) 0.16
    0.1611219 = sum of:
      0.1611219 = product of:
        0.50350595 = sum of:
          0.122149974 = weight(abstract_txt:natürlichsprachige in 1801) [ClassicSimilarity], result of:
            0.122149974 = score(doc=1801,freq=1.0), product of:
              0.1972207 = queryWeight, product of:
                1.0752066 = boost
                9.909708 = idf(docFreq=5, maxDocs=44421)
                0.018509714 = queryNorm
              0.61935675 = fieldWeight in 1801, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.909708 = idf(docFreq=5, maxDocs=44421)
                0.0625 = fieldNorm(doc=1801)
          0.032328684 = weight(abstract_txt:zwischen in 1801) [ClassicSimilarity], result of:
            0.032328684 = score(doc=1801,freq=1.0), product of:
              0.10242963 = queryWeight, product of:
                1.0958308 = boost
                5.049896 = idf(docFreq=773, maxDocs=44421)
                0.018509714 = queryNorm
              0.3156185 = fieldWeight in 1801, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.049896 = idf(docFreq=773, maxDocs=44421)
                0.0625 = fieldNorm(doc=1801)
          0.04044076 = weight(abstract_txt:wird in 1801) [ClassicSimilarity], result of:
            0.04044076 = score(doc=1801,freq=4.0), product of:
              0.08575449 = queryWeight, product of:
                1.228018 = boost
                3.7727013 = idf(docFreq=2775, maxDocs=44421)
                0.018509714 = queryNorm
              0.47158766 = fieldWeight in 1801, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                3.7727013 = idf(docFreq=2775, maxDocs=44421)
                0.0625 = fieldNorm(doc=1801)
          0.05746941 = weight(abstract_txt:oder in 1801) [ClassicSimilarity], result of:
            0.05746941 = score(doc=1801,freq=4.0), product of:
              0.10839315 = queryWeight, product of:
                1.3806298 = boost
                4.241553 = idf(docFreq=1736, maxDocs=44421)
                0.018509714 = queryNorm
              0.5301941 = fieldWeight in 1801, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.241553 = idf(docFreq=1736, maxDocs=44421)
                0.0625 = fieldNorm(doc=1801)
          0.077543825 = weight(abstract_txt:indexierung in 1801) [ClassicSimilarity], result of:
            0.077543825 = score(doc=1801,freq=1.0), product of:
              0.18354042 = queryWeight, product of:
                1.4668866 = boost
                6.759825 = idf(docFreq=139, maxDocs=44421)
                0.018509714 = queryNorm
              0.42248908 = fieldWeight in 1801, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.759825 = idf(docFreq=139, maxDocs=44421)
                0.0625 = fieldNorm(doc=1801)
          0.036124438 = weight(abstract_txt:eines in 1801) [ClassicSimilarity], result of:
            0.036124438 = score(doc=1801,freq=1.0), product of:
              0.1262597 = queryWeight, product of:
                1.4900769 = boost
                4.577795 = idf(docFreq=1240, maxDocs=44421)
                0.018509714 = queryNorm
              0.2861122 = fieldWeight in 1801, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.577795 = idf(docFreq=1240, maxDocs=44421)
                0.0625 = fieldNorm(doc=1801)
          0.08327226 = weight(abstract_txt:automatische in 1801) [ClassicSimilarity], result of:
            0.08327226 = score(doc=1801,freq=1.0), product of:
              0.19247183 = queryWeight, product of:
                1.5021534 = boost
                6.922344 = idf(docFreq=118, maxDocs=44421)
                0.018509714 = queryNorm
              0.4326465 = fieldWeight in 1801, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.922344 = idf(docFreq=118, maxDocs=44421)
                0.0625 = fieldNorm(doc=1801)
          0.05417663 = weight(abstract_txt:werden in 1801) [ClassicSimilarity], result of:
            0.05417663 = score(doc=1801,freq=4.0), product of:
              0.123557255 = queryWeight, product of:
                1.9029826 = boost
                3.507791 = idf(docFreq=3617, maxDocs=44421)
                0.018509714 = queryNorm
              0.43847388 = fieldWeight in 1801, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                3.507791 = idf(docFreq=3617, maxDocs=44421)
                0.0625 = fieldNorm(doc=1801)
        0.32 = coord(8/25)
    
  3. Lepsky, K.: Automatisches Indexieren (2023) 0.16
    0.15931855 = sum of:
      0.15931855 = product of:
        0.7965927 = sum of:
          0.06095551 = weight(abstract_txt:oder in 1782) [ClassicSimilarity], result of:
            0.06095551 = score(doc=1782,freq=2.0), product of:
              0.10839315 = queryWeight, product of:
                1.3806298 = boost
                4.241553 = idf(docFreq=1736, maxDocs=44421)
                0.018509714 = queryNorm
              0.56235576 = fieldWeight in 1782, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.241553 = idf(docFreq=1736, maxDocs=44421)
                0.09375 = fieldNorm(doc=1782)
          0.2600899 = weight(abstract_txt:indexierung in 1782) [ClassicSimilarity], result of:
            0.2600899 = score(doc=1782,freq=5.0), product of:
              0.18354042 = queryWeight, product of:
                1.4668866 = boost
                6.759825 = idf(docFreq=139, maxDocs=44421)
                0.018509714 = queryNorm
              1.4170715 = fieldWeight in 1782, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                6.759825 = idf(docFreq=139, maxDocs=44421)
                0.09375 = fieldNorm(doc=1782)
          0.1249084 = weight(abstract_txt:automatische in 1782) [ClassicSimilarity], result of:
            0.1249084 = score(doc=1782,freq=1.0), product of:
              0.19247183 = queryWeight, product of:
                1.5021534 = boost
                6.922344 = idf(docFreq=118, maxDocs=44421)
                0.018509714 = queryNorm
              0.64896977 = fieldWeight in 1782, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.922344 = idf(docFreq=118, maxDocs=44421)
                0.09375 = fieldNorm(doc=1782)
          0.07037751 = weight(abstract_txt:werden in 1782) [ClassicSimilarity], result of:
            0.07037751 = score(doc=1782,freq=3.0), product of:
              0.123557255 = queryWeight, product of:
                1.9029826 = boost
                3.507791 = idf(docFreq=3617, maxDocs=44421)
                0.018509714 = queryNorm
              0.56959426 = fieldWeight in 1782, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.507791 = idf(docFreq=3617, maxDocs=44421)
                0.09375 = fieldNorm(doc=1782)
          0.28026137 = weight(abstract_txt:indexierungsverfahren in 1782) [ClassicSimilarity], result of:
            0.28026137 = score(doc=1782,freq=1.0), product of:
              0.32987413 = queryWeight, product of:
                1.9665492 = boost
                9.06241 = idf(docFreq=13, maxDocs=44421)
                0.018509714 = queryNorm
              0.849601 = fieldWeight in 1782, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.06241 = idf(docFreq=13, maxDocs=44421)
                0.09375 = fieldNorm(doc=1782)
        0.2 = coord(5/25)
    
  4. Jüngling, H.: Verbesserung der sachlichen Erschließung von Bibliotheksbeständen durch Automatisierung der DK-Nutzung (1983) 0.13
    0.12900124 = sum of:
      0.12900124 = product of:
        0.6450062 = sum of:
          0.04044076 = weight(abstract_txt:wird in 1540) [ClassicSimilarity], result of:
            0.04044076 = score(doc=1540,freq=1.0), product of:
              0.08575449 = queryWeight, product of:
                1.228018 = boost
                3.7727013 = idf(docFreq=2775, maxDocs=44421)
                0.018509714 = queryNorm
              0.47158766 = fieldWeight in 1540, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.7727013 = idf(docFreq=2775, maxDocs=44421)
                0.125 = fieldNorm(doc=1540)
          0.14691083 = weight(abstract_txt:aufgezeigt in 1540) [ClassicSimilarity], result of:
            0.14691083 = score(doc=1540,freq=1.0), product of:
              0.17703106 = queryWeight, product of:
                1.4406399 = boost
                6.6388726 = idf(docFreq=157, maxDocs=44421)
                0.018509714 = queryNorm
              0.8298591 = fieldWeight in 1540, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.6388726 = idf(docFreq=157, maxDocs=44421)
                0.125 = fieldNorm(doc=1540)
          0.12513871 = weight(abstract_txt:eines in 1540) [ClassicSimilarity], result of:
            0.12513871 = score(doc=1540,freq=3.0), product of:
              0.1262597 = queryWeight, product of:
                1.4900769 = boost
                4.577795 = idf(docFreq=1240, maxDocs=44421)
                0.018509714 = queryNorm
              0.99112165 = fieldWeight in 1540, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.577795 = idf(docFreq=1240, maxDocs=44421)
                0.125 = fieldNorm(doc=1540)
          0.2558986 = weight(abstract_txt:hingewiesen in 1540) [ClassicSimilarity], result of:
            0.2558986 = score(doc=1540,freq=1.0), product of:
              0.25628638 = queryWeight, product of:
                1.7333786 = boost
                7.9878955 = idf(docFreq=40, maxDocs=44421)
                0.018509714 = queryNorm
              0.99848694 = fieldWeight in 1540, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.9878955 = idf(docFreq=40, maxDocs=44421)
                0.125 = fieldNorm(doc=1540)
          0.076617315 = weight(abstract_txt:werden in 1540) [ClassicSimilarity], result of:
            0.076617315 = score(doc=1540,freq=2.0), product of:
              0.123557255 = queryWeight, product of:
                1.9029826 = boost
                3.507791 = idf(docFreq=3617, maxDocs=44421)
                0.018509714 = queryNorm
              0.6200957 = fieldWeight in 1540, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.507791 = idf(docFreq=3617, maxDocs=44421)
                0.125 = fieldNorm(doc=1540)
        0.2 = coord(5/25)
    
  5. Manecke, H.-J.: Klassifikation, Klassieren (2004) 0.12
    0.12446005 = sum of:
      0.12446005 = product of:
        0.51858354 = sum of:
          0.034289747 = weight(abstract_txt:zwischen in 3902) [ClassicSimilarity], result of:
            0.034289747 = score(doc=3902,freq=2.0), product of:
              0.10242963 = queryWeight, product of:
                1.0958308 = boost
                5.049896 = idf(docFreq=773, maxDocs=44421)
                0.018509714 = queryNorm
              0.33476394 = fieldWeight in 3902, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.049896 = idf(docFreq=773, maxDocs=44421)
                0.046875 = fieldNorm(doc=3902)
          0.021446953 = weight(abstract_txt:wird in 3902) [ClassicSimilarity], result of:
            0.021446953 = score(doc=3902,freq=2.0), product of:
              0.08575449 = queryWeight, product of:
                1.228018 = boost
                3.7727013 = idf(docFreq=2775, maxDocs=44421)
                0.018509714 = queryNorm
              0.25009713 = fieldWeight in 3902, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.7727013 = idf(docFreq=2775, maxDocs=44421)
                0.046875 = fieldNorm(doc=3902)
          0.021551028 = weight(abstract_txt:oder in 3902) [ClassicSimilarity], result of:
            0.021551028 = score(doc=3902,freq=1.0), product of:
              0.10839315 = queryWeight, product of:
                1.3806298 = boost
                4.241553 = idf(docFreq=1736, maxDocs=44421)
                0.018509714 = queryNorm
              0.1988228 = fieldWeight in 3902, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.241553 = idf(docFreq=1736, maxDocs=44421)
                0.046875 = fieldNorm(doc=3902)
          0.03831575 = weight(abstract_txt:eines in 3902) [ClassicSimilarity], result of:
            0.03831575 = score(doc=3902,freq=2.0), product of:
              0.1262597 = queryWeight, product of:
                1.4900769 = boost
                4.577795 = idf(docFreq=1240, maxDocs=44421)
                0.018509714 = queryNorm
              0.30346778 = fieldWeight in 3902, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.577795 = idf(docFreq=1240, maxDocs=44421)
                0.046875 = fieldNorm(doc=3902)
          0.04063247 = weight(abstract_txt:werden in 3902) [ClassicSimilarity], result of:
            0.04063247 = score(doc=3902,freq=4.0), product of:
              0.123557255 = queryWeight, product of:
                1.9029826 = boost
                3.507791 = idf(docFreq=3617, maxDocs=44421)
                0.018509714 = queryNorm
              0.3288554 = fieldWeight in 3902, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                3.507791 = idf(docFreq=3617, maxDocs=44421)
                0.046875 = fieldNorm(doc=3902)
          0.3623476 = weight(abstract_txt:klasse in 3902) [ClassicSimilarity], result of:
            0.3623476 = score(doc=3902,freq=7.0), product of:
              0.32487053 = queryWeight, product of:
                1.9515777 = boost
                8.993418 = idf(docFreq=14, maxDocs=44421)
                0.018509714 = queryNorm
              1.11536 = fieldWeight in 3902, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                8.993418 = idf(docFreq=14, maxDocs=44421)
                0.046875 = fieldNorm(doc=3902)
        0.24 = coord(6/25)