Document (#23685)

Author
Wilhelmy, A.
Title
Phonetische Ähnlichkeitssuche in Datenbanken
Source
Bibliotheken mit und ohne Grenzen: Informationsgesellschaft und Bibliothek. Der österreichische Bibliothekartag 1990, Bregenz, 4.-8.9.1990, Vorträge und Kommissionssitzungen
Imprint
Wien : VÖB
Year
1991
Pages
S.329-338
Series
Biblos-Schriften; Bd.154
Abstract
In dialoggesteuerten Systemen zur Informationswiedergewinnung (Information Retrieval Systems, IRS) kann man - vergröbernd - das Wechselspiel zwischen Mensch und Computer als iterativen Prozess zur Erhöhung von Genauigkeit (Precision) auf der einen und Vollständigkeit (Recall) der Nachweise auf der anderen Seite verstehen. Vorgestellt wird ein maschinell anwendbares Verfahren, das auf phonologische Untersuchungen des Sprachwissenschaftlers Nikolaj S. Trubetzkoy (1890-1938) zurückgeht. In den Grundzügen kann es erheblich zur Verbesserung der Nachweisvollständigkeit beitragen. Dadurch, daß es die 'Ähnlichkeitsumgebungen' von Suchbegriffen in die Recherche mit einbezieht, zeigt es sich vor allem für Systeme mit koordinativer maschineller Indexierung als vorteilhaft. Bei alphabetischen Begriffen erweist sich die Einführung eines solchen zunächst nur auf den Benutzer hin orientierten Verfahrens auch aus technischer Sicht als günstig, da damit die Anzahl der Zugriffe bei den Suchvorgängen auch für große Datenvolumina niedrig gehalten werden kann
Theme
Retrievalalgorithmen

Similar documents (content)

  1. Munkelt, J.: Erstellung einer DNB-Retrieval-Testkollektion (2018) 0.09
    0.09351487 = sum of:
      0.09351487 = product of:
        0.46757433 = sum of:
          0.020759342 = weight(abstract_txt:auch in 310) [ClassicSimilarity], result of:
            0.020759342 = score(doc=310,freq=1.0), product of:
              0.07101303 = queryWeight, product of:
                3.7418423 = idf(docFreq=2862, maxDocs=44421)
                0.018978093 = queryNorm
              0.29233143 = fieldWeight in 310, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.7418423 = idf(docFreq=2862, maxDocs=44421)
                0.078125 = fieldNorm(doc=310)
          0.14989288 = weight(abstract_txt:verfahrens in 310) [ClassicSimilarity], result of:
            0.14989288 = score(doc=310,freq=2.0), product of:
              0.16712049 = queryWeight, product of:
                1.0847529 = boost
                8.117949 = idf(docFreq=35, maxDocs=44421)
                0.018978093 = queryNorm
              0.8969151 = fieldWeight in 310, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.117949 = idf(docFreq=35, maxDocs=44421)
                0.078125 = fieldNorm(doc=310)
          0.109435074 = weight(abstract_txt:maschinell in 310) [ClassicSimilarity], result of:
            0.109435074 = score(doc=310,freq=1.0), product of:
              0.17072222 = queryWeight, product of:
                1.0963798 = boost
                8.20496 = idf(docFreq=32, maxDocs=44421)
                0.018978093 = queryNorm
              0.6410125 = fieldWeight in 310, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.20496 = idf(docFreq=32, maxDocs=44421)
                0.078125 = fieldNorm(doc=310)
          0.13304465 = weight(abstract_txt:maschineller in 310) [ClassicSimilarity], result of:
            0.13304465 = score(doc=310,freq=1.0), product of:
              0.19446911 = queryWeight, product of:
                1.1701493 = boost
                8.757029 = idf(docFreq=18, maxDocs=44421)
                0.018978093 = queryNorm
              0.6841428 = fieldWeight in 310, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.757029 = idf(docFreq=18, maxDocs=44421)
                0.078125 = fieldNorm(doc=310)
          0.054442376 = weight(abstract_txt:kann in 310) [ClassicSimilarity], result of:
            0.054442376 = score(doc=310,freq=1.0), product of:
              0.15459098 = queryWeight, product of:
                1.8070438 = boost
                4.507782 = idf(docFreq=1330, maxDocs=44421)
                0.018978093 = queryNorm
              0.35217047 = fieldWeight in 310, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.507782 = idf(docFreq=1330, maxDocs=44421)
                0.078125 = fieldNorm(doc=310)
        0.2 = coord(5/25)
    
  2. Latour, B.: ¬Das terrestrische Manifest (2018) 0.06
    0.064818 = sum of:
      0.064818 = product of:
        0.40511253 = sum of:
          0.020759342 = weight(abstract_txt:auch in 480) [ClassicSimilarity], result of:
            0.020759342 = score(doc=480,freq=1.0), product of:
              0.07101303 = queryWeight, product of:
                3.7418423 = idf(docFreq=2862, maxDocs=44421)
                0.018978093 = queryNorm
              0.29233143 = fieldWeight in 480, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.7418423 = idf(docFreq=2862, maxDocs=44421)
                0.078125 = fieldNorm(doc=480)
          0.10388661 = weight(abstract_txt:erweist in 480) [ClassicSimilarity], result of:
            0.10388661 = score(doc=480,freq=1.0), product of:
              0.16490181 = queryWeight, product of:
                1.0775284 = boost
                8.063882 = idf(docFreq=37, maxDocs=44421)
                0.018978093 = queryNorm
              0.62999076 = fieldWeight in 480, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.063882 = idf(docFreq=37, maxDocs=44421)
                0.078125 = fieldNorm(doc=480)
          0.12092407 = weight(abstract_txt:orientierten in 480) [ClassicSimilarity], result of:
            0.12092407 = score(doc=480,freq=1.0), product of:
              0.18247114 = queryWeight, product of:
                1.1334779 = boost
                8.482592 = idf(docFreq=24, maxDocs=44421)
                0.018978093 = queryNorm
              0.66270244 = fieldWeight in 480, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.482592 = idf(docFreq=24, maxDocs=44421)
                0.078125 = fieldNorm(doc=480)
          0.1595425 = weight(abstract_txt:einbezieht in 480) [ClassicSimilarity], result of:
            0.1595425 = score(doc=480,freq=1.0), product of:
              0.21950105 = queryWeight, product of:
                1.2431808 = boost
                9.303573 = idf(docFreq=10, maxDocs=44421)
                0.018978093 = queryNorm
              0.7268416 = fieldWeight in 480, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.303573 = idf(docFreq=10, maxDocs=44421)
                0.078125 = fieldNorm(doc=480)
        0.16 = coord(4/25)
    
  3. Linden, J.; Patt-Bohlscheid, S.: Ständige Weiterentwicklung gehört zum täglichen Geschäft : Das Intranet der Hochschul- und Kreisbibliothek Bonn-Rhein-Sieg (2001) 0.05
    0.04857588 = sum of:
      0.04857588 = product of:
        0.40479898 = sum of:
          0.029063078 = weight(abstract_txt:auch in 6727) [ClassicSimilarity], result of:
            0.029063078 = score(doc=6727,freq=1.0), product of:
              0.07101303 = queryWeight, product of:
                3.7418423 = idf(docFreq=2862, maxDocs=44421)
                0.018978093 = queryNorm
              0.409264 = fieldWeight in 6727, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.7418423 = idf(docFreq=2862, maxDocs=44421)
                0.109375 = fieldNorm(doc=6727)
          0.14544126 = weight(abstract_txt:erweist in 6727) [ClassicSimilarity], result of:
            0.14544126 = score(doc=6727,freq=1.0), product of:
              0.16490181 = queryWeight, product of:
                1.0775284 = boost
                8.063882 = idf(docFreq=37, maxDocs=44421)
                0.018978093 = queryNorm
              0.8819871 = fieldWeight in 6727, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.063882 = idf(docFreq=37, maxDocs=44421)
                0.109375 = fieldNorm(doc=6727)
          0.23029466 = weight(abstract_txt:zugriffe in 6727) [ClassicSimilarity], result of:
            0.23029466 = score(doc=6727,freq=1.0), product of:
              0.22402142 = queryWeight, product of:
                1.2559165 = boost
                9.398883 = idf(docFreq=9, maxDocs=44421)
                0.018978093 = queryNorm
              1.0280029 = fieldWeight in 6727, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.398883 = idf(docFreq=9, maxDocs=44421)
                0.109375 = fieldNorm(doc=6727)
        0.12 = coord(3/25)
    
  4. A A A A: Ich bin auch noch ein Dummie (0000) 0.05
    0.046673156 = sum of:
      0.046673156 = product of:
        0.388943 = sum of:
          0.05871628 = weight(abstract_txt:auch in 6064) [ClassicSimilarity], result of:
            0.05871628 = score(doc=6064,freq=2.0), product of:
              0.07101303 = queryWeight, product of:
                3.7418423 = idf(docFreq=2862, maxDocs=44421)
                0.018978093 = queryNorm
              0.8268381 = fieldWeight in 6064, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.7418423 = idf(docFreq=2862, maxDocs=44421)
                0.15625 = fieldNorm(doc=6064)
          0.22134197 = weight(abstract_txt:alphabetischen in 6064) [ClassicSimilarity], result of:
            0.22134197 = score(doc=6064,freq=1.0), product of:
              0.17200518 = queryWeight, product of:
                1.1004916 = boost
                8.235732 = idf(docFreq=31, maxDocs=44421)
                0.018978093 = queryNorm
              1.2868332 = fieldWeight in 6064, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.235732 = idf(docFreq=31, maxDocs=44421)
                0.15625 = fieldNorm(doc=6064)
          0.10888475 = weight(abstract_txt:kann in 6064) [ClassicSimilarity], result of:
            0.10888475 = score(doc=6064,freq=1.0), product of:
              0.15459098 = queryWeight, product of:
                1.8070438 = boost
                4.507782 = idf(docFreq=1330, maxDocs=44421)
                0.018978093 = queryNorm
              0.70434093 = fieldWeight in 6064, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.507782 = idf(docFreq=1330, maxDocs=44421)
                0.15625 = fieldNorm(doc=6064)
        0.12 = coord(3/25)
    
  5. A A A: Ich bin noch ein Dummie (0000) 0.04
    0.044609446 = sum of:
      0.044609446 = product of:
        0.3717454 = sum of:
          0.041518684 = weight(abstract_txt:auch in 2973) [ClassicSimilarity], result of:
            0.041518684 = score(doc=2973,freq=1.0), product of:
              0.07101303 = queryWeight, product of:
                3.7418423 = idf(docFreq=2862, maxDocs=44421)
                0.018978093 = queryNorm
              0.58466285 = fieldWeight in 2973, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.7418423 = idf(docFreq=2862, maxDocs=44421)
                0.15625 = fieldNorm(doc=2973)
          0.22134197 = weight(abstract_txt:alphabetischen in 2973) [ClassicSimilarity], result of:
            0.22134197 = score(doc=2973,freq=1.0), product of:
              0.17200518 = queryWeight, product of:
                1.1004916 = boost
                8.235732 = idf(docFreq=31, maxDocs=44421)
                0.018978093 = queryNorm
              1.2868332 = fieldWeight in 2973, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.235732 = idf(docFreq=31, maxDocs=44421)
                0.15625 = fieldNorm(doc=2973)
          0.10888475 = weight(abstract_txt:kann in 2973) [ClassicSimilarity], result of:
            0.10888475 = score(doc=2973,freq=1.0), product of:
              0.15459098 = queryWeight, product of:
                1.8070438 = boost
                4.507782 = idf(docFreq=1330, maxDocs=44421)
                0.018978093 = queryNorm
              0.70434093 = fieldWeight in 2973, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.507782 = idf(docFreq=1330, maxDocs=44421)
                0.15625 = fieldNorm(doc=2973)
        0.12 = coord(3/25)