Document (#18733)

Author
Baumgarten, C.
Title
Probabilistische Modellierung der effizienten Informationssuche in verteilten multimedialen Dokumentbeständen durch Einschränkung des Suchraums
Source
Hypertext - Information Retrieval - Multimedia '97: Theorien, Modelle und Implementierungen integrierter elektronischer Informationssysteme. Proceedings HIM '97. Hrsg.: N. Fuhr u.a
Imprint
Konstanz : Universitätsverlag
Year
1997
Pages
S.121-134
Series
Schriften zur Informationswissenschaft; Bd.30
Abstract
Ein Modell für die Informationssuche in einer verteilten Multimedia-Dokumentkollektion wird vorgestellt. Das Modell basiert auf dem probabilistischen Anordnungsprinzip. NAch der Berechnung individueller Ranglisten zu den einzelnen Subkollektionen werden diese schrittweise in eine finale Rangliste überführt, in der die Dokumente gemäß ihrer Relevanzwahrscheinlichkeiten geordnet sind. Dabei können die Dokumente (bzw. Dokumentpassagen, falls es sich um multimediale Dokumente handelt) aus verschiedenen Subkollektionen mit verschiedenen Verfahren indexiert werden. Auch lassen sich unterschiedliche probabilistische Verfahren zur Berechnung der subkollektionsspezifischen Ranglisten einsetzen. Damit wird die Integration von Dokumenten beliebigen Typs unterstützt. Übredies ist das zugrundeliegende Datenvolumen beliebig skalierbar. Das Modell wird durch ein Kriterium zur Einschränkung des Suchraums erweitert, um die effiziente Informationssuche zu ermöglichen. Dabei werden verschiedene Kostenfaktoren berücksichtigt

Similar documents (content)

  1. Panyr, J.: Probabilistische Modelle in Information-Retrieval-Systemen (1986) 0.16
    0.1617194 = sum of:
      0.1617194 = product of:
        0.80859697 = sum of:
          0.041448437 = weight(abstract_txt:dabei in 1459) [ClassicSimilarity], result of:
            0.041448437 = score(doc=1459,freq=1.0), product of:
              0.080874294 = queryWeight, product of:
                1.1244681 = boost
                4.6857553 = idf(docFreq=1113, maxDocs=44421)
                0.015349129 = queryNorm
              0.51250446 = fieldWeight in 1459, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.6857553 = idf(docFreq=1113, maxDocs=44421)
                0.109375 = fieldNorm(doc=1459)
          0.31081328 = weight(abstract_txt:probabilistischen in 1459) [ClassicSimilarity], result of:
            0.31081328 = score(doc=1459,freq=3.0), product of:
              0.17051177 = queryWeight, product of:
                1.154527 = boost
                9.622026 = idf(docFreq=7, maxDocs=44421)
                0.015349129 = queryNorm
              1.822826 = fieldWeight in 1459, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                9.622026 = idf(docFreq=7, maxDocs=44421)
                0.109375 = fieldNorm(doc=1459)
          0.026083251 = weight(abstract_txt:werden in 1459) [ClassicSimilarity], result of:
            0.026083251 = score(doc=1459,freq=1.0), product of:
              0.06798451 = queryWeight, product of:
                1.2626776 = boost
                3.507791 = idf(docFreq=3617, maxDocs=44421)
                0.015349129 = queryNorm
              0.38366464 = fieldWeight in 1459, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.507791 = idf(docFreq=3617, maxDocs=44421)
                0.109375 = fieldNorm(doc=1459)
          0.056205455 = weight(abstract_txt:wird in 1459) [ClassicSimilarity], result of:
            0.056205455 = score(doc=1459,freq=3.0), product of:
              0.0786407 = queryWeight, product of:
                1.3580357 = boost
                3.7727013 = idf(docFreq=2775, maxDocs=44421)
                0.015349129 = queryNorm
              0.714712 = fieldWeight in 1459, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.7727013 = idf(docFreq=2775, maxDocs=44421)
                0.109375 = fieldNorm(doc=1459)
          0.3740465 = weight(abstract_txt:probabilistische in 1459) [ClassicSimilarity], result of:
            0.3740465 = score(doc=1459,freq=1.0), product of:
              0.3505544 = queryWeight, product of:
                2.341098 = boost
                9.755557 = idf(docFreq=6, maxDocs=44421)
                0.015349129 = queryNorm
              1.0670141 = fieldWeight in 1459, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.755557 = idf(docFreq=6, maxDocs=44421)
                0.109375 = fieldNorm(doc=1459)
        0.2 = coord(5/25)
    
  2. Seelbach, H.E.: Von der Stichwortliste zum halbautomatisch kontrollierten Wortschatz (1977) 0.11
    0.10653683 = sum of:
      0.10653683 = product of:
        0.53268415 = sum of:
          0.16387853 = weight(abstract_txt:überführt in 564) [ClassicSimilarity], result of:
            0.16387853 = score(doc=564,freq=1.0), product of:
              0.14683011 = queryWeight, product of:
                1.0713576 = boost
                8.928879 = idf(docFreq=15, maxDocs=44421)
                0.015349129 = queryNorm
              1.1161098 = fieldWeight in 564, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.928879 = idf(docFreq=15, maxDocs=44421)
                0.125 = fieldNorm(doc=564)
          0.21374084 = weight(abstract_txt:dokumentbeständen in 564) [ClassicSimilarity], result of:
            0.21374084 = score(doc=564,freq=1.0), product of:
              0.1752772 = queryWeight, product of:
                1.170549 = boost
                9.755557 = idf(docFreq=6, maxDocs=44421)
                0.015349129 = queryNorm
              1.2194446 = fieldWeight in 564, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.755557 = idf(docFreq=6, maxDocs=44421)
                0.125 = fieldNorm(doc=564)
          0.02980943 = weight(abstract_txt:werden in 564) [ClassicSimilarity], result of:
            0.02980943 = score(doc=564,freq=1.0), product of:
              0.06798451 = queryWeight, product of:
                1.2626776 = boost
                3.507791 = idf(docFreq=3617, maxDocs=44421)
                0.015349129 = queryNorm
              0.43847388 = fieldWeight in 564, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.507791 = idf(docFreq=3617, maxDocs=44421)
                0.125 = fieldNorm(doc=564)
          0.037085984 = weight(abstract_txt:wird in 564) [ClassicSimilarity], result of:
            0.037085984 = score(doc=564,freq=1.0), product of:
              0.0786407 = queryWeight, product of:
                1.3580357 = boost
                3.7727013 = idf(docFreq=2775, maxDocs=44421)
                0.015349129 = queryNorm
              0.47158766 = fieldWeight in 564, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.7727013 = idf(docFreq=2775, maxDocs=44421)
                0.125 = fieldNorm(doc=564)
          0.08816935 = weight(abstract_txt:verfahren in 564) [ClassicSimilarity], result of:
            0.08816935 = score(doc=564,freq=1.0), product of:
              0.12237391 = queryWeight, product of:
                1.3832042 = boost
                5.7639313 = idf(docFreq=378, maxDocs=44421)
                0.015349129 = queryNorm
              0.7204914 = fieldWeight in 564, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.7639313 = idf(docFreq=378, maxDocs=44421)
                0.125 = fieldNorm(doc=564)
        0.2 = coord(5/25)
    
  3. Thiel, M.: Bedingt wahrscheinliche Syntaxbäume (2006) 0.10
    0.104251586 = sum of:
      0.104251586 = product of:
        0.3723271 = sum of:
          0.013218567 = weight(abstract_txt:durch in 69) [ClassicSimilarity], result of:
            0.013218567 = score(doc=69,freq=1.0), product of:
              0.06641188 = queryWeight, product of:
                1.018978 = boost
                4.246169 = idf(docFreq=1728, maxDocs=44421)
                0.015349129 = queryNorm
              0.19903918 = fieldWeight in 69, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.246169 = idf(docFreq=1728, maxDocs=44421)
                0.046875 = fieldNorm(doc=69)
          0.076906346 = weight(abstract_txt:probabilistischen in 69) [ClassicSimilarity], result of:
            0.076906346 = score(doc=69,freq=1.0), product of:
              0.17051177 = queryWeight, product of:
                1.154527 = boost
                9.622026 = idf(docFreq=7, maxDocs=44421)
                0.015349129 = queryNorm
              0.4510325 = fieldWeight in 69, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.622026 = idf(docFreq=7, maxDocs=44421)
                0.046875 = fieldNorm(doc=69)
          0.02499597 = weight(abstract_txt:werden in 69) [ClassicSimilarity], result of:
            0.02499597 = score(doc=69,freq=5.0), product of:
              0.06798451 = queryWeight, product of:
                1.2626776 = boost
                3.507791 = idf(docFreq=3617, maxDocs=44421)
                0.015349129 = queryNorm
              0.36767155 = fieldWeight in 69, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                3.507791 = idf(docFreq=3617, maxDocs=44421)
                0.046875 = fieldNorm(doc=69)
          0.029771408 = weight(abstract_txt:verschiedenen in 69) [ClassicSimilarity], result of:
            0.029771408 = score(doc=69,freq=1.0), product of:
              0.11410968 = queryWeight, product of:
                1.3356822 = boost
                5.5659027 = idf(docFreq=461, maxDocs=44421)
                0.015349129 = queryNorm
              0.2609017 = fieldWeight in 69, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.5659027 = idf(docFreq=461, maxDocs=44421)
                0.046875 = fieldNorm(doc=69)
          0.034065653 = weight(abstract_txt:wird in 69) [ClassicSimilarity], result of:
            0.034065653 = score(doc=69,freq=6.0), product of:
              0.0786407 = queryWeight, product of:
                1.3580357 = boost
                3.7727013 = idf(docFreq=2775, maxDocs=44421)
                0.015349129 = queryNorm
              0.43318096 = fieldWeight in 69, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                3.7727013 = idf(docFreq=2775, maxDocs=44421)
                0.046875 = fieldNorm(doc=69)
          0.033063505 = weight(abstract_txt:verfahren in 69) [ClassicSimilarity], result of:
            0.033063505 = score(doc=69,freq=1.0), product of:
              0.12237391 = queryWeight, product of:
                1.3832042 = boost
                5.7639313 = idf(docFreq=378, maxDocs=44421)
                0.015349129 = queryNorm
              0.27018428 = fieldWeight in 69, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.7639313 = idf(docFreq=378, maxDocs=44421)
                0.046875 = fieldNorm(doc=69)
          0.16030563 = weight(abstract_txt:probabilistische in 69) [ClassicSimilarity], result of:
            0.16030563 = score(doc=69,freq=1.0), product of:
              0.3505544 = queryWeight, product of:
                2.341098 = boost
                9.755557 = idf(docFreq=6, maxDocs=44421)
                0.015349129 = queryNorm
              0.45729172 = fieldWeight in 69, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.755557 = idf(docFreq=6, maxDocs=44421)
                0.046875 = fieldNorm(doc=69)
        0.28 = coord(7/25)
    
  4. Enderle, W.: Auf dem Weg zur digitalen Bibliothek : Projekte in Deutschland (1997) 0.10
    0.10408226 = sum of:
      0.10408226 = product of:
        0.4336761 = sum of:
          0.022030946 = weight(abstract_txt:durch in 2650) [ClassicSimilarity], result of:
            0.022030946 = score(doc=2650,freq=1.0), product of:
              0.06641188 = queryWeight, product of:
                1.018978 = boost
                4.246169 = idf(docFreq=1728, maxDocs=44421)
                0.015349129 = queryNorm
              0.33173198 = fieldWeight in 2650, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.246169 = idf(docFreq=1728, maxDocs=44421)
                0.078125 = fieldNorm(doc=2650)
          0.029606026 = weight(abstract_txt:dabei in 2650) [ClassicSimilarity], result of:
            0.029606026 = score(doc=2650,freq=1.0), product of:
              0.080874294 = queryWeight, product of:
                1.1244681 = boost
                4.6857553 = idf(docFreq=1113, maxDocs=44421)
                0.015349129 = queryNorm
              0.36607462 = fieldWeight in 2650, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.6857553 = idf(docFreq=1113, maxDocs=44421)
                0.078125 = fieldNorm(doc=2650)
          0.032269653 = weight(abstract_txt:werden in 2650) [ClassicSimilarity], result of:
            0.032269653 = score(doc=2650,freq=3.0), product of:
              0.06798451 = queryWeight, product of:
                1.2626776 = boost
                3.507791 = idf(docFreq=3617, maxDocs=44421)
                0.015349129 = queryNorm
              0.4746619 = fieldWeight in 2650, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.507791 = idf(docFreq=3617, maxDocs=44421)
                0.078125 = fieldNorm(doc=2650)
          0.040146753 = weight(abstract_txt:wird in 2650) [ClassicSimilarity], result of:
            0.040146753 = score(doc=2650,freq=3.0), product of:
              0.0786407 = queryWeight, product of:
                1.3580357 = boost
                3.7727013 = idf(docFreq=2775, maxDocs=44421)
                0.015349129 = queryNorm
              0.5105086 = fieldWeight in 2650, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.7727013 = idf(docFreq=2775, maxDocs=44421)
                0.078125 = fieldNorm(doc=2650)
          0.1266784 = weight(abstract_txt:verteilten in 2650) [ClassicSimilarity], result of:
            0.1266784 = score(doc=2650,freq=1.0), product of:
              0.21315335 = queryWeight, product of:
                1.8255258 = boost
                7.607123 = idf(docFreq=59, maxDocs=44421)
                0.015349129 = queryNorm
              0.59430647 = fieldWeight in 2650, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.607123 = idf(docFreq=59, maxDocs=44421)
                0.078125 = fieldNorm(doc=2650)
          0.18294433 = weight(abstract_txt:dokumente in 2650) [ClassicSimilarity], result of:
            0.18294433 = score(doc=2650,freq=3.0), product of:
              0.21615222 = queryWeight, product of:
                2.2514763 = boost
                6.25473 = idf(docFreq=231, maxDocs=44421)
                0.015349129 = queryNorm
              0.846368 = fieldWeight in 2650, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.25473 = idf(docFreq=231, maxDocs=44421)
                0.078125 = fieldNorm(doc=2650)
        0.24 = coord(6/25)
    
  5. Roth, A.: Modellierung und Anwendung von Ontologien am Beispiel "Operations Research & Management Science" (2002) 0.10
    0.09592488 = sum of:
      0.09592488 = product of:
        0.34258884 = sum of:
          0.024925167 = weight(abstract_txt:durch in 11) [ClassicSimilarity], result of:
            0.024925167 = score(doc=11,freq=2.0), product of:
              0.06641188 = queryWeight, product of:
                1.018978 = boost
                4.246169 = idf(docFreq=1728, maxDocs=44421)
                0.015349129 = queryNorm
              0.37531185 = fieldWeight in 11, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.246169 = idf(docFreq=1728, maxDocs=44421)
                0.0625 = fieldNorm(doc=11)
          0.023684822 = weight(abstract_txt:dabei in 11) [ClassicSimilarity], result of:
            0.023684822 = score(doc=11,freq=1.0), product of:
              0.080874294 = queryWeight, product of:
                1.1244681 = boost
                4.6857553 = idf(docFreq=1113, maxDocs=44421)
                0.015349129 = queryNorm
              0.2928597 = fieldWeight in 11, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.6857553 = idf(docFreq=1113, maxDocs=44421)
                0.0625 = fieldNorm(doc=11)
          0.03943417 = weight(abstract_txt:werden in 11) [ClassicSimilarity], result of:
            0.03943417 = score(doc=11,freq=7.0), product of:
              0.06798451 = queryWeight, product of:
                1.2626776 = boost
                3.507791 = idf(docFreq=3617, maxDocs=44421)
                0.015349129 = queryNorm
              0.5800464 = fieldWeight in 11, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                3.507791 = idf(docFreq=3617, maxDocs=44421)
                0.0625 = fieldNorm(doc=11)
          0.03969521 = weight(abstract_txt:verschiedenen in 11) [ClassicSimilarity], result of:
            0.03969521 = score(doc=11,freq=1.0), product of:
              0.11410968 = queryWeight, product of:
                1.3356822 = boost
                5.5659027 = idf(docFreq=461, maxDocs=44421)
                0.015349129 = queryNorm
              0.34786892 = fieldWeight in 11, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.5659027 = idf(docFreq=461, maxDocs=44421)
                0.0625 = fieldNorm(doc=11)
          0.018542992 = weight(abstract_txt:wird in 11) [ClassicSimilarity], result of:
            0.018542992 = score(doc=11,freq=1.0), product of:
              0.0786407 = queryWeight, product of:
                1.3580357 = boost
                3.7727013 = idf(docFreq=2775, maxDocs=44421)
                0.015349129 = queryNorm
              0.23579383 = fieldWeight in 11, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.7727013 = idf(docFreq=2775, maxDocs=44421)
                0.0625 = fieldNorm(doc=11)
          0.10134273 = weight(abstract_txt:verteilten in 11) [ClassicSimilarity], result of:
            0.10134273 = score(doc=11,freq=1.0), product of:
              0.21315335 = queryWeight, product of:
                1.8255258 = boost
                7.607123 = idf(docFreq=59, maxDocs=44421)
                0.015349129 = queryNorm
              0.47544518 = fieldWeight in 11, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.607123 = idf(docFreq=59, maxDocs=44421)
                0.0625 = fieldNorm(doc=11)
          0.09496377 = weight(abstract_txt:modell in 11) [ClassicSimilarity], result of:
            0.09496377 = score(doc=11,freq=1.0), product of:
              0.23365018 = queryWeight, product of:
                2.3408337 = boost
                6.5029707 = idf(docFreq=180, maxDocs=44421)
                0.015349129 = queryNorm
              0.40643567 = fieldWeight in 11, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.5029707 = idf(docFreq=180, maxDocs=44421)
                0.0625 = fieldNorm(doc=11)
        0.28 = coord(7/25)