Document (#18793)

Author
Tzeras, K.
Title
Zur Aufwandsabschätzung bei der Entwicklung eines Indexierungswörterbuches
Source
Information retrieval: GI/GMD-Workshop, Darmstadt, 23.-24.6.1991: Proceedings. Ed.: N. Fuhr
Imprint
Berlin : Springer
Year
1991
Pages
S.23-37
Series
Informatik-Fachberichte; 289
Abstract
Für die automatische Indexierung mit einem vorgegebenen Deskriptorensystem wird ein Wörterbuch benötigt, das möglichst viele Fachausdrücke des Anwendungsgebietes durch Relationen mit Deskriptoren verbindet. Werden die in einem solchen Indexierungswörterbuch erfaßten Relationen aus der Verarbeitung von Texten gewonnen, so ergibt sich eine Beziehung zwischen der Anzahl der Texte und der Größe und Leistungsfähigkeit des Wörterbuches. Die beschreibung derartiger Beziehungen ist besonders vor Beginn der Entwicklung eines automatischen Indexierungssystems von großem Interesse. H. Hüther hat sich in mehreren Arbeiten mit diesem Problem beschäftigt und verschiedene Schätzverfahren theoretische hergeleitet. Für eines der von ihm vorgeschlagenen Schätzverfahren zur Abschätzung der Größe eines Indexierungswörterbuches in Abhängigkeit von der Anzahl der zugrundeliegenden Texte werden im vorliegenden beitrag die Leistungsfähigkeit und die Anwendbarkeit untersucht
Theme
Automatisches Indexieren
Object
AIR/PHYS

Similar documents (content)

  1. Albrecht, R.: Digitale Auskunft im Verbund : Ein Jahr InfoPoint Rhein-Main (2005) 0.09
    0.08617625 = sum of:
      0.08617625 = product of:
        0.43088123 = sum of:
          0.025690531 = weight(abstract_txt:einem in 5305) [ClassicSimilarity], result of:
            0.025690531 = score(doc=5305,freq=1.0), product of:
              0.09480656 = queryWeight, product of:
                1.174341 = boost
                4.3356547 = idf(docFreq=1580, maxDocs=44421)
                0.01862042 = queryNorm
              0.27097842 = fieldWeight in 5305, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.3356547 = idf(docFreq=1580, maxDocs=44421)
                0.0625 = fieldNorm(doc=5305)
          0.040132347 = weight(abstract_txt:entwicklung in 5305) [ClassicSimilarity], result of:
            0.040132347 = score(doc=5305,freq=1.0), product of:
              0.12763977 = queryWeight, product of:
                1.3625989 = boost
                5.030701 = idf(docFreq=788, maxDocs=44421)
                0.01862042 = queryNorm
              0.31441882 = fieldWeight in 5305, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.030701 = idf(docFreq=788, maxDocs=44421)
                0.0625 = fieldNorm(doc=5305)
          0.107746124 = weight(abstract_txt:anzahl in 5305) [ClassicSimilarity], result of:
            0.107746124 = score(doc=5305,freq=1.0), product of:
              0.24656084 = queryWeight, product of:
                1.8938129 = boost
                6.9919376 = idf(docFreq=110, maxDocs=44421)
                0.01862042 = queryNorm
              0.4369961 = fieldWeight in 5305, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.9919376 = idf(docFreq=110, maxDocs=44421)
                0.0625 = fieldNorm(doc=5305)
          0.15255871 = weight(abstract_txt:größe in 5305) [ClassicSimilarity], result of:
            0.15255871 = score(doc=5305,freq=1.0), product of:
              0.31089544 = queryWeight, product of:
                2.1265824 = boost
                7.85132 = idf(docFreq=46, maxDocs=44421)
                0.01862042 = queryNorm
              0.4907075 = fieldWeight in 5305, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.85132 = idf(docFreq=46, maxDocs=44421)
                0.0625 = fieldNorm(doc=5305)
          0.10475353 = weight(abstract_txt:eines in 5305) [ClassicSimilarity], result of:
            0.10475353 = score(doc=5305,freq=3.0), product of:
              0.2113838 = queryWeight, product of:
                2.4798527 = boost
                4.577795 = idf(docFreq=1240, maxDocs=44421)
                0.01862042 = queryNorm
              0.49556082 = fieldWeight in 5305, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.577795 = idf(docFreq=1240, maxDocs=44421)
                0.0625 = fieldNorm(doc=5305)
        0.2 = coord(5/25)
    
  2. Meyer, A.: Begriffsrelationen im Kategoriensystem der Wikipedia : Entwicklung eines Relationeninventars zur kollaborativen Anwendung (2010) 0.08
    0.08256973 = sum of:
      0.08256973 = product of:
        0.5160608 = sum of:
          0.075071186 = weight(abstract_txt:deskriptoren in 429) [ClassicSimilarity], result of:
            0.075071186 = score(doc=429,freq=1.0), product of:
              0.15380195 = queryWeight, product of:
                1.0576475 = boost
                7.809647 = idf(docFreq=48, maxDocs=44421)
                0.01862042 = queryNorm
              0.48810294 = fieldWeight in 429, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.809647 = idf(docFreq=48, maxDocs=44421)
                0.0625 = fieldNorm(doc=429)
          0.040132347 = weight(abstract_txt:entwicklung in 429) [ClassicSimilarity], result of:
            0.040132347 = score(doc=429,freq=1.0), product of:
              0.12763977 = queryWeight, product of:
                1.3625989 = boost
                5.030701 = idf(docFreq=788, maxDocs=44421)
                0.01862042 = queryNorm
              0.31441882 = fieldWeight in 429, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.030701 = idf(docFreq=788, maxDocs=44421)
                0.0625 = fieldNorm(doc=429)
          0.26562104 = weight(abstract_txt:relationen in 429) [ClassicSimilarity], result of:
            0.26562104 = score(doc=429,freq=4.0), product of:
              0.283451 = queryWeight, product of:
                2.0305514 = boost
                7.496775 = idf(docFreq=66, maxDocs=44421)
                0.01862042 = queryNorm
              0.9370969 = fieldWeight in 429, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                7.496775 = idf(docFreq=66, maxDocs=44421)
                0.0625 = fieldNorm(doc=429)
          0.13523625 = weight(abstract_txt:eines in 429) [ClassicSimilarity], result of:
            0.13523625 = score(doc=429,freq=5.0), product of:
              0.2113838 = queryWeight, product of:
                2.4798527 = boost
                4.577795 = idf(docFreq=1240, maxDocs=44421)
                0.01862042 = queryNorm
              0.63976634 = fieldWeight in 429, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                4.577795 = idf(docFreq=1240, maxDocs=44421)
                0.0625 = fieldNorm(doc=429)
        0.16 = coord(4/25)
    
  3. Scholz, O.R.: Bild, Darstellung, Zeichen : Philosophische Theorien bildlicher Darstellung (2004) 0.07
    0.073618926 = sum of:
      0.073618926 = product of:
        0.36809462 = sum of:
          0.07931593 = weight(abstract_txt:theoretische in 2436) [ClassicSimilarity], result of:
            0.07931593 = score(doc=2436,freq=1.0), product of:
              0.13749279 = queryWeight, product of:
                7.3839793 = idf(docFreq=74, maxDocs=44421)
                0.01862042 = queryNorm
              0.57687336 = fieldWeight in 2436, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.3839793 = idf(docFreq=74, maxDocs=44421)
                0.078125 = fieldNorm(doc=2436)
          0.08156007 = weight(abstract_txt:ergibt in 2436) [ClassicSimilarity], result of:
            0.08156007 = score(doc=2436,freq=1.0), product of:
              0.14007416 = queryWeight, product of:
                1.0093436 = boost
                7.4529724 = idf(docFreq=69, maxDocs=44421)
                0.01862042 = queryNorm
              0.58226347 = fieldWeight in 2436, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.4529724 = idf(docFreq=69, maxDocs=44421)
                0.078125 = fieldNorm(doc=2436)
          0.09950611 = weight(abstract_txt:verbindet in 2436) [ClassicSimilarity], result of:
            0.09950611 = score(doc=2436,freq=1.0), product of:
              0.15993352 = queryWeight, product of:
                1.0785239 = boost
                7.963798 = idf(docFreq=41, maxDocs=44421)
                0.01862042 = queryNorm
              0.6221717 = fieldWeight in 2436, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.963798 = idf(docFreq=41, maxDocs=44421)
                0.078125 = fieldNorm(doc=2436)
          0.032113165 = weight(abstract_txt:einem in 2436) [ClassicSimilarity], result of:
            0.032113165 = score(doc=2436,freq=1.0), product of:
              0.09480656 = queryWeight, product of:
                1.174341 = boost
                4.3356547 = idf(docFreq=1580, maxDocs=44421)
                0.01862042 = queryNorm
              0.33872303 = fieldWeight in 2436, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.3356547 = idf(docFreq=1580, maxDocs=44421)
                0.078125 = fieldNorm(doc=2436)
          0.07559936 = weight(abstract_txt:eines in 2436) [ClassicSimilarity], result of:
            0.07559936 = score(doc=2436,freq=1.0), product of:
              0.2113838 = queryWeight, product of:
                2.4798527 = boost
                4.577795 = idf(docFreq=1240, maxDocs=44421)
                0.01862042 = queryNorm
              0.35764024 = fieldWeight in 2436, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.577795 = idf(docFreq=1240, maxDocs=44421)
                0.078125 = fieldNorm(doc=2436)
        0.2 = coord(5/25)
    
  4. Coulon, C.-H.: ¬Die Rolle des Anpassungswissens im CBR : am Beispiel der Ausnutzung von Struktur im CBR (1996) 0.07
    0.06548334 = sum of:
      0.06548334 = product of:
        0.54569453 = sum of:
          0.13125336 = weight(abstract_txt:benötigt in 5271) [ClassicSimilarity], result of:
            0.13125336 = score(doc=5271,freq=1.0), product of:
              0.14061552 = queryWeight, product of:
                1.0112922 = boost
                7.467361 = idf(docFreq=68, maxDocs=44421)
                0.01862042 = queryNorm
              0.9334201 = fieldWeight in 5271, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.467361 = idf(docFreq=68, maxDocs=44421)
                0.125 = fieldNorm(doc=5271)
          0.2934822 = weight(abstract_txt:leistungsfähigkeit in 5271) [ClassicSimilarity], result of:
            0.2934822 = score(doc=5271,freq=1.0), product of:
              0.3029406 = queryWeight, product of:
                2.0991998 = boost
                7.750224 = idf(docFreq=51, maxDocs=44421)
                0.01862042 = queryNorm
              0.968778 = fieldWeight in 5271, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.750224 = idf(docFreq=51, maxDocs=44421)
                0.125 = fieldNorm(doc=5271)
          0.12095897 = weight(abstract_txt:eines in 5271) [ClassicSimilarity], result of:
            0.12095897 = score(doc=5271,freq=1.0), product of:
              0.2113838 = queryWeight, product of:
                2.4798527 = boost
                4.577795 = idf(docFreq=1240, maxDocs=44421)
                0.01862042 = queryNorm
              0.5722244 = fieldWeight in 5271, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.577795 = idf(docFreq=1240, maxDocs=44421)
                0.125 = fieldNorm(doc=5271)
        0.12 = coord(3/25)
    
  5. Schmitz-Esser, W.: Publikumsfragen an Literatur zur Zeitgeschichte (1993) 0.06
    0.063245885 = sum of:
      0.063245885 = product of:
        0.52704906 = sum of:
          0.22521356 = weight(abstract_txt:deskriptoren in 4410) [ClassicSimilarity], result of:
            0.22521356 = score(doc=4410,freq=1.0), product of:
              0.15380195 = queryWeight, product of:
                1.0576475 = boost
                7.809647 = idf(docFreq=48, maxDocs=44421)
                0.01862042 = queryNorm
              1.4643089 = fieldWeight in 4410, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.809647 = idf(docFreq=48, maxDocs=44421)
                0.1875 = fieldNorm(doc=4410)
          0.120397046 = weight(abstract_txt:entwicklung in 4410) [ClassicSimilarity], result of:
            0.120397046 = score(doc=4410,freq=1.0), product of:
              0.12763977 = queryWeight, product of:
                1.3625989 = boost
                5.030701 = idf(docFreq=788, maxDocs=44421)
                0.01862042 = queryNorm
              0.9432565 = fieldWeight in 4410, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.030701 = idf(docFreq=788, maxDocs=44421)
                0.1875 = fieldNorm(doc=4410)
          0.18143845 = weight(abstract_txt:eines in 4410) [ClassicSimilarity], result of:
            0.18143845 = score(doc=4410,freq=1.0), product of:
              0.2113838 = queryWeight, product of:
                2.4798527 = boost
                4.577795 = idf(docFreq=1240, maxDocs=44421)
                0.01862042 = queryNorm
              0.85833657 = fieldWeight in 4410, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.577795 = idf(docFreq=1240, maxDocs=44421)
                0.1875 = fieldNorm(doc=4410)
        0.12 = coord(3/25)