Document (#35915)

Author
Nhongkai, S.N.
Bentz, H.-J.
Title
Bilinguale Suche mittels Konzeptnetzen
Source
Effektive Information Retrieval Verfahren in Theorie und Praxis: ausgewählte und erweiterte Beiträge des Vierten Hildesheimer Evaluierungs- und Retrievalworkshop (HIER 2005), Hildesheim, 20.7.2005. Hrsg.: T. Mandl u. C. Womser-Hacker
Imprint
Konstanz : UVK Verlagsgesellschaft
Year
2006
Pages
S.203-222
Series
Schriften zur Informationswissenschaft; Bd.45
Abstract
Eine neue Methode der Volltextsuche in bilingualen Textsammlungen wird vorgestellt und anhand eines parallelen Textkorpus (Englisch-Deutsch) geprüft. Die Brücke liefern passende Wortcluster, die aus einer Kookkurrenzanalyse stammen, geliefert von der neuartigen Suchmaschine SENTRAX (Essente Extractor Engine). Diese Cluster repräsentieren Konzepte, die sich in beiden Textsammlungen finden. Die Hypothese ist, dass das Finden mittels solcher Strukturvergleiche erfolgreich möglich ist.
Theme
Computerlinguistik
Sprachretrieval
Object
SENTRAX

Similar documents (content)

  1. Glogau, R.: Suchmaschine mit Köpfchen (1996) 0.12
    0.11525625 = sum of:
      0.11525625 = product of:
        0.7203516 = sum of:
          0.08900876 = weight(abstract_txt:anhand in 4903) [ClassicSimilarity], result of:
            0.08900876 = score(doc=4903,freq=1.0), product of:
              0.10011178 = queryWeight, product of:
                5.6902003 = idf(docFreq=407, maxDocs=44421)
                0.017593719 = queryNorm
              0.8890938 = fieldWeight in 4903, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.6902003 = idf(docFreq=407, maxDocs=44421)
                0.15625 = fieldNorm(doc=4903)
          0.12773292 = weight(abstract_txt:suchmaschine in 4903) [ClassicSimilarity], result of:
            0.12773292 = score(doc=4903,freq=1.0), product of:
              0.12736943 = queryWeight, product of:
                1.1279504 = boost
                6.418264 = idf(docFreq=196, maxDocs=44421)
                0.017593719 = queryNorm
              1.0028538 = fieldWeight in 4903, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.418264 = idf(docFreq=196, maxDocs=44421)
                0.15625 = fieldNorm(doc=4903)
          0.33047602 = weight(abstract_txt:volltextsuche in 4903) [ClassicSimilarity], result of:
            0.33047602 = score(doc=4903,freq=1.0), product of:
              0.24004352 = queryWeight, product of:
                1.5484686 = boost
                8.811096 = idf(docFreq=17, maxDocs=44421)
                0.017593719 = queryNorm
              1.3767338 = fieldWeight in 4903, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.811096 = idf(docFreq=17, maxDocs=44421)
                0.15625 = fieldNorm(doc=4903)
          0.17313382 = weight(abstract_txt:finden in 4903) [ClassicSimilarity], result of:
            0.17313382 = score(doc=4903,freq=1.0), product of:
              0.19654468 = queryWeight, product of:
                1.9815409 = boost
                5.6376824 = idf(docFreq=429, maxDocs=44421)
                0.017593719 = queryNorm
              0.88088787 = fieldWeight in 4903, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.6376824 = idf(docFreq=429, maxDocs=44421)
                0.15625 = fieldNorm(doc=4903)
        0.16 = coord(4/25)
    
  2. Bredack, J.: Terminologieextraktion von Mehrwortgruppen in kunsthistorischen Fachtexten (2013) 0.10
    0.10460999 = sum of:
      0.10460999 = product of:
        0.3736071 = sum of:
          0.038541924 = weight(abstract_txt:anhand in 2054) [ClassicSimilarity], result of:
            0.038541924 = score(doc=2054,freq=3.0), product of:
              0.10011178 = queryWeight, product of:
                5.6902003 = idf(docFreq=407, maxDocs=44421)
                0.017593719 = queryNorm
              0.3849889 = fieldWeight in 2054, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.6902003 = idf(docFreq=407, maxDocs=44421)
                0.0390625 = fieldNorm(doc=2054)
          0.0363686 = weight(abstract_txt:möglich in 2054) [ClassicSimilarity], result of:
            0.0363686 = score(doc=2054,freq=2.0), product of:
              0.11024978 = queryWeight, product of:
                1.0494126 = boost
                5.971368 = idf(docFreq=307, maxDocs=44421)
                0.017593719 = queryNorm
              0.32987458 = fieldWeight in 2054, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.971368 = idf(docFreq=307, maxDocs=44421)
                0.0390625 = fieldNorm(doc=2054)
          0.057719007 = weight(abstract_txt:methode in 2054) [ClassicSimilarity], result of:
            0.057719007 = score(doc=2054,freq=2.0), product of:
              0.15000506 = queryWeight, product of:
                1.2240815 = boost
                6.965269 = idf(docFreq=113, maxDocs=44421)
                0.017593719 = queryNorm
              0.3847804 = fieldWeight in 2054, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.965269 = idf(docFreq=113, maxDocs=44421)
                0.0390625 = fieldNorm(doc=2054)
          0.043160405 = weight(abstract_txt:liefern in 2054) [ClassicSimilarity], result of:
            0.043160405 = score(doc=2054,freq=1.0), product of:
              0.15570182 = queryWeight, product of:
                1.2471085 = boost
                7.0962973 = idf(docFreq=99, maxDocs=44421)
                0.017593719 = queryNorm
              0.27719912 = fieldWeight in 2054, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.0962973 = idf(docFreq=99, maxDocs=44421)
                0.0390625 = fieldNorm(doc=2054)
          0.043718565 = weight(abstract_txt:erfolgreich in 2054) [ClassicSimilarity], result of:
            0.043718565 = score(doc=2054,freq=1.0), product of:
              0.15704133 = queryWeight, product of:
                1.2524614 = boost
                7.1267567 = idf(docFreq=96, maxDocs=44421)
                0.017593719 = queryNorm
              0.27838892 = fieldWeight in 2054, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.1267567 = idf(docFreq=96, maxDocs=44421)
                0.0390625 = fieldNorm(doc=2054)
          0.110815145 = weight(abstract_txt:repräsentieren in 2054) [ClassicSimilarity], result of:
            0.110815145 = score(doc=2054,freq=2.0), product of:
              0.23171782 = queryWeight, product of:
                1.5213779 = boost
                8.656945 = idf(docFreq=20, maxDocs=44421)
                0.017593719 = queryNorm
              0.47823316 = fieldWeight in 2054, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.656945 = idf(docFreq=20, maxDocs=44421)
                0.0390625 = fieldNorm(doc=2054)
          0.043283455 = weight(abstract_txt:finden in 2054) [ClassicSimilarity], result of:
            0.043283455 = score(doc=2054,freq=1.0), product of:
              0.19654468 = queryWeight, product of:
                1.9815409 = boost
                5.6376824 = idf(docFreq=429, maxDocs=44421)
                0.017593719 = queryNorm
              0.22022197 = fieldWeight in 2054, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.6376824 = idf(docFreq=429, maxDocs=44421)
                0.0390625 = fieldNorm(doc=2054)
        0.28 = coord(7/25)
    
  3. Schiffhauer, N.: Microsofts Encarta ist eine zuverlässige Enzyklopädie auf CD-ROM - Die Suchfunktionen sind noch verbesserungswürdig : ¬Ein Suchspiel mit 14 Millionen Wörtern (2001) 0.07
    0.06893506 = sum of:
      0.06893506 = product of:
        0.28722942 = sum of:
          0.023030281 = weight(abstract_txt:beiden in 6683) [ClassicSimilarity], result of:
            0.023030281 = score(doc=6683,freq=1.0), product of:
              0.11886195 = queryWeight, product of:
                1.0896294 = boost
                6.2002096 = idf(docFreq=244, maxDocs=44421)
                0.017593719 = queryNorm
              0.19375655 = fieldWeight in 6683, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.2002096 = idf(docFreq=244, maxDocs=44421)
                0.03125 = fieldNorm(doc=6683)
          0.046022937 = weight(abstract_txt:englisch in 6683) [ClassicSimilarity], result of:
            0.046022937 = score(doc=6683,freq=1.0), product of:
              0.18857881 = queryWeight, product of:
                1.3724731 = boost
                7.809647 = idf(docFreq=48, maxDocs=44421)
                0.017593719 = queryNorm
              0.24405147 = fieldWeight in 6683, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.809647 = idf(docFreq=48, maxDocs=44421)
                0.03125 = fieldNorm(doc=6683)
          0.057384036 = weight(abstract_txt:geprüft in 6683) [ClassicSimilarity], result of:
            0.057384036 = score(doc=6683,freq=1.0), product of:
              0.21845941 = queryWeight, product of:
                1.4772118 = boost
                8.405631 = idf(docFreq=26, maxDocs=44421)
                0.017593719 = queryNorm
              0.26267597 = fieldWeight in 6683, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.405631 = idf(docFreq=26, maxDocs=44421)
                0.03125 = fieldNorm(doc=6683)
          0.057384036 = weight(abstract_txt:passende in 6683) [ClassicSimilarity], result of:
            0.057384036 = score(doc=6683,freq=1.0), product of:
              0.21845941 = queryWeight, product of:
                1.4772118 = boost
                8.405631 = idf(docFreq=26, maxDocs=44421)
                0.017593719 = queryNorm
              0.26267597 = fieldWeight in 6683, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.405631 = idf(docFreq=26, maxDocs=44421)
                0.03125 = fieldNorm(doc=6683)
          0.06878138 = weight(abstract_txt:geliefert in 6683) [ClassicSimilarity], result of:
            0.06878138 = score(doc=6683,freq=1.0), product of:
              0.24650398 = queryWeight, product of:
                1.5691677 = boost
                8.928879 = idf(docFreq=15, maxDocs=44421)
                0.017593719 = queryNorm
              0.27902746 = fieldWeight in 6683, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.928879 = idf(docFreq=15, maxDocs=44421)
                0.03125 = fieldNorm(doc=6683)
          0.034626763 = weight(abstract_txt:finden in 6683) [ClassicSimilarity], result of:
            0.034626763 = score(doc=6683,freq=1.0), product of:
              0.19654468 = queryWeight, product of:
                1.9815409 = boost
                5.6376824 = idf(docFreq=429, maxDocs=44421)
                0.017593719 = queryNorm
              0.17617758 = fieldWeight in 6683, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.6376824 = idf(docFreq=429, maxDocs=44421)
                0.03125 = fieldNorm(doc=6683)
        0.24 = coord(6/25)
    
  4. Gesell, J.: Neuauflage der Internationalen Patentklassifikation : incompatibility issues of library classification systems and subject headings in subject cataloguing (1986) 0.06
    0.06416332 = sum of:
      0.06416332 = product of:
        0.40102074 = sum of:
          0.051432967 = weight(abstract_txt:möglich in 3644) [ClassicSimilarity], result of:
            0.051432967 = score(doc=3644,freq=1.0), product of:
              0.11024978 = queryWeight, product of:
                1.0494126 = boost
                5.971368 = idf(docFreq=307, maxDocs=44421)
                0.017593719 = queryNorm
              0.4665131 = fieldWeight in 3644, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.971368 = idf(docFreq=307, maxDocs=44421)
                0.078125 = fieldNorm(doc=3644)
          0.091070324 = weight(abstract_txt:deutsch in 3644) [ClassicSimilarity], result of:
            0.091070324 = score(doc=3644,freq=1.0), product of:
              0.161362 = queryWeight, product of:
                1.269574 = boost
                7.2241306 = idf(docFreq=87, maxDocs=44421)
                0.017593719 = queryNorm
              0.5643852 = fieldWeight in 3644, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.2241306 = idf(docFreq=87, maxDocs=44421)
                0.078125 = fieldNorm(doc=3644)
          0.11505735 = weight(abstract_txt:englisch in 3644) [ClassicSimilarity], result of:
            0.11505735 = score(doc=3644,freq=1.0), product of:
              0.18857881 = queryWeight, product of:
                1.3724731 = boost
                7.809647 = idf(docFreq=48, maxDocs=44421)
                0.017593719 = queryNorm
              0.6101287 = fieldWeight in 3644, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.809647 = idf(docFreq=48, maxDocs=44421)
                0.078125 = fieldNorm(doc=3644)
          0.1434601 = weight(abstract_txt:passende in 3644) [ClassicSimilarity], result of:
            0.1434601 = score(doc=3644,freq=1.0), product of:
              0.21845941 = queryWeight, product of:
                1.4772118 = boost
                8.405631 = idf(docFreq=26, maxDocs=44421)
                0.017593719 = queryNorm
              0.65668994 = fieldWeight in 3644, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.405631 = idf(docFreq=26, maxDocs=44421)
                0.078125 = fieldNorm(doc=3644)
        0.16 = coord(4/25)
    
  5. Stock, W.G.: Informationelle Städte im 21. Jahrhundert (2011) 0.06
    0.060278416 = sum of:
      0.060278416 = product of:
        0.3767401 = sum of:
          0.054258917 = weight(abstract_txt:cluster in 511) [ClassicSimilarity], result of:
            0.054258917 = score(doc=511,freq=1.0), product of:
              0.13257779 = queryWeight, product of:
                1.1507813 = boost
                6.548176 = idf(docFreq=172, maxDocs=44421)
                0.017593719 = queryNorm
              0.409261 = fieldWeight in 511, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.548176 = idf(docFreq=172, maxDocs=44421)
                0.0625 = fieldNorm(doc=511)
          0.06683413 = weight(abstract_txt:solcher in 511) [ClassicSimilarity], result of:
            0.06683413 = score(doc=511,freq=1.0), product of:
              0.1523429 = queryWeight, product of:
                1.2335833 = boost
                7.019336 = idf(docFreq=107, maxDocs=44421)
                0.017593719 = queryNorm
              0.4387085 = fieldWeight in 511, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.019336 = idf(docFreq=107, maxDocs=44421)
                0.0625 = fieldNorm(doc=511)
          0.14056733 = weight(abstract_txt:hypothese in 511) [ClassicSimilarity], result of:
            0.14056733 = score(doc=511,freq=1.0), product of:
              0.25008038 = queryWeight, product of:
                1.5805099 = boost
                8.993418 = idf(docFreq=14, maxDocs=44421)
                0.017593719 = queryNorm
              0.5620886 = fieldWeight in 511, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.993418 = idf(docFreq=14, maxDocs=44421)
                0.0625 = fieldNorm(doc=511)
          0.115079716 = weight(abstract_txt:mittels in 511) [ClassicSimilarity], result of:
            0.115079716 = score(doc=511,freq=1.0), product of:
              0.27573964 = queryWeight, product of:
                2.347048 = boost
                6.677587 = idf(docFreq=151, maxDocs=44421)
                0.017593719 = queryNorm
              0.4173492 = fieldWeight in 511, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.677587 = idf(docFreq=151, maxDocs=44421)
                0.0625 = fieldNorm(doc=511)
        0.16 = coord(4/25)