Document (#685)

Author
Faraj, N.
Title
Analyse d'une methode d'indexation automatique basée sur une analyse syntaxique de texte
Source
Canadian journal of information and library science. 21(1996) no.1, S.1-21
Year
1996
Abstract
Evaluates an automatic indexing method based on syntactical text analysis combined with statistical analysis. Tests many combinations for the choice of term categories and weighting methods. The experiment, conducted on a software engineering corpus, shows systematic improvement in the use of syntactic term phrases compared to using only individual words as index terms
Footnote
Übers. d. Titels: Analysis of an automatic indexing method based on syntactic analysis of text
Theme
Automatisches Indexieren

Similar documents (content)

  1. Coret, A.; Menon, B.; Schibler, D.; Terrasse, C.: ¬Un système d'indexation structurée à l'INIST : bilan d'une étude préalable (1994) 0.18
    0.18140268 = sum of:
      0.18140268 = product of:
        2.2675335 = sum of:
          1.112823 = weight(title_txt:d'indexation in 371) [ClassicSimilarity], result of:
            1.112823 = score(doc=371,freq=1.0), product of:
              0.3746783 = queryWeight, product of:
                1.8355383 = boost
                9.504243 = idf(docFreq=8, maxDocs=44421)
                0.021477196 = queryNorm
              2.9700758 = fieldWeight in 371, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.504243 = idf(docFreq=8, maxDocs=44421)
                0.3125 = fieldNorm(doc=371)
          1.1547105 = weight(title_txt:d'une in 371) [ClassicSimilarity], result of:
            1.1547105 = score(doc=371,freq=1.0), product of:
              0.3840224 = queryWeight, product of:
                1.8582855 = boost
                9.622026 = idf(docFreq=7, maxDocs=44421)
                0.021477196 = queryNorm
              3.0068831 = fieldWeight in 371, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.622026 = idf(docFreq=7, maxDocs=44421)
                0.3125 = fieldNorm(doc=371)
        0.08 = coord(2/25)
    
  2. Lavallee, C.: Indexation manuelle et indexation assistee par ordinateur : comparison de la performance de deux index d'une monographie (1996) 0.14
    0.14110598 = sum of:
      0.14110598 = product of:
        1.1758832 = sum of:
          0.09395386 = weight(abstract_txt:experiment in 740) [ClassicSimilarity], result of:
            0.09395386 = score(doc=740,freq=1.0), product of:
              0.13282432 = queryWeight, product of:
                1.0928812 = boost
                5.658835 = idf(docFreq=420, maxDocs=44421)
                0.021477196 = queryNorm
              0.70735437 = fieldWeight in 740, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.658835 = idf(docFreq=420, maxDocs=44421)
                0.125 = fieldNorm(doc=740)
          0.15816082 = weight(abstract_txt:evaluates in 740) [ClassicSimilarity], result of:
            0.15816082 = score(doc=740,freq=1.0), product of:
              0.18796071 = queryWeight, product of:
                1.3000729 = boost
                6.731654 = idf(docFreq=143, maxDocs=44421)
                0.021477196 = queryNorm
              0.8414568 = fieldWeight in 740, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.731654 = idf(docFreq=143, maxDocs=44421)
                0.125 = fieldNorm(doc=740)
          0.92376846 = weight(title_txt:d'une in 740) [ClassicSimilarity], result of:
            0.92376846 = score(doc=740,freq=1.0), product of:
              0.3840224 = queryWeight, product of:
                1.8582855 = boost
                9.622026 = idf(docFreq=7, maxDocs=44421)
                0.021477196 = queryNorm
              2.4055066 = fieldWeight in 740, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.622026 = idf(docFreq=7, maxDocs=44421)
                0.25 = fieldNorm(doc=740)
        0.12 = coord(3/25)
    
  3. Grefenstette, G.: Explorations in automatic thesaurus discovery (1994) 0.14
    0.13806699 = sum of:
      0.13806699 = product of:
        0.5752791 = sum of:
          0.054540966 = weight(abstract_txt:automatic in 1170) [ClassicSimilarity], result of:
            0.054540966 = score(doc=1170,freq=1.0), product of:
              0.111971855 = queryWeight, product of:
                1.0034335 = boost
                5.1956835 = idf(docFreq=668, maxDocs=44421)
                0.021477196 = queryNorm
              0.48709533 = fieldWeight in 1170, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.1956835 = idf(docFreq=668, maxDocs=44421)
                0.09375 = fieldNorm(doc=1170)
          0.0597414 = weight(abstract_txt:words in 1170) [ClassicSimilarity], result of:
            0.0597414 = score(doc=1170,freq=1.0), product of:
              0.1189809 = queryWeight, product of:
                1.0343626 = boost
                5.355831 = idf(docFreq=569, maxDocs=44421)
                0.021477196 = queryNorm
              0.50210917 = fieldWeight in 1170, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.355831 = idf(docFreq=569, maxDocs=44421)
                0.09375 = fieldNorm(doc=1170)
          0.18762517 = weight(abstract_txt:syntactic in 1170) [ClassicSimilarity], result of:
            0.18762517 = score(doc=1170,freq=3.0), product of:
              0.17692152 = queryWeight, product of:
                1.2613177 = boost
                6.5309834 = idf(docFreq=175, maxDocs=44421)
                0.021477196 = queryNorm
              1.0604994 = fieldWeight in 1170, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.5309834 = idf(docFreq=175, maxDocs=44421)
                0.09375 = fieldNorm(doc=1170)
          0.037684143 = weight(abstract_txt:analysis in 1170) [ClassicSimilarity], result of:
            0.037684143 = score(doc=1170,freq=1.0), product of:
              0.11025783 = queryWeight, product of:
                1.408166 = boost
                3.6456752 = idf(docFreq=3151, maxDocs=44421)
                0.021477196 = queryNorm
              0.34178203 = fieldWeight in 1170, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6456752 = idf(docFreq=3151, maxDocs=44421)
                0.09375 = fieldNorm(doc=1170)
          0.08572601 = weight(abstract_txt:term in 1170) [ClassicSimilarity], result of:
            0.08572601 = score(doc=1170,freq=1.0), product of:
              0.1907123 = queryWeight, product of:
                1.8519895 = boost
                4.794713 = idf(docFreq=998, maxDocs=44421)
                0.021477196 = queryNorm
              0.44950435 = fieldWeight in 1170, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.794713 = idf(docFreq=998, maxDocs=44421)
                0.09375 = fieldNorm(doc=1170)
          0.14996143 = weight(abstract_txt:analyse in 1170) [ClassicSimilarity], result of:
            0.14996143 = score(doc=1170,freq=1.0), product of:
              0.276879 = queryWeight, product of:
                2.231486 = boost
                5.7772117 = idf(docFreq=373, maxDocs=44421)
                0.021477196 = queryNorm
              0.5416136 = fieldWeight in 1170, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.7772117 = idf(docFreq=373, maxDocs=44421)
                0.09375 = fieldNorm(doc=1170)
        0.24 = coord(6/25)
    
  4. Lioma, C.; Ounis, I.: ¬A syntactically-based query reformulation technique for information retrieval (2008) 0.13
    0.13188119 = sum of:
      0.13188119 = product of:
        0.47100428 = sum of:
          0.071141765 = weight(abstract_txt:automatic in 3031) [ClassicSimilarity], result of:
            0.071141765 = score(doc=3031,freq=5.0), product of:
              0.111971855 = queryWeight, product of:
                1.0034335 = boost
                5.1956835 = idf(docFreq=668, maxDocs=44421)
                0.021477196 = queryNorm
              0.635354 = fieldWeight in 3031, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                5.1956835 = idf(docFreq=668, maxDocs=44421)
                0.0546875 = fieldNorm(doc=3031)
          0.034849152 = weight(abstract_txt:words in 3031) [ClassicSimilarity], result of:
            0.034849152 = score(doc=3031,freq=1.0), product of:
              0.1189809 = queryWeight, product of:
                1.0343626 = boost
                5.355831 = idf(docFreq=569, maxDocs=44421)
                0.021477196 = queryNorm
              0.29289702 = fieldWeight in 3031, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.355831 = idf(docFreq=569, maxDocs=44421)
                0.0546875 = fieldNorm(doc=3031)
          0.03870743 = weight(abstract_txt:statistical in 3031) [ClassicSimilarity], result of:
            0.03870743 = score(doc=3031,freq=1.0), product of:
              0.12760822 = queryWeight, product of:
                1.0712073 = boost
                5.5466094 = idf(docFreq=470, maxDocs=44421)
                0.021477196 = queryNorm
              0.3033302 = fieldWeight in 3031, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.5466094 = idf(docFreq=470, maxDocs=44421)
                0.0546875 = fieldNorm(doc=3031)
          0.041104812 = weight(abstract_txt:experiment in 3031) [ClassicSimilarity], result of:
            0.041104812 = score(doc=3031,freq=1.0), product of:
              0.13282432 = queryWeight, product of:
                1.0928812 = boost
                5.658835 = idf(docFreq=420, maxDocs=44421)
                0.021477196 = queryNorm
              0.30946752 = fieldWeight in 3031, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.658835 = idf(docFreq=420, maxDocs=44421)
                0.0546875 = fieldNorm(doc=3031)
          0.1263797 = weight(abstract_txt:syntactic in 3031) [ClassicSimilarity], result of:
            0.1263797 = score(doc=3031,freq=4.0), product of:
              0.17692152 = queryWeight, product of:
                1.2613177 = boost
                6.5309834 = idf(docFreq=175, maxDocs=44421)
                0.021477196 = queryNorm
              0.7143263 = fieldWeight in 3031, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                6.5309834 = idf(docFreq=175, maxDocs=44421)
                0.0546875 = fieldNorm(doc=3031)
          0.10881461 = weight(abstract_txt:weighting in 3031) [ClassicSimilarity], result of:
            0.10881461 = score(doc=3031,freq=2.0), product of:
              0.20174244 = queryWeight, product of:
                1.3468921 = boost
                6.9740796 = idf(docFreq=112, maxDocs=44421)
                0.021477196 = queryNorm
              0.53937393 = fieldWeight in 3031, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.9740796 = idf(docFreq=112, maxDocs=44421)
                0.0546875 = fieldNorm(doc=3031)
          0.05000684 = weight(abstract_txt:term in 3031) [ClassicSimilarity], result of:
            0.05000684 = score(doc=3031,freq=1.0), product of:
              0.1907123 = queryWeight, product of:
                1.8519895 = boost
                4.794713 = idf(docFreq=998, maxDocs=44421)
                0.021477196 = queryNorm
              0.26221088 = fieldWeight in 3031, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.794713 = idf(docFreq=998, maxDocs=44421)
                0.0546875 = fieldNorm(doc=3031)
        0.28 = coord(7/25)
    
  5. Menillet, D.: Grilles d'indexation et de préindexation : l'exemple de PASCAL (1992) 0.09
    0.094740905 = sum of:
      0.094740905 = product of:
        1.1842613 = sum of:
          1.112823 = weight(title_txt:d'indexation in 5805) [ClassicSimilarity], result of:
            1.112823 = score(doc=5805,freq=1.0), product of:
              0.3746783 = queryWeight, product of:
                1.8355383 = boost
                9.504243 = idf(docFreq=8, maxDocs=44421)
                0.021477196 = queryNorm
              2.9700758 = fieldWeight in 5805, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.504243 = idf(docFreq=8, maxDocs=44421)
                0.3125 = fieldNorm(doc=5805)
          0.071438335 = weight(abstract_txt:term in 5805) [ClassicSimilarity], result of:
            0.071438335 = score(doc=5805,freq=1.0), product of:
              0.1907123 = queryWeight, product of:
                1.8519895 = boost
                4.794713 = idf(docFreq=998, maxDocs=44421)
                0.021477196 = queryNorm
              0.37458694 = fieldWeight in 5805, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.794713 = idf(docFreq=998, maxDocs=44421)
                0.078125 = fieldNorm(doc=5805)
        0.08 = coord(2/25)