Document (#20638)

Author
Gil-Leiva, I.
Munoz, J.V.R.
Title
Analisis de los descriptores de diferentes areas del conocimiento indizades en bases de datos del CSIC : Aplicacion a la indizacion automatica
Source
Revista Española de Documentaçion Cientifica. 20(1997) no.2, S.150-160
Year
1997
Abstract
Studies the value of scientific articles' titles and abstracts as sources of terms for document indexing in relation to 6 areas of knowledge: library and information science, medicine, chemistry, biology, psychology and physics, indexed in the databases ISOC, IME and ICYT of the CSIC. Also examines the syntagmatic structures of the indexing terms found in the field 'descriptors'. as well as the relationship between length of document and number of descriptors. Concludes that if the abstracts are not well made and the titles are not precise, they are not definitive sources for the extractions of concepts; the most common syntactic structure is the noun phrase, followed by noun+adjective and noun+noun; and no significant relationship was found between length of document and number of descriptors assigned to it
Footnote
Übers. d. Titels: Descriptors analysis on different knowledge ares in CSIC databases: application on automatic indexing
Theme
Automatisches Indexieren
Field
Physik
Chemie
Medizin
Psychologie
Biologie
Informationswissenschaft
Bibliothekswesen

Similar documents (author)

  1. Gil-Leiva, I.; Munoz, V.R.: ¬Los origines del almacenamiento y recuperacion de informacion (1996) 5.69
    5.6898336 = sum of:
      5.6898336 = sum of:
        2.6570084 = weight(author_txt:leiva in 5585) [ClassicSimilarity], result of:
          2.6570084 = score(doc=5585,freq=1.0), product of:
            0.67528963 = queryWeight, product of:
              8.993418 = idf(docFreq=14, maxDocs=44421)
              0.0750871 = queryNorm
            3.9346204 = fieldWeight in 5585, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              8.993418 = idf(docFreq=14, maxDocs=44421)
              0.4375 = fieldNorm(doc=5585)
        3.032825 = weight(author_txt:munoz in 5585) [ClassicSimilarity], result of:
          3.032825 = score(doc=5585,freq=1.0), product of:
            0.73755264 = queryWeight, product of:
              1.0450846 = boost
              9.398883 = idf(docFreq=9, maxDocs=44421)
              0.0750871 = queryNorm
            4.1120114 = fieldWeight in 5585, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              9.398883 = idf(docFreq=9, maxDocs=44421)
              0.4375 = fieldNorm(doc=5585)
    
  2. Munoz, A.M.; Munoz, F.A.: Nuevas areas de conocimiento y la problematica documental : la prospectiva de la paz en la Universidad de Granada (1997) 2.45
    2.4508924 = sum of:
      2.4508924 = product of:
        4.901785 = sum of:
          4.901785 = weight(author_txt:munoz in 1340) [ClassicSimilarity], result of:
            4.901785 = score(doc=1340,freq=2.0), product of:
              0.73755264 = queryWeight, product of:
                1.0450846 = boost
                9.398883 = idf(docFreq=9, maxDocs=44421)
                0.0750871 = queryNorm
              6.6460137 = fieldWeight in 1340, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                9.398883 = idf(docFreq=9, maxDocs=44421)
                0.5 = fieldNorm(doc=1340)
        0.5 = coord(1/2)
    
  3. Munoz, J.V.R.: Documentos electronicos y normalizacion : informacion y conocimiento (1997) 2.17
    2.1663034 = sum of:
      2.1663034 = product of:
        4.332607 = sum of:
          4.332607 = weight(author_txt:munoz in 3813) [ClassicSimilarity], result of:
            4.332607 = score(doc=3813,freq=1.0), product of:
              0.73755264 = queryWeight, product of:
                1.0450846 = boost
                9.398883 = idf(docFreq=9, maxDocs=44421)
                0.0750871 = queryNorm
              5.874302 = fieldWeight in 3813, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.398883 = idf(docFreq=9, maxDocs=44421)
                0.625 = fieldNorm(doc=3813)
        0.5 = coord(1/2)
    
  4. Leiva, I.G. -> Gil-Leiva, I.: 1.88
    1.8787885 = sum of:
      1.8787885 = product of:
        3.757577 = sum of:
          3.757577 = weight(author_txt:leiva in 98) [ClassicSimilarity], result of:
            3.757577 = score(doc=98,freq=2.0), product of:
              0.67528963 = queryWeight, product of:
                8.993418 = idf(docFreq=14, maxDocs=44421)
                0.0750871 = queryNorm
              5.564393 = fieldWeight in 98, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.993418 = idf(docFreq=14, maxDocs=44421)
                0.4375 = fieldNorm(doc=98)
        0.5 = coord(1/2)
    
  5. Fernández, F.J. Munoz- -> Munoz-Fernández, F.J.: 1.84
    1.8381695 = sum of:
      1.8381695 = product of:
        3.676339 = sum of:
          3.676339 = weight(author_txt:munoz in 3707) [ClassicSimilarity], result of:
            3.676339 = score(doc=3707,freq=2.0), product of:
              0.73755264 = queryWeight, product of:
                1.0450846 = boost
                9.398883 = idf(docFreq=9, maxDocs=44421)
                0.0750871 = queryNorm
              4.9845104 = fieldWeight in 3707, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                9.398883 = idf(docFreq=9, maxDocs=44421)
                0.375 = fieldNorm(doc=3707)
        0.5 = coord(1/2)
    

Similar documents (content)

  1. Mesquita, L.A.P.; Souza, R.R.; Baracho Porto, R.M.A.: Noun phrases in automatic indexing: : a structural analysis of the distribution of relevant terms in doctoral theses (2014) 0.18
    0.1794924 = sum of:
      0.1794924 = product of:
        0.64104426 = sum of:
          0.013312143 = weight(abstract_txt:between in 2442) [ClassicSimilarity], result of:
            0.013312143 = score(doc=2442,freq=2.0), product of:
              0.05808213 = queryWeight, product of:
                3.4573963 = idf(docFreq=3804, maxDocs=44421)
                0.016799385 = queryNorm
              0.22919516 = fieldWeight in 2442, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.4573963 = idf(docFreq=3804, maxDocs=44421)
                0.046875 = fieldNorm(doc=2442)
          0.0135371005 = weight(abstract_txt:well in 2442) [ClassicSimilarity], result of:
            0.0135371005 = score(doc=2442,freq=1.0), product of:
              0.074001014 = queryWeight, product of:
                1.1287495 = boost
                3.9025342 = idf(docFreq=2437, maxDocs=44421)
                0.016799385 = queryNorm
              0.18293129 = fieldWeight in 2442, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.9025342 = idf(docFreq=2437, maxDocs=44421)
                0.046875 = fieldNorm(doc=2442)
          0.03688942 = weight(abstract_txt:terms in 2442) [ClassicSimilarity], result of:
            0.03688942 = score(doc=2442,freq=6.0), product of:
              0.07945197 = queryWeight, product of:
                1.1695831 = boost
                4.043712 = idf(docFreq=2116, maxDocs=44421)
                0.016799385 = queryNorm
              0.46429837 = fieldWeight in 2442, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                4.043712 = idf(docFreq=2116, maxDocs=44421)
                0.046875 = fieldNorm(doc=2442)
          0.026519226 = weight(abstract_txt:indexing in 2442) [ClassicSimilarity], result of:
            0.026519226 = score(doc=2442,freq=2.0), product of:
              0.09195693 = queryWeight, product of:
                1.2582617 = boost
                4.3503094 = idf(docFreq=1557, maxDocs=44421)
                0.016799385 = queryNorm
              0.28838748 = fieldWeight in 2442, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.3503094 = idf(docFreq=1557, maxDocs=44421)
                0.046875 = fieldNorm(doc=2442)
          0.02036492 = weight(abstract_txt:found in 2442) [ClassicSimilarity], result of:
            0.02036492 = score(doc=2442,freq=1.0), product of:
              0.09715735 = queryWeight, product of:
                1.2933515 = boost
                4.4716287 = idf(docFreq=1379, maxDocs=44421)
                0.016799385 = queryNorm
              0.2096076 = fieldWeight in 2442, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.4716287 = idf(docFreq=1379, maxDocs=44421)
                0.046875 = fieldNorm(doc=2442)
          0.05266841 = weight(abstract_txt:areas in 2442) [ClassicSimilarity], result of:
            0.05266841 = score(doc=2442,freq=4.0), product of:
              0.115318954 = queryWeight, product of:
                1.4090587 = boost
                4.871674 = idf(docFreq=924, maxDocs=44421)
                0.016799385 = queryNorm
              0.45671946 = fieldWeight in 2442, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.871674 = idf(docFreq=924, maxDocs=44421)
                0.046875 = fieldNorm(doc=2442)
          0.47775307 = weight(abstract_txt:noun in 2442) [ClassicSimilarity], result of:
            0.47775307 = score(doc=2442,freq=5.0), product of:
              0.58664614 = queryWeight, product of:
                4.494505 = boost
                7.769642 = idf(docFreq=50, maxDocs=44421)
                0.016799385 = queryNorm
              0.81438035 = fieldWeight in 2442, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                7.769642 = idf(docFreq=50, maxDocs=44421)
                0.046875 = fieldNorm(doc=2442)
        0.28 = coord(7/25)
    
  2. Souza, R.R.; Raghavan, K.S.: ¬A methodology for noun phrase-based automatic indexing (2006) 0.18
    0.17735368 = sum of:
      0.17735368 = product of:
        0.8867684 = sum of:
          0.025100071 = weight(abstract_txt:terms in 298) [ClassicSimilarity], result of:
            0.025100071 = score(doc=298,freq=1.0), product of:
              0.07945197 = queryWeight, product of:
                1.1695831 = boost
                4.043712 = idf(docFreq=2116, maxDocs=44421)
                0.016799385 = queryNorm
              0.31591502 = fieldWeight in 298, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.043712 = idf(docFreq=2116, maxDocs=44421)
                0.078125 = fieldNorm(doc=298)
          0.03125321 = weight(abstract_txt:indexing in 298) [ClassicSimilarity], result of:
            0.03125321 = score(doc=298,freq=1.0), product of:
              0.09195693 = queryWeight, product of:
                1.2582617 = boost
                4.3503094 = idf(docFreq=1557, maxDocs=44421)
                0.016799385 = queryNorm
              0.33986792 = fieldWeight in 298, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.3503094 = idf(docFreq=1557, maxDocs=44421)
                0.078125 = fieldNorm(doc=298)
          0.045087595 = weight(abstract_txt:document in 298) [ClassicSimilarity], result of:
            0.045087595 = score(doc=298,freq=1.0), product of:
              0.13439709 = queryWeight, product of:
                1.8630277 = boost
                4.29415 = idf(docFreq=1647, maxDocs=44421)
                0.016799385 = queryNorm
              0.33548045 = fieldWeight in 298, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.29415 = idf(docFreq=1647, maxDocs=44421)
                0.078125 = fieldNorm(doc=298)
          0.16855095 = weight(abstract_txt:descriptors in 298) [ClassicSimilarity], result of:
            0.16855095 = score(doc=298,freq=1.0), product of:
              0.3237223 = queryWeight, product of:
                2.8914165 = boost
                6.664515 = idf(docFreq=153, maxDocs=44421)
                0.016799385 = queryNorm
              0.5206652 = fieldWeight in 298, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.664515 = idf(docFreq=153, maxDocs=44421)
                0.078125 = fieldNorm(doc=298)
          0.6167766 = weight(abstract_txt:noun in 298) [ClassicSimilarity], result of:
            0.6167766 = score(doc=298,freq=3.0), product of:
              0.58664614 = queryWeight, product of:
                4.494505 = boost
                7.769642 = idf(docFreq=50, maxDocs=44421)
                0.016799385 = queryNorm
              1.0513605 = fieldWeight in 298, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                7.769642 = idf(docFreq=50, maxDocs=44421)
                0.078125 = fieldNorm(doc=298)
        0.2 = coord(5/25)
    
  3. Rodriguez Bravo, B.: ¬The visibility of women in indexing languages (2006) 0.15
    0.15414898 = sum of:
      0.15414898 = product of:
        0.7707449 = sum of:
          0.03125321 = weight(abstract_txt:indexing in 1263) [ClassicSimilarity], result of:
            0.03125321 = score(doc=1263,freq=1.0), product of:
              0.09195693 = queryWeight, product of:
                1.2582617 = boost
                4.3503094 = idf(docFreq=1557, maxDocs=44421)
                0.016799385 = queryNorm
              0.33986792 = fieldWeight in 1263, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.3503094 = idf(docFreq=1557, maxDocs=44421)
                0.078125 = fieldNorm(doc=1263)
          0.16908452 = weight(abstract_txt:adjective in 1263) [ClassicSimilarity], result of:
            0.16908452 = score(doc=1263,freq=1.0), product of:
              0.22492996 = queryWeight, product of:
                1.3915133 = boost
                9.622026 = idf(docFreq=7, maxDocs=44421)
                0.016799385 = queryNorm
              0.7517208 = fieldWeight in 1263, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.622026 = idf(docFreq=7, maxDocs=44421)
                0.078125 = fieldNorm(doc=1263)
          0.04576014 = weight(abstract_txt:relationship in 1263) [ClassicSimilarity], result of:
            0.04576014 = score(doc=1263,freq=1.0), product of:
              0.11857131 = queryWeight, product of:
                1.4287905 = boost
                4.9398947 = idf(docFreq=863, maxDocs=44421)
                0.016799385 = queryNorm
              0.3859293 = fieldWeight in 1263, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.9398947 = idf(docFreq=863, maxDocs=44421)
                0.078125 = fieldNorm(doc=1263)
          0.16855095 = weight(abstract_txt:descriptors in 1263) [ClassicSimilarity], result of:
            0.16855095 = score(doc=1263,freq=1.0), product of:
              0.3237223 = queryWeight, product of:
                2.8914165 = boost
                6.664515 = idf(docFreq=153, maxDocs=44421)
                0.016799385 = queryNorm
              0.5206652 = fieldWeight in 1263, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.664515 = idf(docFreq=153, maxDocs=44421)
                0.078125 = fieldNorm(doc=1263)
          0.35609612 = weight(abstract_txt:noun in 1263) [ClassicSimilarity], result of:
            0.35609612 = score(doc=1263,freq=1.0), product of:
              0.58664614 = queryWeight, product of:
                4.494505 = boost
                7.769642 = idf(docFreq=50, maxDocs=44421)
                0.016799385 = queryNorm
              0.6070033 = fieldWeight in 1263, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.769642 = idf(docFreq=50, maxDocs=44421)
                0.078125 = fieldNorm(doc=1263)
        0.2 = coord(5/25)
    
  4. Lopez-Ostenero, F.; Gonzalo, J.; Verdejo, F.: Noun phrases as building blocks for cross-language search assistance (2005) 0.14
    0.14176935 = sum of:
      0.14176935 = product of:
        0.70884675 = sum of:
          0.06811514 = weight(abstract_txt:phrase in 2021) [ClassicSimilarity], result of:
            0.06811514 = score(doc=2021,freq=1.0), product of:
              0.122689426 = queryWeight, product of:
                1.0277022 = boost
                7.1063476 = idf(docFreq=98, maxDocs=44421)
                0.016799385 = queryNorm
              0.5551834 = fieldWeight in 2021, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.1063476 = idf(docFreq=98, maxDocs=44421)
                0.078125 = fieldNorm(doc=2021)
          0.025100071 = weight(abstract_txt:terms in 2021) [ClassicSimilarity], result of:
            0.025100071 = score(doc=2021,freq=1.0), product of:
              0.07945197 = queryWeight, product of:
                1.1695831 = boost
                4.043712 = idf(docFreq=2116, maxDocs=44421)
                0.016799385 = queryNorm
              0.31591502 = fieldWeight in 2021, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.043712 = idf(docFreq=2116, maxDocs=44421)
                0.078125 = fieldNorm(doc=2021)
          0.03394153 = weight(abstract_txt:found in 2021) [ClassicSimilarity], result of:
            0.03394153 = score(doc=2021,freq=1.0), product of:
              0.09715735 = queryWeight, product of:
                1.2933515 = boost
                4.4716287 = idf(docFreq=1379, maxDocs=44421)
                0.016799385 = queryNorm
              0.34934598 = fieldWeight in 2021, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.4716287 = idf(docFreq=1379, maxDocs=44421)
                0.078125 = fieldNorm(doc=2021)
          0.078094006 = weight(abstract_txt:document in 2021) [ClassicSimilarity], result of:
            0.078094006 = score(doc=2021,freq=3.0), product of:
              0.13439709 = queryWeight, product of:
                1.8630277 = boost
                4.29415 = idf(docFreq=1647, maxDocs=44421)
                0.016799385 = queryNorm
              0.5810692 = fieldWeight in 2021, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.29415 = idf(docFreq=1647, maxDocs=44421)
                0.078125 = fieldNorm(doc=2021)
          0.503596 = weight(abstract_txt:noun in 2021) [ClassicSimilarity], result of:
            0.503596 = score(doc=2021,freq=2.0), product of:
              0.58664614 = queryWeight, product of:
                4.494505 = boost
                7.769642 = idf(docFreq=50, maxDocs=44421)
                0.016799385 = queryNorm
              0.8584323 = fieldWeight in 2021, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.769642 = idf(docFreq=50, maxDocs=44421)
                0.078125 = fieldNorm(doc=2021)
        0.2 = coord(5/25)
    
  5. Larouk, O.: Modelling users need : schemas of interrogation and filtering of answers from the WEB in co-operative mode (1998) 0.14
    0.1358399 = sum of:
      0.1358399 = product of:
        0.56599957 = sum of:
          0.017570049 = weight(abstract_txt:terms in 1060) [ClassicSimilarity], result of:
            0.017570049 = score(doc=1060,freq=1.0), product of:
              0.07945197 = queryWeight, product of:
                1.1695831 = boost
                4.043712 = idf(docFreq=2116, maxDocs=44421)
                0.016799385 = queryNorm
              0.2211405 = fieldWeight in 1060, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.043712 = idf(docFreq=2116, maxDocs=44421)
                0.0546875 = fieldNorm(doc=1060)
          0.030939098 = weight(abstract_txt:indexing in 1060) [ClassicSimilarity], result of:
            0.030939098 = score(doc=1060,freq=2.0), product of:
              0.09195693 = queryWeight, product of:
                1.2582617 = boost
                4.3503094 = idf(docFreq=1557, maxDocs=44421)
                0.016799385 = queryNorm
              0.33645207 = fieldWeight in 1060, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.3503094 = idf(docFreq=1557, maxDocs=44421)
                0.0546875 = fieldNorm(doc=1060)
          0.07061199 = weight(abstract_txt:titles in 1060) [ClassicSimilarity], result of:
            0.07061199 = score(doc=1060,freq=2.0), product of:
              0.15940368 = queryWeight, product of:
                1.6566391 = boost
                5.727658 = idf(docFreq=392, maxDocs=44421)
                0.016799385 = queryNorm
              0.4429759 = fieldWeight in 1060, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.727658 = idf(docFreq=392, maxDocs=44421)
                0.0546875 = fieldNorm(doc=1060)
          0.07962549 = weight(abstract_txt:abstracts in 1060) [ClassicSimilarity], result of:
            0.07962549 = score(doc=1060,freq=2.0), product of:
              0.1726954 = queryWeight, product of:
                1.724325 = boost
                5.9616747 = idf(docFreq=310, maxDocs=44421)
                0.016799385 = queryNorm
              0.46107474 = fieldWeight in 1060, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.9616747 = idf(docFreq=310, maxDocs=44421)
                0.0546875 = fieldNorm(doc=1060)
          0.11798566 = weight(abstract_txt:descriptors in 1060) [ClassicSimilarity], result of:
            0.11798566 = score(doc=1060,freq=1.0), product of:
              0.3237223 = queryWeight, product of:
                2.8914165 = boost
                6.664515 = idf(docFreq=153, maxDocs=44421)
                0.016799385 = queryNorm
              0.36446565 = fieldWeight in 1060, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.664515 = idf(docFreq=153, maxDocs=44421)
                0.0546875 = fieldNorm(doc=1060)
          0.2492673 = weight(abstract_txt:noun in 1060) [ClassicSimilarity], result of:
            0.2492673 = score(doc=1060,freq=1.0), product of:
              0.58664614 = queryWeight, product of:
                4.494505 = boost
                7.769642 = idf(docFreq=50, maxDocs=44421)
                0.016799385 = queryNorm
              0.4249023 = fieldWeight in 1060, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.769642 = idf(docFreq=50, maxDocs=44421)
                0.0546875 = fieldNorm(doc=1060)
        0.24 = coord(6/25)