Document (#1604)

Author
Byrne, J.R.
Title
Relative effectiveness of titles, abstracts, and subject headings for machine retrieval from the COMPENDEX services
Source
Journal of the American Society for Information Science. 26(1975), S.223-229
Year
1975
Abstract
We have investigated the relative merits of searching on titles, subject headings, abstracts, free-language terms, and combinations of these elements. The COMPENDEX data base was used for this study since it combined all of these data elements of interest. In general, the results obtained from the experiments indicate that, as expected, titles alone are not satisfactory for efficient retrieval. The combination of titles and abstracts came the closest to 100% retrieval, with searching of abstracts alone doing almost as well. Indexer input, although necessary for 100% retrieval in almost all cases, was found to be relatively unimportant
Theme
Retrievalstudien
Object
COMPENDEX

Similar documents (content)

  1. Orton, D.: Database review : engineering (1995) 0.14
    0.14397883 = sum of:
      0.14397883 = product of:
        1.1998236 = sum of:
          0.067366876 = weight(abstract_txt:searching in 3931) [ClassicSimilarity], result of:
            0.067366876 = score(doc=3931,freq=1.0), product of:
              0.10058762 = queryWeight, product of:
                1.4387238 = boost
                4.2862926 = idf(docFreq=1660, maxDocs=44421)
                0.016311178 = queryNorm
              0.6697332 = fieldWeight in 3931, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.2862926 = idf(docFreq=1660, maxDocs=44421)
                0.15625 = fieldNorm(doc=3931)
          0.20976171 = weight(abstract_txt:almost in 3931) [ClassicSimilarity], result of:
            0.20976171 = score(doc=3931,freq=1.0), product of:
              0.21448542 = queryWeight, product of:
                2.1008935 = boost
                6.25905 = idf(docFreq=230, maxDocs=44421)
                0.016311178 = queryNorm
              0.97797656 = fieldWeight in 3931, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.25905 = idf(docFreq=230, maxDocs=44421)
                0.15625 = fieldNorm(doc=3931)
          0.92269504 = weight(abstract_txt:compendex in 3931) [ClassicSimilarity], result of:
            0.92269504 = score(doc=3931,freq=2.0), product of:
              0.45702758 = queryWeight, product of:
                3.0667357 = boost
                9.1365185 = idf(docFreq=12, maxDocs=44421)
                0.016311178 = queryNorm
              2.0189044 = fieldWeight in 3931, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                9.1365185 = idf(docFreq=12, maxDocs=44421)
                0.15625 = fieldNorm(doc=3931)
        0.12 = coord(3/25)
    
  2. Hook, P.A.; Gantchev, A.: Using combined metadata sources to visualize a small library (OBL's English Language Books) (2017) 0.12
    0.12181849 = sum of:
      0.12181849 = product of:
        0.43506604 = sum of:
          0.06268948 = weight(abstract_txt:combined in 4870) [ClassicSimilarity], result of:
            0.06268948 = score(doc=4870,freq=3.0), product of:
              0.097189575 = queryWeight, product of:
                5.9584646 = idf(docFreq=311, maxDocs=44421)
                0.016311178 = queryNorm
              0.6450227 = fieldWeight in 4870, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.9584646 = idf(docFreq=311, maxDocs=44421)
                0.0625 = fieldNorm(doc=4870)
          0.011033668 = weight(abstract_txt:these in 4870) [ClassicSimilarity], result of:
            0.011033668 = score(doc=4870,freq=1.0), product of:
              0.055465158 = queryWeight, product of:
                1.0683542 = boost
                3.1828754 = idf(docFreq=5006, maxDocs=44421)
                0.016311178 = queryNorm
              0.19892971 = fieldWeight in 4870, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.1828754 = idf(docFreq=5006, maxDocs=44421)
                0.0625 = fieldNorm(doc=4870)
          0.028259655 = weight(abstract_txt:data in 4870) [ClassicSimilarity], result of:
            0.028259655 = score(doc=4870,freq=5.0), product of:
              0.060719505 = queryWeight, product of:
                1.1178132 = boost
                3.3302255 = idf(docFreq=4320, maxDocs=44421)
                0.016311178 = queryNorm
              0.46541315 = fieldWeight in 4870, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                3.3302255 = idf(docFreq=4320, maxDocs=44421)
                0.0625 = fieldNorm(doc=4870)
          0.04573597 = weight(abstract_txt:subject in 4870) [ClassicSimilarity], result of:
            0.04573597 = score(doc=4870,freq=5.0), product of:
              0.083699375 = queryWeight, product of:
                1.3124001 = boost
                3.9099448 = idf(docFreq=2419, maxDocs=44421)
                0.016311178 = queryNorm
              0.5464314 = fieldWeight in 4870, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                3.9099448 = idf(docFreq=2419, maxDocs=44421)
                0.0625 = fieldNorm(doc=4870)
          0.081108205 = weight(abstract_txt:headings in 4870) [ClassicSimilarity], result of:
            0.081108205 = score(doc=4870,freq=3.0), product of:
              0.14539286 = queryWeight, product of:
                1.7297236 = boost
                5.1532483 = idf(docFreq=697, maxDocs=44421)
                0.016311178 = queryNorm
              0.5578555 = fieldWeight in 4870, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.1532483 = idf(docFreq=697, maxDocs=44421)
                0.0625 = fieldNorm(doc=4870)
          0.07764463 = weight(abstract_txt:relative in 4870) [ClassicSimilarity], result of:
            0.07764463 = score(doc=4870,freq=1.0), product of:
              0.20367979 = queryWeight, product of:
                2.047289 = boost
                6.099349 = idf(docFreq=270, maxDocs=44421)
                0.016311178 = queryNorm
              0.3812093 = fieldWeight in 4870, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.099349 = idf(docFreq=270, maxDocs=44421)
                0.0625 = fieldNorm(doc=4870)
          0.12859441 = weight(abstract_txt:titles in 4870) [ClassicSimilarity], result of:
            0.12859441 = score(doc=4870,freq=1.0), product of:
              0.35922375 = queryWeight, product of:
                3.845056 = boost
                5.727658 = idf(docFreq=392, maxDocs=44421)
                0.016311178 = queryNorm
              0.3579786 = fieldWeight in 4870, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.727658 = idf(docFreq=392, maxDocs=44421)
                0.0625 = fieldNorm(doc=4870)
        0.28 = coord(7/25)
    
  3. Ekmekcioglu, F.C.; Robertson, A.M.; Willett, P.: Effectiveness of query expansion in ranked-output document retrieval systems (1992) 0.12
    0.12011176 = sum of:
      0.12011176 = product of:
        0.6005588 = sum of:
          0.01930892 = weight(abstract_txt:these in 6689) [ClassicSimilarity], result of:
            0.01930892 = score(doc=6689,freq=1.0), product of:
              0.055465158 = queryWeight, product of:
                1.0683542 = boost
                3.1828754 = idf(docFreq=5006, maxDocs=44421)
                0.016311178 = queryNorm
              0.348127 = fieldWeight in 6689, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.1828754 = idf(docFreq=5006, maxDocs=44421)
                0.109375 = fieldNorm(doc=6689)
          0.03127771 = weight(abstract_txt:data in 6689) [ClassicSimilarity], result of:
            0.03127771 = score(doc=6689,freq=2.0), product of:
              0.060719505 = queryWeight, product of:
                1.1178132 = boost
                3.3302255 = idf(docFreq=4320, maxDocs=44421)
                0.016311178 = queryNorm
              0.515118 = fieldWeight in 6689, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.3302255 = idf(docFreq=4320, maxDocs=44421)
                0.109375 = fieldNorm(doc=6689)
          0.0711657 = weight(abstract_txt:retrieval in 6689) [ClassicSimilarity], result of:
            0.0711657 = score(doc=6689,freq=2.0), product of:
              0.13234131 = queryWeight, product of:
                2.3338227 = boost
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.016311178 = queryNorm
              0.5377437 = fieldWeight in 6689, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.109375 = fieldNorm(doc=6689)
          0.22504024 = weight(abstract_txt:titles in 6689) [ClassicSimilarity], result of:
            0.22504024 = score(doc=6689,freq=1.0), product of:
              0.35922375 = queryWeight, product of:
                3.845056 = boost
                5.727658 = idf(docFreq=392, maxDocs=44421)
                0.016311178 = queryNorm
              0.6264626 = fieldWeight in 6689, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.727658 = idf(docFreq=392, maxDocs=44421)
                0.109375 = fieldNorm(doc=6689)
          0.25376624 = weight(abstract_txt:abstracts in 6689) [ClassicSimilarity], result of:
            0.25376624 = score(doc=6689,freq=1.0), product of:
              0.3891773 = queryWeight, product of:
                4.002155 = boost
                5.9616747 = idf(docFreq=310, maxDocs=44421)
                0.016311178 = queryNorm
              0.6520582 = fieldWeight in 6689, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.9616747 = idf(docFreq=310, maxDocs=44421)
                0.109375 = fieldNorm(doc=6689)
        0.2 = coord(5/25)
    
  4. Roberts, D.; Souter, C.: ¬The automation of controlled vocabulary subject indexing of medical journal articles (2000) 0.11
    0.11488749 = sum of:
      0.11488749 = product of:
        0.4786979 = sum of:
          0.048794083 = weight(abstract_txt:input in 836) [ClassicSimilarity], result of:
            0.048794083 = score(doc=836,freq=1.0), product of:
              0.10221197 = queryWeight, product of:
                1.0255127 = boost
                6.110481 = idf(docFreq=267, maxDocs=44421)
                0.016311178 = queryNorm
              0.47738132 = fieldWeight in 836, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.110481 = idf(docFreq=267, maxDocs=44421)
                0.078125 = fieldNorm(doc=836)
          0.015797628 = weight(abstract_txt:data in 836) [ClassicSimilarity], result of:
            0.015797628 = score(doc=836,freq=1.0), product of:
              0.060719505 = queryWeight, product of:
                1.1178132 = boost
                3.3302255 = idf(docFreq=4320, maxDocs=44421)
                0.016311178 = queryNorm
              0.26017386 = fieldWeight in 836, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.3302255 = idf(docFreq=4320, maxDocs=44421)
                0.078125 = fieldNorm(doc=836)
          0.036157455 = weight(abstract_txt:subject in 836) [ClassicSimilarity], result of:
            0.036157455 = score(doc=836,freq=2.0), product of:
              0.083699375 = queryWeight, product of:
                1.3124001 = boost
                3.9099448 = idf(docFreq=2419, maxDocs=44421)
                0.016311178 = queryNorm
              0.43199193 = fieldWeight in 836, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.9099448 = idf(docFreq=2419, maxDocs=44421)
                0.078125 = fieldNorm(doc=836)
          0.035944104 = weight(abstract_txt:retrieval in 836) [ClassicSimilarity], result of:
            0.035944104 = score(doc=836,freq=1.0), product of:
              0.13234131 = queryWeight, product of:
                2.3338227 = boost
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.016311178 = queryNorm
              0.27160156 = fieldWeight in 836, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.078125 = fieldNorm(doc=836)
          0.16074303 = weight(abstract_txt:titles in 836) [ClassicSimilarity], result of:
            0.16074303 = score(doc=836,freq=1.0), product of:
              0.35922375 = queryWeight, product of:
                3.845056 = boost
                5.727658 = idf(docFreq=392, maxDocs=44421)
                0.016311178 = queryNorm
              0.44747326 = fieldWeight in 836, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.727658 = idf(docFreq=392, maxDocs=44421)
                0.078125 = fieldNorm(doc=836)
          0.18126158 = weight(abstract_txt:abstracts in 836) [ClassicSimilarity], result of:
            0.18126158 = score(doc=836,freq=1.0), product of:
              0.3891773 = queryWeight, product of:
                4.002155 = boost
                5.9616747 = idf(docFreq=310, maxDocs=44421)
                0.016311178 = queryNorm
              0.46575582 = fieldWeight in 836, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.9616747 = idf(docFreq=310, maxDocs=44421)
                0.078125 = fieldNorm(doc=836)
        0.24 = coord(6/25)
    
  5. Voorbij, H.: ¬Een goede titel behoeft geen trefwoord, of toch wel? : een vergelijkend oderzoek titelwoorden - trefwoorden (1997) 0.11
    0.11163755 = sum of:
      0.11163755 = product of:
        0.5581877 = sum of:
          0.06136124 = weight(abstract_txt:subject in 2446) [ClassicSimilarity], result of:
            0.06136124 = score(doc=2446,freq=4.0), product of:
              0.083699375 = queryWeight, product of:
                1.3124001 = boost
                3.9099448 = idf(docFreq=2419, maxDocs=44421)
                0.016311178 = queryNorm
              0.73311466 = fieldWeight in 2446, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                3.9099448 = idf(docFreq=2419, maxDocs=44421)
                0.09375 = fieldNorm(doc=2446)
          0.040420122 = weight(abstract_txt:searching in 2446) [ClassicSimilarity], result of:
            0.040420122 = score(doc=2446,freq=1.0), product of:
              0.10058762 = queryWeight, product of:
                1.4387238 = boost
                4.2862926 = idf(docFreq=1660, maxDocs=44421)
                0.016311178 = queryNorm
              0.4018399 = fieldWeight in 2446, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.2862926 = idf(docFreq=1660, maxDocs=44421)
                0.09375 = fieldNorm(doc=2446)
          0.14048354 = weight(abstract_txt:headings in 2446) [ClassicSimilarity], result of:
            0.14048354 = score(doc=2446,freq=4.0), product of:
              0.14539286 = queryWeight, product of:
                1.7297236 = boost
                5.1532483 = idf(docFreq=697, maxDocs=44421)
                0.016311178 = queryNorm
              0.9662341 = fieldWeight in 2446, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.1532483 = idf(docFreq=697, maxDocs=44421)
                0.09375 = fieldNorm(doc=2446)
          0.04313293 = weight(abstract_txt:retrieval in 2446) [ClassicSimilarity], result of:
            0.04313293 = score(doc=2446,freq=1.0), product of:
              0.13234131 = queryWeight, product of:
                2.3338227 = boost
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.016311178 = queryNorm
              0.3259219 = fieldWeight in 2446, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.09375 = fieldNorm(doc=2446)
          0.27278993 = weight(abstract_txt:titles in 2446) [ClassicSimilarity], result of:
            0.27278993 = score(doc=2446,freq=2.0), product of:
              0.35922375 = queryWeight, product of:
                3.845056 = boost
                5.727658 = idf(docFreq=392, maxDocs=44421)
                0.016311178 = queryNorm
              0.75938725 = fieldWeight in 2446, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.727658 = idf(docFreq=392, maxDocs=44421)
                0.09375 = fieldNorm(doc=2446)
        0.2 = coord(5/25)