Document (#12421)

Author
McJunkin, M.C.
Title
Precision and recall in title keyword searching
Source
Information technology and libraries. 14(1995) no.3, S.161-171
Year
1995
Abstract
Investigates the extent to which title keywords convey subject content and compares the relative effectiveness of searching title keywords using 2 search strategies to examine whether adjacency operators in title keyword searches are effective in improving recall and precision of online searching. Title keywords from a random sample of titles in the field of economics were searched on FirstSearch, using the WorldCat database, which is equivalent in coverage to the OCLC OLUC, with and without adjacency of the keywords specified. The LCSH of the items retrieved were compared with the sample title subject headings to determine the degree of match or relevance and the values for precision and recall were calculated. Results indicated that, when keywords were discipline specific, adjacency operators improved precision with little degradation of recall. Systems that allow positional operators or rank output by proximity of terms may increase search success
Theme
Verbale Doksprachen im Online-Retrieval
Retrievalstudien

Similar documents (content)

  1. Chan, L.M.: Library of Congress class numbers in online catalog searching (1989) 0.36
    0.3578157 = sum of:
      0.3578157 = product of:
        1.2779132 = sum of:
          0.02080228 = weight(abstract_txt:using in 1146) [ClassicSimilarity], result of:
            0.02080228 = score(doc=1146,freq=1.0), product of:
              0.054919366 = queryWeight, product of:
                1.0584757 = boost
                3.4631186 = idf(docFreq=3765, maxDocs=44218)
                0.014982257 = queryNorm
              0.37877858 = fieldWeight in 1146, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4631186 = idf(docFreq=3765, maxDocs=44218)
                0.109375 = fieldNorm(doc=1146)
          0.042243533 = weight(abstract_txt:subject in 1146) [ClassicSimilarity], result of:
            0.042243533 = score(doc=1146,freq=2.0), product of:
              0.06990073 = queryWeight, product of:
                1.1941503 = boost
                3.9070187 = idf(docFreq=2415, maxDocs=44218)
                0.014982257 = queryNorm
              0.6043361 = fieldWeight in 1146, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.9070187 = idf(docFreq=2415, maxDocs=44218)
                0.109375 = fieldNorm(doc=1146)
          0.10235998 = weight(abstract_txt:searching in 1146) [ClassicSimilarity], result of:
            0.10235998 = score(doc=1146,freq=3.0), product of:
              0.12610385 = queryWeight, product of:
                1.9643911 = boost
                4.284727 = idf(docFreq=1655, maxDocs=44218)
                0.014982257 = queryNorm
              0.8117118 = fieldWeight in 1146, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.284727 = idf(docFreq=1655, maxDocs=44218)
                0.109375 = fieldNorm(doc=1146)
          0.16895837 = weight(abstract_txt:precision in 1146) [ClassicSimilarity], result of:
            0.16895837 = score(doc=1146,freq=1.0), product of:
              0.2795855 = queryWeight, product of:
                3.377462 = boost
                5.5251865 = idf(docFreq=478, maxDocs=44218)
                0.014982257 = queryNorm
              0.6043173 = fieldWeight in 1146, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.5251865 = idf(docFreq=478, maxDocs=44218)
                0.109375 = fieldNorm(doc=1146)
          0.19031906 = weight(abstract_txt:recall in 1146) [ClassicSimilarity], result of:
            0.19031906 = score(doc=1146,freq=1.0), product of:
              0.30267954 = queryWeight, product of:
                3.5141854 = boost
                5.7488523 = idf(docFreq=382, maxDocs=44218)
                0.014982257 = queryNorm
              0.6287807 = fieldWeight in 1146, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.7488523 = idf(docFreq=382, maxDocs=44218)
                0.109375 = fieldNorm(doc=1146)
          0.4715739 = weight(abstract_txt:keywords in 1146) [ClassicSimilarity], result of:
            0.4715739 = score(doc=1146,freq=3.0), product of:
              0.4139593 = queryWeight, product of:
                4.594804 = boost
                6.0133076 = idf(docFreq=293, maxDocs=44218)
                0.014982257 = queryNorm
              1.1391793 = fieldWeight in 1146, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.0133076 = idf(docFreq=293, maxDocs=44218)
                0.109375 = fieldNorm(doc=1146)
          0.28165606 = weight(abstract_txt:title in 1146) [ClassicSimilarity], result of:
            0.28165606 = score(doc=1146,freq=1.0), product of:
              0.44995734 = queryWeight, product of:
                5.247645 = boost
                5.723078 = idf(docFreq=392, maxDocs=44218)
                0.014982257 = queryNorm
              0.62596166 = fieldWeight in 1146, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.723078 = idf(docFreq=392, maxDocs=44218)
                0.109375 = fieldNorm(doc=1146)
        0.28 = coord(7/25)
    
  2. Perry, S.; Salisbury, L.: ¬The ten most effective ways to search WorldCat on FirstSearch (1996) 0.33
    0.3296456 = sum of:
      0.3296456 = product of:
        1.0301425 = sum of:
          0.021013476 = weight(abstract_txt:using in 6520) [ClassicSimilarity], result of:
            0.021013476 = score(doc=6520,freq=2.0), product of:
              0.054919366 = queryWeight, product of:
                1.0584757 = boost
                3.4631186 = idf(docFreq=3765, maxDocs=44218)
                0.014982257 = queryNorm
              0.38262415 = fieldWeight in 6520, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.4631186 = idf(docFreq=3765, maxDocs=44218)
                0.078125 = fieldNorm(doc=6520)
          0.01751179 = weight(abstract_txt:search in 6520) [ClassicSimilarity], result of:
            0.01751179 = score(doc=6520,freq=1.0), product of:
              0.061276026 = queryWeight, product of:
                1.1180557 = boost
                3.6580524 = idf(docFreq=3098, maxDocs=44218)
                0.014982257 = queryNorm
              0.28578535 = fieldWeight in 6520, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6580524 = idf(docFreq=3098, maxDocs=44218)
                0.078125 = fieldNorm(doc=6520)
          0.12450146 = weight(abstract_txt:worldcat in 6520) [ClassicSimilarity], result of:
            0.12450146 = score(doc=6520,freq=3.0), product of:
              0.12468172 = queryWeight, product of:
                1.1277285 = boost
                7.3793993 = idf(docFreq=74, maxDocs=44218)
                0.014982257 = queryNorm
              0.9985542 = fieldWeight in 6520, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                7.3793993 = idf(docFreq=74, maxDocs=44218)
                0.078125 = fieldNorm(doc=6520)
          0.35146248 = weight(title_txt:firstsearch in 6520) [ClassicSimilarity], result of:
            0.35146248 = score(doc=6520,freq=1.0), product of:
              0.14254092 = queryWeight, product of:
                1.2057935 = boost
                7.890225 = idf(docFreq=44, maxDocs=44218)
                0.014982257 = queryNorm
              2.4656954 = fieldWeight in 6520, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.890225 = idf(docFreq=44, maxDocs=44218)
                0.3125 = fieldNorm(doc=6520)
          0.12713732 = weight(abstract_txt:oluc in 6520) [ClassicSimilarity], result of:
            0.12713732 = score(doc=6520,freq=1.0), product of:
              0.18235134 = queryWeight, product of:
                1.3638217 = boost
                8.924298 = idf(docFreq=15, maxDocs=44218)
                0.014982257 = queryNorm
              0.6972108 = fieldWeight in 6520, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.924298 = idf(docFreq=15, maxDocs=44218)
                0.078125 = fieldNorm(doc=6520)
          0.07311427 = weight(abstract_txt:searching in 6520) [ClassicSimilarity], result of:
            0.07311427 = score(doc=6520,freq=3.0), product of:
              0.12610385 = queryWeight, product of:
                1.9643911 = boost
                4.284727 = idf(docFreq=1655, maxDocs=44218)
                0.014982257 = queryNorm
              0.5797941 = fieldWeight in 6520, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.284727 = idf(docFreq=1655, maxDocs=44218)
                0.078125 = fieldNorm(doc=6520)
          0.1947173 = weight(abstract_txt:operators in 6520) [ClassicSimilarity], result of:
            0.1947173 = score(doc=6520,freq=1.0), product of:
              0.3494382 = queryWeight, product of:
                3.270009 = boost
                7.132539 = idf(docFreq=95, maxDocs=44218)
                0.014982257 = queryNorm
              0.5572296 = fieldWeight in 6520, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.132539 = idf(docFreq=95, maxDocs=44218)
                0.078125 = fieldNorm(doc=6520)
          0.12068454 = weight(abstract_txt:precision in 6520) [ClassicSimilarity], result of:
            0.12068454 = score(doc=6520,freq=1.0), product of:
              0.2795855 = queryWeight, product of:
                3.377462 = boost
                5.5251865 = idf(docFreq=478, maxDocs=44218)
                0.014982257 = queryNorm
              0.4316552 = fieldWeight in 6520, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.5251865 = idf(docFreq=478, maxDocs=44218)
                0.078125 = fieldNorm(doc=6520)
        0.32 = coord(8/25)
    
  3. Voorbij, H.: Title keywords and subject descriptors : a comparison of subject search entries of books in the humanities and social sciences (1998) 0.32
    0.32324696 = sum of:
      0.32324696 = product of:
        1.0101467 = sum of:
          0.01040114 = weight(abstract_txt:using in 4721) [ClassicSimilarity], result of:
            0.01040114 = score(doc=4721,freq=1.0), product of:
              0.054919366 = queryWeight, product of:
                1.0584757 = boost
                3.4631186 = idf(docFreq=3765, maxDocs=44218)
                0.014982257 = queryNorm
              0.18938929 = fieldWeight in 4721, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4631186 = idf(docFreq=3765, maxDocs=44218)
                0.0546875 = fieldNorm(doc=4721)
          0.047229707 = weight(abstract_txt:subject in 4721) [ClassicSimilarity], result of:
            0.047229707 = score(doc=4721,freq=10.0), product of:
              0.06990073 = queryWeight, product of:
                1.1941503 = boost
                3.9070187 = idf(docFreq=2415, maxDocs=44218)
                0.014982257 = queryNorm
              0.6756683 = fieldWeight in 4721, product of:
                3.1622777 = tf(freq=10.0), with freq of:
                  10.0 = termFreq=10.0
                3.9070187 = idf(docFreq=2415, maxDocs=44218)
                0.0546875 = fieldNorm(doc=4721)
          0.07793719 = weight(abstract_txt:keyword in 4721) [ClassicSimilarity], result of:
            0.07793719 = score(doc=4721,freq=2.0), product of:
              0.16691346 = queryWeight, product of:
                1.8452866 = boost
                6.037405 = idf(docFreq=286, maxDocs=44218)
                0.014982257 = queryNorm
              0.46693173 = fieldWeight in 4721, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.037405 = idf(docFreq=286, maxDocs=44218)
                0.0546875 = fieldNorm(doc=4721)
          0.041788287 = weight(abstract_txt:searching in 4721) [ClassicSimilarity], result of:
            0.041788287 = score(doc=4721,freq=2.0), product of:
              0.12610385 = queryWeight, product of:
                1.9643911 = boost
                4.284727 = idf(docFreq=1655, maxDocs=44218)
                0.014982257 = queryNorm
              0.33137995 = fieldWeight in 4721, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.284727 = idf(docFreq=1655, maxDocs=44218)
                0.0546875 = fieldNorm(doc=4721)
          0.0428835 = weight(abstract_txt:were in 4721) [ClassicSimilarity], result of:
            0.0428835 = score(doc=4721,freq=3.0), product of:
              0.123358175 = queryWeight, product of:
                2.2434537 = boost
                3.6700637 = idf(docFreq=3061, maxDocs=44218)
                0.014982257 = queryNorm
              0.34763405 = fieldWeight in 4721, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.6700637 = idf(docFreq=3061, maxDocs=44218)
                0.0546875 = fieldNorm(doc=4721)
          0.09515953 = weight(abstract_txt:recall in 4721) [ClassicSimilarity], result of:
            0.09515953 = score(doc=4721,freq=1.0), product of:
              0.30267954 = queryWeight, product of:
                3.5141854 = boost
                5.7488523 = idf(docFreq=382, maxDocs=44218)
                0.014982257 = queryNorm
              0.31439036 = fieldWeight in 4721, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.7488523 = idf(docFreq=382, maxDocs=44218)
                0.0546875 = fieldNorm(doc=4721)
          0.27226332 = weight(abstract_txt:keywords in 4721) [ClassicSimilarity], result of:
            0.27226332 = score(doc=4721,freq=4.0), product of:
              0.4139593 = queryWeight, product of:
                4.594804 = boost
                6.0133076 = idf(docFreq=293, maxDocs=44218)
                0.014982257 = queryNorm
              0.65770555 = fieldWeight in 4721, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                6.0133076 = idf(docFreq=293, maxDocs=44218)
                0.0546875 = fieldNorm(doc=4721)
          0.42248404 = weight(abstract_txt:title in 4721) [ClassicSimilarity], result of:
            0.42248404 = score(doc=4721,freq=9.0), product of:
              0.44995734 = queryWeight, product of:
                5.247645 = boost
                5.723078 = idf(docFreq=392, maxDocs=44218)
                0.014982257 = queryNorm
              0.93894243 = fieldWeight in 4721, product of:
                3.0 = tf(freq=9.0), with freq of:
                  9.0 = termFreq=9.0
                5.723078 = idf(docFreq=392, maxDocs=44218)
                0.0546875 = fieldNorm(doc=4721)
        0.32 = coord(8/25)
    
  4. Lu, K.; Kipp, M.E.I.: Understanding the retrieval effectiveness of collaborative tags and author keywords in different retrieval environments : an experimental study on medical collections (2014) 0.28
    0.28345254 = sum of:
      0.28345254 = product of:
        1.0123305 = sum of:
          0.01681078 = weight(abstract_txt:using in 1215) [ClassicSimilarity], result of:
            0.01681078 = score(doc=1215,freq=2.0), product of:
              0.054919366 = queryWeight, product of:
                1.0584757 = boost
                3.4631186 = idf(docFreq=3765, maxDocs=44218)
                0.014982257 = queryNorm
              0.30609933 = fieldWeight in 1215, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.4631186 = idf(docFreq=3765, maxDocs=44218)
                0.0625 = fieldNorm(doc=1215)
          0.047758043 = weight(abstract_txt:searching in 1215) [ClassicSimilarity], result of:
            0.047758043 = score(doc=1215,freq=2.0), product of:
              0.12610385 = queryWeight, product of:
                1.9643911 = boost
                4.284727 = idf(docFreq=1655, maxDocs=44218)
                0.014982257 = queryNorm
              0.37871996 = fieldWeight in 1215, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.284727 = idf(docFreq=1655, maxDocs=44218)
                0.0625 = fieldNorm(doc=1215)
          0.028295772 = weight(abstract_txt:were in 1215) [ClassicSimilarity], result of:
            0.028295772 = score(doc=1215,freq=1.0), product of:
              0.123358175 = queryWeight, product of:
                2.2434537 = boost
                3.6700637 = idf(docFreq=3061, maxDocs=44218)
                0.014982257 = queryNorm
              0.22937898 = fieldWeight in 1215, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6700637 = idf(docFreq=3061, maxDocs=44218)
                0.0625 = fieldNorm(doc=1215)
          0.19309527 = weight(abstract_txt:precision in 1215) [ClassicSimilarity], result of:
            0.19309527 = score(doc=1215,freq=4.0), product of:
              0.2795855 = queryWeight, product of:
                3.377462 = boost
                5.5251865 = idf(docFreq=478, maxDocs=44218)
                0.014982257 = queryNorm
              0.6906483 = fieldWeight in 1215, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.5251865 = idf(docFreq=478, maxDocs=44218)
                0.0625 = fieldNorm(doc=1215)
          0.15380102 = weight(abstract_txt:recall in 1215) [ClassicSimilarity], result of:
            0.15380102 = score(doc=1215,freq=2.0), product of:
              0.30267954 = queryWeight, product of:
                3.5141854 = boost
                5.7488523 = idf(docFreq=382, maxDocs=44218)
                0.014982257 = queryNorm
              0.50813156 = fieldWeight in 1215, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.7488523 = idf(docFreq=382, maxDocs=44218)
                0.0625 = fieldNorm(doc=1215)
          0.41162342 = weight(abstract_txt:keywords in 1215) [ClassicSimilarity], result of:
            0.41162342 = score(doc=1215,freq=7.0), product of:
              0.4139593 = queryWeight, product of:
                4.594804 = boost
                6.0133076 = idf(docFreq=293, maxDocs=44218)
                0.014982257 = queryNorm
              0.9943572 = fieldWeight in 1215, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                6.0133076 = idf(docFreq=293, maxDocs=44218)
                0.0625 = fieldNorm(doc=1215)
          0.16094631 = weight(abstract_txt:title in 1215) [ClassicSimilarity], result of:
            0.16094631 = score(doc=1215,freq=1.0), product of:
              0.44995734 = queryWeight, product of:
                5.247645 = boost
                5.723078 = idf(docFreq=392, maxDocs=44218)
                0.014982257 = queryNorm
              0.35769236 = fieldWeight in 1215, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.723078 = idf(docFreq=392, maxDocs=44218)
                0.0625 = fieldNorm(doc=1215)
        0.28 = coord(7/25)
    
  5. Voorbij, H.: ¬Een goede titel behoeft geen trefwoord, of toch wel? : een vergelijkend oderzoek titelwoorden - trefwoorden (1997) 0.28
    0.276701 = sum of:
      0.276701 = product of:
        0.9882179 = sum of:
          0.010058528 = weight(abstract_txt:with in 1446) [ClassicSimilarity], result of:
            0.010058528 = score(doc=1446,freq=1.0), product of:
              0.042920962 = queryWeight, product of:
                1.1460366 = boost
                2.4997334 = idf(docFreq=9868, maxDocs=44218)
                0.014982257 = queryNorm
              0.23435001 = fieldWeight in 1446, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.4997334 = idf(docFreq=9868, maxDocs=44218)
                0.09375 = fieldNorm(doc=1446)
          0.051206898 = weight(abstract_txt:subject in 1446) [ClassicSimilarity], result of:
            0.051206898 = score(doc=1446,freq=4.0), product of:
              0.06990073 = queryWeight, product of:
                1.1941503 = boost
                3.9070187 = idf(docFreq=2415, maxDocs=44218)
                0.014982257 = queryNorm
              0.732566 = fieldWeight in 1446, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                3.9070187 = idf(docFreq=2415, maxDocs=44218)
                0.09375 = fieldNorm(doc=1446)
          0.050655056 = weight(abstract_txt:searching in 1446) [ClassicSimilarity], result of:
            0.050655056 = score(doc=1446,freq=1.0), product of:
              0.12610385 = queryWeight, product of:
                1.9643911 = boost
                4.284727 = idf(docFreq=1655, maxDocs=44218)
                0.014982257 = queryNorm
              0.40169317 = fieldWeight in 1446, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.284727 = idf(docFreq=1655, maxDocs=44218)
                0.09375 = fieldNorm(doc=1446)
          0.060024396 = weight(abstract_txt:were in 1446) [ClassicSimilarity], result of:
            0.060024396 = score(doc=1446,freq=2.0), product of:
              0.123358175 = queryWeight, product of:
                2.2434537 = boost
                3.6700637 = idf(docFreq=3061, maxDocs=44218)
                0.014982257 = queryNorm
              0.48658627 = fieldWeight in 1446, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.6700637 = idf(docFreq=3061, maxDocs=44218)
                0.09375 = fieldNorm(doc=1446)
          0.14482145 = weight(abstract_txt:precision in 1446) [ClassicSimilarity], result of:
            0.14482145 = score(doc=1446,freq=1.0), product of:
              0.2795855 = queryWeight, product of:
                3.377462 = boost
                5.5251865 = idf(docFreq=478, maxDocs=44218)
                0.014982257 = queryNorm
              0.51798624 = fieldWeight in 1446, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.5251865 = idf(docFreq=478, maxDocs=44218)
                0.09375 = fieldNorm(doc=1446)
          0.33003294 = weight(abstract_txt:keywords in 1446) [ClassicSimilarity], result of:
            0.33003294 = score(doc=1446,freq=2.0), product of:
              0.4139593 = queryWeight, product of:
                4.594804 = boost
                6.0133076 = idf(docFreq=293, maxDocs=44218)
                0.014982257 = queryNorm
              0.79725945 = fieldWeight in 1446, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.0133076 = idf(docFreq=293, maxDocs=44218)
                0.09375 = fieldNorm(doc=1446)
          0.34141862 = weight(abstract_txt:title in 1446) [ClassicSimilarity], result of:
            0.34141862 = score(doc=1446,freq=2.0), product of:
              0.44995734 = queryWeight, product of:
                5.247645 = boost
                5.723078 = idf(docFreq=392, maxDocs=44218)
                0.014982257 = queryNorm
              0.75878 = fieldWeight in 1446, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.723078 = idf(docFreq=392, maxDocs=44218)
                0.09375 = fieldNorm(doc=1446)
        0.28 = coord(7/25)