Document (#2230)

Author
Cousins, S.A.
Title
Enhancing subject access to OPACs : controlled vocabulary vs. natural language
Source
Journal of documentation. 48(1992) no.3, S.291-309
Year
1992
Abstract
Experimental evidence suggests that enhancing the subject content of OPAC records can improve retrieval performance. This is based on the use of natural language index terms derived from the table of contents and back-of-the-book index of documents. The research reported here investigates the alternative approach of translating these natural language terms into controlled vocabulary. Subject queries were collected by interview at the catalogue, and indexing of the queries demonstrated the impressive ability of PRECIS, and to a lesser extent LCSH, to represent users' information needs. DDC performed poorly in this respect. The assumption was made that an index language adequately specific to represent users' queries should be adequate to represent document contents. Searches were carried out on three test databases, and both natural language and PRECIS enhancement of MARC records increased the number of relevant documents found, with PRECIS showing the better performance. However, with weak stemming the advantage of PRECIS was lost. Consideration must also be given to the potential advantages of controlled vocabulary, over and above basic retrieval performance measures
Theme
Verbale Doksprachen im Online-Retrieval
Kataloganreicherung
Object
LCSH
PRECIS

Similar documents (author)

  1. Cousins, S.A.: In their own words : an examination of catalogue users' subject queries (1992) 5.81
    5.814733 = sum of:
      5.814733 = weight(author_txt:cousins in 2620) [ClassicSimilarity], result of:
        5.814733 = fieldWeight in 2620, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.303573 = idf(docFreq=10, maxDocs=44421)
          0.625 = fieldNorm(doc=2620)
    
  2. Cousins, S.A.: In their own words : an examination of catalogue subject queries (1992) 5.81
    5.814733 = sum of:
      5.814733 = weight(author_txt:cousins in 3730) [ClassicSimilarity], result of:
        5.814733 = fieldWeight in 3730, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.303573 = idf(docFreq=10, maxDocs=44421)
          0.625 = fieldNorm(doc=3730)
    
  3. Cousins, G.: Professional indexing in Australia : first steps towards accreditation (1993) 5.81
    5.814733 = sum of:
      5.814733 = weight(author_txt:cousins in 7650) [ClassicSimilarity], result of:
        5.814733 = fieldWeight in 7650, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.303573 = idf(docFreq=10, maxDocs=44421)
          0.625 = fieldNorm(doc=7650)
    
  4. Cousins, S.: COPAC: new research library union catalogue (1997) 5.81
    5.814733 = sum of:
      5.814733 = weight(author_txt:cousins in 664) [ClassicSimilarity], result of:
        5.814733 = fieldWeight in 664, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.303573 = idf(docFreq=10, maxDocs=44421)
          0.625 = fieldNorm(doc=664)
    
  5. Cousins, S.A.: Duplicate detection and record consolidation in large bibliographic databases : the COPAC database experience (1998) 5.81
    5.814733 = sum of:
      5.814733 = weight(author_txt:cousins in 3833) [ClassicSimilarity], result of:
        5.814733 = fieldWeight in 3833, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.303573 = idf(docFreq=10, maxDocs=44421)
          0.625 = fieldNorm(doc=3833)
    

Similar documents (content)

  1. Austin, D.; Digger, J.A.: PRECIS: The Preserved Context Index System (1985) 0.41
    0.40743777 = sum of:
      0.40743777 = product of:
        1.6976575 = sum of:
          0.019697513 = weight(abstract_txt:were in 4652) [ClassicSimilarity], result of:
            0.019697513 = score(doc=4652,freq=4.0), product of:
              0.068746895 = queryWeight, product of:
                1.0280861 = boost
                3.6674848 = idf(docFreq=3083, maxDocs=44421)
                0.018232882 = queryNorm
              0.28652224 = fieldWeight in 4652, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                3.6674848 = idf(docFreq=3083, maxDocs=44421)
                0.0390625 = fieldNorm(doc=4652)
          0.026402632 = weight(abstract_txt:terms in 4652) [ClassicSimilarity], result of:
            0.026402632 = score(doc=4652,freq=4.0), product of:
              0.08357511 = queryWeight, product of:
                1.1335518 = boost
                4.043712 = idf(docFreq=2116, maxDocs=44421)
                0.018232882 = queryNorm
              0.31591502 = fieldWeight in 4652, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.043712 = idf(docFreq=2116, maxDocs=44421)
                0.0390625 = fieldNorm(doc=4652)
          0.017901098 = weight(abstract_txt:subject in 4652) [ClassicSimilarity], result of:
            0.017901098 = score(doc=4652,freq=1.0), product of:
              0.117205776 = queryWeight, product of:
                1.6440804 = boost
                3.9099448 = idf(docFreq=2419, maxDocs=44421)
                0.018232882 = queryNorm
              0.15273222 = fieldWeight in 4652, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.9099448 = idf(docFreq=2419, maxDocs=44421)
                0.0390625 = fieldNorm(doc=4652)
          0.07860329 = weight(abstract_txt:index in 4652) [ClassicSimilarity], result of:
            0.07860329 = score(doc=4652,freq=6.0), product of:
              0.17295745 = queryWeight, product of:
                1.9971845 = boost
                4.7496953 = idf(docFreq=1044, maxDocs=44421)
                0.018232882 = queryNorm
              0.45446604 = fieldWeight in 4652, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                4.7496953 = idf(docFreq=1044, maxDocs=44421)
                0.0390625 = fieldNorm(doc=4652)
          0.036218733 = weight(abstract_txt:language in 4652) [ClassicSimilarity], result of:
            0.036218733 = score(doc=4652,freq=1.0), product of:
              0.22229737 = queryWeight, product of:
                2.923076 = boost
                4.1709876 = idf(docFreq=1863, maxDocs=44421)
                0.018232882 = queryNorm
              0.1629292 = fieldWeight in 4652, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.1709876 = idf(docFreq=1863, maxDocs=44421)
                0.0390625 = fieldNorm(doc=4652)
          1.5188342 = weight(title_txt:precis in 4652) [ClassicSimilarity], result of:
            1.5188342 = score(doc=4652,freq=1.0), product of:
              0.5514442 = queryWeight, product of:
                4.1178327 = boost
                7.344759 = idf(docFreq=77, maxDocs=44421)
                0.018232882 = queryNorm
              2.7542846 = fieldWeight in 4652, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.344759 = idf(docFreq=77, maxDocs=44421)
                0.375 = fieldNorm(doc=4652)
        0.24 = coord(6/25)
    
  2. Austin, D.: PRECIS in a multilingual context : Pt.1: PRECIS: an overview (1976) 0.40
    0.4037579 = sum of:
      0.4037579 = product of:
        2.0187895 = sum of:
          0.026402632 = weight(abstract_txt:terms in 1983) [ClassicSimilarity], result of:
            0.026402632 = score(doc=1983,freq=1.0), product of:
              0.08357511 = queryWeight, product of:
                1.1335518 = boost
                4.043712 = idf(docFreq=2116, maxDocs=44421)
                0.018232882 = queryNorm
              0.31591502 = fieldWeight in 1983, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.043712 = idf(docFreq=2116, maxDocs=44421)
                0.078125 = fieldNorm(doc=1983)
          0.035802197 = weight(abstract_txt:subject in 1983) [ClassicSimilarity], result of:
            0.035802197 = score(doc=1983,freq=1.0), product of:
              0.117205776 = queryWeight, product of:
                1.6440804 = boost
                3.9099448 = idf(docFreq=2419, maxDocs=44421)
                0.018232882 = queryNorm
              0.30546445 = fieldWeight in 1983, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.9099448 = idf(docFreq=2419, maxDocs=44421)
                0.078125 = fieldNorm(doc=1983)
          0.06417931 = weight(abstract_txt:index in 1983) [ClassicSimilarity], result of:
            0.06417931 = score(doc=1983,freq=1.0), product of:
              0.17295745 = queryWeight, product of:
                1.9971845 = boost
                4.7496953 = idf(docFreq=1044, maxDocs=44421)
                0.018232882 = queryNorm
              0.37106994 = fieldWeight in 1983, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.7496953 = idf(docFreq=1044, maxDocs=44421)
                0.078125 = fieldNorm(doc=1983)
          0.10244205 = weight(abstract_txt:language in 1983) [ClassicSimilarity], result of:
            0.10244205 = score(doc=1983,freq=2.0), product of:
              0.22229737 = queryWeight, product of:
                2.923076 = boost
                4.1709876 = idf(docFreq=1863, maxDocs=44421)
                0.018232882 = queryNorm
              0.46083337 = fieldWeight in 1983, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.1709876 = idf(docFreq=1863, maxDocs=44421)
                0.078125 = fieldNorm(doc=1983)
          1.7899632 = weight(title_txt:precis in 1983) [ClassicSimilarity], result of:
            1.7899632 = score(doc=1983,freq=2.0), product of:
              0.5514442 = queryWeight, product of:
                4.1178327 = boost
                7.344759 = idf(docFreq=77, maxDocs=44421)
                0.018232882 = queryNorm
              3.2459555 = fieldWeight in 1983, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.344759 = idf(docFreq=77, maxDocs=44421)
                0.3125 = fieldNorm(doc=1983)
        0.2 = coord(5/25)
    
  3. Biswas, S.C.; Smith, F.: Efficiency and effectiveness of deep structure based indexing languages : PRECIS vs. DSIS (1991) 0.37
    0.37417012 = sum of:
      0.37417012 = product of:
        1.3363218 = sum of:
          0.018481841 = weight(abstract_txt:terms in 2186) [ClassicSimilarity], result of:
            0.018481841 = score(doc=2186,freq=1.0), product of:
              0.08357511 = queryWeight, product of:
                1.1335518 = boost
                4.043712 = idf(docFreq=2116, maxDocs=44421)
                0.018232882 = queryNorm
              0.2211405 = fieldWeight in 2186, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.043712 = idf(docFreq=2116, maxDocs=44421)
                0.0546875 = fieldNorm(doc=2186)
          0.019595047 = weight(abstract_txt:documents in 2186) [ClassicSimilarity], result of:
            0.019595047 = score(doc=2186,freq=1.0), product of:
              0.086898245 = queryWeight, product of:
                1.1558685 = boost
                4.123322 = idf(docFreq=1954, maxDocs=44421)
                0.018232882 = queryNorm
              0.22549418 = fieldWeight in 2186, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.123322 = idf(docFreq=1954, maxDocs=44421)
                0.0546875 = fieldNorm(doc=2186)
          0.043407854 = weight(abstract_txt:subject in 2186) [ClassicSimilarity], result of:
            0.043407854 = score(doc=2186,freq=3.0), product of:
              0.117205776 = queryWeight, product of:
                1.6440804 = boost
                3.9099448 = idf(docFreq=2419, maxDocs=44421)
                0.018232882 = queryNorm
              0.37035593 = fieldWeight in 2186, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.9099448 = idf(docFreq=2419, maxDocs=44421)
                0.0546875 = fieldNorm(doc=2186)
          0.08985104 = weight(abstract_txt:index in 2186) [ClassicSimilarity], result of:
            0.08985104 = score(doc=2186,freq=4.0), product of:
              0.17295745 = queryWeight, product of:
                1.9971845 = boost
                4.7496953 = idf(docFreq=1044, maxDocs=44421)
                0.018232882 = queryNorm
              0.51949793 = fieldWeight in 2186, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.7496953 = idf(docFreq=1044, maxDocs=44421)
                0.0546875 = fieldNorm(doc=2186)
          0.06460403 = weight(abstract_txt:vocabulary in 2186) [ClassicSimilarity], result of:
            0.06460403 = score(doc=2186,freq=1.0), product of:
              0.22035196 = queryWeight, product of:
                2.2542756 = boost
                5.3611083 = idf(docFreq=566, maxDocs=44421)
                0.018232882 = queryNorm
              0.29318562 = fieldWeight in 2186, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.3611083 = idf(docFreq=566, maxDocs=44421)
                0.0546875 = fieldNorm(doc=2186)
          0.08782576 = weight(abstract_txt:language in 2186) [ClassicSimilarity], result of:
            0.08782576 = score(doc=2186,freq=3.0), product of:
              0.22229737 = queryWeight, product of:
                2.923076 = boost
                4.1709876 = idf(docFreq=1863, maxDocs=44421)
                0.018232882 = queryNorm
              0.39508232 = fieldWeight in 2186, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.1709876 = idf(docFreq=1863, maxDocs=44421)
                0.0546875 = fieldNorm(doc=2186)
          1.0125562 = weight(title_txt:precis in 2186) [ClassicSimilarity], result of:
            1.0125562 = score(doc=2186,freq=1.0), product of:
              0.5514442 = queryWeight, product of:
                4.1178327 = boost
                7.344759 = idf(docFreq=77, maxDocs=44421)
                0.018232882 = queryNorm
              1.8361897 = fieldWeight in 2186, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.344759 = idf(docFreq=77, maxDocs=44421)
                0.25 = fieldNorm(doc=2186)
        0.28 = coord(7/25)
    
  4. Austin, D.: PRECIS (2009) 0.33
    0.331279 = sum of:
      0.331279 = product of:
        4.140988 = sum of:
          0.090763256 = weight(abstract_txt:index in 1985) [ClassicSimilarity], result of:
            0.090763256 = score(doc=1985,freq=2.0), product of:
              0.17295745 = queryWeight, product of:
                1.9971845 = boost
                4.7496953 = idf(docFreq=1044, maxDocs=44421)
                0.018232882 = queryNorm
              0.52477217 = fieldWeight in 1985, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.7496953 = idf(docFreq=1044, maxDocs=44421)
                0.078125 = fieldNorm(doc=1985)
          4.050225 = weight(title_txt:precis in 1985) [ClassicSimilarity], result of:
            4.050225 = score(doc=1985,freq=1.0), product of:
              0.5514442 = queryWeight, product of:
                4.1178327 = boost
                7.344759 = idf(docFreq=77, maxDocs=44421)
                0.018232882 = queryNorm
              7.344759 = fieldWeight in 1985, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.344759 = idf(docFreq=77, maxDocs=44421)
                1.0 = fieldNorm(doc=1985)
        0.08 = coord(2/25)
    
  5. Weintraub, D.K.: ¬An extended review of PRECIS (1979) 0.33
    0.32606402 = sum of:
      0.32606402 = product of:
        2.0379002 = sum of:
          0.1012639 = weight(abstract_txt:subject in 1196) [ClassicSimilarity], result of:
            0.1012639 = score(doc=1196,freq=8.0), product of:
              0.117205776 = queryWeight, product of:
                1.6440804 = boost
                3.9099448 = idf(docFreq=2419, maxDocs=44421)
                0.018232882 = queryNorm
              0.86398387 = fieldWeight in 1196, product of:
                2.828427 = tf(freq=8.0), with freq of:
                  8.0 = termFreq=8.0
                3.9099448 = idf(docFreq=2419, maxDocs=44421)
                0.078125 = fieldNorm(doc=1196)
          0.06417931 = weight(abstract_txt:index in 1196) [ClassicSimilarity], result of:
            0.06417931 = score(doc=1196,freq=1.0), product of:
              0.17295745 = queryWeight, product of:
                1.9971845 = boost
                4.7496953 = idf(docFreq=1044, maxDocs=44421)
                0.018232882 = queryNorm
              0.37106994 = fieldWeight in 1196, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.7496953 = idf(docFreq=1044, maxDocs=44421)
                0.078125 = fieldNorm(doc=1196)
          0.10048367 = weight(abstract_txt:represent in 1196) [ClassicSimilarity], result of:
            0.10048367 = score(doc=1196,freq=1.0), product of:
              0.23320591 = queryWeight, product of:
                2.319094 = boost
                5.515259 = idf(docFreq=485, maxDocs=44421)
                0.018232882 = queryNorm
              0.4308796 = fieldWeight in 1196, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.515259 = idf(docFreq=485, maxDocs=44421)
                0.078125 = fieldNorm(doc=1196)
          1.7719733 = weight(title_txt:precis in 1196) [ClassicSimilarity], result of:
            1.7719733 = score(doc=1196,freq=1.0), product of:
              0.5514442 = queryWeight, product of:
                4.1178327 = boost
                7.344759 = idf(docFreq=77, maxDocs=44421)
                0.018232882 = queryNorm
              3.2133322 = fieldWeight in 1196, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.344759 = idf(docFreq=77, maxDocs=44421)
                0.4375 = fieldNorm(doc=1196)
        0.16 = coord(4/25)