Document (#19103)

Author
Cunningham, S.J.
Title
Approximating document descriptors : what to do when a catalog isn't available
Source
Electronic library and visual information research: Proceedings of the 4th ELVIRA Conference (ELVIRA 4), Electronic Library and Visual Information Research, De Montfort University, Milton Keynes, May 1997. Ed. by C. Davies u. A. Ramsden
Imprint
London : Aslib
Year
1997
Pages
S.125-131
Abstract
The New Zealand Computer Science Technical Reports collection provides a central index to over 32.000 working papers distributed in archives around the world. The collection is not formally catalogued and cataloguing information is available only for a minority of the documents. However it is possible to access and index the full text of the documents, not simply the title and abstract, as is common in bibliographic databases. Techniques for using this expanded keyword access to the full text so as to create 'approximate' document descriptions are being investigated to allow the user to carry out searches similar to (although not as precise as) those supported by formally catalogued systems

Similar documents (author)

  1. Cunningham, E.R.: Classification for medical literature (1946) 5.47
    5.4731426 = sum of:
      5.4731426 = weight(author_txt:cunningham in 3560) [ClassicSimilarity], result of:
        5.4731426 = fieldWeight in 3560, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.757029 = idf(docFreq=18, maxDocs=44421)
          0.625 = fieldNorm(doc=3560)
    
  2. Cunningham, M.: Document imaging : present and future (1994) 5.47
    5.4731426 = sum of:
      5.4731426 = weight(author_txt:cunningham in 10) [ClassicSimilarity], result of:
        5.4731426 = fieldWeight in 10, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.757029 = idf(docFreq=18, maxDocs=44421)
          0.625 = fieldNorm(doc=10)
    
  3. Cunningham, J.: Getting the most from Alta Vista (1996) 5.47
    5.4731426 = sum of:
      5.4731426 = weight(author_txt:cunningham in 768) [ClassicSimilarity], result of:
        5.4731426 = fieldWeight in 768, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.757029 = idf(docFreq=18, maxDocs=44421)
          0.625 = fieldNorm(doc=768)
    
  4. Cunningham, A.: ¬A new direction for the National Bibliography (1997) 5.47
    5.4731426 = sum of:
      5.4731426 = weight(author_txt:cunningham in 2617) [ClassicSimilarity], result of:
        5.4731426 = fieldWeight in 2617, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.757029 = idf(docFreq=18, maxDocs=44421)
          0.625 = fieldNorm(doc=2617)
    
  5. Cunningham, S.: Hybrid WWW and CD-ROM systems (1998) 5.47
    5.4731426 = sum of:
      5.4731426 = weight(author_txt:cunningham in 6220) [ClassicSimilarity], result of:
        5.4731426 = fieldWeight in 6220, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.757029 = idf(docFreq=18, maxDocs=44421)
          0.625 = fieldNorm(doc=6220)
    

Similar documents (content)

  1. Jianchao, X.; Ming, H.; Milin, S.: On indexing descriptors for document archive (1998) 0.16
    0.15812218 = sum of:
      0.15812218 = product of:
        0.65884244 = sum of:
          0.09825533 = weight(abstract_txt:keyword in 4567) [ClassicSimilarity], result of:
            0.09825533 = score(doc=4567,freq=1.0), product of:
              0.13017169 = queryWeight, product of:
                1.0403482 = boost
                6.038507 = idf(docFreq=287, maxDocs=44421)
                0.020720882 = queryNorm
              0.7548134 = fieldWeight in 4567, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.038507 = idf(docFreq=287, maxDocs=44421)
                0.125 = fieldNorm(doc=4567)
          0.22878823 = weight(abstract_txt:descriptors in 4567) [ClassicSimilarity], result of:
            0.22878823 = score(doc=4567,freq=3.0), product of:
              0.1585603 = queryWeight, product of:
                1.1482004 = boost
                6.664515 = idf(docFreq=153, maxDocs=44421)
                0.020720882 = queryNorm
              1.4429098 = fieldWeight in 4567, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.664515 = idf(docFreq=153, maxDocs=44421)
                0.125 = fieldNorm(doc=4567)
          0.058887873 = weight(abstract_txt:text in 4567) [ClassicSimilarity], result of:
            0.058887873 = score(doc=4567,freq=1.0), product of:
              0.1165842 = queryWeight, product of:
                1.392372 = boost
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.020720882 = queryNorm
              0.50511026 = fieldWeight in 4567, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.125 = fieldNorm(doc=4567)
          0.070669 = weight(abstract_txt:document in 4567) [ClassicSimilarity], result of:
            0.070669 = score(doc=4567,freq=1.0), product of:
              0.13165633 = queryWeight, product of:
                1.4796408 = boost
                4.29415 = idf(docFreq=1647, maxDocs=44421)
                0.020720882 = queryNorm
              0.53676873 = fieldWeight in 4567, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.29415 = idf(docFreq=1647, maxDocs=44421)
                0.125 = fieldNorm(doc=4567)
          0.0956301 = weight(abstract_txt:index in 4567) [ClassicSimilarity], result of:
            0.0956301 = score(doc=4567,freq=1.0), product of:
              0.16107155 = queryWeight, product of:
                1.6366087 = boost
                4.7496953 = idf(docFreq=1044, maxDocs=44421)
                0.020720882 = queryNorm
              0.5937119 = fieldWeight in 4567, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.7496953 = idf(docFreq=1044, maxDocs=44421)
                0.125 = fieldNorm(doc=4567)
          0.1066119 = weight(abstract_txt:full in 4567) [ClassicSimilarity], result of:
            0.1066119 = score(doc=4567,freq=1.0), product of:
              0.17317808 = queryWeight, product of:
                1.6970001 = boost
                4.9249606 = idf(docFreq=876, maxDocs=44421)
                0.020720882 = queryNorm
              0.6156201 = fieldWeight in 4567, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.9249606 = idf(docFreq=876, maxDocs=44421)
                0.125 = fieldNorm(doc=4567)
        0.24 = coord(6/25)
    
  2. Preston, L.A.; Ebbs, C.M.; Luther, J.: 'Full text' access evaluation : are we getting the real thing? (1998) 0.13
    0.13122451 = sum of:
      0.13122451 = product of:
        0.54676884 = sum of:
          0.07523557 = weight(abstract_txt:access in 3695) [ClassicSimilarity], result of:
            0.07523557 = score(doc=3695,freq=3.0), product of:
              0.09517674 = queryWeight, product of:
                1.2580585 = boost
                3.6510832 = idf(docFreq=3134, maxDocs=44421)
                0.020720882 = queryNorm
              0.7904827 = fieldWeight in 3695, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.6510832 = idf(docFreq=3134, maxDocs=44421)
                0.125 = fieldNorm(doc=3695)
          0.08328003 = weight(abstract_txt:text in 3695) [ClassicSimilarity], result of:
            0.08328003 = score(doc=3695,freq=2.0), product of:
              0.1165842 = queryWeight, product of:
                1.392372 = boost
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.020720882 = queryNorm
              0.7143338 = fieldWeight in 3695, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.125 = fieldNorm(doc=3695)
          0.070669 = weight(abstract_txt:document in 3695) [ClassicSimilarity], result of:
            0.070669 = score(doc=3695,freq=1.0), product of:
              0.13165633 = queryWeight, product of:
                1.4796408 = boost
                4.29415 = idf(docFreq=1647, maxDocs=44421)
                0.020720882 = queryNorm
              0.53676873 = fieldWeight in 3695, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.29415 = idf(docFreq=1647, maxDocs=44421)
                0.125 = fieldNorm(doc=3695)
          0.07118218 = weight(abstract_txt:available in 3695) [ClassicSimilarity], result of:
            0.07118218 = score(doc=3695,freq=1.0), product of:
              0.13229293 = queryWeight, product of:
                1.4832138 = boost
                4.304519 = idf(docFreq=1630, maxDocs=44421)
                0.020720882 = queryNorm
              0.5380649 = fieldWeight in 3695, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.304519 = idf(docFreq=1630, maxDocs=44421)
                0.125 = fieldNorm(doc=3695)
          0.0956301 = weight(abstract_txt:index in 3695) [ClassicSimilarity], result of:
            0.0956301 = score(doc=3695,freq=1.0), product of:
              0.16107155 = queryWeight, product of:
                1.6366087 = boost
                4.7496953 = idf(docFreq=1044, maxDocs=44421)
                0.020720882 = queryNorm
              0.5937119 = fieldWeight in 3695, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.7496953 = idf(docFreq=1044, maxDocs=44421)
                0.125 = fieldNorm(doc=3695)
          0.15077199 = weight(abstract_txt:full in 3695) [ClassicSimilarity], result of:
            0.15077199 = score(doc=3695,freq=2.0), product of:
              0.17317808 = queryWeight, product of:
                1.6970001 = boost
                4.9249606 = idf(docFreq=876, maxDocs=44421)
                0.020720882 = queryNorm
              0.8706182 = fieldWeight in 3695, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.9249606 = idf(docFreq=876, maxDocs=44421)
                0.125 = fieldNorm(doc=3695)
        0.24 = coord(6/25)
    
  3. Veenema, F.: To index or not to index (1996) 0.11
    0.107803166 = sum of:
      0.107803166 = product of:
        0.44917986 = sum of:
          0.05152689 = weight(abstract_txt:text in 316) [ClassicSimilarity], result of:
            0.05152689 = score(doc=316,freq=1.0), product of:
              0.1165842 = queryWeight, product of:
                1.392372 = boost
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.020720882 = queryNorm
              0.44197148 = fieldWeight in 316, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.109375 = fieldNorm(doc=316)
          0.054745346 = weight(abstract_txt:documents in 316) [ClassicSimilarity], result of:
            0.054745346 = score(doc=316,freq=1.0), product of:
              0.12138971 = queryWeight, product of:
                1.4207785 = boost
                4.123322 = idf(docFreq=1954, maxDocs=44421)
                0.020720882 = queryNorm
              0.45098835 = fieldWeight in 316, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.123322 = idf(docFreq=1954, maxDocs=44421)
                0.109375 = fieldNorm(doc=316)
          0.08744843 = weight(abstract_txt:document in 316) [ClassicSimilarity], result of:
            0.08744843 = score(doc=316,freq=2.0), product of:
              0.13165633 = queryWeight, product of:
                1.4796408 = boost
                4.29415 = idf(docFreq=1647, maxDocs=44421)
                0.020720882 = queryNorm
              0.6642174 = fieldWeight in 316, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.29415 = idf(docFreq=1647, maxDocs=44421)
                0.109375 = fieldNorm(doc=316)
          0.078497455 = weight(abstract_txt:collection in 316) [ClassicSimilarity], result of:
            0.078497455 = score(doc=316,freq=1.0), product of:
              0.15435503 = queryWeight, product of:
                1.6021229 = boost
                4.649612 = idf(docFreq=1154, maxDocs=44421)
                0.020720882 = queryNorm
              0.5085513 = fieldWeight in 316, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.649612 = idf(docFreq=1154, maxDocs=44421)
                0.109375 = fieldNorm(doc=316)
          0.08367634 = weight(abstract_txt:index in 316) [ClassicSimilarity], result of:
            0.08367634 = score(doc=316,freq=1.0), product of:
              0.16107155 = queryWeight, product of:
                1.6366087 = boost
                4.7496953 = idf(docFreq=1044, maxDocs=44421)
                0.020720882 = queryNorm
              0.51949793 = fieldWeight in 316, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.7496953 = idf(docFreq=1044, maxDocs=44421)
                0.109375 = fieldNorm(doc=316)
          0.09328541 = weight(abstract_txt:full in 316) [ClassicSimilarity], result of:
            0.09328541 = score(doc=316,freq=1.0), product of:
              0.17317808 = queryWeight, product of:
                1.6970001 = boost
                4.9249606 = idf(docFreq=876, maxDocs=44421)
                0.020720882 = queryNorm
              0.53866756 = fieldWeight in 316, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.9249606 = idf(docFreq=876, maxDocs=44421)
                0.109375 = fieldNorm(doc=316)
        0.24 = coord(6/25)
    
  4. Lu, K.; Mao, J.; Li, G.: Toward effective automated weighted subject indexing : a comparison of different approaches in different environments (2018) 0.10
    0.10471269 = sum of:
      0.10471269 = product of:
        0.37397388 = sum of:
          0.07520929 = weight(abstract_txt:abstract in 292) [ClassicSimilarity], result of:
            0.07520929 = score(doc=292,freq=2.0), product of:
              0.13723665 = queryWeight, product of:
                1.0682071 = boost
                6.2002096 = idf(docFreq=244, maxDocs=44421)
                0.020720882 = queryNorm
              0.54802626 = fieldWeight in 292, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.2002096 = idf(docFreq=244, maxDocs=44421)
                0.0625 = fieldNorm(doc=292)
          0.09340239 = weight(abstract_txt:descriptors in 292) [ClassicSimilarity], result of:
            0.09340239 = score(doc=292,freq=2.0), product of:
              0.1585603 = queryWeight, product of:
                1.1482004 = boost
                6.664515 = idf(docFreq=153, maxDocs=44421)
                0.020720882 = queryNorm
              0.58906543 = fieldWeight in 292, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.664515 = idf(docFreq=153, maxDocs=44421)
                0.0625 = fieldNorm(doc=292)
          0.021718638 = weight(abstract_txt:access in 292) [ClassicSimilarity], result of:
            0.021718638 = score(doc=292,freq=1.0), product of:
              0.09517674 = queryWeight, product of:
                1.2580585 = boost
                3.6510832 = idf(docFreq=3134, maxDocs=44421)
                0.020720882 = queryNorm
              0.2281927 = fieldWeight in 292, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6510832 = idf(docFreq=3134, maxDocs=44421)
                0.0625 = fieldNorm(doc=292)
          0.041640013 = weight(abstract_txt:text in 292) [ClassicSimilarity], result of:
            0.041640013 = score(doc=292,freq=2.0), product of:
              0.1165842 = queryWeight, product of:
                1.392372 = boost
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.020720882 = queryNorm
              0.3571669 = fieldWeight in 292, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.0625 = fieldNorm(doc=292)
          0.031283055 = weight(abstract_txt:documents in 292) [ClassicSimilarity], result of:
            0.031283055 = score(doc=292,freq=1.0), product of:
              0.12138971 = queryWeight, product of:
                1.4207785 = boost
                4.123322 = idf(docFreq=1954, maxDocs=44421)
                0.020720882 = queryNorm
              0.25770763 = fieldWeight in 292, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.123322 = idf(docFreq=1954, maxDocs=44421)
                0.0625 = fieldNorm(doc=292)
          0.0353345 = weight(abstract_txt:document in 292) [ClassicSimilarity], result of:
            0.0353345 = score(doc=292,freq=1.0), product of:
              0.13165633 = queryWeight, product of:
                1.4796408 = boost
                4.29415 = idf(docFreq=1647, maxDocs=44421)
                0.020720882 = queryNorm
              0.26838437 = fieldWeight in 292, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.29415 = idf(docFreq=1647, maxDocs=44421)
                0.0625 = fieldNorm(doc=292)
          0.075385995 = weight(abstract_txt:full in 292) [ClassicSimilarity], result of:
            0.075385995 = score(doc=292,freq=2.0), product of:
              0.17317808 = queryWeight, product of:
                1.6970001 = boost
                4.9249606 = idf(docFreq=876, maxDocs=44421)
                0.020720882 = queryNorm
              0.4353091 = fieldWeight in 292, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.9249606 = idf(docFreq=876, maxDocs=44421)
                0.0625 = fieldNorm(doc=292)
        0.28 = coord(7/25)
    
  5. Mischo, W.H.: Expanded subject access to reference collection materials (1979) 0.10
    0.09634874 = sum of:
      0.09634874 = product of:
        0.4817437 = sum of:
          0.11557957 = weight(abstract_txt:descriptors in 836) [ClassicSimilarity], result of:
            0.11557957 = score(doc=836,freq=1.0), product of:
              0.1585603 = queryWeight, product of:
                1.1482004 = boost
                6.664515 = idf(docFreq=153, maxDocs=44421)
                0.020720882 = queryNorm
              0.7289313 = fieldWeight in 836, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.664515 = idf(docFreq=153, maxDocs=44421)
                0.109375 = fieldNorm(doc=836)
          0.11557957 = weight(abstract_txt:expanded in 836) [ClassicSimilarity], result of:
            0.11557957 = score(doc=836,freq=1.0), product of:
              0.1585603 = queryWeight, product of:
                1.1482004 = boost
                6.664515 = idf(docFreq=153, maxDocs=44421)
                0.020720882 = queryNorm
              0.7289313 = fieldWeight in 836, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.664515 = idf(docFreq=153, maxDocs=44421)
                0.109375 = fieldNorm(doc=836)
          0.053750884 = weight(abstract_txt:access in 836) [ClassicSimilarity], result of:
            0.053750884 = score(doc=836,freq=2.0), product of:
              0.09517674 = queryWeight, product of:
                1.2580585 = boost
                3.6510832 = idf(docFreq=3134, maxDocs=44421)
                0.020720882 = queryNorm
              0.5647481 = fieldWeight in 836, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.6510832 = idf(docFreq=3134, maxDocs=44421)
                0.109375 = fieldNorm(doc=836)
          0.078497455 = weight(abstract_txt:collection in 836) [ClassicSimilarity], result of:
            0.078497455 = score(doc=836,freq=1.0), product of:
              0.15435503 = queryWeight, product of:
                1.6021229 = boost
                4.649612 = idf(docFreq=1154, maxDocs=44421)
                0.020720882 = queryNorm
              0.5085513 = fieldWeight in 836, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.649612 = idf(docFreq=1154, maxDocs=44421)
                0.109375 = fieldNorm(doc=836)
          0.11833621 = weight(abstract_txt:index in 836) [ClassicSimilarity], result of:
            0.11833621 = score(doc=836,freq=2.0), product of:
              0.16107155 = queryWeight, product of:
                1.6366087 = boost
                4.7496953 = idf(docFreq=1044, maxDocs=44421)
                0.020720882 = queryNorm
              0.734681 = fieldWeight in 836, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.7496953 = idf(docFreq=1044, maxDocs=44421)
                0.109375 = fieldNorm(doc=836)
        0.2 = coord(5/25)