Document (#6503)

Author
Paijmans, H.
Title
Comparing the document representation of two IR-systems : CLARIT and TOPIC
Source
Journal of the American Society for Information Science. 44(1993) no.7, S.383-392
Year
1993
Abstract
Discusses the TOPIC and CLARIT information retrieval systems in terms of assigned versus derived and precoordinate versus postcoordinate indexing. Compares the document representation of the two systems. Reports on a test done on a small sample of Wall Street Journal articles. The positive results found for CLARIT in earlier test on medical documents were not observed in this general database
Theme
Automatisches Indexieren
Object
CLARIT
TOPIC

Similar documents (content)

  1. O'Donnell, R.; Smeaton, A.F.: ¬A linguistic approach to information retrieval (1996) 0.19
    0.18996634 = sum of:
      0.18996634 = product of:
        0.6784512 = sum of:
          0.04388975 = weight(abstract_txt:reports in 3575) [ClassicSimilarity], result of:
            0.04388975 = score(doc=3575,freq=2.0), product of:
              0.08657727 = queryWeight, product of:
                1.0260972 = boost
                4.5883255 = idf(docFreq=1227, maxDocs=44421)
                0.018389128 = queryNorm
              0.5069431 = fieldWeight in 3575, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.5883255 = idf(docFreq=1227, maxDocs=44421)
                0.078125 = fieldNorm(doc=3575)
          0.043712277 = weight(abstract_txt:journal in 3575) [ClassicSimilarity], result of:
            0.043712277 = score(doc=3575,freq=1.0), product of:
              0.10878626 = queryWeight, product of:
                1.1502006 = boost
                5.14327 = idf(docFreq=704, maxDocs=44421)
                0.018389128 = queryNorm
              0.40181798 = fieldWeight in 3575, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.14327 = idf(docFreq=704, maxDocs=44421)
                0.078125 = fieldNorm(doc=3575)
          0.17947009 = weight(abstract_txt:street in 3575) [ClassicSimilarity], result of:
            0.17947009 = score(doc=3575,freq=1.0), product of:
              0.27893296 = queryWeight, product of:
                1.8417746 = boost
                8.235732 = idf(docFreq=31, maxDocs=44421)
                0.018389128 = queryNorm
              0.6434166 = fieldWeight in 3575, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.235732 = idf(docFreq=31, maxDocs=44421)
                0.078125 = fieldNorm(doc=3575)
          0.18155366 = weight(abstract_txt:wall in 3575) [ClassicSimilarity], result of:
            0.18155366 = score(doc=3575,freq=1.0), product of:
              0.28108767 = queryWeight, product of:
                1.8488747 = boost
                8.267481 = idf(docFreq=30, maxDocs=44421)
                0.018389128 = queryNorm
              0.6458969 = fieldWeight in 3575, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.267481 = idf(docFreq=30, maxDocs=44421)
                0.078125 = fieldNorm(doc=3575)
          0.10862769 = weight(abstract_txt:representation in 3575) [ClassicSimilarity], result of:
            0.10862769 = score(doc=3575,freq=2.0), product of:
              0.19958696 = queryWeight, product of:
                2.2032695 = boost
                4.9261017 = idf(docFreq=875, maxDocs=44421)
                0.018389128 = queryNorm
              0.54426247 = fieldWeight in 3575, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.9261017 = idf(docFreq=875, maxDocs=44421)
                0.078125 = fieldNorm(doc=3575)
          0.082940035 = weight(abstract_txt:topic in 3575) [ClassicSimilarity], result of:
            0.082940035 = score(doc=3575,freq=1.0), product of:
              0.21006703 = queryWeight, product of:
                2.260375 = boost
                5.053779 = idf(docFreq=770, maxDocs=44421)
                0.018389128 = queryNorm
              0.3948265 = fieldWeight in 3575, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.053779 = idf(docFreq=770, maxDocs=44421)
                0.078125 = fieldNorm(doc=3575)
          0.03825769 = weight(abstract_txt:systems in 3575) [ClassicSimilarity], result of:
            0.03825769 = score(doc=3575,freq=1.0), product of:
              0.1435571 = queryWeight, product of:
                2.2885454 = boost
                3.411175 = idf(docFreq=3984, maxDocs=44421)
                0.018389128 = queryNorm
              0.26649806 = fieldWeight in 3575, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.411175 = idf(docFreq=3984, maxDocs=44421)
                0.078125 = fieldNorm(doc=3575)
        0.28 = coord(7/25)
    
  2. Foskett, A.C.: ¬The subject approach to information (1996) 0.19
    0.18731011 = sum of:
      0.18731011 = product of:
        1.1706883 = sum of:
          0.08543083 = weight(abstract_txt:derived in 1749) [ClassicSimilarity], result of:
            0.08543083 = score(doc=1749,freq=1.0), product of:
              0.13588229 = queryWeight, product of:
                1.2854879 = boost
                5.7482243 = idf(docFreq=384, maxDocs=44421)
                0.018389128 = queryNorm
              0.62871206 = fieldWeight in 1749, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.7482243 = idf(docFreq=384, maxDocs=44421)
                0.109375 = fieldNorm(doc=1749)
          0.3734584 = weight(abstract_txt:postcoordinate in 1749) [ClassicSimilarity], result of:
            0.3734584 = score(doc=1749,freq=1.0), product of:
              0.36328536 = queryWeight, product of:
                2.1018925 = boost
                9.398883 = idf(docFreq=9, maxDocs=44421)
                0.018389128 = queryNorm
              1.0280029 = fieldWeight in 1749, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.398883 = idf(docFreq=9, maxDocs=44421)
                0.109375 = fieldNorm(doc=1749)
          0.6190291 = weight(abstract_txt:precoordinate in 1749) [ClassicSimilarity], result of:
            0.6190291 = score(doc=1749,freq=2.0), product of:
              0.40384728 = queryWeight, product of:
                2.2161295 = boost
                9.909708 = idf(docFreq=5, maxDocs=44421)
                0.018389128 = queryNorm
              1.5328298 = fieldWeight in 1749, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                9.909708 = idf(docFreq=5, maxDocs=44421)
                0.109375 = fieldNorm(doc=1749)
          0.092769966 = weight(abstract_txt:systems in 1749) [ClassicSimilarity], result of:
            0.092769966 = score(doc=1749,freq=3.0), product of:
              0.1435571 = queryWeight, product of:
                2.2885454 = boost
                3.411175 = idf(docFreq=3984, maxDocs=44421)
                0.018389128 = queryNorm
              0.6462234 = fieldWeight in 1749, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.411175 = idf(docFreq=3984, maxDocs=44421)
                0.109375 = fieldNorm(doc=1749)
        0.16 = coord(4/25)
    
  3. Ruocco, A.S.; Frieder, O.: Clustering and classification of large document bases in a parallel environment (1997) 0.17
    0.17113297 = sum of:
      0.17113297 = product of:
        0.71305406 = sum of:
          0.042101584 = weight(abstract_txt:articles in 2661) [ClassicSimilarity], result of:
            0.042101584 = score(doc=2661,freq=1.0), product of:
              0.09395429 = queryWeight, product of:
                1.0689192 = boost
                4.7798095 = idf(docFreq=1013, maxDocs=44421)
                0.018389128 = queryNorm
              0.44810712 = fieldWeight in 2661, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.7798095 = idf(docFreq=1013, maxDocs=44421)
                0.09375 = fieldNorm(doc=2661)
          0.05245473 = weight(abstract_txt:journal in 2661) [ClassicSimilarity], result of:
            0.05245473 = score(doc=2661,freq=1.0), product of:
              0.10878626 = queryWeight, product of:
                1.1502006 = boost
                5.14327 = idf(docFreq=704, maxDocs=44421)
                0.018389128 = queryNorm
              0.48218155 = fieldWeight in 2661, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.14327 = idf(docFreq=704, maxDocs=44421)
                0.09375 = fieldNorm(doc=2661)
          0.2153641 = weight(abstract_txt:street in 2661) [ClassicSimilarity], result of:
            0.2153641 = score(doc=2661,freq=1.0), product of:
              0.27893296 = queryWeight, product of:
                1.8417746 = boost
                8.235732 = idf(docFreq=31, maxDocs=44421)
                0.018389128 = queryNorm
              0.77209985 = fieldWeight in 2661, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.235732 = idf(docFreq=31, maxDocs=44421)
                0.09375 = fieldNorm(doc=2661)
          0.2178644 = weight(abstract_txt:wall in 2661) [ClassicSimilarity], result of:
            0.2178644 = score(doc=2661,freq=1.0), product of:
              0.28108767 = queryWeight, product of:
                1.8488747 = boost
                8.267481 = idf(docFreq=30, maxDocs=44421)
                0.018389128 = queryNorm
              0.7750763 = fieldWeight in 2661, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.267481 = idf(docFreq=30, maxDocs=44421)
                0.09375 = fieldNorm(doc=2661)
          0.10575208 = weight(abstract_txt:document in 2661) [ClassicSimilarity], result of:
            0.10575208 = score(doc=2661,freq=3.0), product of:
              0.15166306 = queryWeight, product of:
                1.9206201 = boost
                4.29415 = idf(docFreq=1647, maxDocs=44421)
                0.018389128 = queryNorm
              0.697283 = fieldWeight in 2661, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.29415 = idf(docFreq=1647, maxDocs=44421)
                0.09375 = fieldNorm(doc=2661)
          0.07951711 = weight(abstract_txt:systems in 2661) [ClassicSimilarity], result of:
            0.07951711 = score(doc=2661,freq=3.0), product of:
              0.1435571 = queryWeight, product of:
                2.2885454 = boost
                3.411175 = idf(docFreq=3984, maxDocs=44421)
                0.018389128 = queryNorm
              0.5539058 = fieldWeight in 2661, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.411175 = idf(docFreq=3984, maxDocs=44421)
                0.09375 = fieldNorm(doc=2661)
        0.24 = coord(6/25)
    
  4. Bjorner, S.: Let your fingers do the searching : call for the fax (1993) 0.14
    0.14074263 = sum of:
      0.14074263 = product of:
        0.8796414 = sum of:
          0.070169315 = weight(abstract_txt:articles in 6518) [ClassicSimilarity], result of:
            0.070169315 = score(doc=6518,freq=1.0), product of:
              0.09395429 = queryWeight, product of:
                1.0689192 = boost
                4.7798095 = idf(docFreq=1013, maxDocs=44421)
                0.018389128 = queryNorm
              0.74684525 = fieldWeight in 6518, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.7798095 = idf(docFreq=1013, maxDocs=44421)
                0.15625 = fieldNorm(doc=6518)
          0.087424554 = weight(abstract_txt:journal in 6518) [ClassicSimilarity], result of:
            0.087424554 = score(doc=6518,freq=1.0), product of:
              0.10878626 = queryWeight, product of:
                1.1502006 = boost
                5.14327 = idf(docFreq=704, maxDocs=44421)
                0.018389128 = queryNorm
              0.80363595 = fieldWeight in 6518, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.14327 = idf(docFreq=704, maxDocs=44421)
                0.15625 = fieldNorm(doc=6518)
          0.35894018 = weight(abstract_txt:street in 6518) [ClassicSimilarity], result of:
            0.35894018 = score(doc=6518,freq=1.0), product of:
              0.27893296 = queryWeight, product of:
                1.8417746 = boost
                8.235732 = idf(docFreq=31, maxDocs=44421)
                0.018389128 = queryNorm
              1.2868332 = fieldWeight in 6518, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.235732 = idf(docFreq=31, maxDocs=44421)
                0.15625 = fieldNorm(doc=6518)
          0.36310732 = weight(abstract_txt:wall in 6518) [ClassicSimilarity], result of:
            0.36310732 = score(doc=6518,freq=1.0), product of:
              0.28108767 = queryWeight, product of:
                1.8488747 = boost
                8.267481 = idf(docFreq=30, maxDocs=44421)
                0.018389128 = queryNorm
              1.2917938 = fieldWeight in 6518, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.267481 = idf(docFreq=30, maxDocs=44421)
                0.15625 = fieldNorm(doc=6518)
        0.16 = coord(4/25)
    
  5. Névéol, A.; Deserno, T.M.; Darmoni, S.J.; Güld, M.O.; Aronson, A.R.: Natural language processing versus content-based image analysis for medical document retrieval (2009) 0.14
    0.13531214 = sum of:
      0.13531214 = product of:
        0.48325765 = sum of:
          0.0712819 = weight(abstract_txt:medical in 3702) [ClassicSimilarity], result of:
            0.0712819 = score(doc=3702,freq=2.0), product of:
              0.13881017 = queryWeight, product of:
                1.2992634 = boost
                5.8098235 = idf(docFreq=361, maxDocs=44421)
                0.018389128 = queryNorm
              0.5135207 = fieldWeight in 3702, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.8098235 = idf(docFreq=361, maxDocs=44421)
                0.0625 = fieldNorm(doc=3702)
          0.05698734 = weight(abstract_txt:comparing in 3702) [ClassicSimilarity], result of:
            0.05698734 = score(doc=3702,freq=1.0), product of:
              0.15064824 = queryWeight, product of:
                1.3535322 = boost
                6.0524936 = idf(docFreq=283, maxDocs=44421)
                0.018389128 = queryNorm
              0.37828085 = fieldWeight in 3702, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.0524936 = idf(docFreq=283, maxDocs=44421)
                0.0625 = fieldNorm(doc=3702)
          0.059406314 = weight(abstract_txt:assigned in 3702) [ClassicSimilarity], result of:
            0.059406314 = score(doc=3702,freq=1.0), product of:
              0.15488173 = queryWeight, product of:
                1.3724188 = boost
                6.136947 = idf(docFreq=260, maxDocs=44421)
                0.018389128 = queryNorm
              0.3835592 = fieldWeight in 3702, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.136947 = idf(docFreq=260, maxDocs=44421)
                0.0625 = fieldNorm(doc=3702)
          0.057564143 = weight(abstract_txt:document in 3702) [ClassicSimilarity], result of:
            0.057564143 = score(doc=3702,freq=2.0), product of:
              0.15166306 = queryWeight, product of:
                1.9206201 = boost
                4.29415 = idf(docFreq=1647, maxDocs=44421)
                0.018389128 = queryNorm
              0.3795528 = fieldWeight in 3702, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.29415 = idf(docFreq=1647, maxDocs=44421)
                0.0625 = fieldNorm(doc=3702)
          0.06604716 = weight(abstract_txt:test in 3702) [ClassicSimilarity], result of:
            0.06604716 = score(doc=3702,freq=1.0), product of:
              0.2094231 = queryWeight, product of:
                2.256908 = boost
                5.046027 = idf(docFreq=776, maxDocs=44421)
                0.018389128 = queryNorm
              0.3153767 = fieldWeight in 3702, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.046027 = idf(docFreq=776, maxDocs=44421)
                0.0625 = fieldNorm(doc=3702)
          0.03060615 = weight(abstract_txt:systems in 3702) [ClassicSimilarity], result of:
            0.03060615 = score(doc=3702,freq=1.0), product of:
              0.1435571 = queryWeight, product of:
                2.2885454 = boost
                3.411175 = idf(docFreq=3984, maxDocs=44421)
                0.018389128 = queryNorm
              0.21319844 = fieldWeight in 3702, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.411175 = idf(docFreq=3984, maxDocs=44421)
                0.0625 = fieldNorm(doc=3702)
          0.14136465 = weight(abstract_txt:versus in 3702) [ClassicSimilarity], result of:
            0.14136465 = score(doc=3702,freq=1.0), product of:
              0.34781557 = queryWeight, product of:
                2.908547 = boost
                6.5029707 = idf(docFreq=180, maxDocs=44421)
                0.018389128 = queryNorm
              0.40643567 = fieldWeight in 3702, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.5029707 = idf(docFreq=180, maxDocs=44421)
                0.0625 = fieldNorm(doc=3702)
        0.28 = coord(7/25)