Document (#14071)

Author
Rijsbergen, C.J. van
Title
¬A test for the separation of relevant and non-relevant documents in experimental retrieval collections
Source
Journal of documentation. 29(1973) no.3, S.251-257
Year
1973
Abstract
Many retrievalexperiments are intended to discover ways of improving performance, taking the results obtained with some particular technique as a baseline. The fact that substantial alterations to a system often have little or no effect on particular collections is puzzling. This may be due to the initially poor seperation of relevant and non-relevant documents. The paper presents a procedure for characterizing this seperation for a collection, which can be used to show whether proposed modifications of the base system are likely to be useful.
Theme
Retrievalstudien

Similar documents (author)

  1. Van Rijsbergen, C.J. -> Rijsbergen, C.J. van: 4.46
    4.457759 = sum of:
      4.457759 = weight(author_txt:rijsbergen in 4198) [ClassicSimilarity], result of:
        4.457759 = fieldWeight in 4198, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          8.405631 = idf(docFreq=26, maxDocs=44421)
          0.375 = fieldNorm(doc=4198)
    
  2. Rijsbergen, C.J. van: Foundations of evaluation (1974) 4.20
    4.2028155 = sum of:
      4.2028155 = weight(author_txt:rijsbergen in 1077) [ClassicSimilarity], result of:
        4.2028155 = fieldWeight in 1077, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.405631 = idf(docFreq=26, maxDocs=44421)
          0.5 = fieldNorm(doc=1077)
    
  3. Rijsbergen, C.J. van: Automatic classification in information retrieval (1978) 4.20
    4.2028155 = sum of:
      4.2028155 = weight(author_txt:rijsbergen in 2411) [ClassicSimilarity], result of:
        4.2028155 = fieldWeight in 2411, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.405631 = idf(docFreq=26, maxDocs=44421)
          0.5 = fieldNorm(doc=2411)
    
  4. Rijsbergen, C.J. van: ¬A fast hierarchic clustering algorithm (1970) 4.20
    4.2028155 = sum of:
      4.2028155 = weight(author_txt:rijsbergen in 3299) [ClassicSimilarity], result of:
        4.2028155 = fieldWeight in 3299, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.405631 = idf(docFreq=26, maxDocs=44421)
          0.5 = fieldNorm(doc=3299)
    
  5. Rijsbergen, C.J. van: Retrieval effectiveness (1981) 4.20
    4.2028155 = sum of:
      4.2028155 = weight(author_txt:rijsbergen in 3215) [ClassicSimilarity], result of:
        4.2028155 = fieldWeight in 3215, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.405631 = idf(docFreq=26, maxDocs=44421)
          0.5 = fieldNorm(doc=3215)
    

Similar documents (content)

  1. Ruthven, T.; Lalmas, M.; Rijsbergen, K.van: Incorporating user research behavior into relevance feedback (2003) 0.14
    0.1440035 = sum of:
      0.1440035 = product of:
        0.5142982 = sum of:
          0.07271848 = weight(abstract_txt:experimental in 169) [ClassicSimilarity], result of:
            0.07271848 = score(doc=169,freq=3.0), product of:
              0.12446585 = queryWeight, product of:
                5.397019 = idf(docFreq=546, maxDocs=44421)
                0.023061963 = queryNorm
              0.58424443 = fieldWeight in 169, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.397019 = idf(docFreq=546, maxDocs=44421)
                0.0625 = fieldNorm(doc=169)
          0.042503837 = weight(abstract_txt:effect in 169) [ClassicSimilarity], result of:
            0.042503837 = score(doc=169,freq=1.0), product of:
              0.12549108 = queryWeight, product of:
                1.0041101 = boost
                5.419201 = idf(docFreq=534, maxDocs=44421)
                0.023061963 = queryNorm
              0.33870006 = fieldWeight in 169, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.419201 = idf(docFreq=534, maxDocs=44421)
                0.0625 = fieldNorm(doc=169)
          0.06613149 = weight(abstract_txt:technique in 169) [ClassicSimilarity], result of:
            0.06613149 = score(doc=169,freq=2.0), product of:
              0.13373846 = queryWeight, product of:
                1.0365806 = boost
                5.5944448 = idf(docFreq=448, maxDocs=44421)
                0.023061963 = queryNorm
              0.4944837 = fieldWeight in 169, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.5944448 = idf(docFreq=448, maxDocs=44421)
                0.0625 = fieldNorm(doc=169)
          0.040986843 = weight(abstract_txt:system in 169) [ClassicSimilarity], result of:
            0.040986843 = score(doc=169,freq=4.0), product of:
              0.09721809 = queryWeight, product of:
                1.2498659 = boost
                3.372775 = idf(docFreq=4140, maxDocs=44421)
                0.023061963 = queryNorm
              0.42159688 = fieldWeight in 169, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                3.372775 = idf(docFreq=4140, maxDocs=44421)
                0.0625 = fieldNorm(doc=169)
          0.05295526 = weight(abstract_txt:documents in 169) [ClassicSimilarity], result of:
            0.05295526 = score(doc=169,freq=2.0), product of:
              0.1453004 = queryWeight, product of:
                1.5279999 = boost
                4.123322 = idf(docFreq=1954, maxDocs=44421)
                0.023061963 = queryNorm
              0.3644536 = fieldWeight in 169, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.123322 = idf(docFreq=1954, maxDocs=44421)
                0.0625 = fieldNorm(doc=169)
          0.05536699 = weight(abstract_txt:collections in 169) [ClassicSimilarity], result of:
            0.05536699 = score(doc=169,freq=1.0), product of:
              0.18858394 = queryWeight, product of:
                1.7407734 = boost
                4.6974936 = idf(docFreq=1100, maxDocs=44421)
                0.023061963 = queryNorm
              0.29359335 = fieldWeight in 169, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.6974936 = idf(docFreq=1100, maxDocs=44421)
                0.0625 = fieldNorm(doc=169)
          0.1836353 = weight(abstract_txt:relevant in 169) [ClassicSimilarity], result of:
            0.1836353 = score(doc=169,freq=3.0), product of:
              0.3663907 = queryWeight, product of:
                3.4314456 = boost
                4.6298943 = idf(docFreq=1177, maxDocs=44421)
                0.023061963 = queryNorm
              0.50120074 = fieldWeight in 169, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.6298943 = idf(docFreq=1177, maxDocs=44421)
                0.0625 = fieldNorm(doc=169)
        0.28 = coord(7/25)
    
  2. Dadashkarimia, J.; Shakery, A.; Failia, H.; Zamani, H.: ¬An expectation-maximization algorithm for query translation based on pseudo-relevant documents (2017) 0.14
    0.14179407 = sum of:
      0.14179407 = product of:
        0.5064074 = sum of:
          0.03719086 = weight(abstract_txt:effect in 4296) [ClassicSimilarity], result of:
            0.03719086 = score(doc=4296,freq=1.0), product of:
              0.12549108 = queryWeight, product of:
                1.0041101 = boost
                5.419201 = idf(docFreq=534, maxDocs=44421)
                0.023061963 = queryNorm
              0.29636255 = fieldWeight in 4296, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.419201 = idf(docFreq=534, maxDocs=44421)
                0.0546875 = fieldNorm(doc=4296)
          0.044505276 = weight(abstract_txt:obtained in 4296) [ClassicSimilarity], result of:
            0.044505276 = score(doc=4296,freq=1.0), product of:
              0.14144786 = queryWeight, product of:
                1.066039 = boost
                5.7534328 = idf(docFreq=382, maxDocs=44421)
                0.023061963 = queryNorm
              0.31464085 = fieldWeight in 4296, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.7534328 = idf(docFreq=382, maxDocs=44421)
                0.0546875 = fieldNorm(doc=4296)
          0.04775826 = weight(abstract_txt:improving in 4296) [ClassicSimilarity], result of:
            0.04775826 = score(doc=4296,freq=1.0), product of:
              0.148259 = queryWeight, product of:
                1.0914037 = boost
                5.8903265 = idf(docFreq=333, maxDocs=44421)
                0.023061963 = queryNorm
              0.32212722 = fieldWeight in 4296, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.8903265 = idf(docFreq=333, maxDocs=44421)
                0.0546875 = fieldNorm(doc=4296)
          0.075611405 = weight(abstract_txt:baseline in 4296) [ClassicSimilarity], result of:
            0.075611405 = score(doc=4296,freq=1.0), product of:
              0.20139419 = queryWeight, product of:
                1.272033 = boost
                6.8651857 = idf(docFreq=125, maxDocs=44421)
                0.023061963 = queryNorm
              0.37543985 = fieldWeight in 4296, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.8651857 = idf(docFreq=125, maxDocs=44421)
                0.0546875 = fieldNorm(doc=4296)
          0.056749593 = weight(abstract_txt:documents in 4296) [ClassicSimilarity], result of:
            0.056749593 = score(doc=4296,freq=3.0), product of:
              0.1453004 = queryWeight, product of:
                1.5279999 = boost
                4.123322 = idf(docFreq=1954, maxDocs=44421)
                0.023061963 = queryNorm
              0.39056736 = fieldWeight in 4296, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.123322 = idf(docFreq=1954, maxDocs=44421)
                0.0546875 = fieldNorm(doc=4296)
          0.08391113 = weight(abstract_txt:collections in 4296) [ClassicSimilarity], result of:
            0.08391113 = score(doc=4296,freq=3.0), product of:
              0.18858394 = queryWeight, product of:
                1.7407734 = boost
                4.6974936 = idf(docFreq=1100, maxDocs=44421)
                0.023061963 = queryNorm
              0.44495374 = fieldWeight in 4296, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.6974936 = idf(docFreq=1100, maxDocs=44421)
                0.0546875 = fieldNorm(doc=4296)
          0.16068088 = weight(abstract_txt:relevant in 4296) [ClassicSimilarity], result of:
            0.16068088 = score(doc=4296,freq=3.0), product of:
              0.3663907 = queryWeight, product of:
                3.4314456 = boost
                4.6298943 = idf(docFreq=1177, maxDocs=44421)
                0.023061963 = queryNorm
              0.43855065 = fieldWeight in 4296, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.6298943 = idf(docFreq=1177, maxDocs=44421)
                0.0546875 = fieldNorm(doc=4296)
        0.28 = coord(7/25)
    
  3. Talvensaari, T.; Juhola, M.; Laurikkala, J.; Järvelin, K.: Corpus-based cross-language information retrieval in retrieval of highly relevant documents (2007) 0.14
    0.13654359 = sum of:
      0.13654359 = product of:
        0.56893164 = sum of:
          0.020493422 = weight(abstract_txt:system in 1139) [ClassicSimilarity], result of:
            0.020493422 = score(doc=1139,freq=1.0), product of:
              0.09721809 = queryWeight, product of:
                1.2498659 = boost
                3.372775 = idf(docFreq=4140, maxDocs=44421)
                0.023061963 = queryNorm
              0.21079844 = fieldWeight in 1139, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.372775 = idf(docFreq=4140, maxDocs=44421)
                0.0625 = fieldNorm(doc=1139)
          0.08641303 = weight(abstract_txt:baseline in 1139) [ClassicSimilarity], result of:
            0.08641303 = score(doc=1139,freq=1.0), product of:
              0.20139419 = queryWeight, product of:
                1.272033 = boost
                6.8651857 = idf(docFreq=125, maxDocs=44421)
                0.023061963 = queryNorm
              0.4290741 = fieldWeight in 1139, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.8651857 = idf(docFreq=125, maxDocs=44421)
                0.0625 = fieldNorm(doc=1139)
          0.087951064 = weight(abstract_txt:poor in 1139) [ClassicSimilarity], result of:
            0.087951064 = score(doc=1139,freq=1.0), product of:
              0.20377684 = queryWeight, product of:
                1.2795354 = boost
                6.905677 = idf(docFreq=120, maxDocs=44421)
                0.023061963 = queryNorm
              0.4316048 = fieldWeight in 1139, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.905677 = idf(docFreq=120, maxDocs=44421)
                0.0625 = fieldNorm(doc=1139)
          0.08372962 = weight(abstract_txt:documents in 1139) [ClassicSimilarity], result of:
            0.08372962 = score(doc=1139,freq=5.0), product of:
              0.1453004 = queryWeight, product of:
                1.5279999 = boost
                4.123322 = idf(docFreq=1954, maxDocs=44421)
                0.023061963 = queryNorm
              0.5762518 = fieldWeight in 1139, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                4.123322 = idf(docFreq=1954, maxDocs=44421)
                0.0625 = fieldNorm(doc=1139)
          0.078300744 = weight(abstract_txt:collections in 1139) [ClassicSimilarity], result of:
            0.078300744 = score(doc=1139,freq=2.0), product of:
              0.18858394 = queryWeight, product of:
                1.7407734 = boost
                4.6974936 = idf(docFreq=1100, maxDocs=44421)
                0.023061963 = queryNorm
              0.4152037 = fieldWeight in 1139, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.6974936 = idf(docFreq=1100, maxDocs=44421)
                0.0625 = fieldNorm(doc=1139)
          0.21204378 = weight(abstract_txt:relevant in 1139) [ClassicSimilarity], result of:
            0.21204378 = score(doc=1139,freq=4.0), product of:
              0.3663907 = queryWeight, product of:
                3.4314456 = boost
                4.6298943 = idf(docFreq=1177, maxDocs=44421)
                0.023061963 = queryNorm
              0.5787368 = fieldWeight in 1139, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.6298943 = idf(docFreq=1177, maxDocs=44421)
                0.0625 = fieldNorm(doc=1139)
        0.24 = coord(6/25)
    
  4. Lam-Adesina, A.M.; Jones, G.J.F.: Examining and improving the effectiveness of relevance feedback for retrieval of scanned text documents (2006) 0.11
    0.11197561 = sum of:
      0.11197561 = product of:
        0.3999129 = sum of:
          0.041984037 = weight(abstract_txt:experimental in 1977) [ClassicSimilarity], result of:
            0.041984037 = score(doc=1977,freq=1.0), product of:
              0.12446585 = queryWeight, product of:
                5.397019 = idf(docFreq=546, maxDocs=44421)
                0.023061963 = queryNorm
              0.33731368 = fieldWeight in 1977, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.397019 = idf(docFreq=546, maxDocs=44421)
                0.0625 = fieldNorm(doc=1977)
          0.05458087 = weight(abstract_txt:improving in 1977) [ClassicSimilarity], result of:
            0.05458087 = score(doc=1977,freq=1.0), product of:
              0.148259 = queryWeight, product of:
                1.0914037 = boost
                5.8903265 = idf(docFreq=333, maxDocs=44421)
                0.023061963 = queryNorm
              0.3681454 = fieldWeight in 1977, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.8903265 = idf(docFreq=333, maxDocs=44421)
                0.0625 = fieldNorm(doc=1977)
          0.020493422 = weight(abstract_txt:system in 1977) [ClassicSimilarity], result of:
            0.020493422 = score(doc=1977,freq=1.0), product of:
              0.09721809 = queryWeight, product of:
                1.2498659 = boost
                3.372775 = idf(docFreq=4140, maxDocs=44421)
                0.023061963 = queryNorm
              0.21079844 = fieldWeight in 1977, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.372775 = idf(docFreq=4140, maxDocs=44421)
                0.0625 = fieldNorm(doc=1977)
          0.08641303 = weight(abstract_txt:baseline in 1977) [ClassicSimilarity], result of:
            0.08641303 = score(doc=1977,freq=1.0), product of:
              0.20139419 = queryWeight, product of:
                1.272033 = boost
                6.8651857 = idf(docFreq=125, maxDocs=44421)
                0.023061963 = queryNorm
              0.4290741 = fieldWeight in 1977, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.8651857 = idf(docFreq=125, maxDocs=44421)
                0.0625 = fieldNorm(doc=1977)
          0.0979555 = weight(abstract_txt:modifications in 1977) [ClassicSimilarity], result of:
            0.0979555 = score(doc=1977,freq=1.0), product of:
              0.21895085 = queryWeight, product of:
                1.3263197 = boost
                7.1581726 = idf(docFreq=93, maxDocs=44421)
                0.023061963 = queryNorm
              0.4473858 = fieldWeight in 1977, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.1581726 = idf(docFreq=93, maxDocs=44421)
                0.0625 = fieldNorm(doc=1977)
          0.05295526 = weight(abstract_txt:documents in 1977) [ClassicSimilarity], result of:
            0.05295526 = score(doc=1977,freq=2.0), product of:
              0.1453004 = queryWeight, product of:
                1.5279999 = boost
                4.123322 = idf(docFreq=1954, maxDocs=44421)
                0.023061963 = queryNorm
              0.3644536 = fieldWeight in 1977, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.123322 = idf(docFreq=1954, maxDocs=44421)
                0.0625 = fieldNorm(doc=1977)
          0.04553076 = weight(abstract_txt:particular in 1977) [ClassicSimilarity], result of:
            0.04553076 = score(doc=1977,freq=1.0), product of:
              0.16552897 = queryWeight, product of:
                1.6308984 = boost
                4.400995 = idf(docFreq=1480, maxDocs=44421)
                0.023061963 = queryNorm
              0.27506217 = fieldWeight in 1977, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.400995 = idf(docFreq=1480, maxDocs=44421)
                0.0625 = fieldNorm(doc=1977)
        0.28 = coord(7/25)
    
  5. Khan, M.S.; Khor, S.: Enhanced Web document retrieval using automatic query expansion (2004) 0.10
    0.102463126 = sum of:
      0.102463126 = product of:
        0.4269297 = sum of:
          0.041984037 = weight(abstract_txt:experimental in 3091) [ClassicSimilarity], result of:
            0.041984037 = score(doc=3091,freq=1.0), product of:
              0.12446585 = queryWeight, product of:
                5.397019 = idf(docFreq=546, maxDocs=44421)
                0.023061963 = queryNorm
              0.33731368 = fieldWeight in 3091, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.397019 = idf(docFreq=546, maxDocs=44421)
                0.0625 = fieldNorm(doc=3091)
          0.04876454 = weight(abstract_txt:likely in 3091) [ClassicSimilarity], result of:
            0.04876454 = score(doc=3091,freq=1.0), product of:
              0.13752982 = queryWeight, product of:
                1.051171 = boost
                5.673189 = idf(docFreq=414, maxDocs=44421)
                0.023061963 = queryNorm
              0.35457432 = fieldWeight in 3091, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.673189 = idf(docFreq=414, maxDocs=44421)
                0.0625 = fieldNorm(doc=3091)
          0.05458087 = weight(abstract_txt:improving in 3091) [ClassicSimilarity], result of:
            0.05458087 = score(doc=3091,freq=1.0), product of:
              0.148259 = queryWeight, product of:
                1.0914037 = boost
                5.8903265 = idf(docFreq=333, maxDocs=44421)
                0.023061963 = queryNorm
              0.3681454 = fieldWeight in 3091, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.8903265 = idf(docFreq=333, maxDocs=44421)
                0.0625 = fieldNorm(doc=3091)
          0.10068832 = weight(abstract_txt:initially in 3091) [ClassicSimilarity], result of:
            0.10068832 = score(doc=3091,freq=1.0), product of:
              0.22300443 = queryWeight, product of:
                1.3385409 = boost
                7.2241306 = idf(docFreq=87, maxDocs=44421)
                0.023061963 = queryNorm
              0.45150816 = fieldWeight in 3091, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.2241306 = idf(docFreq=87, maxDocs=44421)
                0.0625 = fieldNorm(doc=3091)
          0.07489005 = weight(abstract_txt:documents in 3091) [ClassicSimilarity], result of:
            0.07489005 = score(doc=3091,freq=4.0), product of:
              0.1453004 = queryWeight, product of:
                1.5279999 = boost
                4.123322 = idf(docFreq=1954, maxDocs=44421)
                0.023061963 = queryNorm
              0.51541525 = fieldWeight in 3091, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.123322 = idf(docFreq=1954, maxDocs=44421)
                0.0625 = fieldNorm(doc=3091)
          0.10602189 = weight(abstract_txt:relevant in 3091) [ClassicSimilarity], result of:
            0.10602189 = score(doc=3091,freq=1.0), product of:
              0.3663907 = queryWeight, product of:
                3.4314456 = boost
                4.6298943 = idf(docFreq=1177, maxDocs=44421)
                0.023061963 = queryNorm
              0.2893684 = fieldWeight in 3091, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.6298943 = idf(docFreq=1177, maxDocs=44421)
                0.0625 = fieldNorm(doc=3091)
        0.24 = coord(6/25)