Document (#32987)

Author
Kim, G.
Title
Relationship between index term specificity and relevance judgment
Source
Information processing and management. 42(2006) no.5, S.1218-1229
Year
2006
Abstract
Concurrent concepts of specificity are discussed and differentiated from each other to investigate the relationship between index term specificity and users' relevance judgments. The identified concepts are term-document specificity, hierarchical specificity, statement specificity, and posting specificity. Among them, term-document specificity, which is a relationship between an index term and the document indexed with the term, is regarded as a fruitful research area. In an experiment involving three searches with 175 retrieved documents from 356 matched index terms, the impact of specificity on relevance judgments is analyzed and found to be statistically significant. Implications for index practice and for future research are discussed.

Similar documents (content)

  1. Sparck Jones, K.: ¬A statistical interpretation of term specificity and its application in retrieval (2004) 0.35
    0.34500533 = sum of:
      0.34500533 = product of:
        1.4375223 = sum of:
          0.050567325 = weight(abstract_txt:statistically in 4420) [ClassicSimilarity], result of:
            0.050567325 = score(doc=4420,freq=1.0), product of:
              0.079069085 = queryWeight, product of:
                1.2049464 = boost
                6.82169 = idf(docFreq=130, maxDocs=44218)
                0.0096193785 = queryNorm
              0.63953346 = fieldWeight in 4420, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.82169 = idf(docFreq=130, maxDocs=44218)
                0.09375 = fieldNorm(doc=4420)
          0.054553073 = weight(abstract_txt:regarded in 4420) [ClassicSimilarity], result of:
            0.054553073 = score(doc=4420,freq=1.0), product of:
              0.08317118 = queryWeight, product of:
                1.2358074 = boost
                6.996407 = idf(docFreq=109, maxDocs=44218)
                0.0096193785 = queryNorm
              0.6559132 = fieldWeight in 4420, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.996407 = idf(docFreq=109, maxDocs=44218)
                0.09375 = fieldNorm(doc=4420)
          0.037798893 = weight(abstract_txt:document in 4420) [ClassicSimilarity], result of:
            0.037798893 = score(doc=4420,freq=1.0), product of:
              0.093926154 = queryWeight, product of:
                2.2746692 = boost
                4.2926083 = idf(docFreq=1642, maxDocs=44218)
                0.0096193785 = queryNorm
              0.40243202 = fieldWeight in 4420, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.2926083 = idf(docFreq=1642, maxDocs=44218)
                0.09375 = fieldNorm(doc=4420)
          0.08530154 = weight(abstract_txt:index in 4420) [ClassicSimilarity], result of:
            0.08530154 = score(doc=4420,freq=1.0), product of:
              0.19159669 = queryWeight, product of:
                4.1941442 = boost
                4.74895 = idf(docFreq=1040, maxDocs=44218)
                0.0096193785 = queryNorm
              0.44521406 = fieldWeight in 4420, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.74895 = idf(docFreq=1040, maxDocs=44218)
                0.09375 = fieldNorm(doc=4420)
          0.18321313 = weight(abstract_txt:term in 4420) [ClassicSimilarity], result of:
            0.18321313 = score(doc=4420,freq=3.0), product of:
              0.23500356 = queryWeight, product of:
                5.0883527 = boost
                4.8012047 = idf(docFreq=987, maxDocs=44218)
                0.0096193785 = queryNorm
              0.7796185 = fieldWeight in 4420, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.8012047 = idf(docFreq=987, maxDocs=44218)
                0.09375 = fieldNorm(doc=4420)
          1.0260884 = weight(abstract_txt:specificity in 4420) [ClassicSimilarity], result of:
            1.0260884 = score(doc=4420,freq=3.0), product of:
              0.8483798 = queryWeight, product of:
                11.840793 = boost
                7.448392 = idf(docFreq=69, maxDocs=44218)
                0.0096193785 = queryNorm
              1.2094681 = fieldWeight in 4420, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                7.448392 = idf(docFreq=69, maxDocs=44218)
                0.09375 = fieldNorm(doc=4420)
        0.24 = coord(6/25)
    
  2. Chu, H.: Factors affecting relevance judgment : a report from TREC Legal track (2011) 0.27
    0.2694518 = sum of:
      0.2694518 = product of:
        0.96232784 = sum of:
          0.027928827 = weight(abstract_txt:retrieved in 4540) [ClassicSimilarity], result of:
            0.027928827 = score(doc=4540,freq=2.0), product of:
              0.055357873 = queryWeight, product of:
                1.0082171 = boost
                5.707926 = idf(docFreq=398, maxDocs=44218)
                0.0096193785 = queryNorm
              0.5045141 = fieldWeight in 4540, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.707926 = idf(docFreq=398, maxDocs=44218)
                0.0625 = fieldNorm(doc=4540)
          0.015133328 = weight(abstract_txt:research in 4540) [ClassicSimilarity], result of:
            0.015133328 = score(doc=4540,freq=5.0), product of:
              0.034155753 = queryWeight, product of:
                1.1199826 = boost
                3.170338 = idf(docFreq=5046, maxDocs=44218)
                0.0096193785 = queryNorm
              0.4430682 = fieldWeight in 4540, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                3.170338 = idf(docFreq=5046, maxDocs=44218)
                0.0625 = fieldNorm(doc=4540)
          0.09390908 = weight(abstract_txt:judgment in 4540) [ClassicSimilarity], result of:
            0.09390908 = score(doc=4540,freq=5.0), product of:
              0.09154528 = queryWeight, product of:
                1.2965293 = boost
                7.3401785 = idf(docFreq=77, maxDocs=44218)
                0.0096193785 = queryNorm
              1.0258211 = fieldWeight in 4540, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                7.3401785 = idf(docFreq=77, maxDocs=44218)
                0.0625 = fieldNorm(doc=4540)
          0.06955564 = weight(abstract_txt:judgments in 4540) [ClassicSimilarity], result of:
            0.06955564 = score(doc=4540,freq=1.0), product of:
              0.16145536 = queryWeight, product of:
                2.4350371 = boost
                6.892866 = idf(docFreq=121, maxDocs=44218)
                0.0096193785 = queryNorm
              0.43080413 = fieldWeight in 4540, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.892866 = idf(docFreq=121, maxDocs=44218)
                0.0625 = fieldNorm(doc=4540)
          0.12675042 = weight(abstract_txt:relevance in 4540) [ClassicSimilarity], result of:
            0.12675042 = score(doc=4540,freq=11.0), product of:
              0.12398335 = queryWeight, product of:
                2.6134048 = boost
                4.931848 = idf(docFreq=866, maxDocs=44218)
                0.0096193785 = queryNorm
              1.0223181 = fieldWeight in 4540, product of:
                3.3166249 = tf(freq=11.0), with freq of:
                  11.0 = termFreq=11.0
                4.931848 = idf(docFreq=866, maxDocs=44218)
                0.0625 = fieldNorm(doc=4540)
          0.07051876 = weight(abstract_txt:term in 4540) [ClassicSimilarity], result of:
            0.07051876 = score(doc=4540,freq=1.0), product of:
              0.23500356 = queryWeight, product of:
                5.0883527 = boost
                4.8012047 = idf(docFreq=987, maxDocs=44218)
                0.0096193785 = queryNorm
              0.3000753 = fieldWeight in 4540, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.8012047 = idf(docFreq=987, maxDocs=44218)
                0.0625 = fieldNorm(doc=4540)
          0.55853176 = weight(abstract_txt:specificity in 4540) [ClassicSimilarity], result of:
            0.55853176 = score(doc=4540,freq=2.0), product of:
              0.8483798 = queryWeight, product of:
                11.840793 = boost
                7.448392 = idf(docFreq=69, maxDocs=44218)
                0.0096193785 = queryNorm
              0.65835106 = fieldWeight in 4540, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.448392 = idf(docFreq=69, maxDocs=44218)
                0.0625 = fieldNorm(doc=4540)
        0.28 = coord(7/25)
    
  3. Savolainen, R.; Kari, J.: User-defined relevance criteria in web searching (2006) 0.16
    0.16432004 = sum of:
      0.16432004 = product of:
        0.8216002 = sum of:
          0.00676783 = weight(abstract_txt:research in 614) [ClassicSimilarity], result of:
            0.00676783 = score(doc=614,freq=1.0), product of:
              0.034155753 = queryWeight, product of:
                1.1199826 = boost
                3.170338 = idf(docFreq=5046, maxDocs=44218)
                0.0096193785 = queryNorm
              0.19814612 = fieldWeight in 614, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.170338 = idf(docFreq=5046, maxDocs=44218)
                0.0625 = fieldNorm(doc=614)
          0.059393317 = weight(abstract_txt:judgment in 614) [ClassicSimilarity], result of:
            0.059393317 = score(doc=614,freq=2.0), product of:
              0.09154528 = queryWeight, product of:
                1.2965293 = boost
                7.3401785 = idf(docFreq=77, maxDocs=44218)
                0.0096193785 = queryNorm
              0.64878625 = fieldWeight in 614, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.3401785 = idf(docFreq=77, maxDocs=44218)
                0.0625 = fieldNorm(doc=614)
          0.1204739 = weight(abstract_txt:judgments in 614) [ClassicSimilarity], result of:
            0.1204739 = score(doc=614,freq=3.0), product of:
              0.16145536 = queryWeight, product of:
                2.4350371 = boost
                6.892866 = idf(docFreq=121, maxDocs=44218)
                0.0096193785 = queryNorm
              0.74617463 = fieldWeight in 614, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.892866 = idf(docFreq=121, maxDocs=44218)
                0.0625 = fieldNorm(doc=614)
          0.07643338 = weight(abstract_txt:relevance in 614) [ClassicSimilarity], result of:
            0.07643338 = score(doc=614,freq=4.0), product of:
              0.12398335 = queryWeight, product of:
                2.6134048 = boost
                4.931848 = idf(docFreq=866, maxDocs=44218)
                0.0096193785 = queryNorm
              0.616481 = fieldWeight in 614, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.931848 = idf(docFreq=866, maxDocs=44218)
                0.0625 = fieldNorm(doc=614)
          0.55853176 = weight(abstract_txt:specificity in 614) [ClassicSimilarity], result of:
            0.55853176 = score(doc=614,freq=2.0), product of:
              0.8483798 = queryWeight, product of:
                11.840793 = boost
                7.448392 = idf(docFreq=69, maxDocs=44218)
                0.0096193785 = queryNorm
              0.65835106 = fieldWeight in 614, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.448392 = idf(docFreq=69, maxDocs=44218)
                0.0625 = fieldNorm(doc=614)
        0.2 = coord(5/25)
    
  4. Mooers, C.N.: ¬The indexing language of an information retrieval system (1985) 0.16
    0.16139157 = sum of:
      0.16139157 = product of:
        0.5763985 = sum of:
          0.018097479 = weight(abstract_txt:indexed in 3644) [ClassicSimilarity], result of:
            0.018097479 = score(doc=3644,freq=1.0), product of:
              0.06326917 = queryWeight, product of:
                1.0778552 = boost
                6.1021757 = idf(docFreq=268, maxDocs=44218)
                0.0096193785 = queryNorm
              0.28603947 = fieldWeight in 3644, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.1021757 = idf(docFreq=268, maxDocs=44218)
                0.046875 = fieldNorm(doc=3644)
          0.018757064 = weight(abstract_txt:involving in 3644) [ClassicSimilarity], result of:
            0.018757064 = score(doc=3644,freq=1.0), product of:
              0.06479726 = queryWeight, product of:
                1.0907938 = boost
                6.1754265 = idf(docFreq=249, maxDocs=44218)
                0.0096193785 = queryNorm
              0.28947312 = fieldWeight in 3644, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.1754265 = idf(docFreq=249, maxDocs=44218)
                0.046875 = fieldNorm(doc=3644)
          0.0071783676 = weight(abstract_txt:research in 3644) [ClassicSimilarity], result of:
            0.0071783676 = score(doc=3644,freq=2.0), product of:
              0.034155753 = queryWeight, product of:
                1.1199826 = boost
                3.170338 = idf(docFreq=5046, maxDocs=44218)
                0.0096193785 = queryNorm
              0.2101657 = fieldWeight in 3644, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.170338 = idf(docFreq=5046, maxDocs=44218)
                0.046875 = fieldNorm(doc=3644)
          0.0327348 = weight(abstract_txt:document in 3644) [ClassicSimilarity], result of:
            0.0327348 = score(doc=3644,freq=3.0), product of:
              0.093926154 = queryWeight, product of:
                2.2746692 = boost
                4.2926083 = idf(docFreq=1642, maxDocs=44218)
                0.0096193785 = queryNorm
              0.34851635 = fieldWeight in 3644, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.2926083 = idf(docFreq=1642, maxDocs=44218)
                0.046875 = fieldNorm(doc=3644)
          0.073873304 = weight(abstract_txt:index in 3644) [ClassicSimilarity], result of:
            0.073873304 = score(doc=3644,freq=3.0), product of:
              0.19159669 = queryWeight, product of:
                4.1941442 = boost
                4.74895 = idf(docFreq=1040, maxDocs=44218)
                0.0096193785 = queryNorm
              0.3855667 = fieldWeight in 3644, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.74895 = idf(docFreq=1040, maxDocs=44218)
                0.046875 = fieldNorm(doc=3644)
          0.12955125 = weight(abstract_txt:term in 3644) [ClassicSimilarity], result of:
            0.12955125 = score(doc=3644,freq=6.0), product of:
              0.23500356 = queryWeight, product of:
                5.0883527 = boost
                4.8012047 = idf(docFreq=987, maxDocs=44218)
                0.0096193785 = queryNorm
              0.5512735 = fieldWeight in 3644, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                4.8012047 = idf(docFreq=987, maxDocs=44218)
                0.046875 = fieldNorm(doc=3644)
          0.2962062 = weight(abstract_txt:specificity in 3644) [ClassicSimilarity], result of:
            0.2962062 = score(doc=3644,freq=1.0), product of:
              0.8483798 = queryWeight, product of:
                11.840793 = boost
                7.448392 = idf(docFreq=69, maxDocs=44218)
                0.0096193785 = queryNorm
              0.3491434 = fieldWeight in 3644, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.448392 = idf(docFreq=69, maxDocs=44218)
                0.046875 = fieldNorm(doc=3644)
        0.28 = coord(7/25)
    
  5. Tamine, L.; Chouquet, C.; Palmer, T.: Analysis of biomedical and health queries : lessons learned from TREC and CLEF evaluation benchmarks (2015) 0.16
    0.15995596 = sum of:
      0.15995596 = product of:
        0.79977983 = sum of:
          0.00676783 = weight(abstract_txt:research in 2341) [ClassicSimilarity], result of:
            0.00676783 = score(doc=2341,freq=1.0), product of:
              0.034155753 = queryWeight, product of:
                1.1199826 = boost
                3.170338 = idf(docFreq=5046, maxDocs=44218)
                0.0096193785 = queryNorm
              0.19814612 = fieldWeight in 2341, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.170338 = idf(docFreq=5046, maxDocs=44218)
                0.0625 = fieldNorm(doc=2341)
          0.013235061 = weight(abstract_txt:between in 2341) [ClassicSimilarity], result of:
            0.013235061 = score(doc=2341,freq=1.0), product of:
              0.061142795 = queryWeight, product of:
                1.8352602 = boost
                3.4633842 = idf(docFreq=3764, maxDocs=44218)
                0.0096193785 = queryNorm
              0.21646151 = fieldWeight in 2341, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4633842 = idf(docFreq=3764, maxDocs=44218)
                0.0625 = fieldNorm(doc=2341)
          0.025199262 = weight(abstract_txt:document in 2341) [ClassicSimilarity], result of:
            0.025199262 = score(doc=2341,freq=1.0), product of:
              0.093926154 = queryWeight, product of:
                2.2746692 = boost
                4.2926083 = idf(docFreq=1642, maxDocs=44218)
                0.0096193785 = queryNorm
              0.26828802 = fieldWeight in 2341, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.2926083 = idf(docFreq=1642, maxDocs=44218)
                0.0625 = fieldNorm(doc=2341)
          0.07051876 = weight(abstract_txt:term in 2341) [ClassicSimilarity], result of:
            0.07051876 = score(doc=2341,freq=1.0), product of:
              0.23500356 = queryWeight, product of:
                5.0883527 = boost
                4.8012047 = idf(docFreq=987, maxDocs=44218)
                0.0096193785 = queryNorm
              0.3000753 = fieldWeight in 2341, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.8012047 = idf(docFreq=987, maxDocs=44218)
                0.0625 = fieldNorm(doc=2341)
          0.6840589 = weight(abstract_txt:specificity in 2341) [ClassicSimilarity], result of:
            0.6840589 = score(doc=2341,freq=3.0), product of:
              0.8483798 = queryWeight, product of:
                11.840793 = boost
                7.448392 = idf(docFreq=69, maxDocs=44218)
                0.0096193785 = queryNorm
              0.8063121 = fieldWeight in 2341, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                7.448392 = idf(docFreq=69, maxDocs=44218)
                0.0625 = fieldNorm(doc=2341)
        0.2 = coord(5/25)