Document (#32987)

Author
Kim, G.
Title
Relationship between index term specificity and relevance judgment
Source
Information processing and management. 42(2006) no.5, S.1218-1229
Year
2006
Abstract
Concurrent concepts of specificity are discussed and differentiated from each other to investigate the relationship between index term specificity and users' relevance judgments. The identified concepts are term-document specificity, hierarchical specificity, statement specificity, and posting specificity. Among them, term-document specificity, which is a relationship between an index term and the document indexed with the term, is regarded as a fruitful research area. In an experiment involving three searches with 175 retrieved documents from 356 matched index terms, the impact of specificity on relevance judgments is analyzed and found to be statistically significant. Implications for index practice and for future research are discussed.

Similar documents (content)

  1. Sparck Jones, K.: ¬A statistical interpretation of term specificity and its application in retrieval (2004) 0.35
    0.34509572 = sum of:
      0.34509572 = product of:
        1.4378989 = sum of:
          0.05048341 = weight(abstract_txt:statistically in 5420) [ClassicSimilarity], result of:
            0.05048341 = score(doc=5420,freq=1.0), product of:
              0.07897288 = queryWeight, product of:
                1.2049593 = boost
                6.8186655 = idf(docFreq=131, maxDocs=44421)
                0.009611834 = queryNorm
              0.6392499 = fieldWeight in 5420, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.8186655 = idf(docFreq=131, maxDocs=44421)
                0.09375 = fieldNorm(doc=5420)
          0.05422141 = weight(abstract_txt:regarded in 5420) [ClassicSimilarity], result of:
            0.05422141 = score(doc=5420,freq=1.0), product of:
              0.08282462 = queryWeight, product of:
                1.2339941 = boost
                6.982969 = idf(docFreq=111, maxDocs=44421)
                0.009611834 = queryNorm
              0.6546533 = fieldWeight in 5420, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.982969 = idf(docFreq=111, maxDocs=44421)
                0.09375 = fieldNorm(doc=5420)
          0.037827127 = weight(abstract_txt:document in 5420) [ClassicSimilarity], result of:
            0.037827127 = score(doc=5420,freq=1.0), product of:
              0.093962565 = queryWeight, product of:
                2.2765198 = boost
                4.29415 = idf(docFreq=1647, maxDocs=44421)
                0.009611834 = queryNorm
              0.40257657 = fieldWeight in 5420, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.29415 = idf(docFreq=1647, maxDocs=44421)
                0.09375 = fieldNorm(doc=5420)
          0.08531351 = weight(abstract_txt:index in 5420) [ClassicSimilarity], result of:
            0.08531351 = score(doc=5420,freq=1.0), product of:
              0.19159348 = queryWeight, product of:
                4.1967077 = boost
                4.7496953 = idf(docFreq=1044, maxDocs=44421)
                0.009611834 = queryNorm
              0.44528395 = fieldWeight in 5420, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.7496953 = idf(docFreq=1044, maxDocs=44421)
                0.09375 = fieldNorm(doc=5420)
          0.18241067 = weight(abstract_txt:term in 5420) [ClassicSimilarity], result of:
            0.18241067 = score(doc=5420,freq=3.0), product of:
              0.23429108 = queryWeight, product of:
                5.0837812 = boost
                4.794713 = idf(docFreq=998, maxDocs=44421)
                0.009611834 = queryNorm
              0.77856433 = fieldWeight in 5420, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.794713 = idf(docFreq=998, maxDocs=44421)
                0.09375 = fieldNorm(doc=5420)
          1.0276427 = weight(abstract_txt:specificity in 5420) [ClassicSimilarity], result of:
            1.0276427 = score(doc=5420,freq=3.0), product of:
              0.84914285 = queryWeight, product of:
                11.8534565 = boost
                7.4529724 = idf(docFreq=69, maxDocs=44421)
                0.009611834 = queryNorm
              1.2102119 = fieldWeight in 5420, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                7.4529724 = idf(docFreq=69, maxDocs=44421)
                0.09375 = fieldNorm(doc=5420)
        0.24 = coord(6/25)
    
  2. Chu, H.: Factors affecting relevance judgment : a report from TREC Legal track (2011) 0.27
    0.26914555 = sum of:
      0.26914555 = product of:
        0.9612341 = sum of:
          0.02787692 = weight(abstract_txt:retrieved in 540) [ClassicSimilarity], result of:
            0.02787692 = score(doc=540,freq=2.0), product of:
              0.055283166 = queryWeight, product of:
                1.0081608 = boost
                5.7050157 = idf(docFreq=401, maxDocs=44421)
                0.009611834 = queryNorm
              0.5042569 = fieldWeight in 540, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.7050157 = idf(docFreq=401, maxDocs=44421)
                0.0625 = fieldNorm(doc=540)
          0.014974872 = weight(abstract_txt:research in 540) [ClassicSimilarity], result of:
            0.014974872 = score(doc=540,freq=5.0), product of:
              0.033913177 = queryWeight, product of:
                1.1166899 = boost
                3.159582 = idf(docFreq=5124, maxDocs=44421)
                0.009611834 = queryNorm
              0.441565 = fieldWeight in 540, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                3.159582 = idf(docFreq=5124, maxDocs=44421)
                0.0625 = fieldNorm(doc=540)
          0.09308463 = weight(abstract_txt:judgment in 540) [ClassicSimilarity], result of:
            0.09308463 = score(doc=540,freq=5.0), product of:
              0.09099867 = queryWeight, product of:
                1.2934537 = boost
                7.319441 = idf(docFreq=79, maxDocs=44421)
                0.009611834 = queryNorm
              1.022923 = fieldWeight in 540, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                7.319441 = idf(docFreq=79, maxDocs=44421)
                0.0625 = fieldNorm(doc=540)
          0.06917977 = weight(abstract_txt:judgments in 540) [ClassicSimilarity], result of:
            0.06917977 = score(doc=540,freq=1.0), product of:
              0.16085547 = queryWeight, product of:
                2.4320152 = boost
                6.881186 = idf(docFreq=123, maxDocs=44421)
                0.009611834 = queryNorm
              0.43007413 = fieldWeight in 540, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.881186 = idf(docFreq=123, maxDocs=44421)
                0.0625 = fieldNorm(doc=540)
          0.12653013 = weight(abstract_txt:relevance in 540) [ClassicSimilarity], result of:
            0.12653013 = score(doc=540,freq=11.0), product of:
              0.12382601 = queryWeight, product of:
                2.6133642 = boost
                4.929532 = idf(docFreq=872, maxDocs=44421)
                0.009611834 = queryNorm
              1.0218381 = fieldWeight in 540, product of:
                3.3166249 = tf(freq=11.0), with freq of:
                  11.0 = termFreq=11.0
                4.929532 = idf(docFreq=872, maxDocs=44421)
                0.0625 = fieldNorm(doc=540)
          0.070209906 = weight(abstract_txt:term in 540) [ClassicSimilarity], result of:
            0.070209906 = score(doc=540,freq=1.0), product of:
              0.23429108 = queryWeight, product of:
                5.0837812 = boost
                4.794713 = idf(docFreq=998, maxDocs=44421)
                0.009611834 = queryNorm
              0.29966956 = fieldWeight in 540, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.794713 = idf(docFreq=998, maxDocs=44421)
                0.0625 = fieldNorm(doc=540)
          0.55937785 = weight(abstract_txt:specificity in 540) [ClassicSimilarity], result of:
            0.55937785 = score(doc=540,freq=2.0), product of:
              0.84914285 = queryWeight, product of:
                11.8534565 = boost
                7.4529724 = idf(docFreq=69, maxDocs=44421)
                0.009611834 = queryNorm
              0.6587559 = fieldWeight in 540, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.4529724 = idf(docFreq=69, maxDocs=44421)
                0.0625 = fieldNorm(doc=540)
        0.28 = coord(7/25)
    
  3. Savolainen, R.; Kari, J.: User-defined relevance criteria in web searching (2006) 0.16
    0.16421403 = sum of:
      0.16421403 = product of:
        0.82107013 = sum of:
          0.006696966 = weight(abstract_txt:research in 739) [ClassicSimilarity], result of:
            0.006696966 = score(doc=739,freq=1.0), product of:
              0.033913177 = queryWeight, product of:
                1.1166899 = boost
                3.159582 = idf(docFreq=5124, maxDocs=44421)
                0.009611834 = queryNorm
              0.19747387 = fieldWeight in 739, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.159582 = idf(docFreq=5124, maxDocs=44421)
                0.0625 = fieldNorm(doc=739)
          0.05887189 = weight(abstract_txt:judgment in 739) [ClassicSimilarity], result of:
            0.05887189 = score(doc=739,freq=2.0), product of:
              0.09099867 = queryWeight, product of:
                1.2934537 = boost
                7.319441 = idf(docFreq=79, maxDocs=44421)
                0.009611834 = queryNorm
              0.6469533 = fieldWeight in 739, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.319441 = idf(docFreq=79, maxDocs=44421)
                0.0625 = fieldNorm(doc=739)
          0.11982289 = weight(abstract_txt:judgments in 739) [ClassicSimilarity], result of:
            0.11982289 = score(doc=739,freq=3.0), product of:
              0.16085547 = queryWeight, product of:
                2.4320152 = boost
                6.881186 = idf(docFreq=123, maxDocs=44421)
                0.009611834 = queryNorm
              0.74491024 = fieldWeight in 739, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.881186 = idf(docFreq=123, maxDocs=44421)
                0.0625 = fieldNorm(doc=739)
          0.07630054 = weight(abstract_txt:relevance in 739) [ClassicSimilarity], result of:
            0.07630054 = score(doc=739,freq=4.0), product of:
              0.12382601 = queryWeight, product of:
                2.6133642 = boost
                4.929532 = idf(docFreq=872, maxDocs=44421)
                0.009611834 = queryNorm
              0.6161915 = fieldWeight in 739, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.929532 = idf(docFreq=872, maxDocs=44421)
                0.0625 = fieldNorm(doc=739)
          0.55937785 = weight(abstract_txt:specificity in 739) [ClassicSimilarity], result of:
            0.55937785 = score(doc=739,freq=2.0), product of:
              0.84914285 = queryWeight, product of:
                11.8534565 = boost
                7.4529724 = idf(docFreq=69, maxDocs=44421)
                0.009611834 = queryNorm
              0.6587559 = fieldWeight in 739, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.4529724 = idf(docFreq=69, maxDocs=44421)
                0.0625 = fieldNorm(doc=739)
        0.2 = coord(5/25)
    
  4. Mooers, C.N.: ¬The indexing language of an information retrieval system (1985) 0.16
    0.16128726 = sum of:
      0.16128726 = product of:
        0.57602596 = sum of:
          0.01809924 = weight(abstract_txt:indexed in 4644) [ClassicSimilarity], result of:
            0.01809924 = score(doc=4644,freq=1.0), product of:
              0.0632663 = queryWeight, product of:
                1.0784986 = boost
                6.1030455 = idf(docFreq=269, maxDocs=44421)
                0.009611834 = queryNorm
              0.28608024 = fieldWeight in 4644, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.1030455 = idf(docFreq=269, maxDocs=44421)
                0.046875 = fieldNorm(doc=4644)
          0.01854182 = weight(abstract_txt:involving in 4644) [ClassicSimilarity], result of:
            0.01854182 = score(doc=4644,freq=1.0), product of:
              0.06429351 = queryWeight, product of:
                1.0872188 = boost
                6.1523914 = idf(docFreq=256, maxDocs=44421)
                0.009611834 = queryNorm
              0.28839335 = fieldWeight in 4644, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.1523914 = idf(docFreq=256, maxDocs=44421)
                0.046875 = fieldNorm(doc=4644)
          0.0071032057 = weight(abstract_txt:research in 4644) [ClassicSimilarity], result of:
            0.0071032057 = score(doc=4644,freq=2.0), product of:
              0.033913177 = queryWeight, product of:
                1.1166899 = boost
                3.159582 = idf(docFreq=5124, maxDocs=44421)
                0.009611834 = queryNorm
              0.20945267 = fieldWeight in 4644, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.159582 = idf(docFreq=5124, maxDocs=44421)
                0.046875 = fieldNorm(doc=4644)
          0.03275925 = weight(abstract_txt:document in 4644) [ClassicSimilarity], result of:
            0.03275925 = score(doc=4644,freq=3.0), product of:
              0.093962565 = queryWeight, product of:
                2.2765198 = boost
                4.29415 = idf(docFreq=1647, maxDocs=44421)
                0.009611834 = queryNorm
              0.3486415 = fieldWeight in 4644, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.29415 = idf(docFreq=1647, maxDocs=44421)
                0.046875 = fieldNorm(doc=4644)
          0.07388365 = weight(abstract_txt:index in 4644) [ClassicSimilarity], result of:
            0.07388365 = score(doc=4644,freq=3.0), product of:
              0.19159348 = queryWeight, product of:
                4.1967077 = boost
                4.7496953 = idf(docFreq=1044, maxDocs=44421)
                0.009611834 = queryNorm
              0.38562718 = fieldWeight in 4644, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.7496953 = idf(docFreq=1044, maxDocs=44421)
                0.046875 = fieldNorm(doc=4644)
          0.12898384 = weight(abstract_txt:term in 4644) [ClassicSimilarity], result of:
            0.12898384 = score(doc=4644,freq=6.0), product of:
              0.23429108 = queryWeight, product of:
                5.0837812 = boost
                4.794713 = idf(docFreq=998, maxDocs=44421)
                0.009611834 = queryNorm
              0.55052817 = fieldWeight in 4644, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                4.794713 = idf(docFreq=998, maxDocs=44421)
                0.046875 = fieldNorm(doc=4644)
          0.2966549 = weight(abstract_txt:specificity in 4644) [ClassicSimilarity], result of:
            0.2966549 = score(doc=4644,freq=1.0), product of:
              0.84914285 = queryWeight, product of:
                11.8534565 = boost
                7.4529724 = idf(docFreq=69, maxDocs=44421)
                0.009611834 = queryNorm
              0.34935808 = fieldWeight in 4644, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.4529724 = idf(docFreq=69, maxDocs=44421)
                0.046875 = fieldNorm(doc=4644)
        0.28 = coord(7/25)
    
  5. Tamine, L.; Chouquet, C.; Palmer, T.: Analysis of biomedical and health queries : lessons learned from TREC and CLEF evaluation benchmarks (2015) 0.16
    0.16007647 = sum of:
      0.16007647 = product of:
        0.8003823 = sum of:
          0.006696966 = weight(abstract_txt:research in 3341) [ClassicSimilarity], result of:
            0.006696966 = score(doc=3341,freq=1.0), product of:
              0.033913177 = queryWeight, product of:
                1.1166899 = boost
                3.159582 = idf(docFreq=5124, maxDocs=44421)
                0.009611834 = queryNorm
              0.19747387 = fieldWeight in 3341, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.159582 = idf(docFreq=5124, maxDocs=44421)
                0.0625 = fieldNorm(doc=3341)
          0.0131621845 = weight(abstract_txt:between in 3341) [ClassicSimilarity], result of:
            0.0131621845 = score(doc=3341,freq=1.0), product of:
              0.060911432 = queryWeight, product of:
                1.8329195 = boost
                3.4573963 = idf(docFreq=3804, maxDocs=44421)
                0.009611834 = queryNorm
              0.21608727 = fieldWeight in 3341, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4573963 = idf(docFreq=3804, maxDocs=44421)
                0.0625 = fieldNorm(doc=3341)
          0.025218084 = weight(abstract_txt:document in 3341) [ClassicSimilarity], result of:
            0.025218084 = score(doc=3341,freq=1.0), product of:
              0.093962565 = queryWeight, product of:
                2.2765198 = boost
                4.29415 = idf(docFreq=1647, maxDocs=44421)
                0.009611834 = queryNorm
              0.26838437 = fieldWeight in 3341, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.29415 = idf(docFreq=1647, maxDocs=44421)
                0.0625 = fieldNorm(doc=3341)
          0.070209906 = weight(abstract_txt:term in 3341) [ClassicSimilarity], result of:
            0.070209906 = score(doc=3341,freq=1.0), product of:
              0.23429108 = queryWeight, product of:
                5.0837812 = boost
                4.794713 = idf(docFreq=998, maxDocs=44421)
                0.009611834 = queryNorm
              0.29966956 = fieldWeight in 3341, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.794713 = idf(docFreq=998, maxDocs=44421)
                0.0625 = fieldNorm(doc=3341)
          0.6850952 = weight(abstract_txt:specificity in 3341) [ClassicSimilarity], result of:
            0.6850952 = score(doc=3341,freq=3.0), product of:
              0.84914285 = queryWeight, product of:
                11.8534565 = boost
                7.4529724 = idf(docFreq=69, maxDocs=44421)
                0.009611834 = queryNorm
              0.80680794 = fieldWeight in 3341, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                7.4529724 = idf(docFreq=69, maxDocs=44421)
                0.0625 = fieldNorm(doc=3341)
        0.2 = coord(5/25)