Document (#19793)

Author
Oh, S.G.
Title
Document representation and retrieval using empirical facts : evaluation of a pilot system
Source
Journal of the American Society for Information Science. 49(1998) no.10, S.920-931
Year
1998
Abstract
This article investigates the potentialities of using empirical variables and their associated statistical relationships in document representation and retrieval. To this end, a newly devised empirical fact retrieval system (EFRS) was evaluated in comparison to a simulated traditional retrieval system (TRS) involving a set of predetermined empirical queries. Results indicate that the EFRS generally outperformed the TRS in terms of precision, search effort, and measures of user satisfaction. Possible advantages of the EFRS, as well as the necessity of establishing an efficient methos for extracting empirical facts, are discussed

Similar documents (content)

  1. Frisch, A.M.; Allen, J.F.: Knowledge retrieval as limited inference (1982) 0.11
    0.11079698 = sum of:
      0.11079698 = product of:
        0.4616541 = sum of:
          0.041594885 = weight(abstract_txt:efficient in 5804) [ClassicSimilarity], result of:
            0.041594885 = score(doc=5804,freq=1.0), product of:
              0.11502035 = queryWeight, product of:
                1.0110103 = boost
                5.7860904 = idf(docFreq=368, maxDocs=44218)
                0.019662281 = queryNorm
              0.36163065 = fieldWeight in 5804, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.7860904 = idf(docFreq=368, maxDocs=44218)
                0.0625 = fieldNorm(doc=5804)
          0.017836776 = weight(abstract_txt:using in 5804) [ClassicSimilarity], result of:
            0.017836776 = score(doc=5804,freq=1.0), product of:
              0.08240792 = queryWeight, product of:
                1.2102294 = boost
                3.4631186 = idf(docFreq=3765, maxDocs=44218)
                0.019662281 = queryNorm
              0.21644491 = fieldWeight in 5804, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4631186 = idf(docFreq=3765, maxDocs=44218)
                0.0625 = fieldNorm(doc=5804)
          0.05133616 = weight(abstract_txt:representation in 5804) [ClassicSimilarity], result of:
            0.05133616 = score(doc=5804,freq=1.0), product of:
              0.16674021 = queryWeight, product of:
                1.7214856 = boost
                4.926098 = idf(docFreq=871, maxDocs=44218)
                0.019662281 = queryNorm
              0.30788112 = fieldWeight in 5804, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.926098 = idf(docFreq=871, maxDocs=44218)
                0.0625 = fieldNorm(doc=5804)
          0.034938354 = weight(abstract_txt:system in 5804) [ClassicSimilarity], result of:
            0.034938354 = score(doc=5804,freq=2.0), product of:
              0.117214166 = queryWeight, product of:
                1.767742 = boost
                3.3723085 = idf(docFreq=4123, maxDocs=44218)
                0.019662281 = queryNorm
              0.2980728 = fieldWeight in 5804, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.3723085 = idf(docFreq=4123, maxDocs=44218)
                0.0625 = fieldNorm(doc=5804)
          0.03604632 = weight(abstract_txt:retrieval in 5804) [ClassicSimilarity], result of:
            0.03604632 = score(doc=5804,freq=1.0), product of:
              0.16596201 = queryWeight, product of:
                2.4288604 = boost
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.019662281 = queryNorm
              0.21719621 = fieldWeight in 5804, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.0625 = fieldNorm(doc=5804)
          0.2799016 = weight(abstract_txt:facts in 5804) [ClassicSimilarity], result of:
            0.2799016 = score(doc=5804,freq=3.0), product of:
              0.35814145 = queryWeight, product of:
                2.522961 = boost
                7.2195506 = idf(docFreq=87, maxDocs=44218)
                0.019662281 = queryNorm
              0.78153926 = fieldWeight in 5804, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                7.2195506 = idf(docFreq=87, maxDocs=44218)
                0.0625 = fieldNorm(doc=5804)
        0.24 = coord(6/25)
    
  2. Kostoff, R.N.; Eberhart, H.J.; Toothman, D.R.: Database tomography for information retrieval (1997) 0.10
    0.10465188 = sum of:
      0.10465188 = product of:
        0.5232594 = sum of:
          0.075853646 = weight(abstract_txt:involving in 1) [ClassicSimilarity], result of:
            0.075853646 = score(doc=1,freq=1.0), product of:
              0.13102019 = queryWeight, product of:
                1.0790395 = boost
                6.1754265 = idf(docFreq=249, maxDocs=44218)
                0.019662281 = queryNorm
              0.57894623 = fieldWeight in 1, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.1754265 = idf(docFreq=249, maxDocs=44218)
                0.09375 = fieldNorm(doc=1)
          0.10821723 = weight(abstract_txt:newly in 1) [ClassicSimilarity], result of:
            0.10821723 = score(doc=1,freq=1.0), product of:
              0.16604209 = queryWeight, product of:
                1.2147232 = boost
                6.9519553 = idf(docFreq=114, maxDocs=44218)
                0.019662281 = queryNorm
              0.6517458 = fieldWeight in 1, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.9519553 = idf(docFreq=114, maxDocs=44218)
                0.09375 = fieldNorm(doc=1)
          0.22566502 = weight(abstract_txt:simulated in 1) [ClassicSimilarity], result of:
            0.22566502 = score(doc=1,freq=2.0), product of:
              0.21510644 = queryWeight, product of:
                1.3825948 = boost
                7.912698 = idf(docFreq=43, maxDocs=44218)
                0.019662281 = queryNorm
              1.0490854 = fieldWeight in 1, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.912698 = idf(docFreq=43, maxDocs=44218)
                0.09375 = fieldNorm(doc=1)
          0.037057716 = weight(abstract_txt:system in 1) [ClassicSimilarity], result of:
            0.037057716 = score(doc=1,freq=1.0), product of:
              0.117214166 = queryWeight, product of:
                1.767742 = boost
                3.3723085 = idf(docFreq=4123, maxDocs=44218)
                0.019662281 = queryNorm
              0.3161539 = fieldWeight in 1, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.3723085 = idf(docFreq=4123, maxDocs=44218)
                0.09375 = fieldNorm(doc=1)
          0.076465786 = weight(abstract_txt:retrieval in 1) [ClassicSimilarity], result of:
            0.076465786 = score(doc=1,freq=2.0), product of:
              0.16596201 = queryWeight, product of:
                2.4288604 = boost
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.019662281 = queryNorm
              0.4607427 = fieldWeight in 1, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.09375 = fieldNorm(doc=1)
        0.2 = coord(5/25)
    
  3. Suchanek, F.M.; Kasneci, G.; Weikum, G.: YAGO: a core of semantic knowledge unifying WordNet and Wikipedia (2007) 0.10
    0.09829314 = sum of:
      0.09829314 = product of:
        0.61433214 = sum of:
          0.05228799 = weight(abstract_txt:fact in 3403) [ClassicSimilarity], result of:
            0.05228799 = score(doc=3403,freq=1.0), product of:
              0.1154541 = queryWeight, product of:
                1.0129148 = boost
                5.79699 = idf(docFreq=364, maxDocs=44218)
                0.019662281 = queryNorm
              0.45288983 = fieldWeight in 3403, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.79699 = idf(docFreq=364, maxDocs=44218)
                0.078125 = fieldNorm(doc=3403)
          0.02229597 = weight(abstract_txt:using in 3403) [ClassicSimilarity], result of:
            0.02229597 = score(doc=3403,freq=1.0), product of:
              0.08240792 = queryWeight, product of:
                1.2102294 = boost
                3.4631186 = idf(docFreq=3765, maxDocs=44218)
                0.019662281 = queryNorm
              0.27055615 = fieldWeight in 3403, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4631186 = idf(docFreq=3765, maxDocs=44218)
                0.078125 = fieldNorm(doc=3403)
          0.349877 = weight(abstract_txt:facts in 3403) [ClassicSimilarity], result of:
            0.349877 = score(doc=3403,freq=3.0), product of:
              0.35814145 = queryWeight, product of:
                2.522961 = boost
                7.2195506 = idf(docFreq=87, maxDocs=44218)
                0.019662281 = queryNorm
              0.97692406 = fieldWeight in 3403, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                7.2195506 = idf(docFreq=87, maxDocs=44218)
                0.078125 = fieldNorm(doc=3403)
          0.18987118 = weight(abstract_txt:empirical in 3403) [ClassicSimilarity], result of:
            0.18987118 = score(doc=3403,freq=1.0), product of:
              0.4664131 = queryWeight, product of:
                4.552381 = boost
                5.2107263 = idf(docFreq=655, maxDocs=44218)
                0.019662281 = queryNorm
              0.40708798 = fieldWeight in 3403, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.2107263 = idf(docFreq=655, maxDocs=44218)
                0.078125 = fieldNorm(doc=3403)
        0.16 = coord(4/25)
    
  4. Hansen, P.; Järvelin, K.: Collaborative Information Retrieval in an information-intensive domain (2005) 0.09
    0.09128728 = sum of:
      0.09128728 = product of:
        0.4564364 = sum of:
          0.05031335 = weight(abstract_txt:effort in 1040) [ClassicSimilarity], result of:
            0.05031335 = score(doc=1040,freq=1.0), product of:
              0.11252876 = queryWeight, product of:
                5.723078 = idf(docFreq=392, maxDocs=44218)
                0.019662281 = queryNorm
              0.44711545 = fieldWeight in 1040, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.723078 = idf(docFreq=392, maxDocs=44218)
                0.078125 = fieldNorm(doc=1040)
          0.05228799 = weight(abstract_txt:generally in 1040) [ClassicSimilarity], result of:
            0.05228799 = score(doc=1040,freq=1.0), product of:
              0.1154541 = queryWeight, product of:
                1.0129148 = boost
                5.79699 = idf(docFreq=364, maxDocs=44218)
                0.019662281 = queryNorm
              0.45288983 = fieldWeight in 1040, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.79699 = idf(docFreq=364, maxDocs=44218)
                0.078125 = fieldNorm(doc=1040)
          0.063211374 = weight(abstract_txt:involving in 1040) [ClassicSimilarity], result of:
            0.063211374 = score(doc=1040,freq=1.0), product of:
              0.13102019 = queryWeight, product of:
                1.0790395 = boost
                6.1754265 = idf(docFreq=249, maxDocs=44218)
                0.019662281 = queryNorm
              0.4824552 = fieldWeight in 1040, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.1754265 = idf(docFreq=249, maxDocs=44218)
                0.078125 = fieldNorm(doc=1040)
          0.10075253 = weight(abstract_txt:retrieval in 1040) [ClassicSimilarity], result of:
            0.10075253 = score(doc=1040,freq=5.0), product of:
              0.16596201 = queryWeight, product of:
                2.4288604 = boost
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.019662281 = queryNorm
              0.6070819 = fieldWeight in 1040, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.078125 = fieldNorm(doc=1040)
          0.18987118 = weight(abstract_txt:empirical in 1040) [ClassicSimilarity], result of:
            0.18987118 = score(doc=1040,freq=1.0), product of:
              0.4664131 = queryWeight, product of:
                4.552381 = boost
                5.2107263 = idf(docFreq=655, maxDocs=44218)
                0.019662281 = queryNorm
              0.40708798 = fieldWeight in 1040, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.2107263 = idf(docFreq=655, maxDocs=44218)
                0.078125 = fieldNorm(doc=1040)
        0.2 = coord(5/25)
    
  5. Spink, A.; Saracevic, T.: Human-computer interaction in information retrieval : nature and manifestations of feedback (1998) 0.09
    0.08896583 = sum of:
      0.08896583 = product of:
        0.5560365 = sum of:
          0.075853646 = weight(abstract_txt:involving in 3763) [ClassicSimilarity], result of:
            0.075853646 = score(doc=3763,freq=1.0), product of:
              0.13102019 = queryWeight, product of:
                1.0790395 = boost
                6.1754265 = idf(docFreq=249, maxDocs=44218)
                0.019662281 = queryNorm
              0.57894623 = fieldWeight in 3763, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.1754265 = idf(docFreq=249, maxDocs=44218)
                0.09375 = fieldNorm(doc=3763)
          0.037057716 = weight(abstract_txt:system in 3763) [ClassicSimilarity], result of:
            0.037057716 = score(doc=3763,freq=1.0), product of:
              0.117214166 = queryWeight, product of:
                1.767742 = boost
                3.3723085 = idf(docFreq=4123, maxDocs=44218)
                0.019662281 = queryNorm
              0.3161539 = fieldWeight in 3763, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.3723085 = idf(docFreq=4123, maxDocs=44218)
                0.09375 = fieldNorm(doc=3763)
          0.12090303 = weight(abstract_txt:retrieval in 3763) [ClassicSimilarity], result of:
            0.12090303 = score(doc=3763,freq=5.0), product of:
              0.16596201 = queryWeight, product of:
                2.4288604 = boost
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.019662281 = queryNorm
              0.7284982 = fieldWeight in 3763, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.09375 = fieldNorm(doc=3763)
          0.32222205 = weight(abstract_txt:empirical in 3763) [ClassicSimilarity], result of:
            0.32222205 = score(doc=3763,freq=2.0), product of:
              0.4664131 = queryWeight, product of:
                4.552381 = boost
                5.2107263 = idf(docFreq=655, maxDocs=44218)
                0.019662281 = queryNorm
              0.6908512 = fieldWeight in 3763, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.2107263 = idf(docFreq=655, maxDocs=44218)
                0.09375 = fieldNorm(doc=3763)
        0.16 = coord(4/25)