Document (#19793)

Author
Oh, S.G.
Title
Document representation and retrieval using empirical facts : evaluation of a pilot system
Source
Journal of the American Society for Information Science. 49(1998) no.10, S.920-931
Year
1998
Abstract
This article investigates the potentialities of using empirical variables and their associated statistical relationships in document representation and retrieval. To this end, a newly devised empirical fact retrieval system (EFRS) was evaluated in comparison to a simulated traditional retrieval system (TRS) involving a set of predetermined empirical queries. Results indicate that the EFRS generally outperformed the TRS in terms of precision, search effort, and measures of user satisfaction. Possible advantages of the EFRS, as well as the necessity of establishing an efficient methos for extracting empirical facts, are discussed

Similar documents (content)

  1. Frisch, A.M.; Allen, J.F.: Knowledge retrieval as limited inference (1982) 0.11
    0.110331245 = sum of:
      0.110331245 = product of:
        0.45971352 = sum of:
          0.041710418 = weight(abstract_txt:efficient in 804) [ClassicSimilarity], result of:
            0.041710418 = score(doc=804,freq=1.0), product of:
              0.115356274 = queryWeight, product of:
                1.0100578 = boost
                5.7852654 = idf(docFreq=370, maxDocs=44421)
                0.019741116 = queryNorm
              0.3615791 = fieldWeight in 804, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.7852654 = idf(docFreq=370, maxDocs=44421)
                0.0625 = fieldNorm(doc=804)
          0.017797299 = weight(abstract_txt:using in 804) [ClassicSimilarity], result of:
            0.017797299 = score(doc=804,freq=1.0), product of:
              0.08237415 = queryWeight, product of:
                1.2070801 = boost
                3.4568708 = idf(docFreq=3806, maxDocs=44421)
                0.019741116 = queryNorm
              0.21605442 = fieldWeight in 804, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4568708 = idf(docFreq=3806, maxDocs=44421)
                0.0625 = fieldNorm(doc=804)
          0.051500894 = weight(abstract_txt:representation in 804) [ClassicSimilarity], result of:
            0.051500894 = score(doc=804,freq=1.0), product of:
              0.16727513 = queryWeight, product of:
                1.7201103 = boost
                4.9261017 = idf(docFreq=875, maxDocs=44421)
                0.019741116 = queryNorm
              0.30788136 = fieldWeight in 804, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.9261017 = idf(docFreq=875, maxDocs=44421)
                0.0625 = fieldNorm(doc=804)
          0.035064936 = weight(abstract_txt:system in 804) [ClassicSimilarity], result of:
            0.035064936 = score(doc=804,freq=2.0), product of:
              0.11762257 = queryWeight, product of:
                1.766573 = boost
                3.372775 = idf(docFreq=4140, maxDocs=44421)
                0.019741116 = queryNorm
              0.298114 = fieldWeight in 804, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.372775 = idf(docFreq=4140, maxDocs=44421)
                0.0625 = fieldNorm(doc=804)
          0.0362044 = weight(abstract_txt:retrieval in 804) [ClassicSimilarity], result of:
            0.0362044 = score(doc=804,freq=1.0), product of:
              0.16662459 = queryWeight, product of:
                2.4278686 = boost
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.019741116 = queryNorm
              0.21728125 = fieldWeight in 804, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.0625 = fieldNorm(doc=804)
          0.27743557 = weight(abstract_txt:facts in 804) [ClassicSimilarity], result of:
            0.27743557 = score(doc=804,freq=3.0), product of:
              0.3564149 = queryWeight, product of:
                2.510837 = boost
                7.190608 = idf(docFreq=90, maxDocs=44421)
                0.019741116 = queryNorm
              0.77840614 = fieldWeight in 804, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                7.190608 = idf(docFreq=90, maxDocs=44421)
                0.0625 = fieldNorm(doc=804)
        0.24 = coord(6/25)
    
  2. Kostoff, R.N.; Eberhart, H.J.; Toothman, D.R.: Database tomography for information retrieval (1997) 0.10
    0.10420084 = sum of:
      0.10420084 = product of:
        0.5210042 = sum of:
          0.075248495 = weight(abstract_txt:involving in 1001) [ClassicSimilarity], result of:
            0.075248495 = score(doc=1001,freq=1.0), product of:
              0.13046156 = queryWeight, product of:
                1.0741549 = boost
                6.1523914 = idf(docFreq=256, maxDocs=44421)
                0.019741116 = queryNorm
              0.5767867 = fieldWeight in 1001, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.1523914 = idf(docFreq=256, maxDocs=44421)
                0.09375 = fieldNorm(doc=1001)
          0.10877896 = weight(abstract_txt:newly in 1001) [ClassicSimilarity], result of:
            0.10877896 = score(doc=1001,freq=1.0), product of:
              0.16679408 = queryWeight, product of:
                1.2145514 = boost
                6.9565353 = idf(docFreq=114, maxDocs=44421)
                0.019741116 = queryNorm
              0.6521752 = fieldWeight in 1001, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.9565353 = idf(docFreq=114, maxDocs=44421)
                0.09375 = fieldNorm(doc=1001)
          0.2229836 = weight(abstract_txt:simulated in 1001) [ClassicSimilarity], result of:
            0.2229836 = score(doc=1001,freq=2.0), product of:
              0.21362692 = queryWeight, product of:
                1.374528 = boost
                7.872826 = idf(docFreq=45, maxDocs=44421)
                0.019741116 = queryNorm
              1.0437992 = fieldWeight in 1001, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.872826 = idf(docFreq=45, maxDocs=44421)
                0.09375 = fieldNorm(doc=1001)
          0.037191983 = weight(abstract_txt:system in 1001) [ClassicSimilarity], result of:
            0.037191983 = score(doc=1001,freq=1.0), product of:
              0.11762257 = queryWeight, product of:
                1.766573 = boost
                3.372775 = idf(docFreq=4140, maxDocs=44421)
                0.019741116 = queryNorm
              0.31619766 = fieldWeight in 1001, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.372775 = idf(docFreq=4140, maxDocs=44421)
                0.09375 = fieldNorm(doc=1001)
          0.07680113 = weight(abstract_txt:retrieval in 1001) [ClassicSimilarity], result of:
            0.07680113 = score(doc=1001,freq=2.0), product of:
              0.16662459 = queryWeight, product of:
                2.4278686 = boost
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.019741116 = queryNorm
              0.46092314 = fieldWeight in 1001, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.09375 = fieldNorm(doc=1001)
        0.2 = coord(5/25)
    
  3. Suchanek, F.M.; Kasneci, G.; Weikum, G.: YAGO: a core of semantic knowledge unifying WordNet and Wikipedia (2007) 0.10
    0.09773043 = sum of:
      0.09773043 = product of:
        0.61081517 = sum of:
          0.0522843 = weight(abstract_txt:fact in 390) [ClassicSimilarity], result of:
            0.0522843 = score(doc=390,freq=1.0), product of:
              0.11557194 = queryWeight, product of:
                1.0110016 = boost
                5.790671 = idf(docFreq=368, maxDocs=44421)
                0.019741116 = queryNorm
              0.45239615 = fieldWeight in 390, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.790671 = idf(docFreq=368, maxDocs=44421)
                0.078125 = fieldNorm(doc=390)
          0.022246623 = weight(abstract_txt:using in 390) [ClassicSimilarity], result of:
            0.022246623 = score(doc=390,freq=1.0), product of:
              0.08237415 = queryWeight, product of:
                1.2070801 = boost
                3.4568708 = idf(docFreq=3806, maxDocs=44421)
                0.019741116 = queryNorm
              0.27006802 = fieldWeight in 390, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4568708 = idf(docFreq=3806, maxDocs=44421)
                0.078125 = fieldNorm(doc=390)
          0.34679446 = weight(abstract_txt:facts in 390) [ClassicSimilarity], result of:
            0.34679446 = score(doc=390,freq=3.0), product of:
              0.3564149 = queryWeight, product of:
                2.510837 = boost
                7.190608 = idf(docFreq=90, maxDocs=44421)
                0.019741116 = queryNorm
              0.9730077 = fieldWeight in 390, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                7.190608 = idf(docFreq=90, maxDocs=44421)
                0.078125 = fieldNorm(doc=390)
          0.18948975 = weight(abstract_txt:empirical in 390) [ClassicSimilarity], result of:
            0.18948975 = score(doc=390,freq=1.0), product of:
              0.4662856 = queryWeight, product of:
                4.5408444 = boost
                5.2016807 = idf(docFreq=664, maxDocs=44421)
                0.019741116 = queryNorm
              0.4063813 = fieldWeight in 390, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.2016807 = idf(docFreq=664, maxDocs=44421)
                0.078125 = fieldNorm(doc=390)
        0.16 = coord(4/25)
    
  4. Hansen, P.; Järvelin, K.: Collaborative Information Retrieval in an information-intensive domain (2005) 0.09
    0.09129858 = sum of:
      0.09129858 = product of:
        0.4564929 = sum of:
          0.050595958 = weight(abstract_txt:effort in 2040) [ClassicSimilarity], result of:
            0.050595958 = score(doc=2040,freq=1.0), product of:
              0.113070354 = queryWeight, product of:
                5.727658 = idf(docFreq=392, maxDocs=44421)
                0.019741116 = queryNorm
              0.44747326 = fieldWeight in 2040, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.727658 = idf(docFreq=392, maxDocs=44421)
                0.078125 = fieldNorm(doc=2040)
          0.05250574 = weight(abstract_txt:generally in 2040) [ClassicSimilarity], result of:
            0.05250574 = score(doc=2040,freq=1.0), product of:
              0.11589803 = queryWeight, product of:
                1.0124269 = boost
                5.7988343 = idf(docFreq=365, maxDocs=44421)
                0.019741116 = queryNorm
              0.45303392 = fieldWeight in 2040, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.7988343 = idf(docFreq=365, maxDocs=44421)
                0.078125 = fieldNorm(doc=2040)
          0.062707074 = weight(abstract_txt:involving in 2040) [ClassicSimilarity], result of:
            0.062707074 = score(doc=2040,freq=1.0), product of:
              0.13046156 = queryWeight, product of:
                1.0741549 = boost
                6.1523914 = idf(docFreq=256, maxDocs=44421)
                0.019741116 = queryNorm
              0.48065558 = fieldWeight in 2040, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.1523914 = idf(docFreq=256, maxDocs=44421)
                0.078125 = fieldNorm(doc=2040)
          0.10119438 = weight(abstract_txt:retrieval in 2040) [ClassicSimilarity], result of:
            0.10119438 = score(doc=2040,freq=5.0), product of:
              0.16662459 = queryWeight, product of:
                2.4278686 = boost
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.019741116 = queryNorm
              0.6073196 = fieldWeight in 2040, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.078125 = fieldNorm(doc=2040)
          0.18948975 = weight(abstract_txt:empirical in 2040) [ClassicSimilarity], result of:
            0.18948975 = score(doc=2040,freq=1.0), product of:
              0.4662856 = queryWeight, product of:
                4.5408444 = boost
                5.2016807 = idf(docFreq=664, maxDocs=44421)
                0.019741116 = queryNorm
              0.4063813 = fieldWeight in 2040, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.2016807 = idf(docFreq=664, maxDocs=44421)
                0.078125 = fieldNorm(doc=2040)
        0.2 = coord(5/25)
    
  5. Spink, A.; Saracevic, T.: Human-computer interaction in information retrieval : nature and manifestations of feedback (1998) 0.09
    0.088871755 = sum of:
      0.088871755 = product of:
        0.5554485 = sum of:
          0.075248495 = weight(abstract_txt:involving in 4763) [ClassicSimilarity], result of:
            0.075248495 = score(doc=4763,freq=1.0), product of:
              0.13046156 = queryWeight, product of:
                1.0741549 = boost
                6.1523914 = idf(docFreq=256, maxDocs=44421)
                0.019741116 = queryNorm
              0.5767867 = fieldWeight in 4763, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.1523914 = idf(docFreq=256, maxDocs=44421)
                0.09375 = fieldNorm(doc=4763)
          0.037191983 = weight(abstract_txt:system in 4763) [ClassicSimilarity], result of:
            0.037191983 = score(doc=4763,freq=1.0), product of:
              0.11762257 = queryWeight, product of:
                1.766573 = boost
                3.372775 = idf(docFreq=4140, maxDocs=44421)
                0.019741116 = queryNorm
              0.31619766 = fieldWeight in 4763, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.372775 = idf(docFreq=4140, maxDocs=44421)
                0.09375 = fieldNorm(doc=4763)
          0.12143325 = weight(abstract_txt:retrieval in 4763) [ClassicSimilarity], result of:
            0.12143325 = score(doc=4763,freq=5.0), product of:
              0.16662459 = queryWeight, product of:
                2.4278686 = boost
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.019741116 = queryNorm
              0.7287835 = fieldWeight in 4763, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.09375 = fieldNorm(doc=4763)
          0.32157475 = weight(abstract_txt:empirical in 4763) [ClassicSimilarity], result of:
            0.32157475 = score(doc=4763,freq=2.0), product of:
              0.4662856 = queryWeight, product of:
                4.5408444 = boost
                5.2016807 = idf(docFreq=664, maxDocs=44421)
                0.019741116 = queryNorm
              0.6896519 = fieldWeight in 4763, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.2016807 = idf(docFreq=664, maxDocs=44421)
                0.09375 = fieldNorm(doc=4763)
        0.16 = coord(4/25)