Document (#3117)

Author
Latham, S.J.
Title
Beyond Boolean logic : probabilistic approaches to text retrieval
Source
Law librarian. 22(1991) no.3, S.157-163
Year
1991
Abstract
Text retrieval systems in use today are predominantly based on Boolean techniques. Search statements consisting of search terms and Boolean operators are matches against an inverted file to retrieve a set of records for full display. This approach to text retrieval has a number of limitations which may be overcome by presenting a set of documents ranked in order of their degree of similarity to the search statement. Such probabilistic techniques have not yet been incorporated into commercially available products on a large scale. This is due to inertia among users and producers and results from the nature of the market for text retrieval systems. Probabilistic techniques are likely to play an increasingly important part in text retrieval systems in the future

Similar documents (author)

  1. Latham, S.J.: Open systems and libraries : an overview (1993) 6.10
    6.0972233 = sum of:
      6.0972233 = weight(author_txt:latham in 5566) [ClassicSimilarity], result of:
        6.0972233 = fieldWeight in 5566, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.755557 = idf(docFreq=6, maxDocs=44421)
          0.625 = fieldNorm(doc=5566)
    
  2. Latham, D.: Information architectures : notes toward a new curriculum (2002) 6.10
    6.0972233 = sum of:
      6.0972233 = weight(author_txt:latham in 2009) [ClassicSimilarity], result of:
        6.0972233 = fieldWeight in 2009, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.755557 = idf(docFreq=6, maxDocs=44421)
          0.625 = fieldNorm(doc=2009)
    
  3. Latham, K.F.: Museum object as document : using Buckland's information concepts to understand museum experiences (2012) 6.10
    6.0972233 = sum of:
      6.0972233 = weight(author_txt:latham in 1298) [ClassicSimilarity], result of:
        6.0972233 = fieldWeight in 1298, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.755557 = idf(docFreq=6, maxDocs=44421)
          0.625 = fieldNorm(doc=1298)
    
  4. Gross, M.; Latham, D.: What's skill got to do with it? : information literacy skills and self-views of ability among first-year college students (2012) 4.88
    4.8777785 = sum of:
      4.8777785 = weight(author_txt:latham in 1075) [ClassicSimilarity], result of:
        4.8777785 = fieldWeight in 1075, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.755557 = idf(docFreq=6, maxDocs=44421)
          0.5 = fieldNorm(doc=1075)
    
  5. Hajibayova, L.; Latham, K.F.: Exploring museum crowdsourcing projects through Bordieu's lens (2017) 4.88
    4.8777785 = sum of:
      4.8777785 = weight(author_txt:latham in 129) [ClassicSimilarity], result of:
        4.8777785 = fieldWeight in 129, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.755557 = idf(docFreq=6, maxDocs=44421)
          0.5 = fieldNorm(doc=129)
    

Similar documents (content)

  1. Larson, R.R.: Information retrieval systems (2009) 0.27
    0.27017218 = sum of:
      0.27017218 = product of:
        0.84428805 = sum of:
          0.113147154 = weight(abstract_txt:inverted in 804) [ClassicSimilarity], result of:
            0.113147154 = score(doc=804,freq=1.0), product of:
              0.188674 = queryWeight, product of:
                1.2826791 = boost
                7.676116 = idf(docFreq=55, maxDocs=44421)
                0.019162517 = queryNorm
              0.5996966 = fieldWeight in 804, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.676116 = idf(docFreq=55, maxDocs=44421)
                0.078125 = fieldNorm(doc=804)
          0.05159564 = weight(abstract_txt:systems in 804) [ClassicSimilarity], result of:
            0.05159564 = score(doc=804,freq=3.0), product of:
              0.11177851 = queryWeight, product of:
                1.7100222 = boost
                3.411175 = idf(docFreq=3984, maxDocs=44421)
                0.019162517 = queryNorm
              0.46158817 = fieldWeight in 804, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.411175 = idf(docFreq=3984, maxDocs=44421)
                0.078125 = fieldNorm(doc=804)
          0.036631875 = weight(abstract_txt:search in 804) [ClassicSimilarity], result of:
            0.036631875 = score(doc=804,freq=1.0), product of:
              0.12830085 = queryWeight, product of:
                1.8320501 = boost
                3.654598 = idf(docFreq=3123, maxDocs=44421)
                0.019162517 = queryNorm
              0.28551546 = fieldWeight in 804, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.654598 = idf(docFreq=3123, maxDocs=44421)
                0.078125 = fieldNorm(doc=804)
          0.06986244 = weight(abstract_txt:techniques in 804) [ClassicSimilarity], result of:
            0.06986244 = score(doc=804,freq=1.0), product of:
              0.19731158 = queryWeight, product of:
                2.2719505 = boost
                4.5321174 = idf(docFreq=1298, maxDocs=44421)
                0.019162517 = queryNorm
              0.35407168 = fieldWeight in 804, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.5321174 = idf(docFreq=1298, maxDocs=44421)
                0.078125 = fieldNorm(doc=804)
          0.07432429 = weight(abstract_txt:retrieval in 804) [ClassicSimilarity], result of:
            0.07432429 = score(doc=804,freq=2.0), product of:
              0.19350113 = queryWeight, product of:
                2.9046159 = boost
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.019162517 = queryNorm
              0.3841026 = fieldWeight in 804, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.078125 = fieldNorm(doc=804)
          0.1836395 = weight(abstract_txt:boolean in 804) [ClassicSimilarity], result of:
            0.1836395 = score(doc=804,freq=1.0), product of:
              0.37580925 = queryWeight, product of:
                3.1354966 = boost
                6.25473 = idf(docFreq=231, maxDocs=44421)
                0.019162517 = queryNorm
              0.4886508 = fieldWeight in 804, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.25473 = idf(docFreq=231, maxDocs=44421)
                0.078125 = fieldNorm(doc=804)
          0.082531095 = weight(abstract_txt:text in 804) [ClassicSimilarity], result of:
            0.082531095 = score(doc=804,freq=1.0), product of:
              0.26142758 = queryWeight, product of:
                3.3761573 = boost
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.019162517 = queryNorm
              0.3156939 = fieldWeight in 804, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.078125 = fieldNorm(doc=804)
          0.23255603 = weight(abstract_txt:probabilistic in 804) [ClassicSimilarity], result of:
            0.23255603 = score(doc=804,freq=1.0), product of:
              0.43988767 = queryWeight, product of:
                3.392294 = boost
                6.7669935 = idf(docFreq=138, maxDocs=44421)
                0.019162517 = queryNorm
              0.5286714 = fieldWeight in 804, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.7669935 = idf(docFreq=138, maxDocs=44421)
                0.078125 = fieldNorm(doc=804)
        0.32 = coord(8/25)
    
  2. Poynder, R.: Web research engines? (1996) 0.25
    0.2512262 = sum of:
      0.2512262 = product of:
        0.78508186 = sum of:
          0.06071264 = weight(abstract_txt:logic in 6698) [ClassicSimilarity], result of:
            0.06071264 = score(doc=6698,freq=1.0), product of:
              0.12458595 = queryWeight, product of:
                1.042309 = boost
                6.2376356 = idf(docFreq=235, maxDocs=44421)
                0.019162517 = queryNorm
              0.4873153 = fieldWeight in 6698, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.2376356 = idf(docFreq=235, maxDocs=44421)
                0.078125 = fieldNorm(doc=6698)
          0.06726598 = weight(abstract_txt:overcome in 6698) [ClassicSimilarity], result of:
            0.06726598 = score(doc=6698,freq=1.0), product of:
              0.13339718 = queryWeight, product of:
                1.0785376 = boost
                6.4544435 = idf(docFreq=189, maxDocs=44421)
                0.019162517 = queryNorm
              0.5042534 = fieldWeight in 6698, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.4544435 = idf(docFreq=189, maxDocs=44421)
                0.078125 = fieldNorm(doc=6698)
          0.113147154 = weight(abstract_txt:inverted in 6698) [ClassicSimilarity], result of:
            0.113147154 = score(doc=6698,freq=1.0), product of:
              0.188674 = queryWeight, product of:
                1.2826791 = boost
                7.676116 = idf(docFreq=55, maxDocs=44421)
                0.019162517 = queryNorm
              0.5996966 = fieldWeight in 6698, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.676116 = idf(docFreq=55, maxDocs=44421)
                0.078125 = fieldNorm(doc=6698)
          0.042127665 = weight(abstract_txt:systems in 6698) [ClassicSimilarity], result of:
            0.042127665 = score(doc=6698,freq=2.0), product of:
              0.11177851 = queryWeight, product of:
                1.7100222 = boost
                3.411175 = idf(docFreq=3984, maxDocs=44421)
                0.019162517 = queryNorm
              0.37688518 = fieldWeight in 6698, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.411175 = idf(docFreq=3984, maxDocs=44421)
                0.078125 = fieldNorm(doc=6698)
          0.089729406 = weight(abstract_txt:search in 6698) [ClassicSimilarity], result of:
            0.089729406 = score(doc=6698,freq=6.0), product of:
              0.12830085 = queryWeight, product of:
                1.8320501 = boost
                3.654598 = idf(docFreq=3123, maxDocs=44421)
                0.019162517 = queryNorm
              0.6993672 = fieldWeight in 6698, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                3.654598 = idf(docFreq=3123, maxDocs=44421)
                0.078125 = fieldNorm(doc=6698)
          0.06986244 = weight(abstract_txt:techniques in 6698) [ClassicSimilarity], result of:
            0.06986244 = score(doc=6698,freq=1.0), product of:
              0.19731158 = queryWeight, product of:
                2.2719505 = boost
                4.5321174 = idf(docFreq=1298, maxDocs=44421)
                0.019162517 = queryNorm
              0.35407168 = fieldWeight in 6698, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.5321174 = idf(docFreq=1298, maxDocs=44421)
                0.078125 = fieldNorm(doc=6698)
          0.25970545 = weight(abstract_txt:boolean in 6698) [ClassicSimilarity], result of:
            0.25970545 = score(doc=6698,freq=2.0), product of:
              0.37580925 = queryWeight, product of:
                3.1354966 = boost
                6.25473 = idf(docFreq=231, maxDocs=44421)
                0.019162517 = queryNorm
              0.69105655 = fieldWeight in 6698, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.25473 = idf(docFreq=231, maxDocs=44421)
                0.078125 = fieldNorm(doc=6698)
          0.082531095 = weight(abstract_txt:text in 6698) [ClassicSimilarity], result of:
            0.082531095 = score(doc=6698,freq=1.0), product of:
              0.26142758 = queryWeight, product of:
                3.3761573 = boost
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.019162517 = queryNorm
              0.3156939 = fieldWeight in 6698, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.078125 = fieldNorm(doc=6698)
        0.32 = coord(8/25)
    
  3. Samstag-Schnock, U.; Meadow, C.T.: PBS: an ecomical natural language query interpreter (1993) 0.23
    0.22743294 = sum of:
      0.22743294 = product of:
        0.94763726 = sum of:
          0.097140215 = weight(abstract_txt:logic in 5090) [ClassicSimilarity], result of:
            0.097140215 = score(doc=5090,freq=1.0), product of:
              0.12458595 = queryWeight, product of:
                1.042309 = boost
                6.2376356 = idf(docFreq=235, maxDocs=44421)
                0.019162517 = queryNorm
              0.77970445 = fieldWeight in 5090, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.2376356 = idf(docFreq=235, maxDocs=44421)
                0.125 = fieldNorm(doc=5090)
          0.122476436 = weight(abstract_txt:statements in 5090) [ClassicSimilarity], result of:
            0.122476436 = score(doc=5090,freq=1.0), product of:
              0.14540233 = queryWeight, product of:
                1.126024 = boost
                6.738623 = idf(docFreq=142, maxDocs=44421)
                0.019162517 = queryNorm
              0.8423279 = fieldWeight in 5090, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.738623 = idf(docFreq=142, maxDocs=44421)
                0.125 = fieldNorm(doc=5090)
          0.14551505 = weight(abstract_txt:operators in 5090) [ClassicSimilarity], result of:
            0.14551505 = score(doc=5090,freq=1.0), product of:
              0.16310789 = queryWeight, product of:
                1.1926128 = boost
                7.1371193 = idf(docFreq=95, maxDocs=44421)
                0.019162517 = queryNorm
              0.8921399 = fieldWeight in 5090, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.1371193 = idf(docFreq=95, maxDocs=44421)
                0.125 = fieldNorm(doc=5090)
          0.08288848 = weight(abstract_txt:search in 5090) [ClassicSimilarity], result of:
            0.08288848 = score(doc=5090,freq=2.0), product of:
              0.12830085 = queryWeight, product of:
                1.8320501 = boost
                3.654598 = idf(docFreq=3123, maxDocs=44421)
                0.019162517 = queryNorm
              0.6460478 = fieldWeight in 5090, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.654598 = idf(docFreq=3123, maxDocs=44421)
                0.125 = fieldNorm(doc=5090)
          0.08408833 = weight(abstract_txt:retrieval in 5090) [ClassicSimilarity], result of:
            0.08408833 = score(doc=5090,freq=1.0), product of:
              0.19350113 = queryWeight, product of:
                2.9046159 = boost
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.019162517 = queryNorm
              0.4345625 = fieldWeight in 5090, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.125 = fieldNorm(doc=5090)
          0.4155287 = weight(abstract_txt:boolean in 5090) [ClassicSimilarity], result of:
            0.4155287 = score(doc=5090,freq=2.0), product of:
              0.37580925 = queryWeight, product of:
                3.1354966 = boost
                6.25473 = idf(docFreq=231, maxDocs=44421)
                0.019162517 = queryNorm
              1.1056905 = fieldWeight in 5090, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.25473 = idf(docFreq=231, maxDocs=44421)
                0.125 = fieldNorm(doc=5090)
        0.24 = coord(6/25)
    
  4. Losee, R.M.: Comparing Boolean and probabilistic information retrieval systems across queries and disciplines (1997) 0.23
    0.22640309 = sum of:
      0.22640309 = product of:
        0.9433462 = sum of:
          0.10913629 = weight(abstract_txt:operators in 778) [ClassicSimilarity], result of:
            0.10913629 = score(doc=778,freq=1.0), product of:
              0.16310789 = queryWeight, product of:
                1.1926128 = boost
                7.1371193 = idf(docFreq=95, maxDocs=44421)
                0.019162517 = queryNorm
              0.66910493 = fieldWeight in 778, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.1371193 = idf(docFreq=95, maxDocs=44421)
                0.09375 = fieldNorm(doc=778)
          0.035746507 = weight(abstract_txt:systems in 778) [ClassicSimilarity], result of:
            0.035746507 = score(doc=778,freq=1.0), product of:
              0.11177851 = queryWeight, product of:
                1.7100222 = boost
                3.411175 = idf(docFreq=3984, maxDocs=44421)
                0.019162517 = queryNorm
              0.31979766 = fieldWeight in 778, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.411175 = idf(docFreq=3984, maxDocs=44421)
                0.09375 = fieldNorm(doc=778)
          0.11856049 = weight(abstract_txt:techniques in 778) [ClassicSimilarity], result of:
            0.11856049 = score(doc=778,freq=2.0), product of:
              0.19731158 = queryWeight, product of:
                2.2719505 = boost
                4.5321174 = idf(docFreq=1298, maxDocs=44421)
                0.019162517 = queryNorm
              0.60087955 = fieldWeight in 778, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.5321174 = idf(docFreq=1298, maxDocs=44421)
                0.09375 = fieldNorm(doc=778)
          0.08918915 = weight(abstract_txt:retrieval in 778) [ClassicSimilarity], result of:
            0.08918915 = score(doc=778,freq=2.0), product of:
              0.19350113 = queryWeight, product of:
                2.9046159 = boost
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.019162517 = queryNorm
              0.46092314 = fieldWeight in 778, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.09375 = fieldNorm(doc=778)
          0.31164652 = weight(abstract_txt:boolean in 778) [ClassicSimilarity], result of:
            0.31164652 = score(doc=778,freq=2.0), product of:
              0.37580925 = queryWeight, product of:
                3.1354966 = boost
                6.25473 = idf(docFreq=231, maxDocs=44421)
                0.019162517 = queryNorm
              0.82926786 = fieldWeight in 778, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.25473 = idf(docFreq=231, maxDocs=44421)
                0.09375 = fieldNorm(doc=778)
          0.27906722 = weight(abstract_txt:probabilistic in 778) [ClassicSimilarity], result of:
            0.27906722 = score(doc=778,freq=1.0), product of:
              0.43988767 = queryWeight, product of:
                3.392294 = boost
                6.7669935 = idf(docFreq=138, maxDocs=44421)
                0.019162517 = queryNorm
              0.6344056 = fieldWeight in 778, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.7669935 = idf(docFreq=138, maxDocs=44421)
                0.09375 = fieldNorm(doc=778)
        0.24 = coord(6/25)
    
  5. Nahl, D.; Harada, V.H.: Composing Boolean search statements : self-confidence, concept analysis, search logic and errors (1996) 0.19
    0.1943888 = sum of:
      0.1943888 = product of:
        0.8099534 = sum of:
          0.07285516 = weight(abstract_txt:logic in 6676) [ClassicSimilarity], result of:
            0.07285516 = score(doc=6676,freq=1.0), product of:
              0.12458595 = queryWeight, product of:
                1.042309 = boost
                6.2376356 = idf(docFreq=235, maxDocs=44421)
                0.019162517 = queryNorm
              0.5847783 = fieldWeight in 6676, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.2376356 = idf(docFreq=235, maxDocs=44421)
                0.09375 = fieldNorm(doc=6676)
          0.12990588 = weight(abstract_txt:statements in 6676) [ClassicSimilarity], result of:
            0.12990588 = score(doc=6676,freq=2.0), product of:
              0.14540233 = queryWeight, product of:
                1.126024 = boost
                6.738623 = idf(docFreq=142, maxDocs=44421)
                0.019162517 = queryNorm
              0.8934236 = fieldWeight in 6676, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.738623 = idf(docFreq=142, maxDocs=44421)
                0.09375 = fieldNorm(doc=6676)
          0.17001337 = weight(abstract_txt:statement in 6676) [ClassicSimilarity], result of:
            0.17001337 = score(doc=6676,freq=3.0), product of:
              0.15197673 = queryWeight, product of:
                1.1511993 = boost
                6.889283 = idf(docFreq=122, maxDocs=44421)
                0.019162517 = queryNorm
              1.1186802 = fieldWeight in 6676, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.889283 = idf(docFreq=122, maxDocs=44421)
                0.09375 = fieldNorm(doc=6676)
          0.10913629 = weight(abstract_txt:operators in 6676) [ClassicSimilarity], result of:
            0.10913629 = score(doc=6676,freq=1.0), product of:
              0.16310789 = queryWeight, product of:
                1.1926128 = boost
                7.1371193 = idf(docFreq=95, maxDocs=44421)
                0.019162517 = queryNorm
              0.66910493 = fieldWeight in 6676, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.1371193 = idf(docFreq=95, maxDocs=44421)
                0.09375 = fieldNorm(doc=6676)
          0.107675284 = weight(abstract_txt:search in 6676) [ClassicSimilarity], result of:
            0.107675284 = score(doc=6676,freq=6.0), product of:
              0.12830085 = queryWeight, product of:
                1.8320501 = boost
                3.654598 = idf(docFreq=3123, maxDocs=44421)
                0.019162517 = queryNorm
              0.8392407 = fieldWeight in 6676, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                3.654598 = idf(docFreq=3123, maxDocs=44421)
                0.09375 = fieldNorm(doc=6676)
          0.22036739 = weight(abstract_txt:boolean in 6676) [ClassicSimilarity], result of:
            0.22036739 = score(doc=6676,freq=1.0), product of:
              0.37580925 = queryWeight, product of:
                3.1354966 = boost
                6.25473 = idf(docFreq=231, maxDocs=44421)
                0.019162517 = queryNorm
              0.58638096 = fieldWeight in 6676, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.25473 = idf(docFreq=231, maxDocs=44421)
                0.09375 = fieldNorm(doc=6676)
        0.24 = coord(6/25)