Document (#21516)

Author
Stanfill, C.
Title
Parallel information retrieval algorithms
Source
Information retrieval: data structures and algorithms. Ed.: W.B. Frakes u. R. Baeza-Yates
Imprint
Englewood Cliffs, NJ : Prentice Hall
Year
1992
Pages
S.459-496
Abstract
Data Parallel computers, such as the connection Machine CM-2, can provide interactive access to text databases containign tens, hundreds or even thousands of Gigabytes of data. Starts by presenting a brief overview of data parallel computing, a performance model of the CM-2, and a model of the workload involved in searching text databases. Discusses various algorithms used in information retrieval and gives performance estimates based on the data and procssing models presented
Theme
Retrievalalgorithmen

Similar documents (content)

  1. Couvreur, T.R.; Benzel, R.N.; Miller, S.F.; Zeitler, D.N.; Lee, D.L.; Singhal, M.; Shivaratri, N.; Wong, W.Y.P.: ¬An analysis of performance and cost factors in searching large text databases using parallel search systems (1994) 0.26
    0.26407835 = sum of:
      0.26407835 = product of:
        1.1003265 = sum of:
          0.06834338 = weight(abstract_txt:text in 7656) [ClassicSimilarity], result of:
            0.06834338 = score(doc=7656,freq=2.0), product of:
              0.12756573 = queryWeight, product of:
                1.5875323 = boost
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.019885443 = queryNorm
              0.5357503 = fieldWeight in 7656, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.09375 = fieldNorm(doc=7656)
          0.4185043 = weight(abstract_txt:workload in 7656) [ClassicSimilarity], result of:
            0.4185043 = score(doc=7656,freq=3.0), product of:
              0.29604828 = queryWeight, product of:
                1.7101012 = boost
                8.705735 = idf(docFreq=19, maxDocs=44421)
                0.019885443 = queryNorm
              1.4136353 = fieldWeight in 7656, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                8.705735 = idf(docFreq=19, maxDocs=44421)
                0.09375 = fieldNorm(doc=7656)
          0.06298288 = weight(abstract_txt:databases in 7656) [ClassicSimilarity], result of:
            0.06298288 = score(doc=7656,freq=1.0), product of:
              0.15220469 = queryWeight, product of:
                1.7340817 = boost
                4.413907 = idf(docFreq=1461, maxDocs=44421)
                0.019885443 = queryNorm
              0.4138038 = fieldWeight in 7656, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.413907 = idf(docFreq=1461, maxDocs=44421)
                0.09375 = fieldNorm(doc=7656)
          0.1250753 = weight(abstract_txt:performance in 7656) [ClassicSimilarity], result of:
            0.1250753 = score(doc=7656,freq=3.0), product of:
              0.16673252 = queryWeight, product of:
                1.8149544 = boost
                4.619759 = idf(docFreq=1189, maxDocs=44421)
                0.019885443 = queryNorm
              0.7501554 = fieldWeight in 7656, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.619759 = idf(docFreq=1189, maxDocs=44421)
                0.09375 = fieldNorm(doc=7656)
          0.13564055 = weight(abstract_txt:algorithms in 7656) [ClassicSimilarity], result of:
            0.13564055 = score(doc=7656,freq=1.0), product of:
              0.2538279 = queryWeight, product of:
                2.2393668 = boost
                5.7000527 = idf(docFreq=403, maxDocs=44421)
                0.019885443 = queryNorm
              0.53437996 = fieldWeight in 7656, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.7000527 = idf(docFreq=403, maxDocs=44421)
                0.09375 = fieldNorm(doc=7656)
          0.2897801 = weight(abstract_txt:parallel in 7656) [ClassicSimilarity], result of:
            0.2897801 = score(doc=7656,freq=1.0), product of:
              0.48197275 = queryWeight, product of:
                3.7793093 = boost
                6.4132004 = idf(docFreq=197, maxDocs=44421)
                0.019885443 = queryNorm
              0.60123754 = fieldWeight in 7656, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.4132004 = idf(docFreq=197, maxDocs=44421)
                0.09375 = fieldNorm(doc=7656)
        0.24 = coord(6/25)
    
  2. MacFarlane, A.; Robertson, S.E.; McCann, J.A.: Parallel computing in information retrieval : an updated review (1997) 0.24
    0.24488346 = sum of:
      0.24488346 = product of:
        1.2244173 = sum of:
          0.07472055 = weight(abstract_txt:gives in 519) [ClassicSimilarity], result of:
            0.07472055 = score(doc=519,freq=1.0), product of:
              0.11175593 = queryWeight, product of:
                1.050693 = boost
                5.3488383 = idf(docFreq=573, maxDocs=44421)
                0.019885443 = queryNorm
              0.6686048 = fieldWeight in 519, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.3488383 = idf(docFreq=573, maxDocs=44421)
                0.125 = fieldNorm(doc=519)
          0.15023583 = weight(abstract_txt:computing in 519) [ClassicSimilarity], result of:
            0.15023583 = score(doc=519,freq=2.0), product of:
              0.1413024 = queryWeight, product of:
                1.18145 = boost
                6.014492 = idf(docFreq=294, maxDocs=44421)
                0.019885443 = queryNorm
              1.063222 = fieldWeight in 519, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.014492 = idf(docFreq=294, maxDocs=44421)
                0.125 = fieldNorm(doc=519)
          0.07106881 = weight(abstract_txt:retrieval in 519) [ClassicSimilarity], result of:
            0.07106881 = score(doc=519,freq=3.0), product of:
              0.09442047 = queryWeight, product of:
                1.3658048 = boost
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.019885443 = queryNorm
              0.7526843 = fieldWeight in 519, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.125 = fieldNorm(doc=519)
          0.06443476 = weight(abstract_txt:text in 519) [ClassicSimilarity], result of:
            0.06443476 = score(doc=519,freq=1.0), product of:
              0.12756573 = queryWeight, product of:
                1.5875323 = boost
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.019885443 = queryNorm
              0.50511026 = fieldWeight in 519, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.125 = fieldNorm(doc=519)
          0.86395735 = weight(abstract_txt:parallel in 519) [ClassicSimilarity], result of:
            0.86395735 = score(doc=519,freq=5.0), product of:
              0.48197275 = queryWeight, product of:
                3.7793093 = boost
                6.4132004 = idf(docFreq=197, maxDocs=44421)
                0.019885443 = queryNorm
              1.792544 = fieldWeight in 519, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                6.4132004 = idf(docFreq=197, maxDocs=44421)
                0.125 = fieldNorm(doc=519)
        0.2 = coord(5/25)
    
  3. Xu, J.; Weischedel, R.: Empirical studies on the impact of lexical resources on CLIR performance (2005) 0.21
    0.2100814 = sum of:
      0.2100814 = product of:
        0.87533915 = sum of:
          0.044792354 = weight(abstract_txt:machine in 2020) [ClassicSimilarity], result of:
            0.044792354 = score(doc=2020,freq=1.0), product of:
              0.10869089 = queryWeight, product of:
                1.0361845 = boost
                5.274979 = idf(docFreq=617, maxDocs=44421)
                0.019885443 = queryNorm
              0.41210774 = fieldWeight in 2020, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.274979 = idf(docFreq=617, maxDocs=44421)
                0.078125 = fieldNorm(doc=2020)
          0.025644748 = weight(abstract_txt:retrieval in 2020) [ClassicSimilarity], result of:
            0.025644748 = score(doc=2020,freq=1.0), product of:
              0.09442047 = queryWeight, product of:
                1.3658048 = boost
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.019885443 = queryNorm
              0.27160156 = fieldWeight in 2020, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.078125 = fieldNorm(doc=2020)
          0.038559496 = weight(abstract_txt:model in 2020) [ClassicSimilarity], result of:
            0.038559496 = score(doc=2020,freq=1.0), product of:
              0.123923816 = queryWeight, product of:
                1.5647067 = boost
                3.9827821 = idf(docFreq=2249, maxDocs=44421)
                0.019885443 = queryNorm
              0.31115484 = fieldWeight in 2020, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.9827821 = idf(docFreq=2249, maxDocs=44421)
                0.078125 = fieldNorm(doc=2020)
          0.040271726 = weight(abstract_txt:text in 2020) [ClassicSimilarity], result of:
            0.040271726 = score(doc=2020,freq=1.0), product of:
              0.12756573 = queryWeight, product of:
                1.5875323 = boost
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.019885443 = queryNorm
              0.3156939 = fieldWeight in 2020, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.078125 = fieldNorm(doc=2020)
          0.13455959 = weight(abstract_txt:performance in 2020) [ClassicSimilarity], result of:
            0.13455959 = score(doc=2020,freq=5.0), product of:
              0.16673252 = queryWeight, product of:
                1.8149544 = boost
                4.619759 = idf(docFreq=1189, maxDocs=44421)
                0.019885443 = queryNorm
              0.80703866 = fieldWeight in 2020, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                4.619759 = idf(docFreq=1189, maxDocs=44421)
                0.078125 = fieldNorm(doc=2020)
          0.59151125 = weight(abstract_txt:parallel in 2020) [ClassicSimilarity], result of:
            0.59151125 = score(doc=2020,freq=6.0), product of:
              0.48197275 = queryWeight, product of:
                3.7793093 = boost
                6.4132004 = idf(docFreq=197, maxDocs=44421)
                0.019885443 = queryNorm
              1.2272711 = fieldWeight in 2020, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                6.4132004 = idf(docFreq=197, maxDocs=44421)
                0.078125 = fieldNorm(doc=2020)
        0.24 = coord(6/25)
    
  4. Efron, M.: Eigenvalue-based model selection during Latent Semantic Indexing (2005) 0.19
    0.18948169 = sum of:
      0.18948169 = product of:
        0.78950703 = sum of:
          0.025644748 = weight(abstract_txt:retrieval in 4685) [ClassicSimilarity], result of:
            0.025644748 = score(doc=4685,freq=1.0), product of:
              0.09442047 = queryWeight, product of:
                1.3658048 = boost
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.019885443 = queryNorm
              0.27160156 = fieldWeight in 4685, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.078125 = fieldNorm(doc=4685)
          0.1370751 = weight(abstract_txt:estimates in 4685) [ClassicSimilarity], result of:
            0.1370751 = score(doc=4685,freq=1.0), product of:
              0.22910237 = queryWeight, product of:
                1.5043724 = boost
                7.6584163 = idf(docFreq=56, maxDocs=44421)
                0.019885443 = queryNorm
              0.59831375 = fieldWeight in 4685, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.6584163 = idf(docFreq=56, maxDocs=44421)
                0.078125 = fieldNorm(doc=4685)
          0.038559496 = weight(abstract_txt:model in 4685) [ClassicSimilarity], result of:
            0.038559496 = score(doc=4685,freq=1.0), product of:
              0.123923816 = queryWeight, product of:
                1.5647067 = boost
                3.9827821 = idf(docFreq=2249, maxDocs=44421)
                0.019885443 = queryNorm
              0.31115484 = fieldWeight in 4685, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.9827821 = idf(docFreq=2249, maxDocs=44421)
                0.078125 = fieldNorm(doc=4685)
          0.06017688 = weight(abstract_txt:performance in 4685) [ClassicSimilarity], result of:
            0.06017688 = score(doc=4685,freq=1.0), product of:
              0.16673252 = queryWeight, product of:
                1.8149544 = boost
                4.619759 = idf(docFreq=1189, maxDocs=44421)
                0.019885443 = queryNorm
              0.36091867 = fieldWeight in 4685, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.619759 = idf(docFreq=1189, maxDocs=44421)
                0.078125 = fieldNorm(doc=4685)
          0.04508401 = weight(abstract_txt:data in 4685) [ClassicSimilarity], result of:
            0.04508401 = score(doc=4685,freq=1.0), product of:
              0.17328417 = queryWeight, product of:
                2.6166763 = boost
                3.3302255 = idf(docFreq=4320, maxDocs=44421)
                0.019885443 = queryNorm
              0.26017386 = fieldWeight in 4685, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.3302255 = idf(docFreq=4320, maxDocs=44421)
                0.078125 = fieldNorm(doc=4685)
          0.48296684 = weight(abstract_txt:parallel in 4685) [ClassicSimilarity], result of:
            0.48296684 = score(doc=4685,freq=4.0), product of:
              0.48197275 = queryWeight, product of:
                3.7793093 = boost
                6.4132004 = idf(docFreq=197, maxDocs=44421)
                0.019885443 = queryNorm
              1.0020626 = fieldWeight in 4685, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                6.4132004 = idf(docFreq=197, maxDocs=44421)
                0.078125 = fieldNorm(doc=4685)
        0.24 = coord(6/25)
    
  5. MacFarlane, A.; McCann, J.A.; Robertson, S.E.: Parallel methods for the generation of partitioned inverted files (2005) 0.17
    0.17407335 = sum of:
      0.17407335 = product of:
        0.6216906 = sum of:
          0.032209393 = weight(abstract_txt:even in 776) [ClassicSimilarity], result of:
            0.032209393 = score(doc=776,freq=1.0), product of:
              0.10123225 = queryWeight, product of:
                5.0907717 = idf(docFreq=742, maxDocs=44421)
                0.019885443 = queryNorm
              0.31817323 = fieldWeight in 776, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.0907717 = idf(docFreq=742, maxDocs=44421)
                0.0625 = fieldNorm(doc=776)
          0.092000276 = weight(abstract_txt:computing in 776) [ClassicSimilarity], result of:
            0.092000276 = score(doc=776,freq=3.0), product of:
              0.1413024 = queryWeight, product of:
                1.18145 = boost
                6.014492 = idf(docFreq=294, maxDocs=44421)
                0.019885443 = queryNorm
              0.6510878 = fieldWeight in 776, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.014492 = idf(docFreq=294, maxDocs=44421)
                0.0625 = fieldNorm(doc=776)
          0.02901372 = weight(abstract_txt:retrieval in 776) [ClassicSimilarity], result of:
            0.02901372 = score(doc=776,freq=2.0), product of:
              0.09442047 = queryWeight, product of:
                1.3658048 = boost
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.019885443 = queryNorm
              0.3072821 = fieldWeight in 776, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.0625 = fieldNorm(doc=776)
          0.055802137 = weight(abstract_txt:text in 776) [ClassicSimilarity], result of:
            0.055802137 = score(doc=776,freq=3.0), product of:
              0.12756573 = queryWeight, product of:
                1.5875323 = boost
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.019885443 = queryNorm
              0.4374383 = fieldWeight in 776, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.0625 = fieldNorm(doc=776)
          0.041988585 = weight(abstract_txt:databases in 776) [ClassicSimilarity], result of:
            0.041988585 = score(doc=776,freq=1.0), product of:
              0.15220469 = queryWeight, product of:
                1.7340817 = boost
                4.413907 = idf(docFreq=1461, maxDocs=44421)
                0.019885443 = queryNorm
              0.2758692 = fieldWeight in 776, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.413907 = idf(docFreq=1461, maxDocs=44421)
                0.0625 = fieldNorm(doc=776)
          0.03606721 = weight(abstract_txt:data in 776) [ClassicSimilarity], result of:
            0.03606721 = score(doc=776,freq=1.0), product of:
              0.17328417 = queryWeight, product of:
                2.6166763 = boost
                3.3302255 = idf(docFreq=4320, maxDocs=44421)
                0.019885443 = queryNorm
              0.20813909 = fieldWeight in 776, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.3302255 = idf(docFreq=4320, maxDocs=44421)
                0.0625 = fieldNorm(doc=776)
          0.33460924 = weight(abstract_txt:parallel in 776) [ClassicSimilarity], result of:
            0.33460924 = score(doc=776,freq=3.0), product of:
              0.48197275 = queryWeight, product of:
                3.7793093 = boost
                6.4132004 = idf(docFreq=197, maxDocs=44421)
                0.019885443 = queryNorm
              0.6942493 = fieldWeight in 776, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.4132004 = idf(docFreq=197, maxDocs=44421)
                0.0625 = fieldNorm(doc=776)
        0.28 = coord(7/25)