Document (#14177)

Author
MacFarlane, A.
Robertson, S.E.
McCann, J.A.
Title
Parallel computing for passage retrieval
Source
Aslib proceedings. 56(2004) no.4, S.201-211
Year
2004
Abstract
In this paper methods for both speeding up passage processing and examining more passages using parallel computers are explored. The number of passages processed are varied in order to examine the effect on retrieval effectiveness and efficiency. The particular algorithm applied has previously been used to good effect in Okapi experiments at TREC. This algorithm and the mechanism for applying parallel computing to speed up processing are described.
Theme
Retrievalalgorithmen
Object
Okapi
TREC

Similar documents (author)

  1. MacFarlane, A.; Robertson, S.E.; McCann, J.A.: Parallel computing in information retrieval : an updated review (1997) 4.21
    4.2128725 = sum of:
      4.2128725 = sum of:
        1.6185534 = weight(author_txt:robertson in 519) [ClassicSimilarity], result of:
          1.6185534 = score(doc=519,freq=1.0), product of:
            0.589682 = queryWeight, product of:
              7.319441 = idf(docFreq=79, maxDocs=44421)
              0.0805638 = queryNorm
            2.7447903 = fieldWeight in 519, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              7.319441 = idf(docFreq=79, maxDocs=44421)
              0.375 = fieldNorm(doc=519)
        2.5943193 = weight(author_txt:macfarlane in 519) [ClassicSimilarity], result of:
          2.5943193 = score(doc=519,freq=1.0), product of:
            0.8076356 = queryWeight, product of:
              1.1703043 = boost
              8.565973 = idf(docFreq=22, maxDocs=44421)
              0.0805638 = queryNorm
            3.21224 = fieldWeight in 519, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              8.565973 = idf(docFreq=22, maxDocs=44421)
              0.375 = fieldNorm(doc=519)
    
  2. MacFarlane, A.; McCann, J.A.; Robertson, S.E.: Parallel methods for the generation of partitioned inverted files (2005) 4.21
    4.2128725 = sum of:
      4.2128725 = sum of:
        1.6185534 = weight(author_txt:robertson in 776) [ClassicSimilarity], result of:
          1.6185534 = score(doc=776,freq=1.0), product of:
            0.589682 = queryWeight, product of:
              7.319441 = idf(docFreq=79, maxDocs=44421)
              0.0805638 = queryNorm
            2.7447903 = fieldWeight in 776, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              7.319441 = idf(docFreq=79, maxDocs=44421)
              0.375 = fieldNorm(doc=776)
        2.5943193 = weight(author_txt:macfarlane in 776) [ClassicSimilarity], result of:
          2.5943193 = score(doc=776,freq=1.0), product of:
            0.8076356 = queryWeight, product of:
              1.1703043 = boost
              8.565973 = idf(docFreq=22, maxDocs=44421)
              0.0805638 = queryNorm
            3.21224 = fieldWeight in 776, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              8.565973 = idf(docFreq=22, maxDocs=44421)
              0.375 = fieldNorm(doc=776)
    
  3. MacFarlane, A.; McCann, J.A.; Robertson, S.E.: Parallel methods for the update of partitioned inverted files (2007) 4.21
    4.2128725 = sum of:
      4.2128725 = sum of:
        1.6185534 = weight(author_txt:robertson in 1819) [ClassicSimilarity], result of:
          1.6185534 = score(doc=1819,freq=1.0), product of:
            0.589682 = queryWeight, product of:
              7.319441 = idf(docFreq=79, maxDocs=44421)
              0.0805638 = queryNorm
            2.7447903 = fieldWeight in 1819, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              7.319441 = idf(docFreq=79, maxDocs=44421)
              0.375 = fieldNorm(doc=1819)
        2.5943193 = weight(author_txt:macfarlane in 1819) [ClassicSimilarity], result of:
          2.5943193 = score(doc=1819,freq=1.0), product of:
            0.8076356 = queryWeight, product of:
              1.1703043 = boost
              8.565973 = idf(docFreq=22, maxDocs=44421)
              0.0805638 = queryNorm
            3.21224 = fieldWeight in 1819, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              8.565973 = idf(docFreq=22, maxDocs=44421)
              0.375 = fieldNorm(doc=1819)
    
  4. MacFarlane, A.: On open source IR (2003) 2.16
    2.1619327 = sum of:
      2.1619327 = product of:
        4.3238654 = sum of:
          4.3238654 = weight(author_txt:macfarlane in 3010) [ClassicSimilarity], result of:
            4.3238654 = score(doc=3010,freq=1.0), product of:
              0.8076356 = queryWeight, product of:
                1.1703043 = boost
                8.565973 = idf(docFreq=22, maxDocs=44421)
                0.0805638 = queryNorm
              5.353733 = fieldWeight in 3010, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.565973 = idf(docFreq=22, maxDocs=44421)
                0.625 = fieldNorm(doc=3010)
        0.5 = coord(1/2)
    
  5. MacFarlane, A.: Evaluation of web search for the information practitioner (2007) 2.16
    2.1619327 = sum of:
      2.1619327 = product of:
        4.3238654 = sum of:
          4.3238654 = weight(author_txt:macfarlane in 1817) [ClassicSimilarity], result of:
            4.3238654 = score(doc=1817,freq=1.0), product of:
              0.8076356 = queryWeight, product of:
                1.1703043 = boost
                8.565973 = idf(docFreq=22, maxDocs=44421)
                0.0805638 = queryNorm
              5.353733 = fieldWeight in 1817, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.565973 = idf(docFreq=22, maxDocs=44421)
                0.625 = fieldNorm(doc=1817)
        0.5 = coord(1/2)
    

Similar documents (content)

  1. Melucci, M.: Passage retrieval : a probabilistic technique (1998) 0.23
    0.22838132 = sum of:
      0.22838132 = product of:
        0.95158887 = sum of:
          0.03215322 = weight(abstract_txt:effectiveness in 2150) [ClassicSimilarity], result of:
            0.03215322 = score(doc=2150,freq=1.0), product of:
              0.08075894 = queryWeight, product of:
                5.0961695 = idf(docFreq=738, maxDocs=44421)
                0.015846988 = queryNorm
              0.39813823 = fieldWeight in 2150, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.0961695 = idf(docFreq=738, maxDocs=44421)
                0.078125 = fieldNorm(doc=2150)
          0.03667654 = weight(abstract_txt:experiments in 2150) [ClassicSimilarity], result of:
            0.03667654 = score(doc=2150,freq=1.0), product of:
              0.08816574 = queryWeight, product of:
                1.0448517 = boost
                5.324741 = idf(docFreq=587, maxDocs=44421)
                0.015846988 = queryNorm
              0.4159954 = fieldWeight in 2150, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.324741 = idf(docFreq=587, maxDocs=44421)
                0.078125 = fieldNorm(doc=2150)
          0.020415008 = weight(abstract_txt:retrieval in 2150) [ClassicSimilarity], result of:
            0.020415008 = score(doc=2150,freq=1.0), product of:
              0.07516529 = queryWeight, product of:
                1.3643581 = boost
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.015846988 = queryNorm
              0.27160156 = fieldWeight in 2150, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.078125 = fieldNorm(doc=2150)
          0.090218045 = weight(abstract_txt:algorithm in 2150) [ClassicSimilarity], result of:
            0.090218045 = score(doc=2150,freq=1.0), product of:
              0.20241679 = queryWeight, product of:
                2.2389426 = boost
                5.7050157 = idf(docFreq=401, maxDocs=44421)
                0.015846988 = queryNorm
              0.44570434 = fieldWeight in 2150, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.7050157 = idf(docFreq=401, maxDocs=44421)
                0.078125 = fieldNorm(doc=2150)
          0.38383496 = weight(abstract_txt:passage in 2150) [ClassicSimilarity], result of:
            0.38383496 = score(doc=2150,freq=2.0), product of:
              0.42182982 = queryWeight, product of:
                3.2321265 = boost
                8.235732 = idf(docFreq=31, maxDocs=44421)
                0.015846988 = queryNorm
              0.90992844 = fieldWeight in 2150, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.235732 = idf(docFreq=31, maxDocs=44421)
                0.078125 = fieldNorm(doc=2150)
          0.38829112 = weight(abstract_txt:passages in 2150) [ClassicSimilarity], result of:
            0.38829112 = score(doc=2150,freq=2.0), product of:
              0.4250884 = queryWeight, product of:
                3.2445862 = boost
                8.267481 = idf(docFreq=30, maxDocs=44421)
                0.015846988 = queryNorm
              0.9134362 = fieldWeight in 2150, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.267481 = idf(docFreq=30, maxDocs=44421)
                0.078125 = fieldNorm(doc=2150)
        0.24 = coord(6/25)
    
  2. Kaszkiel, M.; Zobel, J.: Effective ranking with arbitrary passages (2001) 0.18
    0.1785733 = sum of:
      0.1785733 = product of:
        0.8928665 = sum of:
          0.025722576 = weight(abstract_txt:effectiveness in 6764) [ClassicSimilarity], result of:
            0.025722576 = score(doc=6764,freq=1.0), product of:
              0.08075894 = queryWeight, product of:
                5.0961695 = idf(docFreq=738, maxDocs=44421)
                0.015846988 = queryNorm
              0.3185106 = fieldWeight in 6764, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.0961695 = idf(docFreq=738, maxDocs=44421)
                0.0625 = fieldNorm(doc=6764)
          0.029341232 = weight(abstract_txt:experiments in 6764) [ClassicSimilarity], result of:
            0.029341232 = score(doc=6764,freq=1.0), product of:
              0.08816574 = queryWeight, product of:
                1.0448517 = boost
                5.324741 = idf(docFreq=587, maxDocs=44421)
                0.015846988 = queryNorm
              0.3327963 = fieldWeight in 6764, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.324741 = idf(docFreq=587, maxDocs=44421)
                0.0625 = fieldNorm(doc=6764)
          0.023096947 = weight(abstract_txt:retrieval in 6764) [ClassicSimilarity], result of:
            0.023096947 = score(doc=6764,freq=2.0), product of:
              0.07516529 = queryWeight, product of:
                1.3643581 = boost
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.015846988 = queryNorm
              0.3072821 = fieldWeight in 6764, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.0625 = fieldNorm(doc=6764)
          0.43425968 = weight(abstract_txt:passage in 6764) [ClassicSimilarity], result of:
            0.43425968 = score(doc=6764,freq=4.0), product of:
              0.42182982 = queryWeight, product of:
                3.2321265 = boost
                8.235732 = idf(docFreq=31, maxDocs=44421)
                0.015846988 = queryNorm
              1.0294665 = fieldWeight in 6764, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                8.235732 = idf(docFreq=31, maxDocs=44421)
                0.0625 = fieldNorm(doc=6764)
          0.38044605 = weight(abstract_txt:passages in 6764) [ClassicSimilarity], result of:
            0.38044605 = score(doc=6764,freq=3.0), product of:
              0.4250884 = queryWeight, product of:
                3.2445862 = boost
                8.267481 = idf(docFreq=30, maxDocs=44421)
                0.015846988 = queryNorm
              0.894981 = fieldWeight in 6764, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                8.267481 = idf(docFreq=30, maxDocs=44421)
                0.0625 = fieldNorm(doc=6764)
        0.2 = coord(5/25)
    
  3. MacFarlane, A.; McCann, J.A.; Robertson, S.E.: Parallel methods for the generation of partitioned inverted files (2005) 0.18
    0.1756798 = sum of:
      0.1756798 = product of:
        0.6274279 = sum of:
          0.039794512 = weight(abstract_txt:examine in 776) [ClassicSimilarity], result of:
            0.039794512 = score(doc=776,freq=2.0), product of:
              0.08574058 = queryWeight, product of:
                1.0303812 = boost
                5.250997 = idf(docFreq=632, maxDocs=44421)
                0.015846988 = queryNorm
              0.46412694 = fieldWeight in 776, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.250997 = idf(docFreq=632, maxDocs=44421)
                0.0625 = fieldNorm(doc=776)
          0.029341232 = weight(abstract_txt:experiments in 776) [ClassicSimilarity], result of:
            0.029341232 = score(doc=776,freq=1.0), product of:
              0.08816574 = queryWeight, product of:
                1.0448517 = boost
                5.324741 = idf(docFreq=587, maxDocs=44421)
                0.015846988 = queryNorm
              0.3327963 = fieldWeight in 776, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.324741 = idf(docFreq=587, maxDocs=44421)
                0.0625 = fieldNorm(doc=776)
          0.0437041 = weight(abstract_txt:efficiency in 776) [ClassicSimilarity], result of:
            0.0437041 = score(doc=776,freq=1.0), product of:
              0.114990614 = queryWeight, product of:
                1.1932622 = boost
                6.0810666 = idf(docFreq=275, maxDocs=44421)
                0.015846988 = queryNorm
              0.38006666 = fieldWeight in 776, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.0810666 = idf(docFreq=275, maxDocs=44421)
                0.0625 = fieldNorm(doc=776)
          0.078641415 = weight(abstract_txt:speed in 776) [ClassicSimilarity], result of:
            0.078641415 = score(doc=776,freq=2.0), product of:
              0.1350222 = queryWeight, product of:
                1.2930261 = boost
                6.58948 = idf(docFreq=165, maxDocs=44421)
                0.015846988 = queryNorm
              0.5824332 = fieldWeight in 776, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.58948 = idf(docFreq=165, maxDocs=44421)
                0.0625 = fieldNorm(doc=776)
          0.023096947 = weight(abstract_txt:retrieval in 776) [ClassicSimilarity], result of:
            0.023096947 = score(doc=776,freq=2.0), product of:
              0.07516529 = queryWeight, product of:
                1.3643581 = boost
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.015846988 = queryNorm
              0.3072821 = fieldWeight in 776, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.0625 = fieldNorm(doc=776)
          0.1464773 = weight(abstract_txt:computing in 776) [ClassicSimilarity], result of:
            0.1464773 = score(doc=776,freq=3.0), product of:
              0.22497319 = queryWeight, product of:
                2.360397 = boost
                6.014492 = idf(docFreq=294, maxDocs=44421)
                0.015846988 = queryNorm
              0.6510878 = fieldWeight in 776, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.014492 = idf(docFreq=294, maxDocs=44421)
                0.0625 = fieldNorm(doc=776)
          0.26637238 = weight(abstract_txt:parallel in 776) [ClassicSimilarity], result of:
            0.26637238 = score(doc=776,freq=3.0), product of:
              0.38368407 = queryWeight, product of:
                3.7753065 = boost
                6.4132004 = idf(docFreq=197, maxDocs=44421)
                0.015846988 = queryNorm
              0.6942493 = fieldWeight in 776, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.4132004 = idf(docFreq=197, maxDocs=44421)
                0.0625 = fieldNorm(doc=776)
        0.28 = coord(7/25)
    
  4. Mengle, S.; Goharian, N.: Passage detection using text classification (2009) 0.18
    0.17543425 = sum of:
      0.17543425 = product of:
        1.4619521 = sum of:
          0.031954546 = weight(abstract_txt:retrieval in 3765) [ClassicSimilarity], result of:
            0.031954546 = score(doc=3765,freq=5.0), product of:
              0.07516529 = queryWeight, product of:
                1.3643581 = boost
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.015846988 = queryNorm
              0.42512372 = fieldWeight in 3765, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.0546875 = fieldNorm(doc=3765)
          0.7108723 = weight(abstract_txt:passage in 3765) [ClassicSimilarity], result of:
            0.7108723 = score(doc=3765,freq=14.0), product of:
              0.42182982 = queryWeight, product of:
                3.2321265 = boost
                8.235732 = idf(docFreq=31, maxDocs=44421)
                0.015846988 = queryNorm
              1.6852111 = fieldWeight in 3765, product of:
                3.7416575 = tf(freq=14.0), with freq of:
                  14.0 = termFreq=14.0
                8.235732 = idf(docFreq=31, maxDocs=44421)
                0.0546875 = fieldNorm(doc=3765)
          0.7191253 = weight(abstract_txt:passages in 3765) [ClassicSimilarity], result of:
            0.7191253 = score(doc=3765,freq=14.0), product of:
              0.4250884 = queryWeight, product of:
                3.2445862 = boost
                8.267481 = idf(docFreq=30, maxDocs=44421)
                0.015846988 = queryNorm
              1.6917076 = fieldWeight in 3765, product of:
                3.7416575 = tf(freq=14.0), with freq of:
                  14.0 = termFreq=14.0
                8.267481 = idf(docFreq=30, maxDocs=44421)
                0.0546875 = fieldNorm(doc=3765)
        0.12 = coord(3/25)
    
  5. MacFarlane, A.; Robertson, S.E.; McCann, J.A.: Parallel computing in information retrieval : an updated review (1997) 0.17
    0.1722252 = sum of:
      0.1722252 = product of:
        1.0764076 = sum of:
          0.05657573 = weight(abstract_txt:retrieval in 519) [ClassicSimilarity], result of:
            0.05657573 = score(doc=519,freq=3.0), product of:
              0.07516529 = queryWeight, product of:
                1.3643581 = boost
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.015846988 = queryNorm
              0.7526843 = fieldWeight in 519, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.125 = fieldNorm(doc=519)
          0.092864804 = weight(abstract_txt:processing in 519) [ClassicSimilarity], result of:
            0.092864804 = score(doc=519,freq=1.0), product of:
              0.15084758 = queryWeight, product of:
                1.9328088 = boost
                4.9249606 = idf(docFreq=876, maxDocs=44421)
                0.015846988 = queryNorm
              0.6156201 = fieldWeight in 519, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.9249606 = idf(docFreq=876, maxDocs=44421)
                0.125 = fieldNorm(doc=519)
          0.23919645 = weight(abstract_txt:computing in 519) [ClassicSimilarity], result of:
            0.23919645 = score(doc=519,freq=2.0), product of:
              0.22497319 = queryWeight, product of:
                2.360397 = boost
                6.014492 = idf(docFreq=294, maxDocs=44421)
                0.015846988 = queryNorm
              1.063222 = fieldWeight in 519, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.014492 = idf(docFreq=294, maxDocs=44421)
                0.125 = fieldNorm(doc=519)
          0.6877706 = weight(abstract_txt:parallel in 519) [ClassicSimilarity], result of:
            0.6877706 = score(doc=519,freq=5.0), product of:
              0.38368407 = queryWeight, product of:
                3.7753065 = boost
                6.4132004 = idf(docFreq=197, maxDocs=44421)
                0.015846988 = queryNorm
              1.792544 = fieldWeight in 519, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                6.4132004 = idf(docFreq=197, maxDocs=44421)
                0.125 = fieldNorm(doc=519)
        0.16 = coord(4/25)