Document (#38501)

Author
Wu, S.
Li, J.
Zeng, X.
Bi, Y.
Title
Adaptive data fusion methods in information retrieval
Source
Journal of the Association for Information Science and Technology. 65(2014) no.10, S.2048-2061
Year
2014
Abstract
Data fusion is currently used extensively in information retrieval for various tasks. It has proved to be a useful technology because it is able to improve retrieval performance frequently. However, in almost all prior research in data fusion, static search environments have been used, and dynamic search environments have generally not been considered. In this article, we investigate adaptive data fusion methods that can change their behavior when the search environment changes. Three adaptive data fusion methods are proposed and investigated. To test these proposed methods properly, we generate a benchmark from a historic Text REtrieval Conference data set. Experiments with the benchmark show that 2 of the proposed methods are good and may potentially be used in practice.

Similar documents (author)

  1. Zeng, L.: ¬An introduction to thesauri and classification systems in the People's Republic of China (1986) 4.73
    4.7339582 = sum of:
      4.7339582 = weight(author_txt:zeng in 1730) [ClassicSimilarity], result of:
        4.7339582 = fieldWeight in 1730, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          7.574333 = idf(docFreq=61, maxDocs=44421)
          0.625 = fieldNorm(doc=1730)
    
  2. Zeng, L.: Achieving compatibility of indexing languages in online access environment (1992) 4.73
    4.7339582 = sum of:
      4.7339582 = weight(author_txt:zeng in 1352) [ClassicSimilarity], result of:
        4.7339582 = fieldWeight in 1352, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          7.574333 = idf(docFreq=61, maxDocs=44421)
          0.625 = fieldNorm(doc=1352)
    
  3. Zeng, L.: Automatic indexing for Chinese text : problems and progress (1992) 4.73
    4.7339582 = sum of:
      4.7339582 = weight(author_txt:zeng in 1357) [ClassicSimilarity], result of:
        4.7339582 = fieldWeight in 1357, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          7.574333 = idf(docFreq=61, maxDocs=44421)
          0.625 = fieldNorm(doc=1357)
    
  4. Zeng, M.L.: Towards a unified medical langugae in a diverse cultural environment (1996) 4.73
    4.7339582 = sum of:
      4.7339582 = weight(author_txt:zeng in 5224) [ClassicSimilarity], result of:
        4.7339582 = fieldWeight in 5224, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          7.574333 = idf(docFreq=61, maxDocs=44421)
          0.625 = fieldNorm(doc=5224)
    
  5. Zeng, M.L.: Developing control mechanisms for discipline-based virtual libraries : a study of the process (1995) 4.73
    4.7339582 = sum of:
      4.7339582 = weight(author_txt:zeng in 6905) [ClassicSimilarity], result of:
        4.7339582 = fieldWeight in 6905, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          7.574333 = idf(docFreq=61, maxDocs=44421)
          0.625 = fieldNorm(doc=6905)
    

Similar documents (content)

  1. Beitzel, S.M.; Jensen, E.C.; Chowdhury, A.; Grossman, D.; Frieder, O; Goharian, N.: Fusion of effective retrieval strategies in the same information retrieval system (2004) 0.27
    0.26815584 = sum of:
      0.26815584 = product of:
        1.117316 = sum of:
          0.042859934 = weight(abstract_txt:prior in 3502) [ClassicSimilarity], result of:
            0.042859934 = score(doc=3502,freq=1.0), product of:
              0.08967142 = queryWeight, product of:
                1.0614234 = boost
                6.1179714 = idf(docFreq=265, maxDocs=44421)
                0.013808863 = queryNorm
              0.47796652 = fieldWeight in 3502, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.1179714 = idf(docFreq=265, maxDocs=44421)
                0.078125 = fieldNorm(doc=3502)
          0.017337037 = weight(abstract_txt:have in 3502) [ClassicSimilarity], result of:
            0.017337037 = score(doc=3502,freq=2.0), product of:
              0.049045928 = queryWeight, product of:
                1.1101409 = boost
                3.199388 = idf(docFreq=4924, maxDocs=44421)
                0.013808863 = queryNorm
              0.35348576 = fieldWeight in 3502, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.199388 = idf(docFreq=4924, maxDocs=44421)
                0.078125 = fieldNorm(doc=3502)
          0.02499758 = weight(abstract_txt:been in 3502) [ClassicSimilarity], result of:
            0.02499758 = score(doc=3502,freq=2.0), product of:
              0.06259674 = queryWeight, product of:
                1.2541586 = boost
                3.614442 = idf(docFreq=3251, maxDocs=44421)
                0.013808863 = queryNorm
              0.3993432 = fieldWeight in 3502, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.614442 = idf(docFreq=3251, maxDocs=44421)
                0.078125 = fieldNorm(doc=3502)
          0.08897354 = weight(abstract_txt:retrieval in 3502) [ClassicSimilarity], result of:
            0.08897354 = score(doc=3502,freq=8.0), product of:
              0.115820006 = queryWeight, product of:
                2.4125896 = boost
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.013808863 = queryNorm
              0.7682052 = fieldWeight in 3502, product of:
                2.828427 = tf(freq=8.0), with freq of:
                  8.0 = termFreq=8.0
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.078125 = fieldNorm(doc=3502)
          0.07183928 = weight(abstract_txt:data in 3502) [ClassicSimilarity], result of:
            0.07183928 = score(doc=3502,freq=3.0), product of:
              0.15941812 = queryWeight, product of:
                3.466619 = boost
                3.3302255 = idf(docFreq=4320, maxDocs=44421)
                0.013808863 = queryNorm
              0.45063436 = fieldWeight in 3502, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.3302255 = idf(docFreq=4320, maxDocs=44421)
                0.078125 = fieldNorm(doc=3502)
          0.87130857 = weight(abstract_txt:fusion in 3502) [ClassicSimilarity], result of:
            0.87130857 = score(doc=3502,freq=4.0), product of:
              0.7195114 = queryWeight, product of:
                6.723037 = boost
                7.750224 = idf(docFreq=51, maxDocs=44421)
                0.013808863 = queryNorm
              1.2109725 = fieldWeight in 3502, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                7.750224 = idf(docFreq=51, maxDocs=44421)
                0.078125 = fieldNorm(doc=3502)
        0.24 = coord(6/25)
    
  2. Wu, S.; McClean, S.I.: Improving high accuracy retrieval by eliminating the uneven correlation effect in data fusion (2006) 0.26
    0.26280916 = sum of:
      0.26280916 = product of:
        1.0950382 = sum of:
          0.024492528 = weight(abstract_txt:been in 344) [ClassicSimilarity], result of:
            0.024492528 = score(doc=344,freq=3.0), product of:
              0.06259674 = queryWeight, product of:
                1.2541586 = boost
                3.614442 = idf(docFreq=3251, maxDocs=44421)
                0.013808863 = queryNorm
              0.39127484 = fieldWeight in 344, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.614442 = idf(docFreq=3251, maxDocs=44421)
                0.0625 = fieldNorm(doc=344)
          0.016996993 = weight(abstract_txt:used in 344) [ClassicSimilarity], result of:
            0.016996993 = score(doc=344,freq=1.0), product of:
              0.08100556 = queryWeight, product of:
                1.747349 = boost
                3.3572001 = idf(docFreq=4205, maxDocs=44421)
                0.013808863 = queryNorm
              0.20982501 = fieldWeight in 344, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.3572001 = idf(docFreq=4205, maxDocs=44421)
                0.0625 = fieldNorm(doc=344)
          0.05033103 = weight(abstract_txt:retrieval in 344) [ClassicSimilarity], result of:
            0.05033103 = score(doc=344,freq=4.0), product of:
              0.115820006 = queryWeight, product of:
                2.4125896 = boost
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.013808863 = queryNorm
              0.4345625 = fieldWeight in 344, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.0625 = fieldNorm(doc=344)
          0.07419529 = weight(abstract_txt:data in 344) [ClassicSimilarity], result of:
            0.07419529 = score(doc=344,freq=5.0), product of:
              0.15941812 = queryWeight, product of:
                3.466619 = boost
                3.3302255 = idf(docFreq=4320, maxDocs=44421)
                0.013808863 = queryNorm
              0.46541315 = fieldWeight in 344, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                3.3302255 = idf(docFreq=4320, maxDocs=44421)
                0.0625 = fieldNorm(doc=344)
          0.075317785 = weight(abstract_txt:methods in 344) [ClassicSimilarity], result of:
            0.075317785 = score(doc=344,freq=2.0), product of:
              0.20565443 = queryWeight, product of:
                3.5943112 = boost
                4.1434727 = idf(docFreq=1915, maxDocs=44421)
                0.013808863 = queryNorm
              0.3662347 = fieldWeight in 344, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.1434727 = idf(docFreq=1915, maxDocs=44421)
                0.0625 = fieldNorm(doc=344)
          0.8537045 = weight(abstract_txt:fusion in 344) [ClassicSimilarity], result of:
            0.8537045 = score(doc=344,freq=6.0), product of:
              0.7195114 = queryWeight, product of:
                6.723037 = boost
                7.750224 = idf(docFreq=51, maxDocs=44421)
                0.013808863 = queryNorm
              1.1865059 = fieldWeight in 344, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                7.750224 = idf(docFreq=51, maxDocs=44421)
                0.0625 = fieldNorm(doc=344)
        0.24 = coord(6/25)
    
  3. Wu, M.; Hawking, D.; Turpin, A.; Scholer, F.: Using anchor text for homepage and topic distillation search tasks (2012) 0.20
    0.20049468 = sum of:
      0.20049468 = product of:
        0.7160524 = sum of:
          0.013869629 = weight(abstract_txt:have in 1257) [ClassicSimilarity], result of:
            0.013869629 = score(doc=1257,freq=2.0), product of:
              0.049045928 = queryWeight, product of:
                1.1101409 = boost
                3.199388 = idf(docFreq=4924, maxDocs=44421)
                0.013808863 = queryNorm
              0.2827886 = fieldWeight in 1257, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.199388 = idf(docFreq=4924, maxDocs=44421)
                0.0625 = fieldNorm(doc=1257)
          0.014140768 = weight(abstract_txt:been in 1257) [ClassicSimilarity], result of:
            0.014140768 = score(doc=1257,freq=1.0), product of:
              0.06259674 = queryWeight, product of:
                1.2541586 = boost
                3.614442 = idf(docFreq=3251, maxDocs=44421)
                0.013808863 = queryNorm
              0.22590263 = fieldWeight in 1257, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.614442 = idf(docFreq=3251, maxDocs=44421)
                0.0625 = fieldNorm(doc=1257)
          0.024037376 = weight(abstract_txt:used in 1257) [ClassicSimilarity], result of:
            0.024037376 = score(doc=1257,freq=2.0), product of:
              0.08100556 = queryWeight, product of:
                1.747349 = boost
                3.3572001 = idf(docFreq=4205, maxDocs=44421)
                0.013808863 = queryNorm
              0.29673737 = fieldWeight in 1257, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.3572001 = idf(docFreq=4205, maxDocs=44421)
                0.0625 = fieldNorm(doc=1257)
          0.05370749 = weight(abstract_txt:search in 1257) [ClassicSimilarity], result of:
            0.05370749 = score(doc=1257,freq=6.0), product of:
              0.09599301 = queryWeight, product of:
                1.9021382 = boost
                3.654598 = idf(docFreq=3123, maxDocs=44421)
                0.013808863 = queryNorm
              0.5594938 = fieldWeight in 1257, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                3.654598 = idf(docFreq=3123, maxDocs=44421)
                0.0625 = fieldNorm(doc=1257)
          0.025165515 = weight(abstract_txt:retrieval in 1257) [ClassicSimilarity], result of:
            0.025165515 = score(doc=1257,freq=1.0), product of:
              0.115820006 = queryWeight, product of:
                2.4125896 = boost
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.013808863 = queryNorm
              0.21728125 = fieldWeight in 1257, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.0625 = fieldNorm(doc=1257)
          0.09224507 = weight(abstract_txt:methods in 1257) [ClassicSimilarity], result of:
            0.09224507 = score(doc=1257,freq=3.0), product of:
              0.20565443 = queryWeight, product of:
                3.5943112 = boost
                4.1434727 = idf(docFreq=1915, maxDocs=44421)
                0.013808863 = queryNorm
              0.44854406 = fieldWeight in 1257, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.1434727 = idf(docFreq=1915, maxDocs=44421)
                0.0625 = fieldNorm(doc=1257)
          0.49288654 = weight(abstract_txt:fusion in 1257) [ClassicSimilarity], result of:
            0.49288654 = score(doc=1257,freq=2.0), product of:
              0.7195114 = queryWeight, product of:
                6.723037 = boost
                7.750224 = idf(docFreq=51, maxDocs=44421)
                0.013808863 = queryNorm
              0.6850295 = fieldWeight in 1257, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.750224 = idf(docFreq=51, maxDocs=44421)
                0.0625 = fieldNorm(doc=1257)
        0.28 = coord(7/25)
    
  4. Larsen, B.; Ingwersen, P.; Lund, B.: Data fusion according to the principle of polyrepresentation (2009) 0.20
    0.20035544 = sum of:
      0.20035544 = product of:
        1.0017772 = sum of:
          0.0148723675 = weight(abstract_txt:used in 3752) [ClassicSimilarity], result of:
            0.0148723675 = score(doc=3752,freq=1.0), product of:
              0.08100556 = queryWeight, product of:
                1.747349 = boost
                3.3572001 = idf(docFreq=4205, maxDocs=44421)
                0.013808863 = queryNorm
              0.18359688 = fieldWeight in 3752, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.3572001 = idf(docFreq=4205, maxDocs=44421)
                0.0546875 = fieldNorm(doc=3752)
          0.049237832 = weight(abstract_txt:retrieval in 3752) [ClassicSimilarity], result of:
            0.049237832 = score(doc=3752,freq=5.0), product of:
              0.115820006 = queryWeight, product of:
                2.4125896 = boost
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.013808863 = queryNorm
              0.42512372 = fieldWeight in 3752, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.0546875 = fieldNorm(doc=3752)
          0.06492088 = weight(abstract_txt:data in 3752) [ClassicSimilarity], result of:
            0.06492088 = score(doc=3752,freq=5.0), product of:
              0.15941812 = queryWeight, product of:
                3.466619 = boost
                3.3302255 = idf(docFreq=4320, maxDocs=44421)
                0.013808863 = queryNorm
              0.40723652 = fieldWeight in 3752, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                3.3302255 = idf(docFreq=4320, maxDocs=44421)
                0.0546875 = fieldNorm(doc=3752)
          0.06590306 = weight(abstract_txt:methods in 3752) [ClassicSimilarity], result of:
            0.06590306 = score(doc=3752,freq=2.0), product of:
              0.20565443 = queryWeight, product of:
                3.5943112 = boost
                4.1434727 = idf(docFreq=1915, maxDocs=44421)
                0.013808863 = queryNorm
              0.32045534 = fieldWeight in 3752, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.1434727 = idf(docFreq=1915, maxDocs=44421)
                0.0546875 = fieldNorm(doc=3752)
          0.806843 = weight(abstract_txt:fusion in 3752) [ClassicSimilarity], result of:
            0.806843 = score(doc=3752,freq=7.0), product of:
              0.7195114 = queryWeight, product of:
                6.723037 = boost
                7.750224 = idf(docFreq=51, maxDocs=44421)
                0.013808863 = queryNorm
              1.1213763 = fieldWeight in 3752, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                7.750224 = idf(docFreq=51, maxDocs=44421)
                0.0546875 = fieldNorm(doc=3752)
        0.2 = coord(5/25)
    
  5. Seco de Herrera, A.G.; Schaer, R.; Müller, H.: Shangri-La : a medical case-based retrieval tool (2017) 0.18
    0.1795583 = sum of:
      0.1795583 = product of:
        0.74815965 = sum of:
          0.04016386 = weight(abstract_txt:potentially in 4924) [ClassicSimilarity], result of:
            0.04016386 = score(doc=4924,freq=1.0), product of:
              0.09964373 = queryWeight, product of:
                1.118888 = boost
                6.449194 = idf(docFreq=190, maxDocs=44421)
                0.013808863 = queryNorm
              0.40307462 = fieldWeight in 4924, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.449194 = idf(docFreq=190, maxDocs=44421)
                0.0625 = fieldNorm(doc=4924)
          0.061642677 = weight(abstract_txt:retrieval in 4924) [ClassicSimilarity], result of:
            0.061642677 = score(doc=4924,freq=6.0), product of:
              0.115820006 = queryWeight, product of:
                2.4125896 = boost
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.013808863 = queryNorm
              0.53222823 = fieldWeight in 4924, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.0625 = fieldNorm(doc=4924)
          0.11509832 = weight(abstract_txt:benchmark in 4924) [ClassicSimilarity], result of:
            0.11509832 = score(doc=4924,freq=1.0), product of:
              0.25328863 = queryWeight, product of:
                2.5228097 = boost
                7.270651 = idf(docFreq=83, maxDocs=44421)
                0.013808863 = queryNorm
              0.45441568 = fieldWeight in 4924, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.270651 = idf(docFreq=83, maxDocs=44421)
                0.0625 = fieldNorm(doc=4924)
          0.033181142 = weight(abstract_txt:data in 4924) [ClassicSimilarity], result of:
            0.033181142 = score(doc=4924,freq=1.0), product of:
              0.15941812 = queryWeight, product of:
                3.466619 = boost
                3.3302255 = idf(docFreq=4320, maxDocs=44421)
                0.013808863 = queryNorm
              0.20813909 = fieldWeight in 4924, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.3302255 = idf(docFreq=4320, maxDocs=44421)
                0.0625 = fieldNorm(doc=4924)
          0.14955029 = weight(abstract_txt:adaptive in 4924) [ClassicSimilarity], result of:
            0.14955029 = score(doc=4924,freq=1.0), product of:
              0.34524307 = queryWeight, product of:
                3.6073208 = boost
                6.930783 = idf(docFreq=117, maxDocs=44421)
                0.013808863 = queryNorm
              0.43317392 = fieldWeight in 4924, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.930783 = idf(docFreq=117, maxDocs=44421)
                0.0625 = fieldNorm(doc=4924)
          0.3485234 = weight(abstract_txt:fusion in 4924) [ClassicSimilarity], result of:
            0.3485234 = score(doc=4924,freq=1.0), product of:
              0.7195114 = queryWeight, product of:
                6.723037 = boost
                7.750224 = idf(docFreq=51, maxDocs=44421)
                0.013808863 = queryNorm
              0.484389 = fieldWeight in 4924, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.750224 = idf(docFreq=51, maxDocs=44421)
                0.0625 = fieldNorm(doc=4924)
        0.24 = coord(6/25)