Document (#29116)

Liu, X.
Croft, W.B.
Cluster-based retrieval using language models
SIGIR'04: Proceedings of the 27th Annual International ACM-SIGIR Conference an Research and Development in Information Retrieval. Ed.: K. Järvelin, u.a
New York, NY : ACM Press

Similar documents (author)

  1. Croft, W.B.: Approaches to intelligent information retrieval (1987) 5.02
    5.023691 = sum of:
      5.023691 = weight(author_txt:croft in 1093) [ClassicSimilarity], result of:
        5.023691 = fieldWeight in 1093, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.037906 = idf(docFreq=38, maxDocs=44421)
          0.625 = fieldNorm(doc=1093)
  2. Croft, W.B.: Clustering large files of documents using the single link method (1977) 5.02
    5.023691 = sum of:
      5.023691 = weight(author_txt:croft in 5488) [ClassicSimilarity], result of:
        5.023691 = fieldWeight in 5488, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.037906 = idf(docFreq=38, maxDocs=44421)
          0.625 = fieldNorm(doc=5488)
  3. Croft, W.B.: Knowledge-based and statistical approaches to text retrieval (1993) 5.02
    5.023691 = sum of:
      5.023691 = weight(author_txt:croft in 7862) [ClassicSimilarity], result of:
        5.023691 = fieldWeight in 7862, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.037906 = idf(docFreq=38, maxDocs=44421)
          0.625 = fieldNorm(doc=7862)
  4. Croft, W.B.: Hypertext and information retrieval : what are the fundamental concepts? (1990) 5.02
    5.023691 = sum of:
      5.023691 = weight(author_txt:croft in 8002) [ClassicSimilarity], result of:
        5.023691 = fieldWeight in 8002, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.037906 = idf(docFreq=38, maxDocs=44421)
          0.625 = fieldNorm(doc=8002)
  5. Croft, W.B.: What do people want from information retrieval? : the top 10 research issues for companies that use and sell IR systems (1995) 5.02
    5.023691 = sum of:
      5.023691 = weight(author_txt:croft in 3470) [ClassicSimilarity], result of:
        5.023691 = fieldWeight in 3470, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.037906 = idf(docFreq=38, maxDocs=44421)
          0.625 = fieldNorm(doc=3470)

Similar documents (content)

  1. Kang, I.-S.; Na, S.-H.; Kim, J.; Lee, J.-H.: Cluster-based patent retrieval (2007) 1.03
    1.026626 = sum of:
      1.026626 = product of:
        1.2319512 = sum of:
          0.08243286 = weight(abstract_txt:based in 1930) [ClassicSimilarity], result of:
            0.08243286 = score(doc=1930,freq=5.0), product of:
              0.18530555 = queryWeight, product of:
                3.1830752 = idf(docFreq=5005, maxDocs=44421)
                0.05821589 = queryNorm
              0.4448483 = fieldWeight in 1930, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                3.1830752 = idf(docFreq=5005, maxDocs=44421)
                0.0625 = fieldNorm(doc=1930)
          0.047219794 = weight(abstract_txt:using in 1930) [ClassicSimilarity], result of:
            0.047219794 = score(doc=1930,freq=1.0), product of:
              0.2185551 = queryWeight, product of:
                1.086016 = boost
                3.4568708 = idf(docFreq=3806, maxDocs=44421)
                0.05821589 = queryNorm
              0.21605442 = fieldWeight in 1930, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4568708 = idf(docFreq=3806, maxDocs=44421)
                0.0625 = fieldNorm(doc=1930)
          0.13584584 = weight(abstract_txt:retrieval in 1930) [ClassicSimilarity], result of:
            0.13584584 = score(doc=1930,freq=8.0), product of:
              0.2210442 = queryWeight, product of:
                1.0921829 = boost
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.05821589 = queryNorm
              0.6145642 = fieldWeight in 1930, product of:
                2.828427 = tf(freq=8.0), with freq of:
                  8.0 = termFreq=8.0
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.0625 = fieldNorm(doc=1930)
          0.11730226 = weight(abstract_txt:language in 1930) [ClassicSimilarity], result of:
            0.11730226 = score(doc=1930,freq=2.0), product of:
              0.3181797 = queryWeight, product of:
                1.3103641 = boost
                4.1709876 = idf(docFreq=1863, maxDocs=44421)
                0.05821589 = queryNorm
              0.3686667 = fieldWeight in 1930, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.1709876 = idf(docFreq=1863, maxDocs=44421)
                0.0625 = fieldNorm(doc=1930)
          0.84915054 = weight(abstract_txt:cluster in 1930) [ClassicSimilarity], result of:
            0.84915054 = score(doc=1930,freq=7.0), product of:
              0.78421533 = queryWeight, product of:
                2.0571854 = boost
                6.548176 = idf(docFreq=172, maxDocs=44421)
                0.05821589 = queryNorm
              1.0828028 = fieldWeight in 1930, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                6.548176 = idf(docFreq=172, maxDocs=44421)
                0.0625 = fieldNorm(doc=1930)
        0.8333333 = coord(5/6)
  2. Na, S.-H.; Kang, I.-S.; Roh, J.-E.; Lee, J.-H.: ¬An empirical study of query expansion and cluster-based retrieval in language modeling approach (2007) 0.88
    0.8804934 = sum of:
      0.8804934 = product of:
        1.3207401 = sum of:
          0.110595286 = weight(abstract_txt:based in 1906) [ClassicSimilarity], result of:
            0.110595286 = score(doc=1906,freq=4.0), product of:
              0.18530555 = queryWeight, product of:
                3.1830752 = idf(docFreq=5005, maxDocs=44421)
                0.05821589 = queryNorm
              0.5968266 = fieldWeight in 1906, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                3.1830752 = idf(docFreq=5005, maxDocs=44421)
                0.09375 = fieldNorm(doc=1906)
          0.07082969 = weight(abstract_txt:using in 1906) [ClassicSimilarity], result of:
            0.07082969 = score(doc=1906,freq=1.0), product of:
              0.2185551 = queryWeight, product of:
                1.086016 = boost
                3.4568708 = idf(docFreq=3806, maxDocs=44421)
                0.05821589 = queryNorm
              0.32408163 = fieldWeight in 1906, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4568708 = idf(docFreq=3806, maxDocs=44421)
                0.09375 = fieldNorm(doc=1906)
          0.17646894 = weight(abstract_txt:retrieval in 1906) [ClassicSimilarity], result of:
            0.17646894 = score(doc=1906,freq=6.0), product of:
              0.2210442 = queryWeight, product of:
                1.0921829 = boost
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.05821589 = queryNorm
              0.79834235 = fieldWeight in 1906, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.09375 = fieldNorm(doc=1906)
          0.9628462 = weight(abstract_txt:cluster in 1906) [ClassicSimilarity], result of:
            0.9628462 = score(doc=1906,freq=4.0), product of:
              0.78421533 = queryWeight, product of:
                2.0571854 = boost
                6.548176 = idf(docFreq=172, maxDocs=44421)
                0.05821589 = queryNorm
              1.227783 = fieldWeight in 1906, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                6.548176 = idf(docFreq=172, maxDocs=44421)
                0.09375 = fieldNorm(doc=1906)
        0.6666667 = coord(4/6)
  3. Wolfram, D.; Zhang, J.: ¬An investigation of the influence of indexing exhaustivity and term distributions on a document space (2002) 0.65
    0.6460591 = sum of:
      0.6460591 = product of:
        0.77527094 = sum of:
          0.073730186 = weight(abstract_txt:based in 238) [ClassicSimilarity], result of:
            0.073730186 = score(doc=238,freq=4.0), product of:
              0.18530555 = queryWeight, product of:
                3.1830752 = idf(docFreq=5005, maxDocs=44421)
                0.05821589 = queryNorm
              0.3978844 = fieldWeight in 238, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                3.1830752 = idf(docFreq=5005, maxDocs=44421)
                0.0625 = fieldNorm(doc=238)
          0.066778876 = weight(abstract_txt:using in 238) [ClassicSimilarity], result of:
            0.066778876 = score(doc=238,freq=2.0), product of:
              0.2185551 = queryWeight, product of:
                1.086016 = boost
                3.4568708 = idf(docFreq=3806, maxDocs=44421)
                0.05821589 = queryNorm
              0.3055471 = fieldWeight in 238, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.4568708 = idf(docFreq=3806, maxDocs=44421)
                0.0625 = fieldNorm(doc=238)
          0.06792292 = weight(abstract_txt:retrieval in 238) [ClassicSimilarity], result of:
            0.06792292 = score(doc=238,freq=2.0), product of:
              0.2210442 = queryWeight, product of:
                1.0921829 = boost
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.05821589 = queryNorm
              0.3072821 = fieldWeight in 238, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.0625 = fieldNorm(doc=238)
          0.112948865 = weight(abstract_txt:models in 238) [ClassicSimilarity], result of:
            0.112948865 = score(doc=238,freq=1.0), product of:
              0.3909004 = queryWeight, product of:
                1.4524087 = boost
                4.623126 = idf(docFreq=1185, maxDocs=44421)
                0.05821589 = queryNorm
              0.28894538 = fieldWeight in 238, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.623126 = idf(docFreq=1185, maxDocs=44421)
                0.0625 = fieldNorm(doc=238)
          0.45389006 = weight(abstract_txt:cluster in 238) [ClassicSimilarity], result of:
            0.45389006 = score(doc=238,freq=2.0), product of:
              0.78421533 = queryWeight, product of:
                2.0571854 = boost
                6.548176 = idf(docFreq=172, maxDocs=44421)
                0.05821589 = queryNorm
              0.57878244 = fieldWeight in 238, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.548176 = idf(docFreq=172, maxDocs=44421)
                0.0625 = fieldNorm(doc=238)
        0.8333333 = coord(5/6)
  4. Fujita, S.: Technology survey and invalidity search : a comparative study of different tasks for Japanese patent document retrieval (2007) 0.62
    0.6239599 = sum of:
      0.6239599 = product of:
        0.9359398 = sum of:
          0.07082969 = weight(abstract_txt:using in 1918) [ClassicSimilarity], result of:
            0.07082969 = score(doc=1918,freq=1.0), product of:
              0.2185551 = queryWeight, product of:
                1.086016 = boost
                3.4568708 = idf(docFreq=3806, maxDocs=44421)
                0.05821589 = queryNorm
              0.32408163 = fieldWeight in 1918, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4568708 = idf(docFreq=3806, maxDocs=44421)
                0.09375 = fieldNorm(doc=1918)
          0.14408629 = weight(abstract_txt:retrieval in 1918) [ClassicSimilarity], result of:
            0.14408629 = score(doc=1918,freq=4.0), product of:
              0.2210442 = queryWeight, product of:
                1.0921829 = boost
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.05821589 = queryNorm
              0.6518438 = fieldWeight in 1918, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.09375 = fieldNorm(doc=1918)
          0.23960072 = weight(abstract_txt:models in 1918) [ClassicSimilarity], result of:
            0.23960072 = score(doc=1918,freq=2.0), product of:
              0.3909004 = queryWeight, product of:
                1.4524087 = boost
                4.623126 = idf(docFreq=1185, maxDocs=44421)
                0.05821589 = queryNorm
              0.6129457 = fieldWeight in 1918, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.623126 = idf(docFreq=1185, maxDocs=44421)
                0.09375 = fieldNorm(doc=1918)
          0.4814231 = weight(abstract_txt:cluster in 1918) [ClassicSimilarity], result of:
            0.4814231 = score(doc=1918,freq=1.0), product of:
              0.78421533 = queryWeight, product of:
                2.0571854 = boost
                6.548176 = idf(docFreq=172, maxDocs=44421)
                0.05821589 = queryNorm
              0.6138915 = fieldWeight in 1918, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.548176 = idf(docFreq=172, maxDocs=44421)
                0.09375 = fieldNorm(doc=1918)
        0.6666667 = coord(4/6)
  5. Belbachir, F.; Boughanem, M.: Using language models to improve opinion detection (2018) 0.56
    0.5601633 = sum of:
      0.5601633 = product of:
        0.67219603 = sum of:
          0.05587069 = weight(abstract_txt:based in 44) [ClassicSimilarity], result of:
            0.05587069 = score(doc=44,freq=3.0), product of:
              0.18530555 = queryWeight, product of:
                3.1830752 = idf(docFreq=5005, maxDocs=44421)
                0.05821589 = queryNorm
              0.30150574 = fieldWeight in 44, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.1830752 = idf(docFreq=5005, maxDocs=44421)
                0.0546875 = fieldNorm(doc=44)
          0.0715637 = weight(abstract_txt:using in 44) [ClassicSimilarity], result of:
            0.0715637 = score(doc=44,freq=3.0), product of:
              0.2185551 = queryWeight, product of:
                1.086016 = boost
                3.4568708 = idf(docFreq=3806, maxDocs=44421)
                0.05821589 = queryNorm
              0.32744008 = fieldWeight in 44, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.4568708 = idf(docFreq=3806, maxDocs=44421)
                0.0546875 = fieldNorm(doc=44)
          0.10294022 = weight(abstract_txt:retrieval in 44) [ClassicSimilarity], result of:
            0.10294022 = score(doc=44,freq=6.0), product of:
              0.2210442 = queryWeight, product of:
                1.0921829 = boost
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.05821589 = queryNorm
              0.4656997 = fieldWeight in 44, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.0546875 = fieldNorm(doc=44)
          0.16228725 = weight(abstract_txt:language in 44) [ClassicSimilarity], result of:
            0.16228725 = score(doc=44,freq=5.0), product of:
              0.3181797 = queryWeight, product of:
                1.3103641 = boost
                4.1709876 = idf(docFreq=1863, maxDocs=44421)
                0.05821589 = queryNorm
              0.51004905 = fieldWeight in 44, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                4.1709876 = idf(docFreq=1863, maxDocs=44421)
                0.0546875 = fieldNorm(doc=44)
          0.2795342 = weight(abstract_txt:models in 44) [ClassicSimilarity], result of:
            0.2795342 = score(doc=44,freq=8.0), product of:
              0.3909004 = queryWeight, product of:
                1.4524087 = boost
                4.623126 = idf(docFreq=1185, maxDocs=44421)
                0.05821589 = queryNorm
              0.7151033 = fieldWeight in 44, product of:
                2.828427 = tf(freq=8.0), with freq of:
                  8.0 = termFreq=8.0
                4.623126 = idf(docFreq=1185, maxDocs=44421)
                0.0546875 = fieldNorm(doc=44)
        0.8333333 = coord(5/6)