Document (#40479)

Author
Layfield, C.
Azzopardi, J,
Staff, C.
Title
Experiments with document retrieval from small text collections using Latent Semantic Analysis or term similarity with query coordination and automatic relevance feedback
Source
Semantic keyword-based search on structured data sources: COST Action IC1302. Second International KEYSTONE Conference, IKC 2016, Cluj-Napoca, Romania, September 8-9, 2016, Revised Selected Papers. Eds.: A. Calì, A. et al
Imprint
Springer International Publishing
Year
2017
Pages
S.25-36
Series
Information Systems and Applications, incl. Internet/Web, and HCI; 10151
Abstract
One of the problems faced by users of databases containing textual documents is the difficulty in retrieving relevant results due to the diverse vocabulary used in queries and contained in relevant documents, especially when there are only a small number of relevant documents. This problem is known as the Vocabulary Gap. The PIKES team have constructed a small test collection of 331 articles extracted from a blog and a Gold Standard for 35 queries selected from the blog's search log so the results of different approaches to semantic search can be compared. So far, prior approaches include recognising Named Entities in documents and queries, and relations including temporal relations, and represent them as `semantic layers' in a retrieval system index. In this work, we take two different approaches that do not involve Named Entity Recognition. In the first approach, we process an unannotated version of the PIKES document collection using Latent Semantic Analysis and use a combination of query coordination and automatic relevance feedback with which we outperform prior work. However, this approach is highly dependent on the underlying collection, and is not necessarily scalable to massive collections. In our second approach, we use an LSA Model generated by SEMILAR from a Wikipedia dump to generate a Term Similarity Matrix (TSM). We automatically expand the queries in the PIKES test collection with related terms from the TSM and submit them to a term-by-document matrix derived by indexing the PIKES collection using the Vector Space Model. Coupled with a combination of query coordination and automatic relevance feedback we also outperform prior work with this approach. The advantage of the second approach is that it is independent of the underlying document collection.
Content
Vgl. auch: http://www.keystone-cost.eu/ikc2016/program.php.
Theme
Semantisches Umfeld in Indexierung u. Retrieval
Object
Latent Semantic Analysis

Similar documents (author)

  1. Baillie, M.; Azzopardi, L.; Ruthven, I.: Evaluating epistemic uncertainty under incomplete assessments (2008) 3.61
    3.60826 = sum of:
      3.60826 = weight(author_txt:azzopardi in 3065) [ClassicSimilarity], result of:
        3.60826 = fieldWeight in 3065, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.622026 = idf(docFreq=7, maxDocs=44421)
          0.375 = fieldNorm(doc=3065)
    
  2. Balog, K.; Azzopardi, L.; Rijke, M. de: ¬A language modeling framework for expert finding (2009) 3.61
    3.60826 = sum of:
      3.60826 = weight(author_txt:azzopardi in 3447) [ClassicSimilarity], result of:
        3.60826 = fieldWeight in 3447, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.622026 = idf(docFreq=7, maxDocs=44421)
          0.375 = fieldNorm(doc=3447)
    
  3. Russell-Rose, T.; Chamberlain, J.; Azzopardi, L.: Information retrieval in the workplace : a comparison of professional search practices (2018) 3.61
    3.60826 = sum of:
      3.60826 = weight(author_txt:azzopardi in 48) [ClassicSimilarity], result of:
        3.60826 = fieldWeight in 48, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.622026 = idf(docFreq=7, maxDocs=44421)
          0.375 = fieldNorm(doc=48)
    
  4. Jahani, H.; Azzopardi, L.; Sanderson, M.: Measuring the retrievability of digital library content using analytics data (2024) 3.61
    3.60826 = sum of:
      3.60826 = weight(author_txt:azzopardi in 2386) [ClassicSimilarity], result of:
        3.60826 = fieldWeight in 2386, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.622026 = idf(docFreq=7, maxDocs=44421)
          0.375 = fieldNorm(doc=2386)
    
  5. Azzopardi, J.; Benedetti, F.; Guerra, F.; Lupu, M.: Back to the sketch-board : integrating keyword search, semantics, and information retrieval (2017) 3.01
    3.0068831 = sum of:
      3.0068831 = weight(author_txt:azzopardi in 4484) [ClassicSimilarity], result of:
        3.0068831 = fieldWeight in 4484, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.622026 = idf(docFreq=7, maxDocs=44421)
          0.3125 = fieldNorm(doc=4484)
    

Similar documents (content)

  1. Deerwester, S.C.; Dumais, S.T.; Landauer, T.K.; Furnas, G.W.; Harshman, R.A.: Indexing by latent semantic analysis (1990) 0.44
    0.44358987 = sum of:
      0.44358987 = product of:
        0.9241456 = sum of:
          0.06917649 = weight(abstract_txt:combination in 3399) [ClassicSimilarity], result of:
            0.06917649 = score(doc=3399,freq=1.0), product of:
              0.1529828 = queryWeight, product of:
                1.0182699 = boost
                5.787965 = idf(docFreq=369, maxDocs=44421)
                0.02595696 = queryNorm
              0.45218474 = fieldWeight in 3399, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.787965 = idf(docFreq=369, maxDocs=44421)
                0.078125 = fieldNorm(doc=3399)
          0.16212896 = weight(abstract_txt:matrix in 3399) [ClassicSimilarity], result of:
            0.16212896 = score(doc=3399,freq=2.0), product of:
              0.21424003 = queryWeight, product of:
                1.2050135 = boost
                6.849437 = idf(docFreq=127, maxDocs=44421)
                0.02595696 = queryNorm
              0.7567631 = fieldWeight in 3399, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.849437 = idf(docFreq=127, maxDocs=44421)
                0.078125 = fieldNorm(doc=3399)
          0.026502043 = weight(abstract_txt:from in 3399) [ClassicSimilarity], result of:
            0.026502043 = score(doc=3399,freq=2.0), product of:
              0.08692803 = queryWeight, product of:
                1.2136446 = boost
                2.759399 = idf(docFreq=7646, maxDocs=44421)
                0.02595696 = queryNorm
              0.30487338 = fieldWeight in 3399, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.759399 = idf(docFreq=7646, maxDocs=44421)
                0.078125 = fieldNorm(doc=3399)
          0.053111114 = weight(abstract_txt:relevant in 3399) [ClassicSimilarity], result of:
            0.053111114 = score(doc=3399,freq=1.0), product of:
              0.14683321 = queryWeight, product of:
                1.221798 = boost
                4.6298943 = idf(docFreq=1177, maxDocs=44421)
                0.02595696 = queryNorm
              0.3617105 = fieldWeight in 3399, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.6298943 = idf(docFreq=1177, maxDocs=44421)
                0.078125 = fieldNorm(doc=3399)
          0.058987495 = weight(abstract_txt:term in 3399) [ClassicSimilarity], result of:
            0.058987495 = score(doc=3399,freq=1.0), product of:
              0.15747344 = queryWeight, product of:
                1.2652924 = boost
                4.794713 = idf(docFreq=998, maxDocs=44421)
                0.02595696 = queryNorm
              0.37458694 = fieldWeight in 3399, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.794713 = idf(docFreq=998, maxDocs=44421)
                0.078125 = fieldNorm(doc=3399)
          0.023540942 = weight(abstract_txt:with in 3399) [ClassicSimilarity], result of:
            0.023540942 = score(doc=3399,freq=2.0), product of:
              0.08535912 = queryWeight, product of:
                1.317429 = boost
                2.4961398 = idf(docFreq=9949, maxDocs=44421)
                0.02595696 = queryNorm
              0.2757871 = fieldWeight in 3399, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.4961398 = idf(docFreq=9949, maxDocs=44421)
                0.078125 = fieldNorm(doc=3399)
          0.106148824 = weight(abstract_txt:automatic in 3399) [ClassicSimilarity], result of:
            0.106148824 = score(doc=3399,freq=2.0), product of:
              0.18491302 = queryWeight, product of:
                1.3711059 = boost
                5.1956835 = idf(docFreq=668, maxDocs=44421)
                0.02595696 = queryNorm
              0.5740473 = fieldWeight in 3399, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.1956835 = idf(docFreq=668, maxDocs=44421)
                0.078125 = fieldNorm(doc=3399)
          0.10004207 = weight(abstract_txt:documents in 3399) [ClassicSimilarity], result of:
            0.10004207 = score(doc=3399,freq=4.0), product of:
              0.15527995 = queryWeight, product of:
                1.4508225 = boost
                4.123322 = idf(docFreq=1954, maxDocs=44421)
                0.02595696 = queryNorm
              0.64426905 = fieldWeight in 3399, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.123322 = idf(docFreq=1954, maxDocs=44421)
                0.078125 = fieldNorm(doc=3399)
          0.07990197 = weight(abstract_txt:document in 3399) [ClassicSimilarity], result of:
            0.07990197 = score(doc=3399,freq=2.0), product of:
              0.16841286 = queryWeight, product of:
                1.5109296 = boost
                4.29415 = idf(docFreq=1647, maxDocs=44421)
                0.02595696 = queryNorm
              0.47444102 = fieldWeight in 3399, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.29415 = idf(docFreq=1647, maxDocs=44421)
                0.078125 = fieldNorm(doc=3399)
          0.06392248 = weight(abstract_txt:semantic in 3399) [ClassicSimilarity], result of:
            0.06392248 = score(doc=3399,freq=1.0), product of:
              0.18285887 = queryWeight, product of:
                1.5743983 = boost
                4.4745317 = idf(docFreq=1375, maxDocs=44421)
                0.02595696 = queryNorm
              0.34957278 = fieldWeight in 3399, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.4745317 = idf(docFreq=1375, maxDocs=44421)
                0.078125 = fieldNorm(doc=3399)
          0.046701856 = weight(abstract_txt:approach in 3399) [ClassicSimilarity], result of:
            0.046701856 = score(doc=3399,freq=1.0), product of:
              0.15978636 = queryWeight, product of:
                1.6454377 = boost
                3.741144 = idf(docFreq=2864, maxDocs=44421)
                0.02595696 = queryNorm
              0.29227686 = fieldWeight in 3399, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.741144 = idf(docFreq=2864, maxDocs=44421)
                0.078125 = fieldNorm(doc=3399)
          0.13398136 = weight(abstract_txt:queries in 3399) [ClassicSimilarity], result of:
            0.13398136 = score(doc=3399,freq=2.0), product of:
              0.2377022 = queryWeight, product of:
                1.7950362 = boost
                5.1015973 = idf(docFreq=734, maxDocs=44421)
                0.02595696 = queryNorm
              0.56365216 = fieldWeight in 3399, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.1015973 = idf(docFreq=734, maxDocs=44421)
                0.078125 = fieldNorm(doc=3399)
        0.48 = coord(12/25)
    
  2. Dumais, S.T.: Latent semantic analysis (2003) 0.40
    0.39727053 = sum of:
      0.39727053 = product of:
        0.58422136 = sum of:
          0.011647889 = weight(abstract_txt:work in 3462) [ClassicSimilarity], result of:
            0.011647889 = score(doc=3462,freq=1.0), product of:
              0.09836159 = queryWeight, product of:
                3.7894108 = idf(docFreq=2729, maxDocs=44421)
                0.02595696 = queryNorm
              0.11841909 = fieldWeight in 3462, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.7894108 = idf(docFreq=2729, maxDocs=44421)
                0.03125 = fieldNorm(doc=3462)
          0.027670596 = weight(abstract_txt:combination in 3462) [ClassicSimilarity], result of:
            0.027670596 = score(doc=3462,freq=1.0), product of:
              0.1529828 = queryWeight, product of:
                1.0182699 = boost
                5.787965 = idf(docFreq=369, maxDocs=44421)
                0.02595696 = queryNorm
              0.1808739 = fieldWeight in 3462, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.787965 = idf(docFreq=369, maxDocs=44421)
                0.03125 = fieldNorm(doc=3462)
          0.028105712 = weight(abstract_txt:similarity in 3462) [ClassicSimilarity], result of:
            0.028105712 = score(doc=3462,freq=1.0), product of:
              0.15458238 = queryWeight, product of:
                1.0235796 = boost
                5.8181453 = idf(docFreq=358, maxDocs=44421)
                0.02595696 = queryNorm
              0.18181704 = fieldWeight in 3462, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.8181453 = idf(docFreq=358, maxDocs=44421)
                0.03125 = fieldNorm(doc=3462)
          0.04585699 = weight(abstract_txt:matrix in 3462) [ClassicSimilarity], result of:
            0.04585699 = score(doc=3462,freq=1.0), product of:
              0.21424003 = queryWeight, product of:
                1.2050135 = boost
                6.849437 = idf(docFreq=127, maxDocs=44421)
                0.02595696 = queryNorm
              0.21404491 = fieldWeight in 3462, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.849437 = idf(docFreq=127, maxDocs=44421)
                0.03125 = fieldNorm(doc=3462)
          0.036044657 = weight(abstract_txt:approaches in 3462) [ClassicSimilarity], result of:
            0.036044657 = score(doc=3462,freq=3.0), product of:
              0.14482634 = queryWeight, product of:
                1.2134196 = boost
                4.5981455 = idf(docFreq=1215, maxDocs=44421)
                0.02595696 = queryNorm
              0.24888192 = fieldWeight in 3462, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.5981455 = idf(docFreq=1215, maxDocs=44421)
                0.03125 = fieldNorm(doc=3462)
          0.00749591 = weight(abstract_txt:from in 3462) [ClassicSimilarity], result of:
            0.00749591 = score(doc=3462,freq=1.0), product of:
              0.08692803 = queryWeight, product of:
                1.2136446 = boost
                2.759399 = idf(docFreq=7646, maxDocs=44421)
                0.02595696 = queryNorm
              0.08623122 = fieldWeight in 3462, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.759399 = idf(docFreq=7646, maxDocs=44421)
                0.03125 = fieldNorm(doc=3462)
          0.036796458 = weight(abstract_txt:relevant in 3462) [ClassicSimilarity], result of:
            0.036796458 = score(doc=3462,freq=3.0), product of:
              0.14683321 = queryWeight, product of:
                1.221798 = boost
                4.6298943 = idf(docFreq=1177, maxDocs=44421)
                0.02595696 = queryNorm
              0.25060037 = fieldWeight in 3462, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.6298943 = idf(docFreq=1177, maxDocs=44421)
                0.03125 = fieldNorm(doc=3462)
          0.04877907 = weight(abstract_txt:latent in 3462) [ClassicSimilarity], result of:
            0.04877907 = score(doc=3462,freq=1.0), product of:
              0.22324717 = queryWeight, product of:
                1.2300835 = boost
                6.9919376 = idf(docFreq=110, maxDocs=44421)
                0.02595696 = queryNorm
              0.21849805 = fieldWeight in 3462, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.9919376 = idf(docFreq=110, maxDocs=44421)
                0.03125 = fieldNorm(doc=3462)
          0.05144338 = weight(abstract_txt:query in 3462) [ClassicSimilarity], result of:
            0.05144338 = score(doc=3462,freq=5.0), product of:
              0.15484257 = queryWeight, product of:
                1.2546784 = boost
                4.754492 = idf(docFreq=1039, maxDocs=44421)
                0.02595696 = queryNorm
              0.3322302 = fieldWeight in 3462, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                4.754492 = idf(docFreq=1039, maxDocs=44421)
                0.03125 = fieldNorm(doc=3462)
          0.023595 = weight(abstract_txt:term in 3462) [ClassicSimilarity], result of:
            0.023595 = score(doc=3462,freq=1.0), product of:
              0.15747344 = queryWeight, product of:
                1.2652924 = boost
                4.794713 = idf(docFreq=998, maxDocs=44421)
                0.02595696 = queryNorm
              0.14983478 = fieldWeight in 3462, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.794713 = idf(docFreq=998, maxDocs=44421)
                0.03125 = fieldNorm(doc=3462)
          0.009416377 = weight(abstract_txt:with in 3462) [ClassicSimilarity], result of:
            0.009416377 = score(doc=3462,freq=2.0), product of:
              0.08535912 = queryWeight, product of:
                1.317429 = boost
                2.4961398 = idf(docFreq=9949, maxDocs=44421)
                0.02595696 = queryNorm
              0.11031483 = fieldWeight in 3462, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.4961398 = idf(docFreq=9949, maxDocs=44421)
                0.03125 = fieldNorm(doc=3462)
          0.042459533 = weight(abstract_txt:automatic in 3462) [ClassicSimilarity], result of:
            0.042459533 = score(doc=3462,freq=2.0), product of:
              0.18491302 = queryWeight, product of:
                1.3711059 = boost
                5.1956835 = idf(docFreq=668, maxDocs=44421)
                0.02595696 = queryNorm
              0.22961894 = fieldWeight in 3462, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.1956835 = idf(docFreq=668, maxDocs=44421)
                0.03125 = fieldNorm(doc=3462)
          0.06327216 = weight(abstract_txt:documents in 3462) [ClassicSimilarity], result of:
            0.06327216 = score(doc=3462,freq=10.0), product of:
              0.15527995 = queryWeight, product of:
                1.4508225 = boost
                4.123322 = idf(docFreq=1954, maxDocs=44421)
                0.02595696 = queryNorm
              0.40747154 = fieldWeight in 3462, product of:
                3.1622777 = tf(freq=10.0), with freq of:
                  10.0 = termFreq=10.0
                4.123322 = idf(docFreq=1954, maxDocs=44421)
                0.03125 = fieldNorm(doc=3462)
          0.03196079 = weight(abstract_txt:document in 3462) [ClassicSimilarity], result of:
            0.03196079 = score(doc=3462,freq=2.0), product of:
              0.16841286 = queryWeight, product of:
                1.5109296 = boost
                4.29415 = idf(docFreq=1647, maxDocs=44421)
                0.02595696 = queryNorm
              0.1897764 = fieldWeight in 3462, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.29415 = idf(docFreq=1647, maxDocs=44421)
                0.03125 = fieldNorm(doc=3462)
          0.044286795 = weight(abstract_txt:semantic in 3462) [ClassicSimilarity], result of:
            0.044286795 = score(doc=3462,freq=3.0), product of:
              0.18285887 = queryWeight, product of:
                1.5743983 = boost
                4.4745317 = idf(docFreq=1375, maxDocs=44421)
                0.02595696 = queryNorm
              0.24219112 = fieldWeight in 3462, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.4745317 = idf(docFreq=1375, maxDocs=44421)
                0.03125 = fieldNorm(doc=3462)
          0.032355994 = weight(abstract_txt:approach in 3462) [ClassicSimilarity], result of:
            0.032355994 = score(doc=3462,freq=3.0), product of:
              0.15978636 = queryWeight, product of:
                1.6454377 = boost
                3.741144 = idf(docFreq=2864, maxDocs=44421)
                0.02595696 = queryNorm
              0.20249535 = fieldWeight in 3462, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.741144 = idf(docFreq=2864, maxDocs=44421)
                0.03125 = fieldNorm(doc=3462)
          0.043034054 = weight(abstract_txt:collection in 3462) [ClassicSimilarity], result of:
            0.043034054 = score(doc=3462,freq=1.0), product of:
              0.29617304 = queryWeight, product of:
                2.4540024 = boost
                4.649612 = idf(docFreq=1154, maxDocs=44421)
                0.02595696 = queryNorm
              0.14530037 = fieldWeight in 3462, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.649612 = idf(docFreq=1154, maxDocs=44421)
                0.03125 = fieldNorm(doc=3462)
        0.68 = coord(17/25)
    
  3. Crouch, C.J.; Crouch, D.B.; Chen, Q.; Holtz, S.J.: Improving the retrieval effectiveness of very short queries (2002) 0.38
    0.37952062 = sum of:
      0.37952062 = product of:
        0.790668 = sum of:
          0.023295779 = weight(abstract_txt:work in 3572) [ClassicSimilarity], result of:
            0.023295779 = score(doc=3572,freq=1.0), product of:
              0.09836159 = queryWeight, product of:
                3.7894108 = idf(docFreq=2729, maxDocs=44421)
                0.02595696 = queryNorm
              0.23683818 = fieldWeight in 3572, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.7894108 = idf(docFreq=2729, maxDocs=44421)
                0.0625 = fieldNorm(doc=3572)
          0.021201635 = weight(abstract_txt:from in 3572) [ClassicSimilarity], result of:
            0.021201635 = score(doc=3572,freq=2.0), product of:
              0.08692803 = queryWeight, product of:
                1.2136446 = boost
                2.759399 = idf(docFreq=7646, maxDocs=44421)
                0.02595696 = queryNorm
              0.2438987 = fieldWeight in 3572, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.759399 = idf(docFreq=7646, maxDocs=44421)
                0.0625 = fieldNorm(doc=3572)
          0.073592916 = weight(abstract_txt:relevant in 3572) [ClassicSimilarity], result of:
            0.073592916 = score(doc=3572,freq=3.0), product of:
              0.14683321 = queryWeight, product of:
                1.221798 = boost
                4.6298943 = idf(docFreq=1177, maxDocs=44421)
                0.02595696 = queryNorm
              0.50120074 = fieldWeight in 3572, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.6298943 = idf(docFreq=1177, maxDocs=44421)
                0.0625 = fieldNorm(doc=3572)
          0.0650713 = weight(abstract_txt:query in 3572) [ClassicSimilarity], result of:
            0.0650713 = score(doc=3572,freq=2.0), product of:
              0.15484257 = queryWeight, product of:
                1.2546784 = boost
                4.754492 = idf(docFreq=1039, maxDocs=44421)
                0.02595696 = queryNorm
              0.42024165 = fieldWeight in 3572, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.754492 = idf(docFreq=1039, maxDocs=44421)
                0.0625 = fieldNorm(doc=3572)
          0.013316768 = weight(abstract_txt:with in 3572) [ClassicSimilarity], result of:
            0.013316768 = score(doc=3572,freq=1.0), product of:
              0.08535912 = queryWeight, product of:
                1.317429 = boost
                2.4961398 = idf(docFreq=9949, maxDocs=44421)
                0.02595696 = queryNorm
              0.15600874 = fieldWeight in 3572, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.4961398 = idf(docFreq=9949, maxDocs=44421)
                0.0625 = fieldNorm(doc=3572)
          0.060046848 = weight(abstract_txt:automatic in 3572) [ClassicSimilarity], result of:
            0.060046848 = score(doc=3572,freq=1.0), product of:
              0.18491302 = queryWeight, product of:
                1.3711059 = boost
                5.1956835 = idf(docFreq=668, maxDocs=44421)
                0.02595696 = queryNorm
              0.32473022 = fieldWeight in 3572, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.1956835 = idf(docFreq=668, maxDocs=44421)
                0.0625 = fieldNorm(doc=3572)
          0.08003365 = weight(abstract_txt:documents in 3572) [ClassicSimilarity], result of:
            0.08003365 = score(doc=3572,freq=4.0), product of:
              0.15527995 = queryWeight, product of:
                1.4508225 = boost
                4.123322 = idf(docFreq=1954, maxDocs=44421)
                0.02595696 = queryNorm
              0.51541525 = fieldWeight in 3572, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.123322 = idf(docFreq=1954, maxDocs=44421)
                0.0625 = fieldNorm(doc=3572)
          0.04519938 = weight(abstract_txt:document in 3572) [ClassicSimilarity], result of:
            0.04519938 = score(doc=3572,freq=1.0), product of:
              0.16841286 = queryWeight, product of:
                1.5109296 = boost
                4.29415 = idf(docFreq=1647, maxDocs=44421)
                0.02595696 = queryNorm
              0.26838437 = fieldWeight in 3572, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.29415 = idf(docFreq=1647, maxDocs=44421)
                0.0625 = fieldNorm(doc=3572)
          0.12685525 = weight(abstract_txt:feedback in 3572) [ClassicSimilarity], result of:
            0.12685525 = score(doc=3572,freq=2.0), product of:
              0.24164048 = queryWeight, product of:
                1.567372 = boost
                5.9394164 = idf(docFreq=317, maxDocs=44421)
                0.02595696 = queryNorm
              0.5249752 = fieldWeight in 3572, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.9394164 = idf(docFreq=317, maxDocs=44421)
                0.0625 = fieldNorm(doc=3572)
          0.06471199 = weight(abstract_txt:approach in 3572) [ClassicSimilarity], result of:
            0.06471199 = score(doc=3572,freq=3.0), product of:
              0.15978636 = queryWeight, product of:
                1.6454377 = boost
                3.741144 = idf(docFreq=2864, maxDocs=44421)
                0.02595696 = queryNorm
              0.4049907 = fieldWeight in 3572, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.741144 = idf(docFreq=2864, maxDocs=44421)
                0.0625 = fieldNorm(doc=3572)
          0.13127439 = weight(abstract_txt:queries in 3572) [ClassicSimilarity], result of:
            0.13127439 = score(doc=3572,freq=3.0), product of:
              0.2377022 = queryWeight, product of:
                1.7950362 = boost
                5.1015973 = idf(docFreq=734, maxDocs=44421)
                0.02595696 = queryNorm
              0.5522641 = fieldWeight in 3572, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.1015973 = idf(docFreq=734, maxDocs=44421)
                0.0625 = fieldNorm(doc=3572)
          0.08606811 = weight(abstract_txt:collection in 3572) [ClassicSimilarity], result of:
            0.08606811 = score(doc=3572,freq=1.0), product of:
              0.29617304 = queryWeight, product of:
                2.4540024 = boost
                4.649612 = idf(docFreq=1154, maxDocs=44421)
                0.02595696 = queryNorm
              0.29060075 = fieldWeight in 3572, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.649612 = idf(docFreq=1154, maxDocs=44421)
                0.0625 = fieldNorm(doc=3572)
        0.48 = coord(12/25)
    
  4. Cai, F.; Wang, S.; Rijke, M.de: Behavior-based personalization in web search (2017) 0.37
    0.36782357 = sum of:
      0.36782357 = product of:
        0.7662991 = sum of:
          0.023295779 = weight(abstract_txt:work in 4527) [ClassicSimilarity], result of:
            0.023295779 = score(doc=4527,freq=1.0), product of:
              0.09836159 = queryWeight, product of:
                3.7894108 = idf(docFreq=2729, maxDocs=44421)
                0.02595696 = queryNorm
              0.23683818 = fieldWeight in 4527, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.7894108 = idf(docFreq=2729, maxDocs=44421)
                0.0625 = fieldNorm(doc=4527)
          0.05534119 = weight(abstract_txt:combination in 4527) [ClassicSimilarity], result of:
            0.05534119 = score(doc=4527,freq=1.0), product of:
              0.1529828 = queryWeight, product of:
                1.0182699 = boost
                5.787965 = idf(docFreq=369, maxDocs=44421)
                0.02595696 = queryNorm
              0.3617478 = fieldWeight in 4527, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.787965 = idf(docFreq=369, maxDocs=44421)
                0.0625 = fieldNorm(doc=4527)
          0.09171398 = weight(abstract_txt:matrix in 4527) [ClassicSimilarity], result of:
            0.09171398 = score(doc=4527,freq=1.0), product of:
              0.21424003 = queryWeight, product of:
                1.2050135 = boost
                6.849437 = idf(docFreq=127, maxDocs=44421)
                0.02595696 = queryNorm
              0.42808983 = fieldWeight in 4527, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.849437 = idf(docFreq=127, maxDocs=44421)
                0.0625 = fieldNorm(doc=4527)
          0.041620787 = weight(abstract_txt:approaches in 4527) [ClassicSimilarity], result of:
            0.041620787 = score(doc=4527,freq=1.0), product of:
              0.14482634 = queryWeight, product of:
                1.2134196 = boost
                4.5981455 = idf(docFreq=1215, maxDocs=44421)
                0.02595696 = queryNorm
              0.2873841 = fieldWeight in 4527, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.5981455 = idf(docFreq=1215, maxDocs=44421)
                0.0625 = fieldNorm(doc=4527)
          0.025966592 = weight(abstract_txt:from in 4527) [ClassicSimilarity], result of:
            0.025966592 = score(doc=4527,freq=3.0), product of:
              0.08692803 = queryWeight, product of:
                1.2136446 = boost
                2.759399 = idf(docFreq=7646, maxDocs=44421)
                0.02595696 = queryNorm
              0.29871368 = fieldWeight in 4527, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                2.759399 = idf(docFreq=7646, maxDocs=44421)
                0.0625 = fieldNorm(doc=4527)
          0.04248889 = weight(abstract_txt:relevant in 4527) [ClassicSimilarity], result of:
            0.04248889 = score(doc=4527,freq=1.0), product of:
              0.14683321 = queryWeight, product of:
                1.221798 = boost
                4.6298943 = idf(docFreq=1177, maxDocs=44421)
                0.02595696 = queryNorm
              0.2893684 = fieldWeight in 4527, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.6298943 = idf(docFreq=1177, maxDocs=44421)
                0.0625 = fieldNorm(doc=4527)
          0.1127068 = weight(abstract_txt:query in 4527) [ClassicSimilarity], result of:
            0.1127068 = score(doc=4527,freq=6.0), product of:
              0.15484257 = queryWeight, product of:
                1.2546784 = boost
                4.754492 = idf(docFreq=1039, maxDocs=44421)
                0.02595696 = queryNorm
              0.72787994 = fieldWeight in 4527, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                4.754492 = idf(docFreq=1039, maxDocs=44421)
                0.0625 = fieldNorm(doc=4527)
          0.08882595 = weight(abstract_txt:relevance in 4527) [ClassicSimilarity], result of:
            0.08882595 = score(doc=4527,freq=3.0), product of:
              0.16645373 = queryWeight, product of:
                1.3008703 = boost
                4.929532 = idf(docFreq=872, maxDocs=44421)
                0.02595696 = queryNorm
              0.53363746 = fieldWeight in 4527, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.929532 = idf(docFreq=872, maxDocs=44421)
                0.0625 = fieldNorm(doc=4527)
          0.018832754 = weight(abstract_txt:with in 4527) [ClassicSimilarity], result of:
            0.018832754 = score(doc=4527,freq=2.0), product of:
              0.08535912 = queryWeight, product of:
                1.317429 = boost
                2.4961398 = idf(docFreq=9949, maxDocs=44421)
                0.02595696 = queryNorm
              0.22062966 = fieldWeight in 4527, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.4961398 = idf(docFreq=9949, maxDocs=44421)
                0.0625 = fieldNorm(doc=4527)
          0.08003365 = weight(abstract_txt:documents in 4527) [ClassicSimilarity], result of:
            0.08003365 = score(doc=4527,freq=4.0), product of:
              0.15527995 = queryWeight, product of:
                1.4508225 = boost
                4.123322 = idf(docFreq=1954, maxDocs=44421)
                0.02595696 = queryNorm
              0.51541525 = fieldWeight in 4527, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.123322 = idf(docFreq=1954, maxDocs=44421)
                0.0625 = fieldNorm(doc=4527)
          0.07828762 = weight(abstract_txt:document in 4527) [ClassicSimilarity], result of:
            0.07828762 = score(doc=4527,freq=3.0), product of:
              0.16841286 = queryWeight, product of:
                1.5109296 = boost
                4.29415 = idf(docFreq=1647, maxDocs=44421)
                0.02595696 = queryNorm
              0.46485534 = fieldWeight in 4527, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.29415 = idf(docFreq=1647, maxDocs=44421)
                0.0625 = fieldNorm(doc=4527)
          0.107185096 = weight(abstract_txt:queries in 4527) [ClassicSimilarity], result of:
            0.107185096 = score(doc=4527,freq=2.0), product of:
              0.2377022 = queryWeight, product of:
                1.7950362 = boost
                5.1015973 = idf(docFreq=734, maxDocs=44421)
                0.02595696 = queryNorm
              0.45092174 = fieldWeight in 4527, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.1015973 = idf(docFreq=734, maxDocs=44421)
                0.0625 = fieldNorm(doc=4527)
        0.48 = coord(12/25)
    
  5. Talvensaari, T.; Laurikkala, J.; Järvelin, K.; Juhola, M.: ¬A study on automatic creation of a comparable document collection in cross-language information retrieval (2006) 0.36
    0.35614112 = sum of:
      0.35614112 = product of:
        0.7419607 = sum of:
          0.056211423 = weight(abstract_txt:similarity in 601) [ClassicSimilarity], result of:
            0.056211423 = score(doc=601,freq=1.0), product of:
              0.15458238 = queryWeight, product of:
                1.0235796 = boost
                5.8181453 = idf(docFreq=358, maxDocs=44421)
                0.02595696 = queryNorm
              0.36363408 = fieldWeight in 601, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.8181453 = idf(docFreq=358, maxDocs=44421)
                0.0625 = fieldNorm(doc=601)
          0.021201635 = weight(abstract_txt:from in 601) [ClassicSimilarity], result of:
            0.021201635 = score(doc=601,freq=2.0), product of:
              0.08692803 = queryWeight, product of:
                1.2136446 = boost
                2.759399 = idf(docFreq=7646, maxDocs=44421)
                0.02595696 = queryNorm
              0.2438987 = fieldWeight in 601, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.759399 = idf(docFreq=7646, maxDocs=44421)
                0.0625 = fieldNorm(doc=601)
          0.0650713 = weight(abstract_txt:query in 601) [ClassicSimilarity], result of:
            0.0650713 = score(doc=601,freq=2.0), product of:
              0.15484257 = queryWeight, product of:
                1.2546784 = boost
                4.754492 = idf(docFreq=1039, maxDocs=44421)
                0.02595696 = queryNorm
              0.42024165 = fieldWeight in 601, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.754492 = idf(docFreq=1039, maxDocs=44421)
                0.0625 = fieldNorm(doc=601)
          0.04719 = weight(abstract_txt:term in 601) [ClassicSimilarity], result of:
            0.04719 = score(doc=601,freq=1.0), product of:
              0.15747344 = queryWeight, product of:
                1.2652924 = boost
                4.794713 = idf(docFreq=998, maxDocs=44421)
                0.02595696 = queryNorm
              0.29966956 = fieldWeight in 601, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.794713 = idf(docFreq=998, maxDocs=44421)
                0.0625 = fieldNorm(doc=601)
          0.051283687 = weight(abstract_txt:relevance in 601) [ClassicSimilarity], result of:
            0.051283687 = score(doc=601,freq=1.0), product of:
              0.16645373 = queryWeight, product of:
                1.3008703 = boost
                4.929532 = idf(docFreq=872, maxDocs=44421)
                0.02595696 = queryNorm
              0.30809575 = fieldWeight in 601, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.929532 = idf(docFreq=872, maxDocs=44421)
                0.0625 = fieldNorm(doc=601)
          0.032619286 = weight(abstract_txt:with in 601) [ClassicSimilarity], result of:
            0.032619286 = score(doc=601,freq=6.0), product of:
              0.08535912 = queryWeight, product of:
                1.317429 = boost
                2.4961398 = idf(docFreq=9949, maxDocs=44421)
                0.02595696 = queryNorm
              0.3821418 = fieldWeight in 601, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                2.4961398 = idf(docFreq=9949, maxDocs=44421)
                0.0625 = fieldNorm(doc=601)
          0.06507057 = weight(abstract_txt:small in 601) [ClassicSimilarity], result of:
            0.06507057 = score(doc=601,freq=1.0), product of:
              0.19508795 = queryWeight, product of:
                1.4083236 = boost
                5.3367167 = idf(docFreq=580, maxDocs=44421)
                0.02595696 = queryNorm
              0.3335448 = fieldWeight in 601, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.3367167 = idf(docFreq=580, maxDocs=44421)
                0.0625 = fieldNorm(doc=601)
          0.040016826 = weight(abstract_txt:documents in 601) [ClassicSimilarity], result of:
            0.040016826 = score(doc=601,freq=1.0), product of:
              0.15527995 = queryWeight, product of:
                1.4508225 = boost
                4.123322 = idf(docFreq=1954, maxDocs=44421)
                0.02595696 = queryNorm
              0.25770763 = fieldWeight in 601, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.123322 = idf(docFreq=1954, maxDocs=44421)
                0.0625 = fieldNorm(doc=601)
          0.101068884 = weight(abstract_txt:document in 601) [ClassicSimilarity], result of:
            0.101068884 = score(doc=601,freq=5.0), product of:
              0.16841286 = queryWeight, product of:
                1.5109296 = boost
                4.29415 = idf(docFreq=1647, maxDocs=44421)
                0.02595696 = queryNorm
              0.6001257 = fieldWeight in 601, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                4.29415 = idf(docFreq=1647, maxDocs=44421)
                0.0625 = fieldNorm(doc=601)
          0.037361484 = weight(abstract_txt:approach in 601) [ClassicSimilarity], result of:
            0.037361484 = score(doc=601,freq=1.0), product of:
              0.15978636 = queryWeight, product of:
                1.6454377 = boost
                3.741144 = idf(docFreq=2864, maxDocs=44421)
                0.02595696 = queryNorm
              0.2338215 = fieldWeight in 601, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.741144 = idf(docFreq=2864, maxDocs=44421)
                0.0625 = fieldNorm(doc=601)
          0.07579131 = weight(abstract_txt:queries in 601) [ClassicSimilarity], result of:
            0.07579131 = score(doc=601,freq=1.0), product of:
              0.2377022 = queryWeight, product of:
                1.7950362 = boost
                5.1015973 = idf(docFreq=734, maxDocs=44421)
                0.02595696 = queryNorm
              0.31884983 = fieldWeight in 601, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.1015973 = idf(docFreq=734, maxDocs=44421)
                0.0625 = fieldNorm(doc=601)
          0.14907433 = weight(abstract_txt:collection in 601) [ClassicSimilarity], result of:
            0.14907433 = score(doc=601,freq=3.0), product of:
              0.29617304 = queryWeight, product of:
                2.4540024 = boost
                4.649612 = idf(docFreq=1154, maxDocs=44421)
                0.02595696 = queryNorm
              0.50333524 = fieldWeight in 601, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.649612 = idf(docFreq=1154, maxDocs=44421)
                0.0625 = fieldNorm(doc=601)
        0.48 = coord(12/25)