Document (#2300)

Author
Al-Hawamdeh, S.
Smith, G.
Willett, P.
Vere, R. de
Title
Using nearest-neighbour searching techniques to access full-text documents
Source
Online review. 15(1991) nos.3/4, S.173-190
Year
1991
Abstract
Summarises the results to date of a continuing programme of research at Sheffield Univ. to investigate the use of nearest-neighbour retrieval algorithms for full text searching. Given a natural language query statement, the research methods result in a ranking of the paragraphs comprising a full text document in order of decreasing similarity with the query, where the similarity for each paragraph is determined by the number of keyword stems that it has in common with the query
Theme
Retrievalalgorithmen

Similar documents (author)

  1. Al-Hawamdeh, S.; Smith, G.; Willett, P.: Paragraph-based access to full-text documents using a hypertext system (1991) 5.23
    5.2322407 = sum of:
      5.2322407 = sum of:
        0.83791935 = weight(author_txt:smith in 7503) [ClassicSimilarity], result of:
          0.83791935 = score(doc=7503,freq=1.0), product of:
            0.3461881 = queryWeight, product of:
              6.4544435 = idf(docFreq=189, maxDocs=44421)
              0.053635623 = queryNorm
            2.4204164 = fieldWeight in 7503, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              6.4544435 = idf(docFreq=189, maxDocs=44421)
              0.375 = fieldNorm(doc=7503)
        1.618283 = weight(author_txt:willett in 7503) [ClassicSimilarity], result of:
          1.618283 = score(doc=7503,freq=1.0), product of:
            0.53688383 = queryWeight, product of:
              1.245329 = boost
              8.037906 = idf(docFreq=38, maxDocs=44421)
              0.053635623 = queryNorm
            3.0142145 = fieldWeight in 7503, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              8.037906 = idf(docFreq=38, maxDocs=44421)
              0.375 = fieldNorm(doc=7503)
        2.7760384 = weight(author_txt:hawamdeh in 7503) [ClassicSimilarity], result of:
          2.7760384 = score(doc=7503,freq=1.0), product of:
            0.76935655 = queryWeight, product of:
              1.4907601 = boost
              9.622026 = idf(docFreq=7, maxDocs=44421)
              0.053635623 = queryNorm
            3.60826 = fieldWeight in 7503, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              9.622026 = idf(docFreq=7, maxDocs=44421)
              0.375 = fieldNorm(doc=7503)
    
  2. Hawamdeh, S.: Knowledge management : cultivating knowledge professionals (2003) 1.54
    1.5422435 = sum of:
      1.5422435 = product of:
        4.6267304 = sum of:
          4.6267304 = weight(author_txt:hawamdeh in 2465) [ClassicSimilarity], result of:
            4.6267304 = score(doc=2465,freq=1.0), product of:
              0.76935655 = queryWeight, product of:
                1.4907601 = boost
                9.622026 = idf(docFreq=7, maxDocs=44421)
                0.053635623 = queryNorm
              6.0137663 = fieldWeight in 2465, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.622026 = idf(docFreq=7, maxDocs=44421)
                0.625 = fieldNorm(doc=2465)
        0.33333334 = coord(1/3)
    
  3. AI-Hawamdeh, S.: Knowledge Management in Asia : introduction to the special topic section (2005) 1.23
    1.2337949 = sum of:
      1.2337949 = product of:
        3.7013845 = sum of:
          3.7013845 = weight(author_txt:hawamdeh in 5231) [ClassicSimilarity], result of:
            3.7013845 = score(doc=5231,freq=1.0), product of:
              0.76935655 = queryWeight, product of:
                1.4907601 = boost
                9.622026 = idf(docFreq=7, maxDocs=44421)
                0.053635623 = queryNorm
              4.811013 = fieldWeight in 5231, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.622026 = idf(docFreq=7, maxDocs=44421)
                0.5 = fieldNorm(doc=5231)
        0.33333334 = coord(1/3)
    
  4. AI-Hawamdeh, S.: Designing an interdisciplinary graduate program in knowledge management (2005) 1.23
    1.2337949 = sum of:
      1.2337949 = product of:
        3.7013845 = sum of:
          3.7013845 = weight(author_txt:hawamdeh in 5236) [ClassicSimilarity], result of:
            3.7013845 = score(doc=5236,freq=1.0), product of:
              0.76935655 = queryWeight, product of:
                1.4907601 = boost
                9.622026 = idf(docFreq=7, maxDocs=44421)
                0.053635623 = queryNorm
              4.811013 = fieldWeight in 5236, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.622026 = idf(docFreq=7, maxDocs=44421)
                0.5 = fieldNorm(doc=5236)
        0.33333334 = coord(1/3)
    
  5. Teng, S.; Hawamdeh, S.: Knowledge management in public libraries (2002) 1.23
    1.2337949 = sum of:
      1.2337949 = product of:
        3.7013845 = sum of:
          3.7013845 = weight(author_txt:hawamdeh in 807) [ClassicSimilarity], result of:
            3.7013845 = score(doc=807,freq=1.0), product of:
              0.76935655 = queryWeight, product of:
                1.4907601 = boost
                9.622026 = idf(docFreq=7, maxDocs=44421)
                0.053635623 = queryNorm
              4.811013 = fieldWeight in 807, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.622026 = idf(docFreq=7, maxDocs=44421)
                0.5 = fieldNorm(doc=807)
        0.33333334 = coord(1/3)
    

Similar documents (content)

  1. Mohan, K.C.: Boolean and nearest neighbour text searching in a multi-strategy retrieval system (1996) 0.16
    0.16239604 = sum of:
      0.16239604 = product of:
        1.0149753 = sum of:
          0.051123574 = weight(abstract_txt:searching in 324) [ClassicSimilarity], result of:
            0.051123574 = score(doc=324,freq=1.0), product of:
              0.10904891 = queryWeight, product of:
                1.5916098 = boost
                4.2862926 = idf(docFreq=1660, maxDocs=44421)
                0.015984641 = queryNorm
              0.46881324 = fieldWeight in 324, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.2862926 = idf(docFreq=1660, maxDocs=44421)
                0.109375 = fieldNorm(doc=324)
          0.104659654 = weight(abstract_txt:query in 324) [ClassicSimilarity], result of:
            0.104659654 = score(doc=324,freq=1.0), product of:
              0.20125985 = queryWeight, product of:
                2.6481962 = boost
                4.754492 = idf(docFreq=1039, maxDocs=44421)
                0.015984641 = queryNorm
              0.5200225 = fieldWeight in 324, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.754492 = idf(docFreq=1039, maxDocs=44421)
                0.109375 = fieldNorm(doc=324)
          0.35093606 = weight(abstract_txt:nearest in 324) [ClassicSimilarity], result of:
            0.35093606 = score(doc=324,freq=1.0), product of:
              0.3938757 = queryWeight, product of:
                3.0248618 = boost
                8.146119 = idf(docFreq=34, maxDocs=44421)
                0.015984641 = queryNorm
              0.8909818 = fieldWeight in 324, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.146119 = idf(docFreq=34, maxDocs=44421)
                0.109375 = fieldNorm(doc=324)
          0.508256 = weight(abstract_txt:neighbour in 324) [ClassicSimilarity], result of:
            0.508256 = score(doc=324,freq=1.0), product of:
              0.5041915 = queryWeight, product of:
                3.4223444 = boost
                9.216561 = idf(docFreq=11, maxDocs=44421)
                0.015984641 = queryNorm
              1.0080614 = fieldWeight in 324, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.216561 = idf(docFreq=11, maxDocs=44421)
                0.109375 = fieldNorm(doc=324)
        0.16 = coord(4/25)
    
  2. Pirkola, A.; Jarvelin, K.: ¬The effect of anaphor and ellipsis resolution on proximity searching in a text database (1995) 0.15
    0.14624433 = sum of:
      0.14624433 = product of:
        0.6093514 = sum of:
          0.091323264 = weight(abstract_txt:keyword in 4156) [ClassicSimilarity], result of:
            0.091323264 = score(doc=4156,freq=5.0), product of:
              0.10821484 = queryWeight, product of:
                1.1211258 = boost
                6.038507 = idf(docFreq=287, maxDocs=44421)
                0.015984641 = queryNorm
              0.843907 = fieldWeight in 4156, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                6.038507 = idf(docFreq=287, maxDocs=44421)
                0.0625 = fieldNorm(doc=4156)
          0.114854604 = weight(abstract_txt:paragraphs in 4156) [ClassicSimilarity], result of:
            0.114854604 = score(doc=4156,freq=1.0), product of:
              0.21560301 = queryWeight, product of:
                1.5824804 = boost
                8.523414 = idf(docFreq=23, maxDocs=44421)
                0.015984641 = queryNorm
              0.53271335 = fieldWeight in 4156, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.523414 = idf(docFreq=23, maxDocs=44421)
                0.0625 = fieldNorm(doc=4156)
          0.029213471 = weight(abstract_txt:searching in 4156) [ClassicSimilarity], result of:
            0.029213471 = score(doc=4156,freq=1.0), product of:
              0.10904891 = queryWeight, product of:
                1.5916098 = boost
                4.2862926 = idf(docFreq=1660, maxDocs=44421)
                0.015984641 = queryNorm
              0.26789328 = fieldWeight in 4156, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.2862926 = idf(docFreq=1660, maxDocs=44421)
                0.0625 = fieldNorm(doc=4156)
          0.19523329 = weight(abstract_txt:paragraph in 4156) [ClassicSimilarity], result of:
            0.19523329 = score(doc=4156,freq=2.0), product of:
              0.24373345 = queryWeight, product of:
                1.6825521 = boost
                9.06241 = idf(docFreq=13, maxDocs=44421)
                0.015984641 = queryNorm
              0.80101144 = fieldWeight in 4156, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                9.06241 = idf(docFreq=13, maxDocs=44421)
                0.0625 = fieldNorm(doc=4156)
          0.06359429 = weight(abstract_txt:text in 4156) [ClassicSimilarity], result of:
            0.06359429 = score(doc=4156,freq=3.0), product of:
              0.14537887 = queryWeight, product of:
                2.2507238 = boost
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.015984641 = queryNorm
              0.4374383 = fieldWeight in 4156, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.0625 = fieldNorm(doc=4156)
          0.11513248 = weight(abstract_txt:full in 4156) [ClassicSimilarity], result of:
            0.11513248 = score(doc=4156,freq=3.0), product of:
              0.21595062 = queryWeight, product of:
                2.7431452 = boost
                4.9249606 = idf(docFreq=876, maxDocs=44421)
                0.015984641 = queryNorm
              0.5331426 = fieldWeight in 4156, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.9249606 = idf(docFreq=876, maxDocs=44421)
                0.0625 = fieldNorm(doc=4156)
        0.24 = coord(6/25)
    
  3. Savoy, J.: ¬An extended vector-processing scheme for searching information in hypertext systems (1996) 0.13
    0.13060959 = sum of:
      0.13060959 = product of:
        0.6530479 = sum of:
          0.029213471 = weight(abstract_txt:searching in 4104) [ClassicSimilarity], result of:
            0.029213471 = score(doc=4104,freq=1.0), product of:
              0.10904891 = queryWeight, product of:
                1.5916098 = boost
                4.2862926 = idf(docFreq=1660, maxDocs=44421)
                0.015984641 = queryNorm
              0.26789328 = fieldWeight in 4104, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.2862926 = idf(docFreq=1660, maxDocs=44421)
                0.0625 = fieldNorm(doc=4104)
          0.07306198 = weight(abstract_txt:similarity in 4104) [ClassicSimilarity], result of:
            0.07306198 = score(doc=4104,freq=1.0), product of:
              0.2009217 = queryWeight, product of:
                2.160426 = boost
                5.8181453 = idf(docFreq=358, maxDocs=44421)
                0.015984641 = queryNorm
              0.36363408 = fieldWeight in 4104, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.8181453 = idf(docFreq=358, maxDocs=44421)
                0.0625 = fieldNorm(doc=4104)
          0.05980552 = weight(abstract_txt:query in 4104) [ClassicSimilarity], result of:
            0.05980552 = score(doc=4104,freq=1.0), product of:
              0.20125985 = queryWeight, product of:
                2.6481962 = boost
                4.754492 = idf(docFreq=1039, maxDocs=44421)
                0.015984641 = queryNorm
              0.29715574 = fieldWeight in 4104, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.754492 = idf(docFreq=1039, maxDocs=44421)
                0.0625 = fieldNorm(doc=4104)
          0.2005349 = weight(abstract_txt:nearest in 4104) [ClassicSimilarity], result of:
            0.2005349 = score(doc=4104,freq=1.0), product of:
              0.3938757 = queryWeight, product of:
                3.0248618 = boost
                8.146119 = idf(docFreq=34, maxDocs=44421)
                0.015984641 = queryNorm
              0.50913244 = fieldWeight in 4104, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.146119 = idf(docFreq=34, maxDocs=44421)
                0.0625 = fieldNorm(doc=4104)
          0.290432 = weight(abstract_txt:neighbour in 4104) [ClassicSimilarity], result of:
            0.290432 = score(doc=4104,freq=1.0), product of:
              0.5041915 = queryWeight, product of:
                3.4223444 = boost
                9.216561 = idf(docFreq=11, maxDocs=44421)
                0.015984641 = queryNorm
              0.5760351 = fieldWeight in 4104, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.216561 = idf(docFreq=11, maxDocs=44421)
                0.0625 = fieldNorm(doc=4104)
        0.2 = coord(5/25)
    
  4. Loughran, H.: ¬A review of nearest neighbour information retrieval (1994) 0.12
    0.12484328 = sum of:
      0.12484328 = product of:
        1.0403607 = sum of:
          0.058426943 = weight(abstract_txt:searching in 684) [ClassicSimilarity], result of:
            0.058426943 = score(doc=684,freq=1.0), product of:
              0.10904891 = queryWeight, product of:
                1.5916098 = boost
                4.2862926 = idf(docFreq=1660, maxDocs=44421)
                0.015984641 = queryNorm
              0.53578657 = fieldWeight in 684, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.2862926 = idf(docFreq=1660, maxDocs=44421)
                0.125 = fieldNorm(doc=684)
          0.4010698 = weight(abstract_txt:nearest in 684) [ClassicSimilarity], result of:
            0.4010698 = score(doc=684,freq=1.0), product of:
              0.3938757 = queryWeight, product of:
                3.0248618 = boost
                8.146119 = idf(docFreq=34, maxDocs=44421)
                0.015984641 = queryNorm
              1.0182649 = fieldWeight in 684, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.146119 = idf(docFreq=34, maxDocs=44421)
                0.125 = fieldNorm(doc=684)
          0.580864 = weight(abstract_txt:neighbour in 684) [ClassicSimilarity], result of:
            0.580864 = score(doc=684,freq=1.0), product of:
              0.5041915 = queryWeight, product of:
                3.4223444 = boost
                9.216561 = idf(docFreq=11, maxDocs=44421)
                0.015984641 = queryNorm
              1.1520702 = fieldWeight in 684, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.216561 = idf(docFreq=11, maxDocs=44421)
                0.125 = fieldNorm(doc=684)
        0.12 = coord(3/25)
    
  5. Cribbin, T.: Discovering latent topical structure by second-order similarity analysis (2011) 0.12
    0.11532962 = sum of:
      0.11532962 = product of:
        0.7208102 = sum of:
          0.16337155 = weight(abstract_txt:similarity in 470) [ClassicSimilarity], result of:
            0.16337155 = score(doc=470,freq=5.0), product of:
              0.2009217 = queryWeight, product of:
                2.160426 = boost
                5.8181453 = idf(docFreq=358, maxDocs=44421)
                0.015984641 = queryNorm
              0.81311053 = fieldWeight in 470, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                5.8181453 = idf(docFreq=358, maxDocs=44421)
                0.0625 = fieldNorm(doc=470)
          0.06647177 = weight(abstract_txt:full in 470) [ClassicSimilarity], result of:
            0.06647177 = score(doc=470,freq=1.0), product of:
              0.21595062 = queryWeight, product of:
                2.7431452 = boost
                4.9249606 = idf(docFreq=876, maxDocs=44421)
                0.015984641 = queryNorm
              0.30781004 = fieldWeight in 470, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.9249606 = idf(docFreq=876, maxDocs=44421)
                0.0625 = fieldNorm(doc=470)
          0.2005349 = weight(abstract_txt:nearest in 470) [ClassicSimilarity], result of:
            0.2005349 = score(doc=470,freq=1.0), product of:
              0.3938757 = queryWeight, product of:
                3.0248618 = boost
                8.146119 = idf(docFreq=34, maxDocs=44421)
                0.015984641 = queryNorm
              0.50913244 = fieldWeight in 470, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.146119 = idf(docFreq=34, maxDocs=44421)
                0.0625 = fieldNorm(doc=470)
          0.290432 = weight(abstract_txt:neighbour in 470) [ClassicSimilarity], result of:
            0.290432 = score(doc=470,freq=1.0), product of:
              0.5041915 = queryWeight, product of:
                3.4223444 = boost
                9.216561 = idf(docFreq=11, maxDocs=44421)
                0.015984641 = queryNorm
              0.5760351 = fieldWeight in 470, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.216561 = idf(docFreq=11, maxDocs=44421)
                0.0625 = fieldNorm(doc=470)
        0.16 = coord(4/25)