Document (#3629)

Author
Johnson, B.
Peterson, E.
Title
Reviewing initial stopword selection
Source
Information technology and libraries. 11(1992) no.2, S.136-139
Year
1992
Abstract
5 years after a stopword list was drawn up for the online catalogue at Montana State University of Bozeman Libraries, it was found that the data base had changed sufficietly to require a reevaluation of the stopword list. Tables present the original soft stopwords; the original soft stopword occurrence; all stopword occurrences over 4.000 times and the increase in stopword occurrence over the 5 years

Similar documents (author)

  1. Peterson, I.C.: Effective question negotiation in the reference interview (1997) 2.32
    2.3183877 = sum of:
      2.3183877 = product of:
        4.6367755 = sum of:
          4.6367755 = weight(author_txt:peterson in 2613) [ClassicSimilarity], result of:
            4.6367755 = score(doc=2613,freq=1.0), product of:
              0.83088154 = queryWeight, product of:
                1.2219592 = boost
                8.928879 = idf(docFreq=15, maxDocs=44421)
                0.07615273 = queryNorm
              5.5805492 = fieldWeight in 2613, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.928879 = idf(docFreq=15, maxDocs=44421)
                0.625 = fieldNorm(doc=2613)
        0.5 = coord(1/2)
    
  2. Peterson, R.E.: Eight Internet search engines compared (1996) 2.32
    2.3183877 = sum of:
      2.3183877 = product of:
        4.6367755 = sum of:
          4.6367755 = weight(author_txt:peterson in 3328) [ClassicSimilarity], result of:
            4.6367755 = score(doc=3328,freq=1.0), product of:
              0.83088154 = queryWeight, product of:
                1.2219592 = boost
                8.928879 = idf(docFreq=15, maxDocs=44421)
                0.07615273 = queryNorm
              5.5805492 = fieldWeight in 3328, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.928879 = idf(docFreq=15, maxDocs=44421)
                0.625 = fieldNorm(doc=3328)
        0.5 = coord(1/2)
    
  3. Peterson, B.J.: Knowledge management : 3M perspective (1997) 2.32
    2.3183877 = sum of:
      2.3183877 = product of:
        4.6367755 = sum of:
          4.6367755 = weight(author_txt:peterson in 4121) [ClassicSimilarity], result of:
            4.6367755 = score(doc=4121,freq=1.0), product of:
              0.83088154 = queryWeight, product of:
                1.2219592 = boost
                8.928879 = idf(docFreq=15, maxDocs=44421)
                0.07615273 = queryNorm
              5.5805492 = fieldWeight in 4121, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.928879 = idf(docFreq=15, maxDocs=44421)
                0.625 = fieldNorm(doc=4121)
        0.5 = coord(1/2)
    
  4. Peterson, E.: Parallel systems : the coexistence of subject cataloging and folksonomy (2008) 2.32
    2.3183877 = sum of:
      2.3183877 = product of:
        4.6367755 = sum of:
          4.6367755 = weight(author_txt:peterson in 1251) [ClassicSimilarity], result of:
            4.6367755 = score(doc=1251,freq=1.0), product of:
              0.83088154 = queryWeight, product of:
                1.2219592 = boost
                8.928879 = idf(docFreq=15, maxDocs=44421)
                0.07615273 = queryNorm
              5.5805492 = fieldWeight in 1251, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.928879 = idf(docFreq=15, maxDocs=44421)
                0.625 = fieldNorm(doc=1251)
        0.5 = coord(1/2)
    
  5. Peterson, G.M.: Characteristics of retracted open access biomedical literature : a bibliographic analysis (2013) 2.32
    2.3183877 = sum of:
      2.3183877 = product of:
        4.6367755 = sum of:
          4.6367755 = weight(author_txt:peterson in 2138) [ClassicSimilarity], result of:
            4.6367755 = score(doc=2138,freq=1.0), product of:
              0.83088154 = queryWeight, product of:
                1.2219592 = boost
                8.928879 = idf(docFreq=15, maxDocs=44421)
                0.07615273 = queryNorm
              5.5805492 = fieldWeight in 2138, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.928879 = idf(docFreq=15, maxDocs=44421)
                0.625 = fieldNorm(doc=2138)
        0.5 = coord(1/2)
    

Similar documents (content)

  1. Dolamic, L.; Savoy, J.: When stopword lists make the difference (2009) 0.21
    0.20929334 = sum of:
      0.20929334 = product of:
        1.3080834 = sum of:
          0.029998826 = weight(abstract_txt:drawn in 306) [ClassicSimilarity], result of:
            0.029998826 = score(doc=306,freq=1.0), product of:
              0.06168429 = queryWeight, product of:
                1.4324096 = boost
                6.225004 = idf(docFreq=238, maxDocs=44421)
                0.0069177956 = queryNorm
              0.48632845 = fieldWeight in 306, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.225004 = idf(docFreq=238, maxDocs=44421)
                0.078125 = fieldNorm(doc=306)
          0.038941894 = weight(abstract_txt:list in 306) [ClassicSimilarity], result of:
            0.038941894 = score(doc=306,freq=1.0), product of:
              0.09248255 = queryWeight, product of:
                2.4804177 = boost
                5.389733 = idf(docFreq=550, maxDocs=44421)
                0.0069177956 = queryNorm
              0.42107287 = fieldWeight in 306, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.389733 = idf(docFreq=550, maxDocs=44421)
                0.078125 = fieldNorm(doc=306)
          0.039219685 = weight(abstract_txt:original in 306) [ClassicSimilarity], result of:
            0.039219685 = score(doc=306,freq=1.0), product of:
              0.092921846 = queryWeight, product of:
                2.486302 = boost
                5.4025183 = idf(docFreq=543, maxDocs=44421)
                0.0069177956 = queryNorm
              0.42207175 = fieldWeight in 306, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.4025183 = idf(docFreq=543, maxDocs=44421)
                0.078125 = fieldNorm(doc=306)
          1.199923 = weight(abstract_txt:stopword in 306) [ClassicSimilarity], result of:
            1.199923 = score(doc=306,freq=3.0), product of:
              0.90897244 = queryWeight, product of:
                13.468863 = boost
                9.755557 = idf(docFreq=6, maxDocs=44421)
                0.0069177956 = queryNorm
              1.3200874 = fieldWeight in 306, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                9.755557 = idf(docFreq=6, maxDocs=44421)
                0.078125 = fieldNorm(doc=306)
        0.16 = coord(4/25)
    
  2. Lazarinis, F.: Engineering and utilizing a stopword list in Greek Web retrieval (2007) 0.20
    0.20328054 = sum of:
      0.20328054 = product of:
        1.6940045 = sum of:
          0.18801013 = weight(abstract_txt:stopwords in 1587) [ClassicSimilarity], result of:
            0.18801013 = score(doc=1587,freq=2.0), product of:
              0.14737657 = queryWeight, product of:
                2.2140844 = boost
                9.622026 = idf(docFreq=7, maxDocs=44421)
                0.0069177956 = queryNorm
              1.2757125 = fieldWeight in 1587, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                9.622026 = idf(docFreq=7, maxDocs=44421)
                0.09375 = fieldNorm(doc=1587)
          0.06608658 = weight(abstract_txt:list in 1587) [ClassicSimilarity], result of:
            0.06608658 = score(doc=1587,freq=2.0), product of:
              0.09248255 = queryWeight, product of:
                2.4804177 = boost
                5.389733 = idf(docFreq=550, maxDocs=44421)
                0.0069177956 = queryNorm
              0.71458435 = fieldWeight in 1587, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.389733 = idf(docFreq=550, maxDocs=44421)
                0.09375 = fieldNorm(doc=1587)
          1.4399078 = weight(abstract_txt:stopword in 1587) [ClassicSimilarity], result of:
            1.4399078 = score(doc=1587,freq=3.0), product of:
              0.90897244 = queryWeight, product of:
                13.468863 = boost
                9.755557 = idf(docFreq=6, maxDocs=44421)
                0.0069177956 = queryNorm
              1.584105 = fieldWeight in 1587, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                9.755557 = idf(docFreq=6, maxDocs=44421)
                0.09375 = fieldNorm(doc=1587)
        0.12 = coord(3/25)
    
  3. Stamatatos, E.: Plagiarism detection using stopword n-grams (2011) 0.14
    0.14107578 = sum of:
      0.14107578 = product of:
        0.8817236 = sum of:
          0.11078603 = weight(abstract_txt:stopwords in 955) [ClassicSimilarity], result of:
            0.11078603 = score(doc=955,freq=1.0), product of:
              0.14737657 = queryWeight, product of:
                2.2140844 = boost
                9.622026 = idf(docFreq=7, maxDocs=44421)
                0.0069177956 = queryNorm
              0.7517208 = fieldWeight in 955, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.622026 = idf(docFreq=7, maxDocs=44421)
                0.078125 = fieldNorm(doc=955)
          0.038941894 = weight(abstract_txt:list in 955) [ClassicSimilarity], result of:
            0.038941894 = score(doc=955,freq=1.0), product of:
              0.09248255 = queryWeight, product of:
                2.4804177 = boost
                5.389733 = idf(docFreq=550, maxDocs=44421)
                0.0069177956 = queryNorm
              0.42107287 = fieldWeight in 955, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.389733 = idf(docFreq=550, maxDocs=44421)
                0.078125 = fieldNorm(doc=955)
          0.039219685 = weight(abstract_txt:original in 955) [ClassicSimilarity], result of:
            0.039219685 = score(doc=955,freq=1.0), product of:
              0.092921846 = queryWeight, product of:
                2.486302 = boost
                5.4025183 = idf(docFreq=543, maxDocs=44421)
                0.0069177956 = queryNorm
              0.42207175 = fieldWeight in 955, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.4025183 = idf(docFreq=543, maxDocs=44421)
                0.078125 = fieldNorm(doc=955)
          0.69277596 = weight(abstract_txt:stopword in 955) [ClassicSimilarity], result of:
            0.69277596 = score(doc=955,freq=1.0), product of:
              0.90897244 = queryWeight, product of:
                13.468863 = boost
                9.755557 = idf(docFreq=6, maxDocs=44421)
                0.0069177956 = queryNorm
              0.7621529 = fieldWeight in 955, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.755557 = idf(docFreq=6, maxDocs=44421)
                0.078125 = fieldNorm(doc=955)
        0.16 = coord(4/25)
    
  4. Ekmekcioglu, F.C.; Lynch, M.F.; Willet, P.: Development and evaluation of conflation techniques for the implementation of a document retrieval system for Turkish text databases (1995) 0.07
    0.07024491 = sum of:
      0.07024491 = product of:
        0.8780614 = sum of:
          0.046730276 = weight(abstract_txt:list in 5865) [ClassicSimilarity], result of:
            0.046730276 = score(doc=5865,freq=1.0), product of:
              0.09248255 = queryWeight, product of:
                2.4804177 = boost
                5.389733 = idf(docFreq=550, maxDocs=44421)
                0.0069177956 = queryNorm
              0.50528747 = fieldWeight in 5865, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.389733 = idf(docFreq=550, maxDocs=44421)
                0.09375 = fieldNorm(doc=5865)
          0.83133113 = weight(abstract_txt:stopword in 5865) [ClassicSimilarity], result of:
            0.83133113 = score(doc=5865,freq=1.0), product of:
              0.90897244 = queryWeight, product of:
                13.468863 = boost
                9.755557 = idf(docFreq=6, maxDocs=44421)
                0.0069177956 = queryNorm
              0.91458344 = fieldWeight in 5865, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.755557 = idf(docFreq=6, maxDocs=44421)
                0.09375 = fieldNorm(doc=5865)
        0.08 = coord(2/25)
    
  5. Can, F.; Kocberber, S.; Balcik, E.; Kaynak, C.; Ocalan, H.C.: Information retrieval on Turkish texts (2008) 0.07
    0.07024491 = sum of:
      0.07024491 = product of:
        0.8780614 = sum of:
          0.046730276 = weight(abstract_txt:list in 2373) [ClassicSimilarity], result of:
            0.046730276 = score(doc=2373,freq=1.0), product of:
              0.09248255 = queryWeight, product of:
                2.4804177 = boost
                5.389733 = idf(docFreq=550, maxDocs=44421)
                0.0069177956 = queryNorm
              0.50528747 = fieldWeight in 2373, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.389733 = idf(docFreq=550, maxDocs=44421)
                0.09375 = fieldNorm(doc=2373)
          0.83133113 = weight(abstract_txt:stopword in 2373) [ClassicSimilarity], result of:
            0.83133113 = score(doc=2373,freq=1.0), product of:
              0.90897244 = queryWeight, product of:
                13.468863 = boost
                9.755557 = idf(docFreq=6, maxDocs=44421)
                0.0069177956 = queryNorm
              0.91458344 = fieldWeight in 2373, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.755557 = idf(docFreq=6, maxDocs=44421)
                0.09375 = fieldNorm(doc=2373)
        0.08 = coord(2/25)