Document (#3630)

Author
Johnson, B.
Peterson, E.
Title
Reviewing initial stopword selection
Source
Information technology and libraries. 11(1992) no.2, S.136-139
Year
1992
Abstract
5 years after a stopword list was drawn up for the online catalogue at Montana State University of Bozeman Libraries, it was found that the data base had changed sufficietly to require a reevaluation of the stopword list. Tables present the original soft stopwords; the original soft stopword occurrence; all stopword occurrences over 4.000 times and the increase in stopword occurrence over the 5 years

Similar documents (author)

  1. Peterson, I.C.: Effective question negotiation in the reference interview (1997) 2.37
    2.3709164 = sum of:
      2.3709164 = product of:
        4.7418327 = sum of:
          4.7418327 = weight(author_txt:peterson in 1613) [ClassicSimilarity], result of:
            4.7418327 = score(doc=1613,freq=1.0), product of:
              0.83761036 = queryWeight, product of:
                1.2382778 = boost
                9.05783 = idf(docFreq=13, maxDocs=44218)
                0.074679226 = queryNorm
              5.661144 = fieldWeight in 1613, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.05783 = idf(docFreq=13, maxDocs=44218)
                0.625 = fieldNorm(doc=1613)
        0.5 = coord(1/2)
    
  2. Peterson, R.E.: Eight Internet search engines compared (1996) 2.37
    2.3709164 = sum of:
      2.3709164 = product of:
        4.7418327 = sum of:
          4.7418327 = weight(author_txt:peterson in 2328) [ClassicSimilarity], result of:
            4.7418327 = score(doc=2328,freq=1.0), product of:
              0.83761036 = queryWeight, product of:
                1.2382778 = boost
                9.05783 = idf(docFreq=13, maxDocs=44218)
                0.074679226 = queryNorm
              5.661144 = fieldWeight in 2328, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.05783 = idf(docFreq=13, maxDocs=44218)
                0.625 = fieldNorm(doc=2328)
        0.5 = coord(1/2)
    
  3. Peterson, B.J.: Knowledge management : 3M perspective (1997) 2.37
    2.3709164 = sum of:
      2.3709164 = product of:
        4.7418327 = sum of:
          4.7418327 = weight(author_txt:peterson in 3121) [ClassicSimilarity], result of:
            4.7418327 = score(doc=3121,freq=1.0), product of:
              0.83761036 = queryWeight, product of:
                1.2382778 = boost
                9.05783 = idf(docFreq=13, maxDocs=44218)
                0.074679226 = queryNorm
              5.661144 = fieldWeight in 3121, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.05783 = idf(docFreq=13, maxDocs=44218)
                0.625 = fieldNorm(doc=3121)
        0.5 = coord(1/2)
    
  4. Peterson, E.: Parallel systems : the coexistence of subject cataloging and folksonomy (2008) 2.37
    2.3709164 = sum of:
      2.3709164 = product of:
        4.7418327 = sum of:
          4.7418327 = weight(author_txt:peterson in 251) [ClassicSimilarity], result of:
            4.7418327 = score(doc=251,freq=1.0), product of:
              0.83761036 = queryWeight, product of:
                1.2382778 = boost
                9.05783 = idf(docFreq=13, maxDocs=44218)
                0.074679226 = queryNorm
              5.661144 = fieldWeight in 251, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.05783 = idf(docFreq=13, maxDocs=44218)
                0.625 = fieldNorm(doc=251)
        0.5 = coord(1/2)
    
  5. Peterson, G.M.: Characteristics of retracted open access biomedical literature : a bibliographic analysis (2013) 2.37
    2.3709164 = sum of:
      2.3709164 = product of:
        4.7418327 = sum of:
          4.7418327 = weight(author_txt:peterson in 1138) [ClassicSimilarity], result of:
            4.7418327 = score(doc=1138,freq=1.0), product of:
              0.83761036 = queryWeight, product of:
                1.2382778 = boost
                9.05783 = idf(docFreq=13, maxDocs=44218)
                0.074679226 = queryNorm
              5.661144 = fieldWeight in 1138, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.05783 = idf(docFreq=13, maxDocs=44218)
                0.625 = fieldNorm(doc=1138)
        0.5 = coord(1/2)
    

Similar documents (content)

  1. Dolamic, L.; Savoy, J.: When stopword lists make the difference (2009) 0.21
    0.20916994 = sum of:
      0.20916994 = product of:
        1.3073121 = sum of:
          0.030136583 = weight(abstract_txt:drawn in 3319) [ClassicSimilarity], result of:
            0.030136583 = score(doc=3319,freq=1.0), product of:
              0.061887506 = queryWeight, product of:
                1.4332346 = boost
                6.2330556 = idf(docFreq=235, maxDocs=44218)
                0.0069276304 = queryNorm
              0.48695746 = fieldWeight in 3319, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.2330556 = idf(docFreq=235, maxDocs=44218)
                0.078125 = fieldNorm(doc=3319)
          0.038870037 = weight(abstract_txt:list in 3319) [ClassicSimilarity], result of:
            0.038870037 = score(doc=3319,freq=1.0), product of:
              0.09239042 = queryWeight, product of:
                2.4765337 = boost
                5.3851523 = idf(docFreq=550, maxDocs=44218)
                0.0069276304 = queryNorm
              0.42071503 = fieldWeight in 3319, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.3851523 = idf(docFreq=550, maxDocs=44218)
                0.078125 = fieldNorm(doc=3319)
          0.03922775 = weight(abstract_txt:original in 3319) [ClassicSimilarity], result of:
            0.03922775 = score(doc=3319,freq=1.0), product of:
              0.09295639 = queryWeight, product of:
                2.4841075 = boost
                5.4016213 = idf(docFreq=541, maxDocs=44218)
                0.0069276304 = queryNorm
              0.42200166 = fieldWeight in 3319, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.4016213 = idf(docFreq=541, maxDocs=44218)
                0.078125 = fieldNorm(doc=3319)
          1.1990777 = weight(abstract_txt:stopword in 3319) [ClassicSimilarity], result of:
            1.1990777 = score(doc=3319,freq=3.0), product of:
              0.9087586 = queryWeight, product of:
                13.452892 = boost
                9.7509775 = idf(docFreq=6, maxDocs=44218)
                0.0069276304 = queryNorm
              1.3194679 = fieldWeight in 3319, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                9.7509775 = idf(docFreq=6, maxDocs=44218)
                0.078125 = fieldNorm(doc=3319)
        0.16 = coord(4/25)
    
  2. Lazarinis, F.: Engineering and utilizing a stopword list in Greek Web retrieval (2007) 0.20
    0.2031278 = sum of:
      0.2031278 = product of:
        1.6927317 = sum of:
          0.18787393 = weight(abstract_txt:stopwords in 587) [ClassicSimilarity], result of:
            0.18787393 = score(doc=587,freq=2.0), product of:
              0.14733994 = queryWeight, product of:
                2.2114444 = boost
                9.617446 = idf(docFreq=7, maxDocs=44218)
                0.0069276304 = queryNorm
              1.2751052 = fieldWeight in 587, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                9.617446 = idf(docFreq=7, maxDocs=44218)
                0.09375 = fieldNorm(doc=587)
          0.06596464 = weight(abstract_txt:list in 587) [ClassicSimilarity], result of:
            0.06596464 = score(doc=587,freq=2.0), product of:
              0.09239042 = queryWeight, product of:
                2.4765337 = boost
                5.3851523 = idf(docFreq=550, maxDocs=44218)
                0.0069276304 = queryNorm
              0.7139771 = fieldWeight in 587, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.3851523 = idf(docFreq=550, maxDocs=44218)
                0.09375 = fieldNorm(doc=587)
          1.4388932 = weight(abstract_txt:stopword in 587) [ClassicSimilarity], result of:
            1.4388932 = score(doc=587,freq=3.0), product of:
              0.9087586 = queryWeight, product of:
                13.452892 = boost
                9.7509775 = idf(docFreq=6, maxDocs=44218)
                0.0069276304 = queryNorm
              1.5833614 = fieldWeight in 587, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                9.7509775 = idf(docFreq=6, maxDocs=44218)
                0.09375 = fieldNorm(doc=587)
        0.12 = coord(3/25)
    
  3. Stamatatos, E.: Plagiarism detection using stopword n-grams (2011) 0.14
    0.14097463 = sum of:
      0.14097463 = product of:
        0.8810914 = sum of:
          0.11070578 = weight(abstract_txt:stopwords in 4955) [ClassicSimilarity], result of:
            0.11070578 = score(doc=4955,freq=1.0), product of:
              0.14733994 = queryWeight, product of:
                2.2114444 = boost
                9.617446 = idf(docFreq=7, maxDocs=44218)
                0.0069276304 = queryNorm
              0.751363 = fieldWeight in 4955, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.617446 = idf(docFreq=7, maxDocs=44218)
                0.078125 = fieldNorm(doc=4955)
          0.038870037 = weight(abstract_txt:list in 4955) [ClassicSimilarity], result of:
            0.038870037 = score(doc=4955,freq=1.0), product of:
              0.09239042 = queryWeight, product of:
                2.4765337 = boost
                5.3851523 = idf(docFreq=550, maxDocs=44218)
                0.0069276304 = queryNorm
              0.42071503 = fieldWeight in 4955, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.3851523 = idf(docFreq=550, maxDocs=44218)
                0.078125 = fieldNorm(doc=4955)
          0.03922775 = weight(abstract_txt:original in 4955) [ClassicSimilarity], result of:
            0.03922775 = score(doc=4955,freq=1.0), product of:
              0.09295639 = queryWeight, product of:
                2.4841075 = boost
                5.4016213 = idf(docFreq=541, maxDocs=44218)
                0.0069276304 = queryNorm
              0.42200166 = fieldWeight in 4955, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.4016213 = idf(docFreq=541, maxDocs=44218)
                0.078125 = fieldNorm(doc=4955)
          0.69228786 = weight(abstract_txt:stopword in 4955) [ClassicSimilarity], result of:
            0.69228786 = score(doc=4955,freq=1.0), product of:
              0.9087586 = queryWeight, product of:
                13.452892 = boost
                9.7509775 = idf(docFreq=6, maxDocs=44218)
                0.0069276304 = queryNorm
              0.7617951 = fieldWeight in 4955, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.7509775 = idf(docFreq=6, maxDocs=44218)
                0.078125 = fieldNorm(doc=4955)
        0.16 = coord(4/25)
    
  4. Ekmekcioglu, F.C.; Lynch, M.F.; Willet, P.: Development and evaluation of conflation techniques for the implementation of a document retrieval system for Turkish text databases (1995) 0.07
    0.07019116 = sum of:
      0.07019116 = product of:
        0.8773895 = sum of:
          0.046644043 = weight(abstract_txt:list in 5797) [ClassicSimilarity], result of:
            0.046644043 = score(doc=5797,freq=1.0), product of:
              0.09239042 = queryWeight, product of:
                2.4765337 = boost
                5.3851523 = idf(docFreq=550, maxDocs=44218)
                0.0069276304 = queryNorm
              0.504858 = fieldWeight in 5797, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.3851523 = idf(docFreq=550, maxDocs=44218)
                0.09375 = fieldNorm(doc=5797)
          0.83074546 = weight(abstract_txt:stopword in 5797) [ClassicSimilarity], result of:
            0.83074546 = score(doc=5797,freq=1.0), product of:
              0.9087586 = queryWeight, product of:
                13.452892 = boost
                9.7509775 = idf(docFreq=6, maxDocs=44218)
                0.0069276304 = queryNorm
              0.9141542 = fieldWeight in 5797, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.7509775 = idf(docFreq=6, maxDocs=44218)
                0.09375 = fieldNorm(doc=5797)
        0.08 = coord(2/25)
    
  5. Can, F.; Kocberber, S.; Balcik, E.; Kaynak, C.; Ocalan, H.C.: Information retrieval on Turkish texts (2008) 0.07
    0.07019116 = sum of:
      0.07019116 = product of:
        0.8773895 = sum of:
          0.046644043 = weight(abstract_txt:list in 1373) [ClassicSimilarity], result of:
            0.046644043 = score(doc=1373,freq=1.0), product of:
              0.09239042 = queryWeight, product of:
                2.4765337 = boost
                5.3851523 = idf(docFreq=550, maxDocs=44218)
                0.0069276304 = queryNorm
              0.504858 = fieldWeight in 1373, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.3851523 = idf(docFreq=550, maxDocs=44218)
                0.09375 = fieldNorm(doc=1373)
          0.83074546 = weight(abstract_txt:stopword in 1373) [ClassicSimilarity], result of:
            0.83074546 = score(doc=1373,freq=1.0), product of:
              0.9087586 = queryWeight, product of:
                13.452892 = boost
                9.7509775 = idf(docFreq=6, maxDocs=44218)
                0.0069276304 = queryNorm
              0.9141542 = fieldWeight in 1373, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.7509775 = idf(docFreq=6, maxDocs=44218)
                0.09375 = fieldNorm(doc=1373)
        0.08 = coord(2/25)