Document (#38163)

Author
Landauer, T.K.
Foltz, P.W.
Laham, D.
Title
¬An introduction to Latent Semantic Analysis
Source
Discourse Processes. 25(1998), S.259-284. [http://lsa.colorado.edu/papers/dp1.LSAintro.pdf]
Year
1998
Abstract
Latent Semantic Analysis (LSA) is a theory and method for extracting and representing the contextual-usage meaning of words by statistical computations applied to a large corpus of text (Landauer and Dumais, 1997). The underlying idea is that the aggregate of all the word contexts in which a given word does and does not appear provides a set of mutual constraints that largely determines the similarity of meaning of words and sets of words to each other. The adequacy of LSA's reflection of human knowledge has been established in a variety of ways. For example, its scores overlap those of humans on standard vocabulary and subject matter tests; it mimics human word sorting and category judgments; it simulates word-word and passage-word lexical priming data; and as reported in 3 following articles in this issue, it accurately estimates passage coherence, learnability of passages by individual students, and the quality and quantity of knowledge contained in an essay.
Theme
Semantisches Umfeld in Indexierung u. Retrieval
Object
Latent Semantic Indexing

Similar documents (author)

  1. Furnas, G.W.; Landauer, T.K.: Describing categories of objects for menu retrieval systems (1984) 4.88
    4.8777785 = sum of:
      4.8777785 = weight(author_txt:landauer in 6575) [ClassicSimilarity], result of:
        4.8777785 = fieldWeight in 6575, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.755557 = idf(docFreq=6, maxDocs=44421)
          0.5 = fieldNorm(doc=6575)
    
  2. Gomez, L.; Lochbaum, C.C.; Landauer, T.K.: All the right words: finding what you want as an function of richness of indexing vocabulary (1990) 3.66
    3.6583338 = sum of:
      3.6583338 = weight(author_txt:landauer in 154) [ClassicSimilarity], result of:
        3.6583338 = fieldWeight in 154, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.755557 = idf(docFreq=6, maxDocs=44421)
          0.375 = fieldNorm(doc=154)
    
  3. Furnas, G.W.; Landauer, T.K.; Gomez, L.M.; Dumais, S.T.: ¬The vocabulary problem in human-system communication (1987) 3.05
    3.0486116 = sum of:
      3.0486116 = weight(author_txt:landauer in 7628) [ClassicSimilarity], result of:
        3.0486116 = fieldWeight in 7628, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.755557 = idf(docFreq=6, maxDocs=44421)
          0.3125 = fieldNorm(doc=7628)
    
  4. Deerwester, S.; Dumais, S.; Landauer, T.; Furnass, G.; Beck, L.: Improving information retrieval with latent semantic indexing (1988) 3.05
    3.0486116 = sum of:
      3.0486116 = weight(author_txt:landauer in 3396) [ClassicSimilarity], result of:
        3.0486116 = fieldWeight in 3396, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.755557 = idf(docFreq=6, maxDocs=44421)
          0.3125 = fieldNorm(doc=3396)
    
  5. Deerwester, S.C.; Dumais, S.T.; Landauer, T.K.; Furnas, G.W.; Harshman, R.A.: Indexing by latent semantic analysis (1990) 3.05
    3.0486116 = sum of:
      3.0486116 = weight(author_txt:landauer in 3399) [ClassicSimilarity], result of:
        3.0486116 = fieldWeight in 3399, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.755557 = idf(docFreq=6, maxDocs=44421)
          0.3125 = fieldNorm(doc=3399)
    

Similar documents (content)

  1. Jorge-Botana, G.; León, J.A.; Olmos, R.; Hassan-Montero, Y.: Visualizing polysemy using LSA and the predication algorithm (2010) 0.19
    0.19159411 = sum of:
      0.19159411 = product of:
        0.68426466 = sum of:
          0.022832979 = weight(abstract_txt:analysis in 683) [ClassicSimilarity], result of:
            0.022832979 = score(doc=683,freq=2.0), product of:
              0.07085811 = queryWeight, product of:
                1.0494356 = boost
                3.6456752 = idf(docFreq=3151, maxDocs=44421)
                0.018520633 = queryNorm
              0.3222352 = fieldWeight in 683, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.6456752 = idf(docFreq=3151, maxDocs=44421)
                0.0625 = fieldNorm(doc=683)
          0.029850831 = weight(abstract_txt:semantic in 683) [ClassicSimilarity], result of:
            0.029850831 = score(doc=683,freq=1.0), product of:
              0.1067404 = queryWeight, product of:
                1.2880284 = boost
                4.4745317 = idf(docFreq=1375, maxDocs=44421)
                0.018520633 = queryNorm
              0.27965823 = fieldWeight in 683, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.4745317 = idf(docFreq=1375, maxDocs=44421)
                0.0625 = fieldNorm(doc=683)
          0.04834167 = weight(abstract_txt:human in 683) [ClassicSimilarity], result of:
            0.04834167 = score(doc=683,freq=2.0), product of:
              0.11683214 = queryWeight, product of:
                1.3475416 = boost
                4.681277 = idf(docFreq=1118, maxDocs=44421)
                0.018520633 = queryNorm
              0.41377032 = fieldWeight in 683, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.681277 = idf(docFreq=1118, maxDocs=44421)
                0.0625 = fieldNorm(doc=683)
          0.08231218 = weight(abstract_txt:meaning in 683) [ClassicSimilarity], result of:
            0.08231218 = score(doc=683,freq=2.0), product of:
              0.16659321 = queryWeight, product of:
                1.6091245 = boost
                5.59 = idf(docFreq=450, maxDocs=44421)
                0.018520633 = queryNorm
              0.49409086 = fieldWeight in 683, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.59 = idf(docFreq=450, maxDocs=44421)
                0.0625 = fieldNorm(doc=683)
          0.113895446 = weight(abstract_txt:latent in 683) [ClassicSimilarity], result of:
            0.113895446 = score(doc=683,freq=1.0), product of:
              0.26063263 = queryWeight, product of:
                2.0126832 = boost
                6.9919376 = idf(docFreq=110, maxDocs=44421)
                0.018520633 = queryNorm
              0.4369961 = fieldWeight in 683, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.9919376 = idf(docFreq=110, maxDocs=44421)
                0.0625 = fieldNorm(doc=683)
          0.10859269 = weight(abstract_txt:words in 683) [ClassicSimilarity], result of:
            0.10859269 = score(doc=683,freq=2.0), product of:
              0.22939223 = queryWeight, product of:
                2.3125758 = boost
                5.355831 = idf(docFreq=569, maxDocs=44421)
                0.018520633 = queryNorm
              0.47339305 = fieldWeight in 683, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.355831 = idf(docFreq=569, maxDocs=44421)
                0.0625 = fieldNorm(doc=683)
          0.27843887 = weight(abstract_txt:word in 683) [ClassicSimilarity], result of:
            0.27843887 = score(doc=683,freq=3.0), product of:
              0.47298184 = queryWeight, product of:
                4.696171 = boost
                5.4380693 = idf(docFreq=524, maxDocs=44421)
                0.018520633 = queryNorm
              0.58868825 = fieldWeight in 683, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.4380693 = idf(docFreq=524, maxDocs=44421)
                0.0625 = fieldNorm(doc=683)
        0.28 = coord(7/25)
    
  2. Dumais, S.T.: Latent semantic analysis (2003) 0.18
    0.17706898 = sum of:
      0.17706898 = product of:
        0.49185827 = sum of:
          0.010508945 = weight(abstract_txt:knowledge in 3462) [ClassicSimilarity], result of:
            0.010508945 = score(doc=3462,freq=2.0), product of:
              0.067051314 = queryWeight, product of:
                1.0208564 = boost
                3.5463927 = idf(docFreq=3480, maxDocs=44421)
                0.018520633 = queryNorm
              0.15672989 = fieldWeight in 3462, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.5463927 = idf(docFreq=3480, maxDocs=44421)
                0.03125 = fieldNorm(doc=3462)
          0.016145354 = weight(abstract_txt:analysis in 3462) [ClassicSimilarity], result of:
            0.016145354 = score(doc=3462,freq=4.0), product of:
              0.07085811 = queryWeight, product of:
                1.0494356 = boost
                3.6456752 = idf(docFreq=3151, maxDocs=44421)
                0.018520633 = queryNorm
              0.2278547 = fieldWeight in 3462, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                3.6456752 = idf(docFreq=3151, maxDocs=44421)
                0.03125 = fieldNorm(doc=3462)
          0.047073156 = weight(abstract_txt:passages in 3462) [ClassicSimilarity], result of:
            0.047073156 = score(doc=3462,freq=1.0), product of:
              0.18220071 = queryWeight, product of:
                1.189929 = boost
                8.267481 = idf(docFreq=30, maxDocs=44421)
                0.018520633 = queryNorm
              0.25835878 = fieldWeight in 3462, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.267481 = idf(docFreq=30, maxDocs=44421)
                0.03125 = fieldNorm(doc=3462)
          0.025851578 = weight(abstract_txt:semantic in 3462) [ClassicSimilarity], result of:
            0.025851578 = score(doc=3462,freq=3.0), product of:
              0.1067404 = queryWeight, product of:
                1.2880284 = boost
                4.4745317 = idf(docFreq=1375, maxDocs=44421)
                0.018520633 = queryNorm
              0.24219112 = fieldWeight in 3462, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.4745317 = idf(docFreq=1375, maxDocs=44421)
                0.03125 = fieldNorm(doc=3462)
          0.029603105 = weight(abstract_txt:human in 3462) [ClassicSimilarity], result of:
            0.029603105 = score(doc=3462,freq=3.0), product of:
              0.11683214 = queryWeight, product of:
                1.3475416 = boost
                4.681277 = idf(docFreq=1118, maxDocs=44421)
                0.018520633 = queryNorm
              0.25338152 = fieldWeight in 3462, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.681277 = idf(docFreq=1118, maxDocs=44421)
                0.03125 = fieldNorm(doc=3462)
          0.033510655 = weight(abstract_txt:does in 3462) [ClassicSimilarity], result of:
            0.033510655 = score(doc=3462,freq=2.0), product of:
              0.1452635 = queryWeight, product of:
                1.5025856 = boost
                5.2198906 = idf(docFreq=652, maxDocs=44421)
                0.018520633 = queryNorm
              0.23068875 = fieldWeight in 3462, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.2198906 = idf(docFreq=652, maxDocs=44421)
                0.03125 = fieldNorm(doc=3462)
          0.056947723 = weight(abstract_txt:latent in 3462) [ClassicSimilarity], result of:
            0.056947723 = score(doc=3462,freq=1.0), product of:
              0.26063263 = queryWeight, product of:
                2.0126832 = boost
                6.9919376 = idf(docFreq=110, maxDocs=44421)
                0.018520633 = queryNorm
              0.21849805 = fieldWeight in 3462, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.9919376 = idf(docFreq=110, maxDocs=44421)
                0.03125 = fieldNorm(doc=3462)
          0.13299833 = weight(abstract_txt:words in 3462) [ClassicSimilarity], result of:
            0.13299833 = score(doc=3462,freq=12.0), product of:
              0.22939223 = queryWeight, product of:
                2.3125758 = boost
                5.355831 = idf(docFreq=569, maxDocs=44421)
                0.018520633 = queryNorm
              0.5797857 = fieldWeight in 3462, product of:
                3.4641016 = tf(freq=12.0), with freq of:
                  12.0 = termFreq=12.0
                5.355831 = idf(docFreq=569, maxDocs=44421)
                0.03125 = fieldNorm(doc=3462)
          0.13921943 = weight(abstract_txt:word in 3462) [ClassicSimilarity], result of:
            0.13921943 = score(doc=3462,freq=3.0), product of:
              0.47298184 = queryWeight, product of:
                4.696171 = boost
                5.4380693 = idf(docFreq=524, maxDocs=44421)
                0.018520633 = queryNorm
              0.29434413 = fieldWeight in 3462, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.4380693 = idf(docFreq=524, maxDocs=44421)
                0.03125 = fieldNorm(doc=3462)
        0.36 = coord(9/25)
    
  3. Leydesdorff, L.; Zhou, P.: Co-word analysis using the Chinese character set (2008) 0.15
    0.15107875 = sum of:
      0.15107875 = product of:
        0.75539374 = sum of:
          0.02421803 = weight(abstract_txt:analysis in 2970) [ClassicSimilarity], result of:
            0.02421803 = score(doc=2970,freq=1.0), product of:
              0.07085811 = queryWeight, product of:
                1.0494356 = boost
                3.6456752 = idf(docFreq=3151, maxDocs=44421)
                0.018520633 = queryNorm
              0.34178203 = fieldWeight in 2970, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6456752 = idf(docFreq=3151, maxDocs=44421)
                0.09375 = fieldNorm(doc=2970)
          0.06332318 = weight(abstract_txt:semantic in 2970) [ClassicSimilarity], result of:
            0.06332318 = score(doc=2970,freq=2.0), product of:
              0.1067404 = queryWeight, product of:
                1.2880284 = boost
                4.4745317 = idf(docFreq=1375, maxDocs=44421)
                0.018520633 = queryNorm
              0.5932447 = fieldWeight in 2970, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.4745317 = idf(docFreq=1375, maxDocs=44421)
                0.09375 = fieldNorm(doc=2970)
          0.087305255 = weight(abstract_txt:meaning in 2970) [ClassicSimilarity], result of:
            0.087305255 = score(doc=2970,freq=1.0), product of:
              0.16659321 = queryWeight, product of:
                1.6091245 = boost
                5.59 = idf(docFreq=450, maxDocs=44421)
                0.018520633 = queryNorm
              0.5240625 = fieldWeight in 2970, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.59 = idf(docFreq=450, maxDocs=44421)
                0.09375 = fieldNorm(doc=2970)
          0.16288903 = weight(abstract_txt:words in 2970) [ClassicSimilarity], result of:
            0.16288903 = score(doc=2970,freq=2.0), product of:
              0.22939223 = queryWeight, product of:
                2.3125758 = boost
                5.355831 = idf(docFreq=569, maxDocs=44421)
                0.018520633 = queryNorm
              0.71008956 = fieldWeight in 2970, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.355831 = idf(docFreq=569, maxDocs=44421)
                0.09375 = fieldNorm(doc=2970)
          0.41765827 = weight(abstract_txt:word in 2970) [ClassicSimilarity], result of:
            0.41765827 = score(doc=2970,freq=3.0), product of:
              0.47298184 = queryWeight, product of:
                4.696171 = boost
                5.4380693 = idf(docFreq=524, maxDocs=44421)
                0.018520633 = queryNorm
              0.8830324 = fieldWeight in 2970, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.4380693 = idf(docFreq=524, maxDocs=44421)
                0.09375 = fieldNorm(doc=2970)
        0.2 = coord(5/25)
    
  4. Rishel, T.; Perkins, L.A.; Yenduri, S.; Zand, F.: Determining the context of text using augmented latent semantic indexing (2007) 0.14
    0.1364476 = sum of:
      0.1364476 = product of:
        0.6822379 = sum of:
          0.040363386 = weight(abstract_txt:analysis in 2316) [ClassicSimilarity], result of:
            0.040363386 = score(doc=2316,freq=4.0), product of:
              0.07085811 = queryWeight, product of:
                1.0494356 = boost
                3.6456752 = idf(docFreq=3151, maxDocs=44421)
                0.018520633 = queryNorm
              0.56963676 = fieldWeight in 2316, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                3.6456752 = idf(docFreq=3151, maxDocs=44421)
                0.078125 = fieldNorm(doc=2316)
          0.08343561 = weight(abstract_txt:semantic in 2316) [ClassicSimilarity], result of:
            0.08343561 = score(doc=2316,freq=5.0), product of:
              0.1067404 = queryWeight, product of:
                1.2880284 = boost
                4.4745317 = idf(docFreq=1375, maxDocs=44421)
                0.018520633 = queryNorm
              0.7816685 = fieldWeight in 2316, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                4.4745317 = idf(docFreq=1375, maxDocs=44421)
                0.078125 = fieldNorm(doc=2316)
          0.07275438 = weight(abstract_txt:meaning in 2316) [ClassicSimilarity], result of:
            0.07275438 = score(doc=2316,freq=1.0), product of:
              0.16659321 = queryWeight, product of:
                1.6091245 = boost
                5.59 = idf(docFreq=450, maxDocs=44421)
                0.018520633 = queryNorm
              0.43671876 = fieldWeight in 2316, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.59 = idf(docFreq=450, maxDocs=44421)
                0.078125 = fieldNorm(doc=2316)
          0.2847386 = weight(abstract_txt:latent in 2316) [ClassicSimilarity], result of:
            0.2847386 = score(doc=2316,freq=4.0), product of:
              0.26063263 = queryWeight, product of:
                2.0126832 = boost
                6.9919376 = idf(docFreq=110, maxDocs=44421)
                0.018520633 = queryNorm
              1.0924902 = fieldWeight in 2316, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                6.9919376 = idf(docFreq=110, maxDocs=44421)
                0.078125 = fieldNorm(doc=2316)
          0.20094593 = weight(abstract_txt:word in 2316) [ClassicSimilarity], result of:
            0.20094593 = score(doc=2316,freq=1.0), product of:
              0.47298184 = queryWeight, product of:
                4.696171 = boost
                5.4380693 = idf(docFreq=524, maxDocs=44421)
                0.018520633 = queryNorm
              0.42484915 = fieldWeight in 2316, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.4380693 = idf(docFreq=524, maxDocs=44421)
                0.078125 = fieldNorm(doc=2316)
        0.2 = coord(5/25)
    
  5. Hou, Y.; Pascale, A.; Carnerero-Cano, J.; Sattigeri, P.; Tchrakian, T.; Marinescu, R.; Daly, E.; Padhi, I.: WikiContradict : a benchmark for evaluating LLMs on real-world knowledge conflicts from Wikipedia (2024) 0.13
    0.13274233 = sum of:
      0.13274233 = product of:
        0.5530931 = sum of:
          0.018390654 = weight(abstract_txt:knowledge in 2368) [ClassicSimilarity], result of:
            0.018390654 = score(doc=2368,freq=2.0), product of:
              0.067051314 = queryWeight, product of:
                1.0208564 = boost
                3.5463927 = idf(docFreq=3480, maxDocs=44421)
                0.018520633 = queryNorm
              0.2742773 = fieldWeight in 2368, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.5463927 = idf(docFreq=3480, maxDocs=44421)
                0.0546875 = fieldNorm(doc=2368)
          0.052767627 = weight(abstract_txt:accurately in 2368) [ClassicSimilarity], result of:
            0.052767627 = score(doc=2368,freq=1.0), product of:
              0.1353903 = queryWeight, product of:
                1.0257459 = boost
                7.1267567 = idf(docFreq=96, maxDocs=44421)
                0.018520633 = queryNorm
              0.38974452 = fieldWeight in 2368, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.1267567 = idf(docFreq=96, maxDocs=44421)
                0.0546875 = fieldNorm(doc=2368)
          0.06548 = weight(abstract_txt:estimates in 2368) [ClassicSimilarity], result of:
            0.06548 = score(doc=2368,freq=1.0), product of:
              0.15634415 = queryWeight, product of:
                1.102267 = boost
                7.6584163 = idf(docFreq=56, maxDocs=44421)
                0.018520633 = queryNorm
              0.41881964 = fieldWeight in 2368, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.6584163 = idf(docFreq=56, maxDocs=44421)
                0.0546875 = fieldNorm(doc=2368)
          0.20178412 = weight(abstract_txt:passages in 2368) [ClassicSimilarity], result of:
            0.20178412 = score(doc=2368,freq=6.0), product of:
              0.18220071 = queryWeight, product of:
                1.189929 = boost
                8.267481 = idf(docFreq=30, maxDocs=44421)
                0.018520633 = queryNorm
              1.1074826 = fieldWeight in 2368, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                8.267481 = idf(docFreq=30, maxDocs=44421)
                0.0546875 = fieldNorm(doc=2368)
          0.051805433 = weight(abstract_txt:human in 2368) [ClassicSimilarity], result of:
            0.051805433 = score(doc=2368,freq=3.0), product of:
              0.11683214 = queryWeight, product of:
                1.3475416 = boost
                4.681277 = idf(docFreq=1118, maxDocs=44421)
                0.018520633 = queryNorm
              0.44341767 = fieldWeight in 2368, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.681277 = idf(docFreq=1118, maxDocs=44421)
                0.0546875 = fieldNorm(doc=2368)
          0.16286522 = weight(abstract_txt:passage in 2368) [ClassicSimilarity], result of:
            0.16286522 = score(doc=2368,freq=1.0), product of:
              0.36160806 = queryWeight, product of:
                2.370719 = boost
                8.235732 = idf(docFreq=31, maxDocs=44421)
                0.018520633 = queryNorm
              0.4503916 = fieldWeight in 2368, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.235732 = idf(docFreq=31, maxDocs=44421)
                0.0546875 = fieldNorm(doc=2368)
        0.24 = coord(6/25)