Document (#21545)

Author
Lindsay, R.K.
Gordon, M.D.
Title
Literature-based discovery by lexical statistics
Source
Journal of the American Society for Information Science. 50(1999) no.7, S.574-587
Year
1999
Abstract
We report experiments that use lexical statistics, such as word frequency counts, to discover hidden connections in the medical literature. Hidden connections are those that are unlikely to be found by examination of bibliographic citations or the use of standard indexing methods and yet establish a relationship between topics that might profitably by explored by scientific research. Our experiments were conducted with the MEDLINE medical literature database and follow and extend the work of Swanson
Theme
Informetrie
Field
Medizin
Object
Medline

Similar documents (author)

  1. Gordon, J.A.: Training in indexing : some recent development (1981) 5.66
    5.664006 = sum of:
      5.664006 = weight(author_txt:gordon in 6174) [ClassicSimilarity], result of:
        5.664006 = fieldWeight in 6174, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.06241 = idf(docFreq=13, maxDocs=44421)
          0.625 = fieldNorm(doc=6174)
    
  2. Gordon, M.: Training for indexing : a teacher's view (1987) 5.66
    5.664006 = sum of:
      5.664006 = weight(author_txt:gordon in 6175) [ClassicSimilarity], result of:
        5.664006 = fieldWeight in 6175, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.06241 = idf(docFreq=13, maxDocs=44421)
          0.625 = fieldNorm(doc=6175)
    
  3. Gordon, S.: Museums and the information superhighway (1995) 5.66
    5.664006 = sum of:
      5.664006 = weight(author_txt:gordon in 3912) [ClassicSimilarity], result of:
        5.664006 = fieldWeight in 3912, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.06241 = idf(docFreq=13, maxDocs=44421)
          0.625 = fieldNorm(doc=3912)
    
  4. Gordon, A.S.: Browsing image collections with representations of common-sense activities (2001) 5.66
    5.664006 = sum of:
      5.664006 = weight(author_txt:gordon in 530) [ClassicSimilarity], result of:
        5.664006 = fieldWeight in 530, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.06241 = idf(docFreq=13, maxDocs=44421)
          0.625 = fieldNorm(doc=530)
    
  5. Gordon, A.: ¬The invisibility of science publications in hebrew : a comparative database study (2012) 5.66
    5.664006 = sum of:
      5.664006 = weight(author_txt:gordon in 1079) [ClassicSimilarity], result of:
        5.664006 = fieldWeight in 1079, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.06241 = idf(docFreq=13, maxDocs=44421)
          0.625 = fieldNorm(doc=1079)
    

Similar documents (content)

  1. Srinivasan, P.: Text mining : generating hypotheses from MEDLINE (2004) 0.18
    0.181825 = sum of:
      0.181825 = product of:
        0.64937496 = sum of:
          0.043301687 = weight(abstract_txt:report in 3225) [ClassicSimilarity], result of:
            0.043301687 = score(doc=3225,freq=1.0), product of:
              0.10255826 = queryWeight, product of:
                1.0194576 = boost
                5.4043584 = idf(docFreq=542, maxDocs=44421)
                0.018614756 = queryNorm
              0.4222155 = fieldWeight in 3225, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.4043584 = idf(docFreq=542, maxDocs=44421)
                0.078125 = fieldNorm(doc=3225)
          0.04861589 = weight(abstract_txt:discovery in 3225) [ClassicSimilarity], result of:
            0.04861589 = score(doc=3225,freq=1.0), product of:
              0.11078635 = queryWeight, product of:
                1.0595634 = boost
                5.616968 = idf(docFreq=438, maxDocs=44421)
                0.018614756 = queryNorm
              0.43882564 = fieldWeight in 3225, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.616968 = idf(docFreq=438, maxDocs=44421)
                0.078125 = fieldNorm(doc=3225)
          0.086396076 = weight(abstract_txt:medline in 3225) [ClassicSimilarity], result of:
            0.086396076 = score(doc=3225,freq=1.0), product of:
              0.1625412 = queryWeight, product of:
                1.2834104 = boost
                6.803628 = idf(docFreq=133, maxDocs=44421)
                0.018614756 = queryNorm
              0.5315334 = fieldWeight in 3225, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.803628 = idf(docFreq=133, maxDocs=44421)
                0.078125 = fieldNorm(doc=3225)
          0.015394528 = weight(abstract_txt:that in 3225) [ClassicSimilarity], result of:
            0.015394528 = score(doc=3225,freq=2.0), product of:
              0.0589172 = queryWeight, product of:
                1.3383371 = boost
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.018614756 = queryNorm
              0.2612909 = fieldWeight in 3225, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.078125 = fieldNorm(doc=3225)
          0.22777238 = weight(abstract_txt:swanson in 3225) [ClassicSimilarity], result of:
            0.22777238 = score(doc=3225,freq=1.0), product of:
              0.310195 = queryWeight, product of:
                1.7729694 = boost
                9.398883 = idf(docFreq=9, maxDocs=44421)
                0.018614756 = queryNorm
              0.73428774 = fieldWeight in 3225, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.398883 = idf(docFreq=9, maxDocs=44421)
                0.078125 = fieldNorm(doc=3225)
          0.082831934 = weight(abstract_txt:experiments in 3225) [ClassicSimilarity], result of:
            0.082831934 = score(doc=3225,freq=1.0), product of:
              0.19911744 = queryWeight, product of:
                2.0088775 = boost
                5.324741 = idf(docFreq=587, maxDocs=44421)
                0.018614756 = queryNorm
              0.4159954 = fieldWeight in 3225, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.324741 = idf(docFreq=587, maxDocs=44421)
                0.078125 = fieldNorm(doc=3225)
          0.14506249 = weight(abstract_txt:connections in 3225) [ClassicSimilarity], result of:
            0.14506249 = score(doc=3225,freq=1.0), product of:
              0.2892994 = queryWeight, product of:
                2.4214337 = boost
                6.418264 = idf(docFreq=196, maxDocs=44421)
                0.018614756 = queryNorm
              0.5014269 = fieldWeight in 3225, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.418264 = idf(docFreq=196, maxDocs=44421)
                0.078125 = fieldNorm(doc=3225)
        0.28 = coord(7/25)
    
  2. Weeber, M.; Klein, H.; Jong-van den Berg, L.T.W. de; Vos, R.: Using concepts in literature-based discovery : simulating Swanson's Raynaud-Fish Oil and Migraine-Manesium discoveries (2001) 0.13
    0.12535758 = sum of:
      0.12535758 = product of:
        0.6267879 = sum of:
          0.0825039 = weight(abstract_txt:discovery in 6910) [ClassicSimilarity], result of:
            0.0825039 = score(doc=6910,freq=2.0), product of:
              0.11078635 = queryWeight, product of:
                1.0595634 = boost
                5.616968 = idf(docFreq=438, maxDocs=44421)
                0.018614756 = queryNorm
              0.7447118 = fieldWeight in 6910, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.616968 = idf(docFreq=438, maxDocs=44421)
                0.09375 = fieldNorm(doc=6910)
          0.02262524 = weight(abstract_txt:that in 6910) [ClassicSimilarity], result of:
            0.02262524 = score(doc=6910,freq=3.0), product of:
              0.0589172 = queryWeight, product of:
                1.3383371 = boost
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.018614756 = queryNorm
              0.3840176 = fieldWeight in 6910, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.09375 = fieldNorm(doc=6910)
          0.27332684 = weight(abstract_txt:swanson in 6910) [ClassicSimilarity], result of:
            0.27332684 = score(doc=6910,freq=1.0), product of:
              0.310195 = queryWeight, product of:
                1.7729694 = boost
                9.398883 = idf(docFreq=9, maxDocs=44421)
                0.018614756 = queryNorm
              0.88114524 = fieldWeight in 6910, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.398883 = idf(docFreq=9, maxDocs=44421)
                0.09375 = fieldNorm(doc=6910)
          0.12911373 = weight(abstract_txt:medical in 6910) [ClassicSimilarity], result of:
            0.12911373 = score(doc=6910,freq=1.0), product of:
              0.23704906 = queryWeight, product of:
                2.191886 = boost
                5.8098235 = idf(docFreq=361, maxDocs=44421)
                0.018614756 = queryNorm
              0.54467094 = fieldWeight in 6910, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.8098235 = idf(docFreq=361, maxDocs=44421)
                0.09375 = fieldNorm(doc=6910)
          0.11921818 = weight(abstract_txt:literature in 6910) [ClassicSimilarity], result of:
            0.11921818 = score(doc=6910,freq=2.0), product of:
              0.20422332 = queryWeight, product of:
                2.4917078 = boost
                4.4030223 = idf(docFreq=1477, maxDocs=44421)
                0.018614756 = queryNorm
              0.5837638 = fieldWeight in 6910, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.4030223 = idf(docFreq=1477, maxDocs=44421)
                0.09375 = fieldNorm(doc=6910)
        0.2 = coord(5/25)
    
  3. Sebastian, Y.: Literature-based discovery by learning heterogeneous bibliographic information networks (2017) 0.12
    0.11826598 = sum of:
      0.11826598 = product of:
        0.49277493 = sum of:
          0.030881947 = weight(abstract_txt:word in 1536) [ClassicSimilarity], result of:
            0.030881947 = score(doc=1536,freq=1.0), product of:
              0.103841715 = queryWeight, product of:
                1.0258167 = boost
                5.4380693 = idf(docFreq=524, maxDocs=44421)
                0.018614756 = queryNorm
              0.29739442 = fieldWeight in 1536, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.4380693 = idf(docFreq=524, maxDocs=44421)
                0.0546875 = fieldNorm(doc=1536)
          0.048127275 = weight(abstract_txt:discovery in 1536) [ClassicSimilarity], result of:
            0.048127275 = score(doc=1536,freq=2.0), product of:
              0.11078635 = queryWeight, product of:
                1.0595634 = boost
                5.616968 = idf(docFreq=438, maxDocs=44421)
                0.018614756 = queryNorm
              0.43441522 = fieldWeight in 1536, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.616968 = idf(docFreq=438, maxDocs=44421)
                0.0546875 = fieldNorm(doc=1536)
          0.017038621 = weight(abstract_txt:that in 1536) [ClassicSimilarity], result of:
            0.017038621 = score(doc=1536,freq=5.0), product of:
              0.0589172 = queryWeight, product of:
                1.3383371 = boost
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.018614756 = queryNorm
              0.28919604 = fieldWeight in 1536, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.0546875 = fieldNorm(doc=1536)
          0.17587893 = weight(abstract_txt:connections in 1536) [ClassicSimilarity], result of:
            0.17587893 = score(doc=1536,freq=3.0), product of:
              0.2892994 = queryWeight, product of:
                2.4214337 = boost
                6.418264 = idf(docFreq=196, maxDocs=44421)
                0.018614756 = queryNorm
              0.60794777 = fieldWeight in 1536, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.418264 = idf(docFreq=196, maxDocs=44421)
                0.0546875 = fieldNorm(doc=1536)
          0.15130424 = weight(abstract_txt:lexical in 1536) [ClassicSimilarity], result of:
            0.15130424 = score(doc=1536,freq=2.0), product of:
              0.29955012 = queryWeight, product of:
                2.4639595 = boost
                6.5309834 = idf(docFreq=175, maxDocs=44421)
                0.018614756 = queryNorm
              0.50510496 = fieldWeight in 1536, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.5309834 = idf(docFreq=175, maxDocs=44421)
                0.0546875 = fieldNorm(doc=1536)
          0.069543935 = weight(abstract_txt:literature in 1536) [ClassicSimilarity], result of:
            0.069543935 = score(doc=1536,freq=2.0), product of:
              0.20422332 = queryWeight, product of:
                2.4917078 = boost
                4.4030223 = idf(docFreq=1477, maxDocs=44421)
                0.018614756 = queryNorm
              0.34052888 = fieldWeight in 1536, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.4030223 = idf(docFreq=1477, maxDocs=44421)
                0.0546875 = fieldNorm(doc=1536)
        0.24 = coord(6/25)
    
  4. Mohammadi, E.; Thelwall, M.; Haustein, S.; Larivière, V.: Who reads research articles? : an altmetrics analysis of Mendeley user categories (2015) 0.12
    0.1160293 = sum of:
      0.1160293 = product of:
        0.48345542 = sum of:
          0.03326036 = weight(abstract_txt:citations in 3162) [ClassicSimilarity], result of:
            0.03326036 = score(doc=3162,freq=1.0), product of:
              0.09981414 = queryWeight, product of:
                1.0057265 = boost
                5.331567 = idf(docFreq=583, maxDocs=44421)
                0.018614756 = queryNorm
              0.33322293 = fieldWeight in 3162, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.331567 = idf(docFreq=583, maxDocs=44421)
                0.0625 = fieldNorm(doc=3162)
          0.09133937 = weight(abstract_txt:counts in 3162) [ClassicSimilarity], result of:
            0.09133937 = score(doc=3162,freq=2.0), product of:
              0.1553589 = queryWeight, product of:
                1.2547346 = boost
                6.651612 = idf(docFreq=155, maxDocs=44421)
                0.018614756 = queryNorm
              0.58792496 = fieldWeight in 3162, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.651612 = idf(docFreq=155, maxDocs=44421)
                0.0625 = fieldNorm(doc=3162)
          0.008708459 = weight(abstract_txt:that in 3162) [ClassicSimilarity], result of:
            0.008708459 = score(doc=3162,freq=1.0), product of:
              0.0589172 = queryWeight, product of:
                1.3383371 = boost
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.018614756 = queryNorm
              0.14780845 = fieldWeight in 3162, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.0625 = fieldNorm(doc=3162)
          0.08607583 = weight(abstract_txt:medical in 3162) [ClassicSimilarity], result of:
            0.08607583 = score(doc=3162,freq=1.0), product of:
              0.23704906 = queryWeight, product of:
                2.191886 = boost
                5.8098235 = idf(docFreq=361, maxDocs=44421)
                0.018614756 = queryNorm
              0.36311397 = fieldWeight in 3162, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.8098235 = idf(docFreq=361, maxDocs=44421)
                0.0625 = fieldNorm(doc=3162)
          0.108529694 = weight(abstract_txt:statistics in 3162) [ClassicSimilarity], result of:
            0.108529694 = score(doc=3162,freq=1.0), product of:
              0.2766622 = queryWeight, product of:
                2.3679566 = boost
                6.2765174 = idf(docFreq=226, maxDocs=44421)
                0.018614756 = queryNorm
              0.39228234 = fieldWeight in 3162, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.2765174 = idf(docFreq=226, maxDocs=44421)
                0.0625 = fieldNorm(doc=3162)
          0.15554173 = weight(abstract_txt:hidden in 3162) [ClassicSimilarity], result of:
            0.15554173 = score(doc=3162,freq=1.0), product of:
              0.35168085 = queryWeight, product of:
                2.669766 = boost
                7.0764947 = idf(docFreq=101, maxDocs=44421)
                0.018614756 = queryNorm
              0.44228092 = fieldWeight in 3162, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.0764947 = idf(docFreq=101, maxDocs=44421)
                0.0625 = fieldNorm(doc=3162)
        0.24 = coord(6/25)
    
  5. Bruhns, S.: Bibliografisk sogning som forskning : Don R. Swansons projekt (1995) 0.11
    0.11086374 = sum of:
      0.11086374 = product of:
        0.6928984 = sum of:
          0.013062689 = weight(abstract_txt:that in 4480) [ClassicSimilarity], result of:
            0.013062689 = score(doc=4480,freq=1.0), product of:
              0.0589172 = queryWeight, product of:
                1.3383371 = boost
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.018614756 = queryNorm
              0.22171268 = fieldWeight in 4480, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.09375 = fieldNorm(doc=4480)
          0.38654256 = weight(abstract_txt:swanson in 4480) [ClassicSimilarity], result of:
            0.38654256 = score(doc=4480,freq=2.0), product of:
              0.310195 = queryWeight, product of:
                1.7729694 = boost
                9.398883 = idf(docFreq=9, maxDocs=44421)
                0.018614756 = queryNorm
              1.2461276 = fieldWeight in 4480, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                9.398883 = idf(docFreq=9, maxDocs=44421)
                0.09375 = fieldNorm(doc=4480)
          0.17407499 = weight(abstract_txt:connections in 4480) [ClassicSimilarity], result of:
            0.17407499 = score(doc=4480,freq=1.0), product of:
              0.2892994 = queryWeight, product of:
                2.4214337 = boost
                6.418264 = idf(docFreq=196, maxDocs=44421)
                0.018614756 = queryNorm
              0.6017122 = fieldWeight in 4480, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.418264 = idf(docFreq=196, maxDocs=44421)
                0.09375 = fieldNorm(doc=4480)
          0.11921818 = weight(abstract_txt:literature in 4480) [ClassicSimilarity], result of:
            0.11921818 = score(doc=4480,freq=2.0), product of:
              0.20422332 = queryWeight, product of:
                2.4917078 = boost
                4.4030223 = idf(docFreq=1477, maxDocs=44421)
                0.018614756 = queryNorm
              0.5837638 = fieldWeight in 4480, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.4030223 = idf(docFreq=1477, maxDocs=44421)
                0.09375 = fieldNorm(doc=4480)
        0.16 = coord(4/25)