Document (#21916)

Author
Zhang, J.
Korfhage, R.R.
Title
¬A distance and angle similarity measure method
Source
Journal of the American Society for Information Science. 50(1999) no.9, S.772-778
Year
1999
Abstract
This article presents a distance and angle similarity measure. The integrated similarity measure takes the strenghts of both the distance and direction of measured documents into account. This article analyzes the features of the similarity measure by comparing it with the traditional distance-based similarity measure and the cosine measure, providing the iso-similarity contour, investigating the impacts of the parameters and variables on the new similarity measure. It also gives the further research issues on the topic

Similar documents (author)

  1. Zhang, M.; Zhang, Y.: Professional organizations in Twittersphere : an empirical study of U.S. library and information science professional organizations-related Tweets (2020) 4.53
    4.5277104 = sum of:
      4.5277104 = weight(author_txt:zhang in 775) [ClassicSimilarity], result of:
        4.5277104 = score(doc=775,freq=2.0), product of:
          0.99999994 = queryWeight, product of:
            6.40315 = idf(docFreq=199, maxDocs=44421)
            0.15617312 = queryNorm
          4.527711 = fieldWeight in 775, product of:
            1.4142135 = tf(freq=2.0), with freq of:
              2.0 = termFreq=2.0
            6.40315 = idf(docFreq=199, maxDocs=44421)
            0.5 = fieldNorm(doc=775)
    
  2. Zhang, Y.; Zhang, C.: Enhancing keyphrase extraction from microblogs using human reading time (2021) 4.53
    4.5277104 = sum of:
      4.5277104 = weight(author_txt:zhang in 1238) [ClassicSimilarity], result of:
        4.5277104 = score(doc=1238,freq=2.0), product of:
          0.99999994 = queryWeight, product of:
            6.40315 = idf(docFreq=199, maxDocs=44421)
            0.15617312 = queryNorm
          4.527711 = fieldWeight in 1238, product of:
            1.4142135 = tf(freq=2.0), with freq of:
              2.0 = termFreq=2.0
            6.40315 = idf(docFreq=199, maxDocs=44421)
            0.5 = fieldNorm(doc=1238)
    
  3. Zhang, J.: TOFIR: A tool of facilitating information retrieval : introduce a visual retrieval model (2001) 4.00
    4.0019684 = sum of:
      4.0019684 = weight(author_txt:zhang in 7710) [ClassicSimilarity], result of:
        4.0019684 = score(doc=7710,freq=1.0), product of:
          0.99999994 = queryWeight, product of:
            6.40315 = idf(docFreq=199, maxDocs=44421)
            0.15617312 = queryNorm
          4.001969 = fieldWeight in 7710, product of:
            1.0 = tf(freq=1.0), with freq of:
              1.0 = termFreq=1.0
            6.40315 = idf(docFreq=199, maxDocs=44421)
            0.625 = fieldNorm(doc=7710)
    
  4. Zhang, A.: Multimedia file formats on the Internet : a beginner's guide for PC users (1995) 4.00
    4.0019684 = sum of:
      4.0019684 = weight(author_txt:zhang in 3280) [ClassicSimilarity], result of:
        4.0019684 = score(doc=3280,freq=1.0), product of:
          0.99999994 = queryWeight, product of:
            6.40315 = idf(docFreq=199, maxDocs=44421)
            0.15617312 = queryNorm
          4.001969 = fieldWeight in 3280, product of:
            1.0 = tf(freq=1.0), with freq of:
              1.0 = termFreq=1.0
            6.40315 = idf(docFreq=199, maxDocs=44421)
            0.625 = fieldNorm(doc=3280)
    
  5. Zhang, J.: ¬A representational analysis of relational information displays (1996) 4.00
    4.0019684 = sum of:
      4.0019684 = weight(author_txt:zhang in 6471) [ClassicSimilarity], result of:
        4.0019684 = score(doc=6471,freq=1.0), product of:
          0.99999994 = queryWeight, product of:
            6.40315 = idf(docFreq=199, maxDocs=44421)
            0.15617312 = queryNorm
          4.001969 = fieldWeight in 6471, product of:
            1.0 = tf(freq=1.0), with freq of:
              1.0 = termFreq=1.0
            6.40315 = idf(docFreq=199, maxDocs=44421)
            0.625 = fieldNorm(doc=6471)
    

Similar documents (content)

  1. Zhang, J.; Korfhage, R.R.: DARE: Distance and Angle Retrieval Environment : A tale of the two measures (1999) 0.32
    0.31951115 = sum of:
      0.31951115 = product of:
        1.3312964 = sum of:
          0.008013987 = weight(abstract_txt:this in 4916) [ClassicSimilarity], result of:
            0.008013987 = score(doc=4916,freq=1.0), product of:
              0.026644073 = queryWeight, product of:
                1.0329002 = boost
                2.4062347 = idf(docFreq=10885, maxDocs=44421)
                0.010720233 = queryNorm
              0.30077934 = fieldWeight in 4916, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.4062347 = idf(docFreq=10885, maxDocs=44421)
                0.125 = fieldNorm(doc=4916)
          0.09404305 = weight(abstract_txt:direction in 4916) [ClassicSimilarity], result of:
            0.09404305 = score(doc=4916,freq=1.0), product of:
              0.10920504 = queryWeight, product of:
                1.4786466 = boost
                6.889283 = idf(docFreq=122, maxDocs=44421)
                0.010720233 = queryNorm
              0.8611604 = fieldWeight in 4916, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.889283 = idf(docFreq=122, maxDocs=44421)
                0.125 = fieldNorm(doc=4916)
          0.031547323 = weight(abstract_txt:article in 4916) [ClassicSimilarity], result of:
            0.031547323 = score(doc=4916,freq=1.0), product of:
              0.06642678 = queryWeight, product of:
                1.6309087 = boost
                3.79935 = idf(docFreq=2702, maxDocs=44421)
                0.010720233 = queryNorm
              0.47491875 = fieldWeight in 4916, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.79935 = idf(docFreq=2702, maxDocs=44421)
                0.125 = fieldNorm(doc=4916)
          0.361547 = weight(abstract_txt:angle in 4916) [ClassicSimilarity], result of:
            0.361547 = score(doc=4916,freq=1.0), product of:
              0.33765876 = queryWeight, product of:
                3.6770291 = boost
                8.565973 = idf(docFreq=22, maxDocs=44421)
                0.010720233 = queryNorm
              1.0707467 = fieldWeight in 4916, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.565973 = idf(docFreq=22, maxDocs=44421)
                0.125 = fieldNorm(doc=4916)
          0.43963304 = weight(abstract_txt:distance in 4916) [ClassicSimilarity], result of:
            0.43963304 = score(doc=4916,freq=2.0), product of:
              0.384676 = queryWeight, product of:
                5.550353 = boost
                6.4650254 = idf(docFreq=187, maxDocs=44421)
                0.010720233 = queryNorm
              1.1428658 = fieldWeight in 4916, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.4650254 = idf(docFreq=187, maxDocs=44421)
                0.125 = fieldNorm(doc=4916)
          0.396512 = weight(abstract_txt:similarity in 4916) [ClassicSimilarity], result of:
            0.396512 = score(doc=4916,freq=1.0), product of:
              0.54520744 = queryWeight, product of:
                8.741239 = boost
                5.8181453 = idf(docFreq=358, maxDocs=44421)
                0.010720233 = queryNorm
              0.72726816 = fieldWeight in 4916, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.8181453 = idf(docFreq=358, maxDocs=44421)
                0.125 = fieldNorm(doc=4916)
        0.24 = coord(6/25)
    
  2. Zhang, J.; Wolfram, D.: Visualization of term discrimination analysis (2001) 0.30
    0.29944643 = sum of:
      0.29944643 = product of:
        1.2476935 = sum of:
          0.031884354 = weight(abstract_txt:comparing in 210) [ClassicSimilarity], result of:
            0.031884354 = score(doc=210,freq=1.0), product of:
              0.08428752 = queryWeight, product of:
                1.2990465 = boost
                6.0524936 = idf(docFreq=283, maxDocs=44421)
                0.010720233 = queryNorm
              0.37828085 = fieldWeight in 210, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.0524936 = idf(docFreq=283, maxDocs=44421)
                0.0625 = fieldNorm(doc=210)
          0.067449324 = weight(abstract_txt:cosine in 210) [ClassicSimilarity], result of:
            0.067449324 = score(doc=210,freq=1.0), product of:
              0.1388982 = queryWeight, product of:
                1.667598 = boost
                7.769642 = idf(docFreq=50, maxDocs=44421)
                0.010720233 = queryNorm
              0.48560262 = fieldWeight in 210, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.769642 = idf(docFreq=50, maxDocs=44421)
                0.0625 = fieldNorm(doc=210)
          0.25565234 = weight(abstract_txt:angle in 210) [ClassicSimilarity], result of:
            0.25565234 = score(doc=210,freq=2.0), product of:
              0.33765876 = queryWeight, product of:
                3.6770291 = boost
                8.565973 = idf(docFreq=22, maxDocs=44421)
                0.010720233 = queryNorm
              0.75713223 = fieldWeight in 210, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.565973 = idf(docFreq=22, maxDocs=44421)
                0.0625 = fieldNorm(doc=210)
          0.26921916 = weight(abstract_txt:distance in 210) [ClassicSimilarity], result of:
            0.26921916 = score(doc=210,freq=3.0), product of:
              0.384676 = queryWeight, product of:
                5.550353 = boost
                6.4650254 = idf(docFreq=187, maxDocs=44421)
                0.010720233 = queryNorm
              0.6998595 = fieldWeight in 210, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.4650254 = idf(docFreq=187, maxDocs=44421)
                0.0625 = fieldNorm(doc=210)
          0.28009894 = weight(abstract_txt:measure in 210) [ClassicSimilarity], result of:
            0.28009894 = score(doc=210,freq=3.0), product of:
              0.4759684 = queryWeight, product of:
                8.16735 = boost
                5.4361663 = idf(docFreq=525, maxDocs=44421)
                0.010720233 = queryNorm
              0.58848226 = fieldWeight in 210, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.4361663 = idf(docFreq=525, maxDocs=44421)
                0.0625 = fieldNorm(doc=210)
          0.34338945 = weight(abstract_txt:similarity in 210) [ClassicSimilarity], result of:
            0.34338945 = score(doc=210,freq=3.0), product of:
              0.54520744 = queryWeight, product of:
                8.741239 = boost
                5.8181453 = idf(docFreq=358, maxDocs=44421)
                0.010720233 = queryNorm
              0.6298327 = fieldWeight in 210, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.8181453 = idf(docFreq=358, maxDocs=44421)
                0.0625 = fieldNorm(doc=210)
        0.24 = coord(6/25)
    
  3. Tudhope, D.; Taylor, C.: Navigation via similarity (1997) 0.25
    0.25423965 = sum of:
      0.25423965 = product of:
        1.0593319 = sum of:
          0.028144037 = weight(abstract_txt:integrated in 1155) [ClassicSimilarity], result of:
            0.028144037 = score(doc=1155,freq=1.0), product of:
              0.066838875 = queryWeight, product of:
                1.1567982 = boost
                5.389733 = idf(docFreq=550, maxDocs=44421)
                0.010720233 = queryNorm
              0.42107287 = fieldWeight in 1155, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.389733 = idf(docFreq=550, maxDocs=44421)
                0.078125 = fieldNorm(doc=1155)
          0.040753786 = weight(abstract_txt:account in 1155) [ClassicSimilarity], result of:
            0.040753786 = score(doc=1155,freq=2.0), product of:
              0.06790058 = queryWeight, product of:
                1.1659497 = boost
                5.432371 = idf(docFreq=527, maxDocs=44421)
                0.010720233 = queryNorm
              0.60019785 = fieldWeight in 1155, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.432371 = idf(docFreq=527, maxDocs=44421)
                0.078125 = fieldNorm(doc=1155)
          0.039855443 = weight(abstract_txt:takes in 1155) [ClassicSimilarity], result of:
            0.039855443 = score(doc=1155,freq=1.0), product of:
              0.08428752 = queryWeight, product of:
                1.2990465 = boost
                6.0524936 = idf(docFreq=283, maxDocs=44421)
                0.010720233 = queryNorm
              0.47285107 = fieldWeight in 1155, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.0524936 = idf(docFreq=283, maxDocs=44421)
                0.078125 = fieldNorm(doc=1155)
          0.1942922 = weight(abstract_txt:distance in 1155) [ClassicSimilarity], result of:
            0.1942922 = score(doc=1155,freq=1.0), product of:
              0.384676 = queryWeight, product of:
                5.550353 = boost
                6.4650254 = idf(docFreq=187, maxDocs=44421)
                0.010720233 = queryNorm
              0.5050801 = fieldWeight in 1155, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.4650254 = idf(docFreq=187, maxDocs=44421)
                0.078125 = fieldNorm(doc=1155)
          0.20214401 = weight(abstract_txt:measure in 1155) [ClassicSimilarity], result of:
            0.20214401 = score(doc=1155,freq=1.0), product of:
              0.4759684 = queryWeight, product of:
                8.16735 = boost
                5.4361663 = idf(docFreq=525, maxDocs=44421)
                0.010720233 = queryNorm
              0.4247005 = fieldWeight in 1155, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.4361663 = idf(docFreq=525, maxDocs=44421)
                0.078125 = fieldNorm(doc=1155)
          0.5541424 = weight(abstract_txt:similarity in 1155) [ClassicSimilarity], result of:
            0.5541424 = score(doc=1155,freq=5.0), product of:
              0.54520744 = queryWeight, product of:
                8.741239 = boost
                5.8181453 = idf(docFreq=358, maxDocs=44421)
                0.010720233 = queryNorm
              1.0163882 = fieldWeight in 1155, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                5.8181453 = idf(docFreq=358, maxDocs=44421)
                0.078125 = fieldNorm(doc=1155)
        0.24 = coord(6/25)
    
  4. Wolfram, D.; Zhang, J.: ¬An investigation of the influence of indexing exhaustivity and term distributions on a document space (2002) 0.23
    0.22529504 = sum of:
      0.22529504 = product of:
        0.93872935 = sum of:
          0.017154584 = weight(abstract_txt:providing in 238) [ClassicSimilarity], result of:
            0.017154584 = score(doc=238,freq=1.0), product of:
              0.055756863 = queryWeight, product of:
                1.0565553 = boost
                4.922683 = idf(docFreq=878, maxDocs=44421)
                0.010720233 = queryNorm
              0.30766767 = fieldWeight in 238, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.922683 = idf(docFreq=878, maxDocs=44421)
                0.0625 = fieldNorm(doc=238)
          0.03673205 = weight(abstract_txt:measured in 238) [ClassicSimilarity], result of:
            0.03673205 = score(doc=238,freq=1.0), product of:
              0.09262786 = queryWeight, product of:
                1.3618017 = boost
                6.3448815 = idf(docFreq=211, maxDocs=44421)
                0.010720233 = queryNorm
              0.3965551 = fieldWeight in 238, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.3448815 = idf(docFreq=211, maxDocs=44421)
                0.0625 = fieldNorm(doc=238)
          0.25565234 = weight(abstract_txt:angle in 238) [ClassicSimilarity], result of:
            0.25565234 = score(doc=238,freq=2.0), product of:
              0.33765876 = queryWeight, product of:
                3.6770291 = boost
                8.565973 = idf(docFreq=22, maxDocs=44421)
                0.010720233 = queryNorm
              0.75713223 = fieldWeight in 238, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.565973 = idf(docFreq=22, maxDocs=44421)
                0.0625 = fieldNorm(doc=238)
          0.26921916 = weight(abstract_txt:distance in 238) [ClassicSimilarity], result of:
            0.26921916 = score(doc=238,freq=3.0), product of:
              0.384676 = queryWeight, product of:
                5.550353 = boost
                6.4650254 = idf(docFreq=187, maxDocs=44421)
                0.010720233 = queryNorm
              0.6998595 = fieldWeight in 238, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.4650254 = idf(docFreq=187, maxDocs=44421)
                0.0625 = fieldNorm(doc=238)
          0.16171521 = weight(abstract_txt:measure in 238) [ClassicSimilarity], result of:
            0.16171521 = score(doc=238,freq=1.0), product of:
              0.4759684 = queryWeight, product of:
                8.16735 = boost
                5.4361663 = idf(docFreq=525, maxDocs=44421)
                0.010720233 = queryNorm
              0.3397604 = fieldWeight in 238, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.4361663 = idf(docFreq=525, maxDocs=44421)
                0.0625 = fieldNorm(doc=238)
          0.198256 = weight(abstract_txt:similarity in 238) [ClassicSimilarity], result of:
            0.198256 = score(doc=238,freq=1.0), product of:
              0.54520744 = queryWeight, product of:
                8.741239 = boost
                5.8181453 = idf(docFreq=358, maxDocs=44421)
                0.010720233 = queryNorm
              0.36363408 = fieldWeight in 238, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.8181453 = idf(docFreq=358, maxDocs=44421)
                0.0625 = fieldNorm(doc=238)
        0.24 = coord(6/25)
    
  5. Shibata, N.; Kajikawa, Y.; Sakata, I.: Measuring relatedness between communities in a citation network (2011) 0.22
    0.22475436 = sum of:
      0.22475436 = product of:
        0.80269414 = sum of:
          0.0050087417 = weight(abstract_txt:this in 484) [ClassicSimilarity], result of:
            0.0050087417 = score(doc=484,freq=1.0), product of:
              0.026644073 = queryWeight, product of:
                1.0329002 = boost
                2.4062347 = idf(docFreq=10885, maxDocs=44421)
                0.010720233 = queryNorm
              0.18798709 = fieldWeight in 484, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.4062347 = idf(docFreq=10885, maxDocs=44421)
                0.078125 = fieldNorm(doc=484)
          0.023202427 = weight(abstract_txt:topic in 484) [ClassicSimilarity], result of:
            0.023202427 = score(doc=484,freq=1.0), product of:
              0.058766134 = queryWeight, product of:
                1.0846925 = boost
                5.053779 = idf(docFreq=770, maxDocs=44421)
                0.010720233 = queryNorm
              0.3948265 = fieldWeight in 484, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.053779 = idf(docFreq=770, maxDocs=44421)
                0.078125 = fieldNorm(doc=484)
          0.045915063 = weight(abstract_txt:measured in 484) [ClassicSimilarity], result of:
            0.045915063 = score(doc=484,freq=1.0), product of:
              0.09262786 = queryWeight, product of:
                1.3618017 = boost
                6.3448815 = idf(docFreq=211, maxDocs=44421)
                0.010720233 = queryNorm
              0.49569386 = fieldWeight in 484, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.3448815 = idf(docFreq=211, maxDocs=44421)
                0.078125 = fieldNorm(doc=484)
          0.08431166 = weight(abstract_txt:cosine in 484) [ClassicSimilarity], result of:
            0.08431166 = score(doc=484,freq=1.0), product of:
              0.1388982 = queryWeight, product of:
                1.667598 = boost
                7.769642 = idf(docFreq=50, maxDocs=44421)
                0.010720233 = queryNorm
              0.6070033 = fieldWeight in 484, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.769642 = idf(docFreq=50, maxDocs=44421)
                0.078125 = fieldNorm(doc=484)
          0.1942922 = weight(abstract_txt:distance in 484) [ClassicSimilarity], result of:
            0.1942922 = score(doc=484,freq=1.0), product of:
              0.384676 = queryWeight, product of:
                5.550353 = boost
                6.4650254 = idf(docFreq=187, maxDocs=44421)
                0.010720233 = queryNorm
              0.5050801 = fieldWeight in 484, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.4650254 = idf(docFreq=187, maxDocs=44421)
                0.078125 = fieldNorm(doc=484)
          0.20214401 = weight(abstract_txt:measure in 484) [ClassicSimilarity], result of:
            0.20214401 = score(doc=484,freq=1.0), product of:
              0.4759684 = queryWeight, product of:
                8.16735 = boost
                5.4361663 = idf(docFreq=525, maxDocs=44421)
                0.010720233 = queryNorm
              0.4247005 = fieldWeight in 484, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.4361663 = idf(docFreq=525, maxDocs=44421)
                0.078125 = fieldNorm(doc=484)
          0.24782 = weight(abstract_txt:similarity in 484) [ClassicSimilarity], result of:
            0.24782 = score(doc=484,freq=1.0), product of:
              0.54520744 = queryWeight, product of:
                8.741239 = boost
                5.8181453 = idf(docFreq=358, maxDocs=44421)
                0.010720233 = queryNorm
              0.4545426 = fieldWeight in 484, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.8181453 = idf(docFreq=358, maxDocs=44421)
                0.078125 = fieldNorm(doc=484)
        0.28 = coord(7/25)