Document (#28876)

Author
Breitzman, A.
Title
Automated identification of technologically similar organizations
Source
Journal of the American Society for Information Science and Technology. 56(2005) no.10, S.1015-1023
Year
2005
Abstract
This article introduces and validates a method for identifying technologically similar organizations, industries, or regions by applying the techniques from information science for term similarity to international patent classifications. Several applications of the method are explored, including identifying hidden competitive threats, finding potential acquisition targets, locating university expertise within a technology, identifying competitor strategy shifts, and more. One advantage of the method is that it is size invariant, meaning, for example, that it is possible for a huge corporation to identify smaller firms in its space before they become significant competitors. Another advantage is that technologically similar organizations can be identified an a large scale without any particular knowledge of the technology or business of either source organizations or target organizations.

Similar documents (content)

  1. Liu, D.-R.; Shih, M.-J.: Hybrid-patent classification based on patent-network analysis (2011) 0.12
    0.1203893 = sum of:
      0.1203893 = product of:
        0.5016221 = sum of:
          0.18930657 = weight(abstract_txt:patent in 189) [ClassicSimilarity], result of:
            0.18930657 = score(doc=189,freq=14.0), product of:
              0.116799064 = queryWeight, product of:
                6.930783 = idf(docFreq=117, maxDocs=44421)
                0.016852219 = queryNorm
              1.6207885 = fieldWeight in 189, product of:
                3.7416575 = tf(freq=14.0), with freq of:
                  14.0 = termFreq=14.0
                6.930783 = idf(docFreq=117, maxDocs=44421)
                0.0625 = fieldNorm(doc=189)
          0.050969604 = weight(abstract_txt:competitive in 189) [ClassicSimilarity], result of:
            0.050969604 = score(doc=189,freq=1.0), product of:
              0.11737594 = queryWeight, product of:
                1.0024664 = boost
                6.9478774 = idf(docFreq=115, maxDocs=44421)
                0.016852219 = queryNorm
              0.43424234 = fieldWeight in 189, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.9478774 = idf(docFreq=115, maxDocs=44421)
                0.0625 = fieldNorm(doc=189)
          0.012060459 = weight(abstract_txt:that in 189) [ClassicSimilarity], result of:
            0.012060459 = score(doc=189,freq=4.0), product of:
              0.040797595 = queryWeight, product of:
                1.0236659 = boost
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.016852219 = queryNorm
              0.2956169 = fieldWeight in 189, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.0625 = fieldNorm(doc=189)
          0.06601588 = weight(abstract_txt:advantage in 189) [ClassicSimilarity], result of:
            0.06601588 = score(doc=189,freq=1.0), product of:
              0.17571703 = queryWeight, product of:
                1.7346115 = boost
                6.011108 = idf(docFreq=295, maxDocs=44421)
                0.016852219 = queryNorm
              0.37569425 = fieldWeight in 189, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.011108 = idf(docFreq=295, maxDocs=44421)
                0.0625 = fieldNorm(doc=189)
          0.058706064 = weight(abstract_txt:method in 189) [ClassicSimilarity], result of:
            0.058706064 = score(doc=189,freq=2.0), product of:
              0.14763544 = queryWeight, product of:
                1.9473153 = boost
                4.4988065 = idf(docFreq=1342, maxDocs=44421)
                0.016852219 = queryNorm
              0.39764208 = fieldWeight in 189, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.4988065 = idf(docFreq=1342, maxDocs=44421)
                0.0625 = fieldNorm(doc=189)
          0.124563515 = weight(abstract_txt:organizations in 189) [ClassicSimilarity], result of:
            0.124563515 = score(doc=189,freq=1.0), product of:
              0.36415714 = queryWeight, product of:
                3.9482963 = boost
                5.4729567 = idf(docFreq=506, maxDocs=44421)
                0.016852219 = queryNorm
              0.3420598 = fieldWeight in 189, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.4729567 = idf(docFreq=506, maxDocs=44421)
                0.0625 = fieldNorm(doc=189)
        0.24 = coord(6/25)
    
  2. Kay, L.; Newman, N.; Youtie, J.; Porter, A.L.; Rafols, I.: Patent overlay mapping : visualizing technological distance (2014) 0.11
    0.112488635 = sum of:
      0.112488635 = product of:
        0.46870264 = sum of:
          0.17526382 = weight(abstract_txt:patent in 2543) [ClassicSimilarity], result of:
            0.17526382 = score(doc=2543,freq=12.0), product of:
              0.116799064 = queryWeight, product of:
                6.930783 = idf(docFreq=117, maxDocs=44421)
                0.016852219 = queryNorm
              1.5005585 = fieldWeight in 2543, product of:
                3.4641016 = tf(freq=12.0), with freq of:
                  12.0 = termFreq=12.0
                6.930783 = idf(docFreq=117, maxDocs=44421)
                0.0625 = fieldNorm(doc=2543)
          0.050969604 = weight(abstract_txt:competitive in 2543) [ClassicSimilarity], result of:
            0.050969604 = score(doc=2543,freq=1.0), product of:
              0.11737594 = queryWeight, product of:
                1.0024664 = boost
                6.9478774 = idf(docFreq=115, maxDocs=44421)
                0.016852219 = queryNorm
              0.43424234 = fieldWeight in 2543, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.9478774 = idf(docFreq=115, maxDocs=44421)
                0.0625 = fieldNorm(doc=2543)
          0.012060459 = weight(abstract_txt:that in 2543) [ClassicSimilarity], result of:
            0.012060459 = score(doc=2543,freq=4.0), product of:
              0.040797595 = queryWeight, product of:
                1.0236659 = boost
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.016852219 = queryNorm
              0.2956169 = fieldWeight in 2543, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.0625 = fieldNorm(doc=2543)
          0.041511457 = weight(abstract_txt:method in 2543) [ClassicSimilarity], result of:
            0.041511457 = score(doc=2543,freq=1.0), product of:
              0.14763544 = queryWeight, product of:
                1.9473153 = boost
                4.4988065 = idf(docFreq=1342, maxDocs=44421)
                0.016852219 = queryNorm
              0.2811754 = fieldWeight in 2543, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.4988065 = idf(docFreq=1342, maxDocs=44421)
                0.0625 = fieldNorm(doc=2543)
          0.06433379 = weight(abstract_txt:similar in 2543) [ClassicSimilarity], result of:
            0.06433379 = score(doc=2543,freq=1.0), product of:
              0.1977143 = queryWeight, product of:
                2.2535126 = boost
                5.206202 = idf(docFreq=661, maxDocs=44421)
                0.016852219 = queryNorm
              0.32538763 = fieldWeight in 2543, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.206202 = idf(docFreq=661, maxDocs=44421)
                0.0625 = fieldNorm(doc=2543)
          0.124563515 = weight(abstract_txt:organizations in 2543) [ClassicSimilarity], result of:
            0.124563515 = score(doc=2543,freq=1.0), product of:
              0.36415714 = queryWeight, product of:
                3.9482963 = boost
                5.4729567 = idf(docFreq=506, maxDocs=44421)
                0.016852219 = queryNorm
              0.3420598 = fieldWeight in 2543, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.4729567 = idf(docFreq=506, maxDocs=44421)
                0.0625 = fieldNorm(doc=2543)
        0.24 = coord(6/25)
    
  3. Pan, S.; Pan, G.; Hsieh, M.H.: ¬A dual-level analysis of the capability development process : a case study of TT&T (2006) 0.10
    0.09563694 = sum of:
      0.09563694 = product of:
        0.39848727 = sum of:
          0.050969604 = weight(abstract_txt:competitive in 337) [ClassicSimilarity], result of:
            0.050969604 = score(doc=337,freq=1.0), product of:
              0.11737594 = queryWeight, product of:
                1.0024664 = boost
                6.9478774 = idf(docFreq=115, maxDocs=44421)
                0.016852219 = queryNorm
              0.43424234 = fieldWeight in 337, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.9478774 = idf(docFreq=115, maxDocs=44421)
                0.0625 = fieldNorm(doc=337)
          0.008528032 = weight(abstract_txt:that in 337) [ClassicSimilarity], result of:
            0.008528032 = score(doc=337,freq=2.0), product of:
              0.040797595 = queryWeight, product of:
                1.0236659 = boost
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.016852219 = queryNorm
              0.20903271 = fieldWeight in 337, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.0625 = fieldNorm(doc=337)
          0.07295994 = weight(abstract_txt:firms in 337) [ClassicSimilarity], result of:
            0.07295994 = score(doc=337,freq=1.0), product of:
              0.14908291 = queryWeight, product of:
                1.129781 = boost
                7.8302665 = idf(docFreq=47, maxDocs=44421)
                0.016852219 = queryNorm
              0.48939165 = fieldWeight in 337, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.8302665 = idf(docFreq=47, maxDocs=44421)
                0.0625 = fieldNorm(doc=337)
          0.023854416 = weight(abstract_txt:technology in 337) [ClassicSimilarity], result of:
            0.023854416 = score(doc=337,freq=1.0), product of:
              0.08914441 = queryWeight, product of:
                1.2354989 = boost
                4.2814875 = idf(docFreq=1668, maxDocs=44421)
                0.016852219 = queryNorm
              0.26759297 = fieldWeight in 337, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.2814875 = idf(docFreq=1668, maxDocs=44421)
                0.0625 = fieldNorm(doc=337)
          0.06601588 = weight(abstract_txt:advantage in 337) [ClassicSimilarity], result of:
            0.06601588 = score(doc=337,freq=1.0), product of:
              0.17571703 = queryWeight, product of:
                1.7346115 = boost
                6.011108 = idf(docFreq=295, maxDocs=44421)
                0.016852219 = queryNorm
              0.37569425 = fieldWeight in 337, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.011108 = idf(docFreq=295, maxDocs=44421)
                0.0625 = fieldNorm(doc=337)
          0.17615941 = weight(abstract_txt:organizations in 337) [ClassicSimilarity], result of:
            0.17615941 = score(doc=337,freq=2.0), product of:
              0.36415714 = queryWeight, product of:
                3.9482963 = boost
                5.4729567 = idf(docFreq=506, maxDocs=44421)
                0.016852219 = queryNorm
              0.48374557 = fieldWeight in 337, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.4729567 = idf(docFreq=506, maxDocs=44421)
                0.0625 = fieldNorm(doc=337)
        0.24 = coord(6/25)
    
  4. Allen, C.: Information challenges in the global marketplace (1994) 0.09
    0.0877214 = sum of:
      0.0877214 = product of:
        0.5482588 = sum of:
          0.0764544 = weight(abstract_txt:competitive in 605) [ClassicSimilarity], result of:
            0.0764544 = score(doc=605,freq=1.0), product of:
              0.11737594 = queryWeight, product of:
                1.0024664 = boost
                6.9478774 = idf(docFreq=115, maxDocs=44421)
                0.016852219 = queryNorm
              0.6513635 = fieldWeight in 605, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.9478774 = idf(docFreq=115, maxDocs=44421)
                0.09375 = fieldNorm(doc=605)
          0.009045344 = weight(abstract_txt:that in 605) [ClassicSimilarity], result of:
            0.009045344 = score(doc=605,freq=1.0), product of:
              0.040797595 = queryWeight, product of:
                1.0236659 = boost
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.016852219 = queryNorm
              0.22171268 = fieldWeight in 605, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.09375 = fieldNorm(doc=605)
          0.13913351 = weight(abstract_txt:competitors in 605) [ClassicSimilarity], result of:
            0.13913351 = score(doc=605,freq=1.0), product of:
              0.17495723 = queryWeight, product of:
                1.2239009 = boost
                8.482592 = idf(docFreq=24, maxDocs=44421)
                0.016852219 = queryNorm
              0.79524297 = fieldWeight in 605, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.482592 = idf(docFreq=24, maxDocs=44421)
                0.09375 = fieldNorm(doc=605)
          0.3236255 = weight(abstract_txt:organizations in 605) [ClassicSimilarity], result of:
            0.3236255 = score(doc=605,freq=3.0), product of:
              0.36415714 = queryWeight, product of:
                3.9482963 = boost
                5.4729567 = idf(docFreq=506, maxDocs=44421)
                0.016852219 = queryNorm
              0.8886974 = fieldWeight in 605, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.4729567 = idf(docFreq=506, maxDocs=44421)
                0.09375 = fieldNorm(doc=605)
        0.16 = coord(4/25)
    
  5. Yan, B.; Luo, J.: Filtering patent maps for visualization of diversification paths of inventors and organizations (2017) 0.08
    0.083714105 = sum of:
      0.083714105 = product of:
        0.41857052 = sum of:
          0.13385996 = weight(abstract_txt:patent in 4651) [ClassicSimilarity], result of:
            0.13385996 = score(doc=4651,freq=7.0), product of:
              0.116799064 = queryWeight, product of:
                6.930783 = idf(docFreq=117, maxDocs=44421)
                0.016852219 = queryNorm
              1.1460705 = fieldWeight in 4651, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                6.930783 = idf(docFreq=117, maxDocs=44421)
                0.0625 = fieldNorm(doc=4651)
          0.008528032 = weight(abstract_txt:that in 4651) [ClassicSimilarity], result of:
            0.008528032 = score(doc=4651,freq=2.0), product of:
              0.040797595 = queryWeight, product of:
                1.0236659 = boost
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.016852219 = queryNorm
              0.20903271 = fieldWeight in 4651, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.0625 = fieldNorm(doc=4651)
          0.04131706 = weight(abstract_txt:technology in 4651) [ClassicSimilarity], result of:
            0.04131706 = score(doc=4651,freq=3.0), product of:
              0.08914441 = queryWeight, product of:
                1.2354989 = boost
                4.2814875 = idf(docFreq=1668, maxDocs=44421)
                0.016852219 = queryNorm
              0.46348462 = fieldWeight in 4651, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.2814875 = idf(docFreq=1668, maxDocs=44421)
                0.0625 = fieldNorm(doc=4651)
          0.058706064 = weight(abstract_txt:method in 4651) [ClassicSimilarity], result of:
            0.058706064 = score(doc=4651,freq=2.0), product of:
              0.14763544 = queryWeight, product of:
                1.9473153 = boost
                4.4988065 = idf(docFreq=1342, maxDocs=44421)
                0.016852219 = queryNorm
              0.39764208 = fieldWeight in 4651, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.4988065 = idf(docFreq=1342, maxDocs=44421)
                0.0625 = fieldNorm(doc=4651)
          0.17615941 = weight(abstract_txt:organizations in 4651) [ClassicSimilarity], result of:
            0.17615941 = score(doc=4651,freq=2.0), product of:
              0.36415714 = queryWeight, product of:
                3.9482963 = boost
                5.4729567 = idf(docFreq=506, maxDocs=44421)
                0.016852219 = queryNorm
              0.48374557 = fieldWeight in 4651, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.4729567 = idf(docFreq=506, maxDocs=44421)
                0.0625 = fieldNorm(doc=4651)
        0.2 = coord(5/25)