Document (#43922)

Author
Li, G.
Siddharth, L.
Luo, J.
Title
Embedding knowledge graph of patent metadata to measure knowledge proximity
Source
Journal of the Association for Information Science and Technology. 74(2023) no.4, S.476-490
Year
2023
Abstract
Knowledge proximity refers to the strength of association between any two entities in a structural form that embodies certain aspects of a knowledge base. In this work, we operationalize knowledge proximity within the context of the US Patent Database (knowledge base) using a knowledge graph (structural form) named "PatNet" built using patent metadata, including citations, inventors, assignees, and domain classifications. We train various graph embedding models using PatNet to obtain the embeddings of entities and relations. The cosine similarity between the corresponding (or transformed) embeddings of entities denotes the knowledge proximity between these. We compare the embedding models in terms of their performances in predicting target entities and explaining domain expansion profiles of inventors and assignees. We then apply the embeddings of the best-preferred model to associate homogeneous (e.g., patent-patent) and heterogeneous (e.g., inventor-assignee) pairs of entities.
Content
Vgl.: https://asistdl.onlinelibrary.wiley.com/doi/10.1002/asi.24736.
Field
Patentinformation

Similar documents (content)

  1. Yan, B.; Luo, J.: Measuring technological distance for patent mapping (2017) 0.30
    0.29845 = sum of:
      0.29845 = product of:
        0.9326562 = sum of:
          0.07469033 = weight(abstract_txt:inventor in 4351) [ClassicSimilarity], result of:
            0.07469033 = score(doc=4351,freq=1.0), product of:
              0.13384046 = queryWeight, product of:
                1.165891 = boost
                8.928879 = idf(docFreq=15, maxDocs=44421)
                0.012856789 = queryNorm
              0.5580549 = fieldWeight in 4351, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.928879 = idf(docFreq=15, maxDocs=44421)
                0.0625 = fieldNorm(doc=4351)
          0.018389048 = weight(abstract_txt:using in 4351) [ClassicSimilarity], result of:
            0.018389048 = score(doc=4351,freq=2.0), product of:
              0.060184006 = queryWeight, product of:
                1.3541458 = boost
                3.4568708 = idf(docFreq=3806, maxDocs=44421)
                0.012856789 = queryNorm
              0.3055471 = fieldWeight in 4351, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.4568708 = idf(docFreq=3806, maxDocs=44421)
                0.0625 = fieldNorm(doc=4351)
          0.013008951 = weight(abstract_txt:between in 4351) [ClassicSimilarity], result of:
            0.013008951 = score(doc=4351,freq=1.0), product of:
              0.060202304 = queryWeight, product of:
                1.3543516 = boost
                3.4573963 = idf(docFreq=3804, maxDocs=44421)
                0.012856789 = queryNorm
              0.21608727 = fieldWeight in 4351, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4573963 = idf(docFreq=3804, maxDocs=44421)
                0.0625 = fieldNorm(doc=4351)
          0.03750874 = weight(abstract_txt:base in 4351) [ClassicSimilarity], result of:
            0.03750874 = score(doc=4351,freq=1.0), product of:
              0.10653921 = queryWeight, product of:
                1.4710723 = boost
                5.633042 = idf(docFreq=431, maxDocs=44421)
                0.012856789 = queryNorm
              0.35206512 = fieldWeight in 4351, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.633042 = idf(docFreq=431, maxDocs=44421)
                0.0625 = fieldNorm(doc=4351)
          0.041811347 = weight(abstract_txt:structural in 4351) [ClassicSimilarity], result of:
            0.041811347 = score(doc=4351,freq=1.0), product of:
              0.11453827 = queryWeight, product of:
                1.5252975 = boost
                5.8406816 = idf(docFreq=350, maxDocs=44421)
                0.012856789 = queryNorm
              0.3650426 = fieldWeight in 4351, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.8406816 = idf(docFreq=350, maxDocs=44421)
                0.0625 = fieldNorm(doc=4351)
          0.03743898 = weight(abstract_txt:knowledge in 4351) [ClassicSimilarity], result of:
            0.03743898 = score(doc=4351,freq=1.0), product of:
              0.16891071 = queryWeight, product of:
                3.7045703 = boost
                3.5463927 = idf(docFreq=3480, maxDocs=44421)
                0.012856789 = queryNorm
              0.22164954 = fieldWeight in 4351, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.5463927 = idf(docFreq=3480, maxDocs=44421)
                0.0625 = fieldNorm(doc=4351)
          0.15748923 = weight(abstract_txt:proximity in 4351) [ClassicSimilarity], result of:
            0.15748923 = score(doc=4351,freq=1.0), product of:
              0.34935346 = queryWeight, product of:
                3.7672703 = boost
                7.212831 = idf(docFreq=88, maxDocs=44421)
                0.012856789 = queryNorm
              0.45080194 = fieldWeight in 4351, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.212831 = idf(docFreq=88, maxDocs=44421)
                0.0625 = fieldNorm(doc=4351)
          0.5523196 = weight(abstract_txt:patent in 4351) [ClassicSimilarity], result of:
            0.5523196 = score(doc=4351,freq=10.0), product of:
              0.40320706 = queryWeight, product of:
                4.5249453 = boost
                6.930783 = idf(docFreq=117, maxDocs=44421)
                0.012856789 = queryNorm
              1.3698162 = fieldWeight in 4351, product of:
                3.1622777 = tf(freq=10.0), with freq of:
                  10.0 = termFreq=10.0
                6.930783 = idf(docFreq=117, maxDocs=44421)
                0.0625 = fieldNorm(doc=4351)
        0.32 = coord(8/25)
    
  2. Jiang, S.; Gao, Q.; Chen, H.; Roco, M.C.: ¬The roles of sharing, transfer, and public funding in nanotechnology knowledge-diffusion networks (2015) 0.23
    0.23295112 = sum of:
      0.23295112 = product of:
        0.9706297 = sum of:
          0.093362905 = weight(abstract_txt:inventor in 2823) [ClassicSimilarity], result of:
            0.093362905 = score(doc=2823,freq=1.0), product of:
              0.13384046 = queryWeight, product of:
                1.165891 = boost
                8.928879 = idf(docFreq=15, maxDocs=44421)
                0.012856789 = queryNorm
              0.69756866 = fieldWeight in 2823, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.928879 = idf(docFreq=15, maxDocs=44421)
                0.078125 = fieldNorm(doc=2823)
          0.025919152 = weight(abstract_txt:models in 2823) [ClassicSimilarity], result of:
            0.025919152 = score(doc=2823,freq=1.0), product of:
              0.07176208 = queryWeight, product of:
                1.2073321 = boost
                4.623126 = idf(docFreq=1185, maxDocs=44421)
                0.012856789 = queryNorm
              0.36118174 = fieldWeight in 2823, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.623126 = idf(docFreq=1185, maxDocs=44421)
                0.078125 = fieldNorm(doc=2823)
          0.30800465 = weight(abstract_txt:inventors in 2823) [ClassicSimilarity], result of:
            0.30800465 = score(doc=2823,freq=2.0), product of:
              0.29660332 = queryWeight, product of:
                2.4545238 = boost
                9.398883 = idf(docFreq=9, maxDocs=44421)
                0.012856789 = queryNorm
              1.0384396 = fieldWeight in 2823, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                9.398883 = idf(docFreq=9, maxDocs=44421)
                0.078125 = fieldNorm(doc=2823)
          0.110769145 = weight(abstract_txt:graph in 2823) [ClassicSimilarity], result of:
            0.110769145 = score(doc=2823,freq=1.0), product of:
              0.21633366 = queryWeight, product of:
                2.5673609 = boost
                6.553973 = idf(docFreq=171, maxDocs=44421)
                0.012856789 = queryNorm
              0.5120292 = fieldWeight in 2823, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.553973 = idf(docFreq=171, maxDocs=44421)
                0.078125 = fieldNorm(doc=2823)
          0.123817794 = weight(abstract_txt:knowledge in 2823) [ClassicSimilarity], result of:
            0.123817794 = score(doc=2823,freq=7.0), product of:
              0.16891071 = queryWeight, product of:
                3.7045703 = boost
                3.5463927 = idf(docFreq=3480, maxDocs=44421)
                0.012856789 = queryNorm
              0.73303694 = fieldWeight in 2823, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                3.5463927 = idf(docFreq=3480, maxDocs=44421)
                0.078125 = fieldNorm(doc=2823)
          0.30875602 = weight(abstract_txt:patent in 2823) [ClassicSimilarity], result of:
            0.30875602 = score(doc=2823,freq=2.0), product of:
              0.40320706 = queryWeight, product of:
                4.5249453 = boost
                6.930783 = idf(docFreq=117, maxDocs=44421)
                0.012856789 = queryNorm
              0.7657505 = fieldWeight in 2823, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.930783 = idf(docFreq=117, maxDocs=44421)
                0.078125 = fieldNorm(doc=2823)
        0.24 = coord(6/25)
    
  3. Zhu, Y.; Quan, L.; Chen, P.-Y.; Kim, M.C.; Che, C.: Predicting coauthorship using bibliographic network embedding (2023) 0.18
    0.17599645 = sum of:
      0.17599645 = product of:
        0.87998223 = sum of:
          0.013003021 = weight(abstract_txt:using in 1918) [ClassicSimilarity], result of:
            0.013003021 = score(doc=1918,freq=1.0), product of:
              0.060184006 = queryWeight, product of:
                1.3541458 = boost
                3.4568708 = idf(docFreq=3806, maxDocs=44421)
                0.012856789 = queryNorm
              0.21605442 = fieldWeight in 1918, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4568708 = idf(docFreq=3806, maxDocs=44421)
                0.0625 = fieldNorm(doc=1918)
          0.08861531 = weight(abstract_txt:graph in 1918) [ClassicSimilarity], result of:
            0.08861531 = score(doc=1918,freq=1.0), product of:
              0.21633366 = queryWeight, product of:
                2.5673609 = boost
                6.553973 = idf(docFreq=171, maxDocs=44421)
                0.012856789 = queryNorm
              0.40962332 = fieldWeight in 1918, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.553973 = idf(docFreq=171, maxDocs=44421)
                0.0625 = fieldNorm(doc=1918)
          0.30468622 = weight(abstract_txt:embedding in 1918) [ClassicSimilarity], result of:
            0.30468622 = score(doc=1918,freq=4.0), product of:
              0.31045607 = queryWeight, product of:
                3.0755653 = boost
                7.85132 = idf(docFreq=46, maxDocs=44421)
                0.012856789 = queryNorm
              0.981415 = fieldWeight in 1918, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                7.85132 = idf(docFreq=46, maxDocs=44421)
                0.0625 = fieldNorm(doc=1918)
          0.3696056 = weight(abstract_txt:embeddings in 1918) [ClassicSimilarity], result of:
            0.3696056 = score(doc=1918,freq=2.0), product of:
              0.444905 = queryWeight, product of:
                3.6817858 = boost
                9.398883 = idf(docFreq=9, maxDocs=44421)
                0.012856789 = queryNorm
              0.8307517 = fieldWeight in 1918, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                9.398883 = idf(docFreq=9, maxDocs=44421)
                0.0625 = fieldNorm(doc=1918)
          0.10407208 = weight(abstract_txt:entities in 1918) [ClassicSimilarity], result of:
            0.10407208 = score(doc=1918,freq=1.0), product of:
              0.28551176 = queryWeight, product of:
                3.807687 = boost
                5.8321705 = idf(docFreq=353, maxDocs=44421)
                0.012856789 = queryNorm
              0.36451066 = fieldWeight in 1918, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.8321705 = idf(docFreq=353, maxDocs=44421)
                0.0625 = fieldNorm(doc=1918)
        0.2 = coord(5/25)
    
  4. Li, R.; Chambers, T.; Ding, Y.; Zhang, G.; Meng, L.: Patent citation analysis : calculating science linkage based on citing motivation (2014) 0.15
    0.14738825 = sum of:
      0.14738825 = product of:
        0.92117655 = sum of:
          0.14938065 = weight(abstract_txt:inventor in 2257) [ClassicSimilarity], result of:
            0.14938065 = score(doc=2257,freq=4.0), product of:
              0.13384046 = queryWeight, product of:
                1.165891 = boost
                8.928879 = idf(docFreq=15, maxDocs=44421)
                0.012856789 = queryNorm
              1.1161098 = fieldWeight in 2257, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                8.928879 = idf(docFreq=15, maxDocs=44421)
                0.0625 = fieldNorm(doc=2257)
          0.031382553 = weight(abstract_txt:domain in 2257) [ClassicSimilarity], result of:
            0.031382553 = score(doc=2257,freq=2.0), product of:
              0.075082146 = queryWeight, product of:
                1.2349449 = boost
                4.7288613 = idf(docFreq=1066, maxDocs=44421)
                0.012856789 = queryNorm
              0.41797623 = fieldWeight in 2257, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.7288613 = idf(docFreq=1066, maxDocs=44421)
                0.0625 = fieldNorm(doc=2257)
          0.24640372 = weight(abstract_txt:inventors in 2257) [ClassicSimilarity], result of:
            0.24640372 = score(doc=2257,freq=2.0), product of:
              0.29660332 = queryWeight, product of:
                2.4545238 = boost
                9.398883 = idf(docFreq=9, maxDocs=44421)
                0.012856789 = queryNorm
              0.8307517 = fieldWeight in 2257, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                9.398883 = idf(docFreq=9, maxDocs=44421)
                0.0625 = fieldNorm(doc=2257)
          0.49400964 = weight(abstract_txt:patent in 2257) [ClassicSimilarity], result of:
            0.49400964 = score(doc=2257,freq=8.0), product of:
              0.40320706 = queryWeight, product of:
                4.5249453 = boost
                6.930783 = idf(docFreq=117, maxDocs=44421)
                0.012856789 = queryNorm
              1.2252009 = fieldWeight in 2257, product of:
                2.828427 = tf(freq=8.0), with freq of:
                  8.0 = termFreq=8.0
                6.930783 = idf(docFreq=117, maxDocs=44421)
                0.0625 = fieldNorm(doc=2257)
        0.16 = coord(4/25)
    
  5. Yan, B.; Luo, J.: Filtering patent maps for visualization of diversification paths of inventors and organizations (2017) 0.13
    0.13320526 = sum of:
      0.13320526 = product of:
        0.83253294 = sum of:
          0.10562807 = weight(abstract_txt:inventor in 4651) [ClassicSimilarity], result of:
            0.10562807 = score(doc=4651,freq=2.0), product of:
              0.13384046 = queryWeight, product of:
                1.165891 = boost
                8.928879 = idf(docFreq=15, maxDocs=44421)
                0.012856789 = queryNorm
              0.7892088 = fieldWeight in 4651, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.928879 = idf(docFreq=15, maxDocs=44421)
                0.0625 = fieldNorm(doc=4651)
          0.018397436 = weight(abstract_txt:between in 4651) [ClassicSimilarity], result of:
            0.018397436 = score(doc=4651,freq=2.0), product of:
              0.060202304 = queryWeight, product of:
                1.3543516 = boost
                3.4573963 = idf(docFreq=3804, maxDocs=44421)
                0.012856789 = queryNorm
              0.30559355 = fieldWeight in 4651, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.4573963 = idf(docFreq=3804, maxDocs=44421)
                0.0625 = fieldNorm(doc=4651)
          0.24640372 = weight(abstract_txt:inventors in 4651) [ClassicSimilarity], result of:
            0.24640372 = score(doc=4651,freq=2.0), product of:
              0.29660332 = queryWeight, product of:
                2.4545238 = boost
                9.398883 = idf(docFreq=9, maxDocs=44421)
                0.012856789 = queryNorm
              0.8307517 = fieldWeight in 4651, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                9.398883 = idf(docFreq=9, maxDocs=44421)
                0.0625 = fieldNorm(doc=4651)
          0.46210372 = weight(abstract_txt:patent in 4651) [ClassicSimilarity], result of:
            0.46210372 = score(doc=4651,freq=7.0), product of:
              0.40320706 = queryWeight, product of:
                4.5249453 = boost
                6.930783 = idf(docFreq=117, maxDocs=44421)
                0.012856789 = queryNorm
              1.1460705 = fieldWeight in 4651, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                6.930783 = idf(docFreq=117, maxDocs=44421)
                0.0625 = fieldNorm(doc=4651)
        0.16 = coord(4/25)