Document (#44056)

Author
Safder, I.
Ali, M.
Aljohani, N.R.
Nawaz, R.
Hassan, S.-U.
Title
Neural machine translation for in-text citation classification
Source
Journal of the Association for Information Science and Technology. 74(2023) no.10, S.1229-1240
Year
2023
Abstract
The quality of scientific publications can be measured by quantitative indices such as the h-index, Source Normalized Impact per Paper, or g-index. However, these measures lack to explain the function or reasons for citations and the context of citations from citing publication to cited publication. We argue that citation context may be considered while calculating the impact of research work. However, mining citation context from unstructured full-text publications is a challenging task. In this paper, we compiled a data set comprising 9,518 citations context. We developed a deep learning-based architecture for citation context classification. Unlike feature-based state-of-the-art models, our proposed focal-loss and class-weight-aware BiLSTM model with pretrained GloVe embedding vectors use citation context as input to outperform them in multiclass citation context classification tasks. Our model improves on the baseline state-of-the-art by achieving an F1 score of 0.80 with an accuracy of 0.81 for citation context classification. Moreover, we delve into the effects of using different word embeddings on the performance of the classification model and draw a comparison between fastText, GloVe, and spaCy pretrained word embeddings.
Content
Vgl.: https://asistdl.onlinelibrary.wiley.com/doi/10.1002/asi.24817.
Theme
Citation indexing

Similar documents (author)

  1. Hassan, E.: Simultaneous mapping of interactions between scientific and technological knowledge bases : the case of space communications (2003) 5.94
    5.9401517 = sum of:
      5.9401517 = weight(author_txt:hassan in 2472) [ClassicSimilarity], result of:
        5.9401517 = fieldWeight in 2472, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.504243 = idf(docFreq=8, maxDocs=44421)
          0.625 = fieldNorm(doc=2472)
    
  2. Ibrahim, N. Hassan -> Hassan Ibrahim, N.: 5.04
    5.0403857 = sum of:
      5.0403857 = weight(author_txt:hassan in 1270) [ClassicSimilarity], result of:
        5.0403857 = fieldWeight in 1270, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          9.504243 = idf(docFreq=8, maxDocs=44421)
          0.375 = fieldNorm(doc=1270)
    
  3. Hassan, N.R.; Serenko, A.: Patterns of citations for the growth of knowledge : a Foucauldian perspective (2019) 4.75
    4.7521214 = sum of:
      4.7521214 = weight(author_txt:hassan in 284) [ClassicSimilarity], result of:
        4.7521214 = fieldWeight in 284, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.504243 = idf(docFreq=8, maxDocs=44421)
          0.5 = fieldNorm(doc=284)
    
  4. Mat-Hassan, M.; Levene, M.: Associating search and navigation behavior through log analysis (2005) 4.16
    4.1581063 = sum of:
      4.1581063 = weight(author_txt:hassan in 4681) [ClassicSimilarity], result of:
        4.1581063 = fieldWeight in 4681, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.504243 = idf(docFreq=8, maxDocs=44421)
          0.4375 = fieldNorm(doc=4681)
    
  5. Hassan Ibrahim, N.; Allen, D.: Information sharing and trust during major incidents : findings from the oil industry (2012) 4.16
    4.1581063 = sum of:
      4.1581063 = weight(author_txt:hassan in 1450) [ClassicSimilarity], result of:
        4.1581063 = fieldWeight in 1450, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.504243 = idf(docFreq=8, maxDocs=44421)
          0.4375 = fieldNorm(doc=1450)
    

Similar documents (content)

  1. Järvelin, K.; Persson, O.: ¬The DCI index : discounted cumulated impact-based research evaluation (2008) 0.18
    0.18078189 = sum of:
      0.18078189 = product of:
        0.6456496 = sum of:
          0.057199124 = weight(abstract_txt:weight in 3694) [ClassicSimilarity], result of:
            0.057199124 = score(doc=3694,freq=1.0), product of:
              0.12371721 = queryWeight, product of:
                1.0106512 = boost
                7.3974023 = idf(docFreq=73, maxDocs=44421)
                0.016548155 = queryNorm
              0.46233764 = fieldWeight in 3694, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.3974023 = idf(docFreq=73, maxDocs=44421)
                0.0625 = fieldNorm(doc=3694)
          0.0209932 = weight(abstract_txt:however in 3694) [ClassicSimilarity], result of:
            0.0209932 = score(doc=3694,freq=1.0), product of:
              0.07990359 = queryWeight, product of:
                1.1486412 = boost
                4.203706 = idf(docFreq=1803, maxDocs=44421)
                0.016548155 = queryNorm
              0.2627316 = fieldWeight in 3694, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.203706 = idf(docFreq=1803, maxDocs=44421)
                0.0625 = fieldNorm(doc=3694)
          0.02702572 = weight(abstract_txt:impact in 3694) [ClassicSimilarity], result of:
            0.02702572 = score(doc=3694,freq=1.0), product of:
              0.0945581 = queryWeight, product of:
                1.2495413 = boost
                4.572972 = idf(docFreq=1246, maxDocs=44421)
                0.016548155 = queryNorm
              0.28581074 = fieldWeight in 3694, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.572972 = idf(docFreq=1246, maxDocs=44421)
                0.0625 = fieldNorm(doc=3694)
          0.06771174 = weight(abstract_txt:index in 3694) [ClassicSimilarity], result of:
            0.06771174 = score(doc=3694,freq=5.0), product of:
              0.102007754 = queryWeight, product of:
                1.2978301 = boost
                4.7496953 = idf(docFreq=1044, maxDocs=44421)
                0.016548155 = queryNorm
              0.6637901 = fieldWeight in 3694, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                4.7496953 = idf(docFreq=1044, maxDocs=44421)
                0.0625 = fieldNorm(doc=3694)
          0.11288237 = weight(abstract_txt:publication in 3694) [ClassicSimilarity], result of:
            0.11288237 = score(doc=3694,freq=7.0), product of:
              0.12820312 = queryWeight, product of:
                1.4549583 = boost
                5.324741 = idf(docFreq=587, maxDocs=44421)
                0.016548155 = queryNorm
              0.88049626 = fieldWeight in 3694, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                5.324741 = idf(docFreq=587, maxDocs=44421)
                0.0625 = fieldNorm(doc=3694)
          0.12848946 = weight(abstract_txt:citations in 3694) [ClassicSimilarity], result of:
            0.12848946 = score(doc=3694,freq=4.0), product of:
              0.19279805 = queryWeight, product of:
                2.1852353 = boost
                5.331567 = idf(docFreq=583, maxDocs=44421)
                0.016548155 = queryNorm
              0.66644585 = fieldWeight in 3694, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.331567 = idf(docFreq=583, maxDocs=44421)
                0.0625 = fieldNorm(doc=3694)
          0.23134798 = weight(abstract_txt:citation in 3694) [ClassicSimilarity], result of:
            0.23134798 = score(doc=3694,freq=4.0), product of:
              0.37846613 = queryWeight, product of:
                4.6768 = boost
                4.890223 = idf(docFreq=907, maxDocs=44421)
                0.016548155 = queryNorm
              0.6112779 = fieldWeight in 3694, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.890223 = idf(docFreq=907, maxDocs=44421)
                0.0625 = fieldNorm(doc=3694)
        0.28 = coord(7/25)
    
  2. Soni, S.; Lerman, K.; Eisenstein, J.: Follow the leader : documents on the leading edge of semantic change get more citations (2021) 0.17
    0.17462407 = sum of:
      0.17462407 = product of:
        0.72760034 = sum of:
          0.018647054 = weight(abstract_txt:text in 1170) [ClassicSimilarity], result of:
            0.018647054 = score(doc=1170,freq=1.0), product of:
              0.0738336 = queryWeight, product of:
                1.1041505 = boost
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.016548155 = queryNorm
              0.25255513 = fieldWeight in 1170, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.0625 = fieldNorm(doc=1170)
          0.0209932 = weight(abstract_txt:however in 1170) [ClassicSimilarity], result of:
            0.0209932 = score(doc=1170,freq=1.0), product of:
              0.07990359 = queryWeight, product of:
                1.1486412 = boost
                4.203706 = idf(docFreq=1803, maxDocs=44421)
                0.016548155 = queryNorm
              0.2627316 = fieldWeight in 1170, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.203706 = idf(docFreq=1803, maxDocs=44421)
                0.0625 = fieldNorm(doc=1170)
          0.10162509 = weight(abstract_txt:word in 1170) [ClassicSimilarity], result of:
            0.10162509 = score(doc=1170,freq=5.0), product of:
              0.1337184 = queryWeight, product of:
                1.4859248 = boost
                5.4380693 = idf(docFreq=524, maxDocs=44421)
                0.016548155 = queryNorm
              0.7599933 = fieldWeight in 1170, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                5.4380693 = idf(docFreq=524, maxDocs=44421)
                0.0625 = fieldNorm(doc=1170)
          0.06424473 = weight(abstract_txt:citations in 1170) [ClassicSimilarity], result of:
            0.06424473 = score(doc=1170,freq=1.0), product of:
              0.19279805 = queryWeight, product of:
                2.1852353 = boost
                5.331567 = idf(docFreq=583, maxDocs=44421)
                0.016548155 = queryNorm
              0.33322293 = fieldWeight in 1170, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.331567 = idf(docFreq=583, maxDocs=44421)
                0.0625 = fieldNorm(doc=1170)
          0.40641624 = weight(abstract_txt:embeddings in 1170) [ClassicSimilarity], result of:
            0.40641624 = score(doc=1170,freq=3.0), product of:
              0.3994424 = queryWeight, product of:
                2.568197 = boost
                9.398883 = idf(docFreq=9, maxDocs=44421)
                0.016548155 = queryNorm
              1.0174589 = fieldWeight in 1170, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                9.398883 = idf(docFreq=9, maxDocs=44421)
                0.0625 = fieldNorm(doc=1170)
          0.11567399 = weight(abstract_txt:citation in 1170) [ClassicSimilarity], result of:
            0.11567399 = score(doc=1170,freq=1.0), product of:
              0.37846613 = queryWeight, product of:
                4.6768 = boost
                4.890223 = idf(docFreq=907, maxDocs=44421)
                0.016548155 = queryNorm
              0.30563894 = fieldWeight in 1170, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.890223 = idf(docFreq=907, maxDocs=44421)
                0.0625 = fieldNorm(doc=1170)
        0.24 = coord(6/25)
    
  3. Li, G.; Siddharth, L.; Luo, J.: Embedding knowledge graph of patent metadata to measure knowledge proximity (2023) 0.17
    0.17153189 = sum of:
      0.17153189 = product of:
        0.8576594 = sum of:
          0.12089399 = weight(abstract_txt:embedding in 1921) [ClassicSimilarity], result of:
            0.12089399 = score(doc=1921,freq=2.0), product of:
              0.13936606 = queryWeight, product of:
                1.0726666 = boost
                7.85132 = idf(docFreq=46, maxDocs=44421)
                0.016548155 = queryNorm
              0.8674565 = fieldWeight in 1921, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.85132 = idf(docFreq=46, maxDocs=44421)
                0.078125 = fieldNorm(doc=1921)
          0.033476695 = weight(abstract_txt:model in 1921) [ClassicSimilarity], result of:
            0.033476695 = score(doc=1921,freq=1.0), product of:
              0.107588544 = queryWeight, product of:
                1.6324124 = boost
                3.9827821 = idf(docFreq=2249, maxDocs=44421)
                0.016548155 = queryNorm
              0.31115484 = fieldWeight in 1921, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.9827821 = idf(docFreq=2249, maxDocs=44421)
                0.078125 = fieldNorm(doc=1921)
          0.08030591 = weight(abstract_txt:citations in 1921) [ClassicSimilarity], result of:
            0.08030591 = score(doc=1921,freq=1.0), product of:
              0.19279805 = queryWeight, product of:
                2.1852353 = boost
                5.331567 = idf(docFreq=583, maxDocs=44421)
                0.016548155 = queryNorm
              0.41652864 = fieldWeight in 1921, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.331567 = idf(docFreq=583, maxDocs=44421)
                0.078125 = fieldNorm(doc=1921)
          0.5080203 = weight(abstract_txt:embeddings in 1921) [ClassicSimilarity], result of:
            0.5080203 = score(doc=1921,freq=3.0), product of:
              0.3994424 = queryWeight, product of:
                2.568197 = boost
                9.398883 = idf(docFreq=9, maxDocs=44421)
                0.016548155 = queryNorm
              1.2718236 = fieldWeight in 1921, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                9.398883 = idf(docFreq=9, maxDocs=44421)
                0.078125 = fieldNorm(doc=1921)
          0.1149625 = weight(abstract_txt:context in 1921) [ClassicSimilarity], result of:
            0.1149625 = score(doc=1921,freq=1.0), product of:
              0.33959764 = queryWeight, product of:
                4.7360206 = boost
                4.333128 = idf(docFreq=1584, maxDocs=44421)
                0.016548155 = queryNorm
              0.33852562 = fieldWeight in 1921, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.333128 = idf(docFreq=1584, maxDocs=44421)
                0.078125 = fieldNorm(doc=1921)
        0.2 = coord(5/25)
    
  4. Gorraiz, J.; Purnell, P.J.; Glänzel, W.: Opportunities for and limitations of the Book Citation Index (2013) 0.17
    0.16744125 = sum of:
      0.16744125 = product of:
        0.59800446 = sum of:
          0.0209932 = weight(abstract_txt:however in 1966) [ClassicSimilarity], result of:
            0.0209932 = score(doc=1966,freq=1.0), product of:
              0.07990359 = queryWeight, product of:
                1.1486412 = boost
                4.203706 = idf(docFreq=1803, maxDocs=44421)
                0.016548155 = queryNorm
              0.2627316 = fieldWeight in 1966, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.203706 = idf(docFreq=1803, maxDocs=44421)
                0.0625 = fieldNorm(doc=1966)
          0.042824663 = weight(abstract_txt:index in 1966) [ClassicSimilarity], result of:
            0.042824663 = score(doc=1966,freq=2.0), product of:
              0.102007754 = queryWeight, product of:
                1.2978301 = boost
                4.7496953 = idf(docFreq=1044, maxDocs=44421)
                0.016548155 = queryNorm
              0.41981772 = fieldWeight in 1966, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.7496953 = idf(docFreq=1044, maxDocs=44421)
                0.0625 = fieldNorm(doc=1966)
          0.07125717 = weight(abstract_txt:publications in 1966) [ClassicSimilarity], result of:
            0.07125717 = score(doc=1966,freq=3.0), product of:
              0.12512934 = queryWeight, product of:
                1.4374106 = boost
                5.260521 = idf(docFreq=626, maxDocs=44421)
                0.016548155 = queryNorm
              0.5694681 = fieldWeight in 1966, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.260521 = idf(docFreq=626, maxDocs=44421)
                0.0625 = fieldNorm(doc=1966)
          0.042665526 = weight(abstract_txt:publication in 1966) [ClassicSimilarity], result of:
            0.042665526 = score(doc=1966,freq=1.0), product of:
              0.12820312 = queryWeight, product of:
                1.4549583 = boost
                5.324741 = idf(docFreq=587, maxDocs=44421)
                0.016548155 = queryNorm
              0.3327963 = fieldWeight in 1966, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.324741 = idf(docFreq=587, maxDocs=44421)
                0.0625 = fieldNorm(doc=1966)
          0.04495161 = weight(abstract_txt:classification in 1966) [ClassicSimilarity], result of:
            0.04495161 = score(doc=1966,freq=1.0), product of:
              0.18015958 = queryWeight, product of:
                2.727093 = boost
                3.9921594 = idf(docFreq=2228, maxDocs=44421)
                0.016548155 = queryNorm
              0.24950996 = fieldWeight in 1966, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.9921594 = idf(docFreq=2228, maxDocs=44421)
                0.0625 = fieldNorm(doc=1966)
          0.28334227 = weight(abstract_txt:citation in 1966) [ClassicSimilarity], result of:
            0.28334227 = score(doc=1966,freq=6.0), product of:
              0.37846613 = queryWeight, product of:
                4.6768 = boost
                4.890223 = idf(docFreq=907, maxDocs=44421)
                0.016548155 = queryNorm
              0.7486595 = fieldWeight in 1966, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                4.890223 = idf(docFreq=907, maxDocs=44421)
                0.0625 = fieldNorm(doc=1966)
          0.091970004 = weight(abstract_txt:context in 1966) [ClassicSimilarity], result of:
            0.091970004 = score(doc=1966,freq=1.0), product of:
              0.33959764 = queryWeight, product of:
                4.7360206 = boost
                4.333128 = idf(docFreq=1584, maxDocs=44421)
                0.016548155 = queryNorm
              0.2708205 = fieldWeight in 1966, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.333128 = idf(docFreq=1584, maxDocs=44421)
                0.0625 = fieldNorm(doc=1966)
        0.28 = coord(7/25)
    
  5. Hooydonk, G. Van: Standardizing relative impacts : estimating the quality of research from citation counts (1998) 0.17
    0.16688266 = sum of:
      0.16688266 = product of:
        0.69534445 = sum of:
          0.093539655 = weight(abstract_txt:calculating in 2791) [ClassicSimilarity], result of:
            0.093539655 = score(doc=2791,freq=1.0), product of:
              0.14798841 = queryWeight, product of:
                1.1053507 = boost
                8.090549 = idf(docFreq=36, maxDocs=44421)
                0.016548155 = queryNorm
              0.6320742 = fieldWeight in 2791, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.090549 = idf(docFreq=36, maxDocs=44421)
                0.078125 = fieldNorm(doc=2791)
          0.04777517 = weight(abstract_txt:impact in 2791) [ClassicSimilarity], result of:
            0.04777517 = score(doc=2791,freq=2.0), product of:
              0.0945581 = queryWeight, product of:
                1.2495413 = boost
                4.572972 = idf(docFreq=1246, maxDocs=44421)
                0.016548155 = queryNorm
              0.50524676 = fieldWeight in 2791, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.572972 = idf(docFreq=1246, maxDocs=44421)
                0.078125 = fieldNorm(doc=2791)
          0.08907145 = weight(abstract_txt:publications in 2791) [ClassicSimilarity], result of:
            0.08907145 = score(doc=2791,freq=3.0), product of:
              0.12512934 = queryWeight, product of:
                1.4374106 = boost
                5.260521 = idf(docFreq=626, maxDocs=44421)
                0.016548155 = queryNorm
              0.7118351 = fieldWeight in 2791, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.260521 = idf(docFreq=626, maxDocs=44421)
                0.078125 = fieldNorm(doc=2791)
          0.075422704 = weight(abstract_txt:publication in 2791) [ClassicSimilarity], result of:
            0.075422704 = score(doc=2791,freq=2.0), product of:
              0.12820312 = queryWeight, product of:
                1.4549583 = boost
                5.324741 = idf(docFreq=587, maxDocs=44421)
                0.016548155 = queryNorm
              0.5883063 = fieldWeight in 2791, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.324741 = idf(docFreq=587, maxDocs=44421)
                0.078125 = fieldNorm(doc=2791)
          0.13909392 = weight(abstract_txt:citations in 2791) [ClassicSimilarity], result of:
            0.13909392 = score(doc=2791,freq=3.0), product of:
              0.19279805 = queryWeight, product of:
                2.1852353 = boost
                5.331567 = idf(docFreq=583, maxDocs=44421)
                0.016548155 = queryNorm
              0.7214488 = fieldWeight in 2791, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.331567 = idf(docFreq=583, maxDocs=44421)
                0.078125 = fieldNorm(doc=2791)
          0.25044152 = weight(abstract_txt:citation in 2791) [ClassicSimilarity], result of:
            0.25044152 = score(doc=2791,freq=3.0), product of:
              0.37846613 = queryWeight, product of:
                4.6768 = boost
                4.890223 = idf(docFreq=907, maxDocs=44421)
                0.016548155 = queryNorm
              0.6617277 = fieldWeight in 2791, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.890223 = idf(docFreq=907, maxDocs=44421)
                0.078125 = fieldNorm(doc=2791)
        0.24 = coord(6/25)