Document (#41576)

Author
Li, J.
Sun, A.
Xing, Z.
Title
To do or not to do : distill crowdsourced negative caveats to augment api documentation
Source
Journal of the Association for Information Science and Technology. 69(2018) no.12, S.1460-1475
Year
2018
Abstract
Negative caveats of application programming interfaces (APIs) are about "how not to use an API," which are often absent from the official API documentation. When these caveats are overlooked, programming errors may emerge from misusing APIs, leading to heavy discussions on Q&A websites like Stack Overflow. If the overlooked caveats could be mined from these discussions, they would be beneficial for programmers to avoid misuse of APIs. However, it is challenging because the discussions are informal, redundant, and diverse. For this, for example, we propose Disca, a novel approach for automatically Distilling desirable API negative caveats from unstructured Q&A discussions. Through sentence selection and prominent term clustering, Disca ensures that distilled caveats are context-independent, prominent, semantically diverse, and nonredundant. Quantitative evaluation in our experiments shows that the proposed Disca significantly outperforms four text-summarization techniques. We also show that the distilled API negative caveats could greatly augment API documentation through qualitative analysis.
Content
https://onlinelibrary.wiley.com/doi/10.1002/asi.24067.

Similar documents (content)

  1. Tang, L.; Hu, G.; Liu, W.: Funding acknowledgment analysis : queries and caveats (2017) 0.24
    0.23607752 = sum of:
      0.23607752 = product of:
        2.950969 = sum of:
          0.011088319 = weight(abstract_txt:from in 4442) [ClassicSimilarity], result of:
            0.011088319 = score(doc=4442,freq=1.0), product of:
              0.036739495 = queryWeight, product of:
                1.5053964 = boost
                2.759399 = idf(docFreq=7646, maxDocs=44421)
                0.008844389 = queryNorm
              0.30180925 = fieldWeight in 4442, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.759399 = idf(docFreq=7646, maxDocs=44421)
                0.109375 = fieldNorm(doc=4442)
          2.9398806 = weight(title_txt:caveats in 4442) [ClassicSimilarity], result of:
            2.9398806 = score(doc=4442,freq=1.0), product of:
              0.80361193 = queryWeight, product of:
                9.313791 = boost
                9.755557 = idf(docFreq=6, maxDocs=44421)
                0.008844389 = queryNorm
              3.6583338 = fieldWeight in 4442, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.755557 = idf(docFreq=6, maxDocs=44421)
                0.375 = fieldNorm(doc=4442)
        0.08 = coord(2/25)
    
  2. Leydesdorff, L.: Caveats for the use of citation indicators in research and journal evaluations (2008) 0.16
    0.15742727 = sum of:
      0.15742727 = product of:
        1.9678408 = sum of:
          0.007920229 = weight(abstract_txt:from in 2361) [ClassicSimilarity], result of:
            0.007920229 = score(doc=2361,freq=1.0), product of:
              0.036739495 = queryWeight, product of:
                1.5053964 = boost
                2.759399 = idf(docFreq=7646, maxDocs=44421)
                0.008844389 = queryNorm
              0.21557805 = fieldWeight in 2361, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.759399 = idf(docFreq=7646, maxDocs=44421)
                0.078125 = fieldNorm(doc=2361)
          1.9599205 = weight(title_txt:caveats in 2361) [ClassicSimilarity], result of:
            1.9599205 = score(doc=2361,freq=1.0), product of:
              0.80361193 = queryWeight, product of:
                9.313791 = boost
                9.755557 = idf(docFreq=6, maxDocs=44421)
                0.008844389 = queryNorm
              2.4388893 = fieldWeight in 2361, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.755557 = idf(docFreq=6, maxDocs=44421)
                0.25 = fieldNorm(doc=2361)
        0.08 = coord(2/25)
    
  3. Curran, G.L.: Inmagic: Kudos and caveats (1986) 0.16
    0.15679364 = sum of:
      0.15679364 = product of:
        3.919841 = sum of:
          3.919841 = weight(title_txt:caveats in 5600) [ClassicSimilarity], result of:
            3.919841 = score(doc=5600,freq=1.0), product of:
              0.80361193 = queryWeight, product of:
                9.313791 = boost
                9.755557 = idf(docFreq=6, maxDocs=44421)
                0.008844389 = queryNorm
              4.8777785 = fieldWeight in 5600, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.755557 = idf(docFreq=6, maxDocs=44421)
                0.5 = fieldNorm(doc=5600)
        0.04 = coord(1/25)
    
  4. Schneider, K.G.: Cataloging Internet resources : concerns and caveats (1997) 0.12
    0.11759522 = sum of:
      0.11759522 = product of:
        2.9398806 = sum of:
          2.9398806 = weight(title_txt:caveats in 973) [ClassicSimilarity], result of:
            2.9398806 = score(doc=973,freq=1.0), product of:
              0.80361193 = queryWeight, product of:
                9.313791 = boost
                9.755557 = idf(docFreq=6, maxDocs=44421)
                0.008844389 = queryNorm
              3.6583338 = fieldWeight in 973, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.755557 = idf(docFreq=6, maxDocs=44421)
                0.375 = fieldNorm(doc=973)
        0.04 = coord(1/25)
    
  5. McCain, K.W.: Assessing obliteration by incorporation : issues and caveats (2012) 0.12
    0.11759522 = sum of:
      0.11759522 = product of:
        2.9398806 = sum of:
          2.9398806 = weight(title_txt:caveats in 1485) [ClassicSimilarity], result of:
            2.9398806 = score(doc=1485,freq=1.0), product of:
              0.80361193 = queryWeight, product of:
                9.313791 = boost
                9.755557 = idf(docFreq=6, maxDocs=44421)
                0.008844389 = queryNorm
              3.6583338 = fieldWeight in 1485, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.755557 = idf(docFreq=6, maxDocs=44421)
                0.375 = fieldNorm(doc=1485)
        0.04 = coord(1/25)