What Information about Code Snippets Is Available in Different Software-Related Documents? An Exploratory Study
Author : Chatterjee, Preetha; Nishi, Manziba Akanda; Damevski, Kostadin; Augustine, Vinay, Pollock, Lori, Kraft, Nocholas A.
Booktitle : IEEE 24th International Conference on Software Analysis, Evolution and Reengineering
Date : 2017
Publisher : SANER
Project :
Keywords:
Abstract :
A large corpora of software-related documents is available on the Web, and these documents offer the unique opportunity to learn from what developers are saying or asking about the code snippets that they are discussing. For example, the natural language in a bug report provides information about what is not functioning properly in a particular code snippet. Previous research has mined information about code snippets from bug reports, emails, and Q&A forums. This paper describes an exploratory study into the kinds of information that is embedded in different software-related documents. The goal of the study is to gain insight into the potential value and difficulty of mining the natural language text associated with the code snippets found in a variety of software-related documents, including blog posts, API documentation, code reviews, and public chats.