Improving Source Code Search with Natural Language Phrasal Representations of Method Signatures

Author : Hill, Emily; Pollock, Lori; Vijay-Shanker, K.
Date : Nov 2011
Publisher : IEEE
Journal : Proceedings of the 26th IEEE International Conference on Automated Software Engineering (ASE’11) (short paper)
Pages : (short paper)
Keyword(s) : code search, natural language program analysis, SWUM, concern location
Document Type : Article
BibTeX Entry : (show)

Abstract :

As software continues to grow, locating code for maintenance tasks becomes increasingly difficult. Software search tools help developers find source code relevant to their maintenance tasks. One major challenge to successful search tools is locating relevant code when the user’s query contains words with multiple meanings or words that occur frequently throughout the program. Traditional search techniques, which treat each word individually, are unable to distinguish relevant and irrelevant methods under these conditions. In this paper, we present a novel search technique that uses information such as the position of the query word and its semantic role to calculate relevance. Our evaluation shows that this approach is more consistently effective than three other state of the art search techniques.

Paper Link