publication . Conference object . Part of book or chapter of book . 2011

Improving the tokenisation of identifier names

Butler, Simon; Wermelinger, Michel; Yu, Yijun; Sharp, Helen;
Open Access English
  • Published: 01 Jan 2011
  • Publisher: Springer Verlag
Abstract
Identifier names are the main vehicle for semantic information during program comprehension. For tool-supported program comprehension tasks, including concept location and requirements traceability, identifier names need to be tokenised into their semantic constituents. In this paper we present an approach to the automated tokenisation of identifier names that improves on existing techniques in two ways. First, it improves the tokenisation accuracy for single-case identifier names and for identifier names containing digits, which existing techniques largely ignore. Second, performance gains over existing techniques are achieved using smaller oracles, making the ...
Download fromView all 2 versions
http://oro.open.ac.uk/25656/5/...
Part of book or chapter of book
Provider: UnpayWall
http://link.springer.com/conte...
Part of book or chapter of book . 2011
Provider: Crossref

1. Abebe, S., Tonella, P.: Natural language parsing of program element names for concept extraction. In: 18th Int'l Conf. on Program Comprehension. pp. 156{159. IEEE (jun 2010)

2. Antoniol, G., Canfora, G., Casazza, G., De Lucia, A., Merlo, E.: Recovering traceability links between code and documentation. IEEE Transactions on Software Engineering 28(10), 970{983 (Oct 2002)

3. Antoniol, G., Gueheneuc, Y.G., Merlo, E., Tonella, P.: Mining the lexicon used by programmers during sofware [sic] evolution. In: Proc. of Int'l Conf. on Software Maintenance. pp. 14{23. IEEE (Oct 2007) [OpenAIRE]

Powered by OpenAIRE Research Graph
Any information missing or wrong?Report an Issue