Using LSTMs to Model the Java Programming Language

Name: Using LSTMs to Model the Java Programming Language
Creator: Brendon Boldt
Keywords: Software Engineering (cs.SE), FOS: Computer and information sciences, Computer Science - Software Engineering, Computer Science - Computation and Language, Computer Science - Programming Languages, 0202 electrical engineering, electronic engineering, information engineering, 02 engineering and technology, Computation and Language (cs.CL), Programming Languages (cs.PL)

Brendon Boldt

Found an issue? Give us feedback

http://arxiv.org/pdf...arrow_drop_down

http://arxiv.org/pdf/1908.1168...

Part of book or chapter of book

Data sources: UnpayWall

arXiv.org e-Print Archive

Preprint . 2019

Data sources: arXiv.org e-Print Archive

https://doi.org/10.1007/978-3-...

Part of book or chapter of book . 2017 . Peer-reviewed

License: Springer TDM

Data sources: Crossref

https://dx.doi.org/10.48550/ar...

Article . 2019

License: arXiv Non-Exclusive Distribution

Data sources: Datacite

DBLP

Conference object

Data sources: DBLP

DBLP

Article

Data sources: DBLP

https://dx.doi.org/10.1007/978...

Article

Data sources: Microsoft Academic Graph

https://dx.doi.org/10.1007/978...

Other literature type

Data sources: Microsoft Academic Graph

Using LSTMs to Model the Java Programming Language

descriptionPublicationkeyboard_double_arrow_right Part of book or chapter of book , Article , Preprint , Conference object , Other literature type 01 Jan 2017Embargo end date: 01 Jan 2019Publisher:Springer International Publishing

Authors: Brendon Boldt;

doi: 10.1007/978-3-319-68612-7_31 , 10.48550/arxiv.1908.11685

arXiv: 1908.11685

Using LSTMs to Model the Java Programming Language

- Summary
- Subjects
- Metrics

Abstract

Recurrent neural networks (RNNs), specifically long-short term memory networks (LSTMs), can model natural language effectively. This research investigates the ability for these same LSTMs to perform next "word" prediction on the Java programming language. Java source code from four different repositories undergoes a transformation that preserves the logical structure of the source code and removes the code's various specificities such as variable names and literal values. Such datasets and an additional English language corpus are used to train and test standard LSTMs' ability to predict the next element in a sequence. Results suggest that LSTMs can effectively model Java code achieving perplexities under 22 and accuracies above 0.47, which is an improvement over LSTM's performance on the English language which demonstrated a perplexity of 85 and an accuracy of 0.27. This research can have applicability in other areas such as syntactic template suggestion and automated bug patching.

9 pages, 2 figures

Related Organizations

Marist College
United States

Keywords

Software Engineering (cs.SE), FOS: Computer and information sciences, Computer Science - Software Engineering, Computer Science - Computation and Language, Computer Science - Programming Languages, Computation and Language (cs.CL), Programming Languages (cs.PL)

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	2
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Average
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Average
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Average

Found an issue? Give us feedback

2

Average

Green

Fields of Science

engineering and technology

electrical engineering, electronic engineering, information engineering

Fields of Science

engineering and technology

electrical engineering, electronic engineering, information engineering