Optimizing Dense Feed-Forward Neural Networks

descriptionPublicationkeyboard_double_arrow_right Article , Preprint 01 Jan 2023Embargo end date: 01 Jan 2023 Spain Publisher:Elsevier BVJournal:Neural Networks, volume 171, pages 229-241 (issn: 0893-6080,

Copyright policy )

Authors: Luis Balderas; Miguel Lastra; José Manuel Benítez 0001;

doi: 10.2139/ssrn.4481770 , 10.1016/j.neunet.2023.12.015 , 10.48550/arxiv.2312.10560

pmid: 38101291

arXiv: 2312.10560

handle: 10481/98920

Optimizing Dense Feed-Forward Neural Networks

- Summary
- Subjects
- Metrics

Abstract

Deep learning models have been widely used during the last decade due to their outstanding learning and abstraction capacities. However, one of the main challenges any scientist has to face using deep learning models is to establish the network's architecture. Due to this difficulty, data scientists usually build over complex models and, as a result, most of them result computationally intensive and impose a large memory footprint, generating huge costs, contributing to climate change and hindering their use in computational-limited devices. In this paper, we propose a novel feed-forward neural network constructing method based on pruning and transfer learning. Its performance has been thoroughly assessed in classification and regression problems. Without any accuracy loss, our approach can compress the number of parameters by more than 70%. Even further, choosing the pruning parameter carefully, most of the refined models outperform original ones. We also evaluate the transfer learning level comparing the refined model and the original one training from scratch a neural network with the same hyper parameters as the optimized model. The results obtained show that our constructing method not only helps in the design of more efficient models but also more effective ones.

Country

Spain

Related Organizations

University of Granada
Spain

Keywords

FOS: Computer and information sciences, Computer Science - Machine Learning, Knowledge, Artificial Intelligence (cs.AI), Computer Science - Artificial Intelligence, Climate Change, Concept Formation, Neural Networks, Computer, Machine Learning (cs.LG)

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	7
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Top 10%
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Average
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Top 10%

Found an issue? Give us feedback

7

Top 10%

Average

Top 10%

Green

Related to Research communities

Energy Research