Downloads provided by UsageCounts
doi: 10.1109/tcsi.2022.3142525 , 10.5281/zenodo.5960705 , 10.5281/zenodo.5960706 , 10.48550/arxiv.2201.03386
arXiv: 2201.03386
handle: 11585/904854 , 11582/330206
doi: 10.1109/tcsi.2022.3142525 , 10.5281/zenodo.5960705 , 10.5281/zenodo.5960706 , 10.48550/arxiv.2201.03386
arXiv: 2201.03386
handle: 11585/904854 , 11582/330206
Keyword spotting (KWS) is a crucial function enabling the interaction with the many ubiquitous smart devices in our surroundings, either activating them through wake-word or directly as a human-computer interface. For many applications, KWS is the entry point for our interactions with the device and, thus, an always-on workload. Many smart devices are mobile and their battery lifetime is heavily impacted by continuously running services. KWS and similar always-on services are thus the focus when optimizing the overall power consumption. This work addresses KWS energy-efficiency on low-cost microcontroller units (MCUs). We combine analog binary feature extraction with binary neural networks. By replacing the digital preprocessing with the proposed analog front-end, we show that the energy required for data acquisition and preprocessing can be reduced by 29x, cutting its share from a dominating 85% to a mere 16% of the overall energy consumption for our reference KWS application. Experimental evaluations on the Speech Commands Dataset show that the proposed system outperforms state-of-the-art accuracy and energy efficiency, respectively, by 1% and 4.3x on a 10-class dataset while providing a compelling accuracy-energy trade-off including a 2% accuracy drop for a 71x energy reduction.
FOS: Computer and information sciences, Sound (cs.SD), 000, Computer Science - Artificial Intelligence, feature extraction, deep learning, binary neural networks, keyword spotting, Feature extraction; Neural networks; Quantization (signal); Task analysis; Power demand; Computational modeling; Memory management; Keyword spotting; quantization; binary neural networks; deep learning; feature extraction, Computer Science - Sound, 004, Artificial Intelligence (cs.AI), Audio and Speech Processing (eess.AS), Hardware Architecture (cs.AR), FOS: Electrical engineering, electronic engineering, information engineering, quantization, Computer Science - Hardware Architecture, Electrical Engineering and Systems Science - Audio and Speech Processing
FOS: Computer and information sciences, Sound (cs.SD), 000, Computer Science - Artificial Intelligence, feature extraction, deep learning, binary neural networks, keyword spotting, Feature extraction; Neural networks; Quantization (signal); Task analysis; Power demand; Computational modeling; Memory management; Keyword spotting; quantization; binary neural networks; deep learning; feature extraction, Computer Science - Sound, 004, Artificial Intelligence (cs.AI), Audio and Speech Processing (eess.AS), Hardware Architecture (cs.AR), FOS: Electrical engineering, electronic engineering, information engineering, quantization, Computer Science - Hardware Architecture, Electrical Engineering and Systems Science - Audio and Speech Processing
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 15 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Top 10% | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Top 10% | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Top 10% |
| views | 7 | |
| downloads | 25 |

Views provided by UsageCounts
Downloads provided by UsageCounts