State Space Models for Event Cameras

Name: State Space Models for Event Cameras
Keywords: 1712 Software, FOS: Computer and information sciences, Computer Science - Machine Learning, 1707 Computer Vision and Pattern Recognition, 10009 Department of Informatics, Computer Vision and Pattern Recognition (cs.CV), Computer Science - Computer Vision and Pattern Recognition, 000 Computer science, knowledge & systems, Machine Learning (cs.LG)

Zubić, Nikola; Gehrig, Mathias; Scaramuzza, Davide

Found an issue? Give us feedback

arXiv.org e-Print Ar...arrow_drop_down

arXiv.org e-Print Archive

Preprint . 2024

Data sources: arXiv.org e-Print Archive

Zurich Open Repository and Archive

Conference object . 2024

License: CC BY

Data sources: Zurich Open Repository and Archive

https://doi.org/10.1109/cvpr52...

Article . 2024 . Peer-reviewed

License: STM Policy #29

Data sources: Crossref

https://dx.doi.org/10.48550/ar...

Article . 2024

License: CC BY

Data sources: Datacite

https://dx.doi.org/10.5167/uzh...

Other literature type . 2024

Data sources: Datacite

DBLP

Conference object

Data sources: DBLP

DBLP

Article

Data sources: DBLP

http://dx.doi.org/10.1109/cvpr...

Conference object . 2024

Data sources: European Union Open Data Portal

State Space Models for Event Cameras

descriptionPublicationkeyboard_double_arrow_right Article , Other literature type , Preprint , Conference object 16 Jun 2024Embargo end date: 01 Jan 2024 Switzerland Publisher:IEEEJournal:2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)Funded by:SNSF | NCCR Robotics: Intelligen..., EC | AGILEFLIGHT

Authors: Zubić, Nikola; Gehrig, Mathias; Scaramuzza, Davide;

doi: 10.1109/cvpr52733.2024.00556 , 10.48550/arxiv.2402.15584 , 10.5167/uzh-264837

arXiv: 2402.15584

State Space Models for Event Cameras

- Summary
- Subjects
- Metrics

Abstract

Today, state-of-the-art deep neural networks that process event-camera data first convert a temporal window of events into dense, grid-like input representations. As such, they exhibit poor generalizability when deployed at higher inference frequencies (i.e., smaller temporal windows) than the ones they were trained on. We address this challenge by introducing state-space models (SSMs) with learnable timescale parameters to event-based vision. This design adapts to varying frequencies without the need to retrain the network at different frequencies. Additionally, we investigate two strategies to counteract aliasing effects when deploying the model at higher frequencies. We comprehensively evaluate our approach against existing methods based on RNN and Transformer architectures across various benchmarks, including Gen1 and 1 Mpx event camera datasets. Our results demonstrate that SSM-based models train 33% faster and also exhibit minimal performance degradation when tested at higher frequencies than the training input. Traditional RNN and Transformer models exhibit performance drops of more than 20 mAP, with SSMs having a drop of 3.76 mAP, highlighting the effectiveness of SSMs in event-based vision tasks.

18 pages, 5 figures, 6 tables, CVPR 2024 Camera Ready paper

Country

Switzerland

Related Organizations

Keywords

1712 Software, FOS: Computer and information sciences, Computer Science - Machine Learning, 1707 Computer Vision and Pattern Recognition, 10009 Department of Informatics, Computer Vision and Pattern Recognition (cs.CV), Computer Science - Computer Vision and Pattern Recognition, 000 Computer science, knowledge & systems, Machine Learning (cs.LG)

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	4
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Top 10%
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Average
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Average

Found an issue? Give us feedback

4

Top 10%

Average

Green

Funded by

SNSF| NCCR Robotics: Intelligent Robots for Improving the Quality of Life (phase III), EC| AGILEFLIGHT