Downloads provided by UsageCounts
handle: 10261/351706 , 10553/130228 , 2117/410288
While original Neural Radiance Fields (NeRF) have shown impressive results in modeling the appearance of a scene with compact MLP architectures, they are not able to achieve real-time rendering. This has been recently addressed by either baking the outputs of NeRF into a data structure or arranging trainable parameters in an explicit feature grid. These strategies, however, significantly increase the memory footprint of the model which prevents their deployment on bandwidth-constrained applications. In this paper, we extend the grid-based approach to achieve real-time view synthesis at more than 150 FPS using a lightweight model. Our main contribution is a novel architecture in which the density field of NeRF-based representations is split into N regions and the density is modeled using N different decoders which reuse the same feature grid. This results in a smaller grid where each feature is located in more than one spatial position, forcing them to learn a compact representation that is valid for different parts of the scene. We further reduce the size of the final model by disposing of the features symmetrically on each region, which favors feature pruning after training while also allowing smooth gradient transitions between neighboring voxels. An exhaustive evaluation demonstrates that our method achieves real-time performance and quality metrics on a pair with state-of-the-art with an improvement of more than 2× in the FPS/MB ratio.
This work is partially supported by the European Union under the project FlexiGroBots (DOI: 10.3030/101017111) and by the Spanish government under projects MoHuCo (PID2020-120049RB-I00) and SmartWEED-DARWEEM (PID2020-113229RB-C43). Adrian Penate-Sanchez is supported by a Beatriz Galindo grant. The authors would like to thank Marcos Barrios for his support in the graphic design of the figures and videos included in the paper
Trabajo presentado en la Conference on Computer Vision and Pattern Recognition (CVPR), celebrada en Vancuver (Canadá), del 17 al 24 de junio de 2023
Peer reviewed
Measurement, Metaverse, Data structures, Videojocs, Social networking (online), Àrees temàtiques de la UPC::Informàtica::Infografia, Three dimensional imaging, Video games, Training, 3D from multi-view and sensors, Games, Rendering (computer graphics), 1203 Ciencia de los ordenadores, Imatgeria tridimensional
Measurement, Metaverse, Data structures, Videojocs, Social networking (online), Àrees temàtiques de la UPC::Informàtica::Infografia, Three dimensional imaging, Video games, Training, 3D from multi-view and sensors, Games, Rendering (computer graphics), 1203 Ciencia de los ordenadores, Imatgeria tridimensional
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 3 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Top 10% | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Average | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Average |
| views | 88 | |
| downloads | 87 |

Views provided by UsageCounts
Downloads provided by UsageCounts