Pixels-to-Graph: Real-time Integration of Building Information Models and Scene Graphs for Semantic-Geometric Human-Robot Understanding

Name: Pixels-to-Graph: Real-time Integration of Building Information Models and Scene Graphs for Semantic-Geometric Human-Robot Understanding
Keywords: FOS: Computer and information sciences, Artificial Intelligence (cs.AI), Artificial Intelligence, Computer Vision and Pattern Recognition (cs.CV), Computer Science - Robotics; Computer Science - Robotics; Computer Science - Artificial Intelligence; Computer Science - Computer Vision and Pattern Recognition, Robotics, Computer Vision and Pattern Recognition, Robotics (cs.RO)

Antonello Longo; Chanyoung Chung; Matteo Palieri; Sung-Kyun Kim; Ali Agha; Cataldo Guaragnella; Shehryar Khattak

Found an issue? Give us feedback

arXiv.org e-Print Ar...arrow_drop_down

arXiv.org e-Print Archive

Preprint . 2025

Data sources: arXiv.org e-Print Archive

https://doi.org/10.1109/case58...

Article . 2025 . Peer-reviewed

License: STM Policy #29

Data sources: Crossref

Archivio Istituzionale della Ricerca - Politecnico di Bari

Conference object . 2025

Data sources: Archivio Istituzionale della Ricerca - Politecnico di Bari

https://dx.doi.org/10.48550/ar...

Article . 2025

License: CC BY

Data sources: Datacite

DBLP

Article

Data sources: DBLP

DBLP

Conference object

Data sources: DBLP

Pixels-to-Graph: Real-time Integration of Building Information Models and Scene Graphs for Semantic-Geometric Human-Robot Understanding

descriptionPublicationkeyboard_double_arrow_right Article , Preprint , Conference object 17 Aug 2025Embargo end date: 01 Jan 2025 Italy Publisher:IEEEJournal:2025 IEEE 21st International Conference on Automation Science and Engineering (CASE)

Authors: Antonello Longo; Chanyoung Chung; Matteo Palieri; Sung-Kyun Kim; Ali Agha; Cataldo Guaragnella; Shehryar Khattak;

doi: 10.1109/case58245.2025.11163791 , 10.48550/arxiv.2506.22593

arXiv: 2506.22593

handle: 11589/289040

Pixels-to-Graph: Real-time Integration of Building Information Models and Scene Graphs for Semantic-Geometric Human-Robot Understanding

- Summary
- Subjects
- Metrics

Abstract

Autonomous robots are increasingly playing key roles as support platforms for human operators in high-risk, dangerous applications. To accomplish challenging tasks, an efficient human-robot cooperation and understanding is required. While typically robotic planning leverages 3D geometric information, human operators are accustomed to a high-level compact representation of the environment, like top-down 2D maps representing the Building Information Model (BIM). 3D scene graphs have emerged as a powerful tool to bridge the gap between human readable 2D BIM and the robot 3D maps. In this work, we introduce Pixels-to-Graph (Pix2G), a novel lightweight method to generate structured scene graphs from image pixels and LiDAR maps in real-time for the autonomous exploration of unknown environments on resource-constrained robot platforms. To satisfy onboard compute constraints, the framework is designed to perform all operation on CPU only. The method output are a de-noised 2D top-down environment map and a structure-segmented 3D pointcloud which are seamlessly connected using a multi-layer graph abstracting information from object-level up to the building-level. The proposed method is quantitatively and qualitatively evaluated during real-world experiments performed using the NASA JPL NeBula-Spot legged robot to autonomously explore and map cluttered garage and urban office like environments in real-time.

Paper accepted to 2025 IEEE International Conference on Automation Science and Engineering (CASE)

Country

Italy

Related Organizations

National Aeronautics and Space Administration
United States
California Institute of Technology
United States
Jet Propulsion Lab
United States
Polytechnic University of Bari
Italy

Keywords

FOS: Computer and information sciences, Artificial Intelligence (cs.AI), Artificial Intelligence, Computer Vision and Pattern Recognition (cs.CV), Computer Science - Robotics; Computer Science - Robotics; Computer Science - Artificial Intelligence; Computer Science - Computer Vision and Pattern Recognition, Robotics, Computer Vision and Pattern Recognition, Robotics (cs.RO)

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	0
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Average
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Average
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Average

Found an issue? Give us feedback

0

Average

Green

Related to Research communities

Knowmad Institut