Abstract: Accurate and robust long-term global localization is a critical challenge for autonomous vehicles operating in complex urban transportation systems, where ...
MMGDreamer is a dual-branch diffusion model for scene generation that incorporates a novel Mixed-Modality Graph, visual enhancement module, and relation predictor. Feel free to contact Zhifei Yang ...
Abstract: The 3D visual grounding task aims to establish correspondences between the 3D physical world and textual descriptions. Despite significant progress having been made, it still suffers from ...