Struct2d_cvinw25
Our paper Struct2D: A Perception-Guided Framework for Spatial Reasoning in Large Multimodal Models was selected as a spotlight presentation at the CVPR 2025 Workshop on Computer Vision in the Wild (CVinW)! I will be presenting it in Nashville on June 11th!