Abstract: The integration of Large Multimodal Models (LMMs) into robotics enables unprecedented capabilities in understanding complex commands and interacting with unstructured environments. However, ...