SpatialRGPT is a powerful vision-language model adept at understanding both 2D and 3D spatial arrangements. It can process any region proposal, such as boxes or masks, and provide answers to complex ...
Results that may be inaccessible to you are currently showing.
Hide inaccessible results