SpatialRGPT is a powerful vision-language model adept at understanding both 2D and 3D spatial arrangements. It can process any region proposal, such as boxes or masks, and provide answers to complex ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results