Abstract: In recent years, various neural network architectures for computer vision have been devised, such as the visual transformer and multilayer perceptron (MLP). A transformer based on an ...
TL;DR: GigaPose is a "hybrid" template-patch correspondence approach to estimate 6D pose of novel objects in RGB images: GigaPose first uses templates, rendered images of the CAD models, to recover ...
Abstract: Issues, such as insufficient feature representation and background confusion, make detection tasks for small object in remote sensing arduous. Particularly, when the algorithm will be ...