NVIDIA Open-Sources LocateAnything-3B for Faster Visual Localization
LocateAnything-3B uses parallel decoding to predict bounding boxes at once, improving detection in dense scenes and supporting UI, OCR and documents.
101 min0
LocateAnything-3B uses parallel decoding to predict bounding boxes at once, improving detection in dense scenes and supporting UI, OCR and documents.