Publications

GLIPv2: Unifying Localization and Vision-Language Understanding
GLIP: Grounded Language-Image Pre-training
Eye in the sky: Drone-based object tracking and 3D localization
Exploit the connectivity: Multi-object tracking with trackletnet