CIRCO Dataset
Val
Test
Powered by SEARLE
Reference Image
Relative caption
is held in front of the camera by a person that is facing the camera
Retrieved Images