CIRCO Dataset
Val
Test
Powered by SEARLE
Reference Image
Relative caption
is taken from a closer distance and there are more people
Retrieved Images