CIRCO Dataset
Val
Test
Powered by SEARLE
Reference Image
Relative caption
is shot from the same angle and has a single bike instead of TVs
Retrieved Images