CIRCO Dataset
Val
Test
Powered by SEARLE
Reference Image
Relative caption
has only one person in the foreground and they are making pizzas
Retrieved Images