CIRCO Dataset
Val
Test
Powered by SEARLE
Reference Image
Relative caption
has more people dressed similarly and they are sitting at a dining table
Retrieved Images