CIRCO Dataset
Val
Test
Powered by SEARLE
Choose or insert the relative caption
Reference Image
Dataset captions
has only one person in the foreground and they are making pizzas
Try with a custom caption