CLIP Interrogator

Model Card:

The CLIP Interrogator uses the OpenAI CLIP models to test a given image against a variety of artists, mediums, and styles to study how the different models see the content of the image. It also combines the results with BLIP captions to suggest a text prompt to create more images similar to what was given.

Recommended GPU : Nvidia T4

Inference Time: 94 seconds

Use OpenAI's CLIP and Salesforce's BLIP to optimize prompts for image-text matching and creating art with text-to-image models.

Last updated