
OpenAI’s CLIP is a breakthrough in the computer vision. While OpenAI’s DALL-E creates images from text captions for a wide range of concepts expressible in natural language and OpenAI’s CLIP efficiently learns visual concepts from natural language supervision, CLIPPO is a quantum leap for Multimodal Learning in the field of computer vision.