OpenAI releases CLIP, a model trained on 400M text-image pairs that learns joint embeddings. CLIP underpins Stable Diffusion and the modern multimodal revolution. PrevMain BlogNext