Papers tagged “multimodal”
4 papers ·
All papers →
BLIP-2: Bootstrapping Language-Image Pre-training with Frozen Image Encoders and Large Language Models
2023
ICML
Flamingo: a Visual Language Model for Few-Shot Learning
2022
NeurIPS
Learning Transferable Visual Models from Natural Language Supervision
2021
ICML
Zero-Shot Text-to-Image Generation
2021
ICML