Papers tagged “multimodal”

7 papers · All papers →

Chameleon: Mixed-Modal Early-Fusion Foundation Models

2024 arXiv
Unified-IO 2: Scaling Autoregressive Multimodal Models with Vision, Language, Audio, and Action

2024 CVPR
BLIP-2: Bootstrapping Language-Image Pre-training with Frozen Image Encoders and Large Language Models

2023 ICML
NExT-GPT: Any-to-Any Multimodal LLM

2023 ICML
Flamingo: a Visual Language Model for Few-Shot Learning

2022 NeurIPS
Learning Transferable Visual Models from Natural Language Supervision

2021 ICML
Zero-Shot Text-to-Image Generation

2021 ICML

© 2026 Taeung Jeong

Main Explorer Papers