CMU -> Google Deepmind | Multimodal Understanding & Generation
-
Google Deepmind
- Kirkland, WA
- https://lxa9867.github.io/
Stars
2
stars
written in HTML
Clear filter
🔥🔥🔥 A curated list of papers on LLMs-based multimodal generation (image, video, 3D and audio).
[ICML'24] TroVE: Inducing Verifiable and Efficient Toolboxes for Solving Programmatic Tasks