LLaMA-VID LLaMA-VID checkpoints. Please refer to project page for more detail: https://llama-vid.github.io/ Text Generation • Updated Dec 3, 2023 • 8 Text Generation • Updated Dec 3, 2023 • 5 • 2 Text Generation • Updated Dec 3, 2023 • 1 Text Generation • Updated Dec 3, 2023 • 2
MGM-Data Official data collection for the paper "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models" Viewer • Updated Apr 21, 2024 • 1.27M • 31 • 16 Updated Apr 21, 2024 • 26 • 17
MGM Official model collection for the paper "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models" Text Generation • 7B • Updated Apr 21, 2024 • 10 • 31 Text Generation • 14B • Updated Apr 21, 2024 • 8 • 1 Text Generation • 4B • Updated Apr 21, 2024 • 42 • 21 Text Generation • 35B • Updated Apr 21, 2024 • 9 • 9
LLaMA-VID LLaMA-VID checkpoints. Please refer to project page for more detail: https://llama-vid.github.io/ Text Generation • Updated Dec 3, 2023 • 8 Text Generation • Updated Dec 3, 2023 • 5 • 2 Text Generation • Updated Dec 3, 2023 • 1 Text Generation • Updated Dec 3, 2023 • 2
MGM Official model collection for the paper "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models" Text Generation • 7B • Updated Apr 21, 2024 • 10 • 31 Text Generation • 14B • Updated Apr 21, 2024 • 8 • 1 Text Generation • 4B • Updated Apr 21, 2024 • 42 • 21 Text Generation • 35B • Updated Apr 21, 2024 • 9 • 9
MGM-Data Official data collection for the paper "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models" Viewer • Updated Apr 21, 2024 • 1.27M • 31 • 16 Updated Apr 21, 2024 • 26 • 17