[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
Get up and running with Llama 3.3, Mistral, Gemma 2, and other large language models.