MiniCPM-o 2.6: A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming on Your Phone
OmniGen: Unified Image Generation. https://arxiv.org/pdf/2409.11340