A Gemini 2.5 Flash Level MLLM for Vision, Speech, and Full-Duplex Multimodal Live Streaming on Your Phone
OmniGen: Unified Image Generation. https://arxiv.org/pdf/2409.11340