A high-throughput and memory-efficient inference and serving engine for LLMs
Get up and running with OpenAI gpt-oss, DeepSeek-R1, Gemma 3 and other models.
🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.
⏩ Ship faster with Continuous AI. Open-source CLI that can be used in TUI mode as a coding agent or Headless mode to run background agents
SD.Next: All-in-one WebUI for AI generative image and video creation