A high-throughput and memory-efficient inference and serving engine for LLMs
Stable Diffusion web UI
ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
Industrial-grade speech recognition toolkit: 170x realtime, 50+ languages, speaker diarization, emotion detection, streaming, and OpenAI-compatible API.
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
π€ Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.
πΈπ¬ - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
Ultralytics YOLO π
We write your reusable computer vision tools. π
SoftVC VITS Singing Voice Conversion
Open standard for machine learning interoperability
π€ Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.
StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models
SD.Next: All-in-one WebUI for AI generative image and video creation, captioning and processing
Easy Docker setup for Stable Diffusion with user-friendly UI