A high-throughput and memory-efficient inference and serving engine for LLMs
πΈπ¬ - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
π€ Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch and FLAX.
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
Stable Diffusion web UI
Ultralytics YOLO11 π
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
Open standard for machine learning interoperability
SoftVC VITS Singing Voice Conversion
Bringing Old Photo Back to Life (CVPR 2020 oral)
We write your reusable computer vision tools. π
StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models
Easy Docker setup for Stable Diffusion with user-friendly UI
ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator
Samples and Tools for Windows ML.
This repository provides code for machine learning algorithms for edge devices developed at Microsoft Research India.
Run the official Stable Diffusion releases in a Docker container with txt2img, img2img, depth2img, pix2pix, upscale4x, and inpaint.