🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
Faster Whisper transcription with CTranslate2
🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch and FLAX.
RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.
An Open Source Machine Learning Framework for Everyone
Gorgonia is a library that helps facilitate machine learning in Go.
Stable Diffusion web UI
Ultralytics YOLO11 🚀
Translate manga/image 一键翻译各类图片内文字 https://cotrans.touhou.ai/
Open Source Computer Vision Library
12 Weeks, 24 Lessons, AI for All!
Open standard for machine learning interoperability
(⌐■_■) - Deep Reinforcement Learning instrumenting bettercap for WiFi pwning.
SoftVC VITS Singing Voice Conversion
100 Days of ML Coding
We write your reusable computer vision tools. 💜
StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models
High-performance In-browser LLM Inference Engine
A curated list of awesome things related to artificial intelligence tools
ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator
OpenVINO™ is an open source toolkit for optimizing and deploying AI inference
A desktop application that extracts YouTube playlist transcripts and enhances them using Google’s Gemini AI models. The output is a book in any language you want.
Code and documentation to train Stanford’s Alpaca models, and generate the data.
Interactive Image Generation via Generative Adversarial Networks
Run modern deep learning models in the browser.
Detect the programming language of a source code
Samples and Tools for Windows ML.
This repository provides code for machine learning algorithms for edge devices developed at Microsoft Research India.