🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.
Stable Diffusion web UI
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
Open Source Computer Vision Library
ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator
Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node
Ultralytics YOLO 🚀
SoftVC VITS Singing Voice Conversion
12 Weeks, 24 Lessons, AI for All!
RAGFlow is a leading open-source Retrieval-Augmented Generation (RAG) engine that fuses cutting-edge RAG with Agent capabilities to create a superior context layer for LLMs
High-performance In-browser LLM Inference Engine
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models
Faster Whisper transcription with CTranslate2
We write your reusable computer vision tools. 💜
OpenVINO™ is an open source toolkit for optimizing and deploying AI inference
Code and documentation to train Stanford’s Alpaca models, and generate the data.
Translate manga/image 一键翻译各类图片内文字 https://cotrans.touhou.ai/ (no longer working)
100 Days of ML Coding
Open standard for machine learning interoperability
Samples and Tools for Windows ML.
(⌐■_■) - Deep Reinforcement Learning instrumenting bettercap for WiFi pwning.
Gorgonia is a library that helps facilitate machine learning in Go.
A curated list of awesome things related to artificial intelligence tools
A desktop application that extracts YouTube playlist transcripts and enhances them using Google’s Gemini AI models. The output is a book in any language you want.
Detect the programming language of a source code
Run modern deep learning models in the browser.