🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.
Open Source Computer Vision Library
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
OpenVINO™ is an open source toolkit for optimizing and deploying AI inference
RAGFlow is a leading open-source Retrieval-Augmented Generation (RAG) engine that fuses cutting-edge RAG with Agent capabilities to create a superior context layer for LLMs
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
Translate manga/image 一键翻译各类图片内文字 https://cotrans.touhou.ai/ (no longer working)
Stable Diffusion web UI
We write your reusable computer vision tools. 💜
Ultralytics YOLO 🚀
12 Weeks, 24 Lessons, AI for All!
100 Days of ML Coding
High-performance In-browser LLM Inference Engine
ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator
Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node
Faster Whisper transcription with CTranslate2
(⌐■_■) - Deep Reinforcement Learning instrumenting bettercap for WiFi pwning.
🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.
SoftVC VITS Singing Voice Conversion
StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models
Code and documentation to train Stanford’s Alpaca models, and generate the data.
Fast and accurate AI powered file content types detection
Open standard for machine learning interoperability
A curated list of awesome things related to artificial intelligence tools
A desktop application that extracts YouTube playlist transcripts and enhances them using Google’s Gemini AI models. The output is a book in any language you want.
Run modern deep learning models in the browser.