Open Source Computer Vision Library
Ultralytics YOLO11 🚀
Stable Diffusion web UI
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.
100 Days of ML Coding
Faster Whisper transcription with CTranslate2
Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node
12 Weeks, 24 Lessons, AI for All!
We write your reusable computer vision tools. 💜
🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch and FLAX.
OpenVINO™ is an open source toolkit for optimizing and deploying AI inference
ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
High-performance In-browser LLM Inference Engine
SoftVC VITS Singing Voice Conversion
This repository provides code for machine learning algorithms for edge devices developed at Microsoft Research India.
Open standard for machine learning interoperability
Gorgonia is a library that helps facilitate machine learning in Go.
Translate manga/image 一键翻译各类图片内文字 https://cotrans.touhou.ai/
A curated list of awesome things related to artificial intelligence tools
Code and documentation to train Stanford’s Alpaca models, and generate the data.
StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models
(⌐■_■) - Deep Reinforcement Learning instrumenting bettercap for WiFi pwning.
Samples and Tools for Windows ML.
Run modern deep learning models in the browser.
Detect the programming language of a source code
A desktop application that extracts YouTube playlist transcripts and enhances them using Google’s Gemini AI models. The output is a book in any language you want.