Download your Spotify playlists and songs along with album art and metadata (from YouTube if a match is found).
The official repo for βDolphin: Document Image Parsing via Heterogeneous Anchor Promptingβ, ACL, 2025.
π€ Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.
A feature-rich command-line audio/video downloader
AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.
An extremely fast Python package and project manager, written in Rust.
πΈπ¬ - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
Collection of awesome LLM apps with AI Agents and RAG using OpenAI, Anthropic, Gemini and opensource models.
π₯π₯π₯ Open-source Jira, Linear, Monday, and ClickUp alternative. Plane is a modern project management platform to manage tasks, sprints, docs, and triage.
AI conversations that actually remember. Never re-explain your project to your AI again. Join our Discord: https://discord.gg/tyvKNccgqN
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
π¦π The platform for reliable agents.
π Open source self-hosted web archiving. Takes URLs/browser history/bookmarks/Pocket/Pinboard/etc., saves HTML, JS, PDFs, media, and more…
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
Financial data platform for analysts, quants and AI agents.
Multi-Platform Package Manager for Stable Diffusion
A generative speech model for daily dialogue.
A keyboard-driven, vim-like browser based on Python and Qt.
A hackable shell for Hyprland, powered by Fabric.
We write your reusable computer vision tools. π
Ultralytics YOLO π
An LLM agent that conducts deep research (local and web) on any given topic and generates a long report with citations.
Open Source AI Platform - AI Chat with advanced features that works with every LLM
A lightweight yet powerful audio-to-MIDI converter with pitch bend detection
Iconic font aggregator, collection, & patcher. 3,600+ icons, 50+ patched fonts: Hack, Source Code Pro, more. Glyph collections: Font Awesome, Material Design Icons, Octicons, & more
100 Days of ML Coding
Apache CloudStack is an opensource Infrastructure as a Service (IaaS) cloud computing platform
Free and Open Source Enterprise Resource Planning (ERP)
music library manager and MusicBrainz tagger
Speech To Speech: an effort for an open-sourced and modular GPT4-o
A user friendly TUI for SQL databases. Written in python. Supports SQL server, Mysql, PostreSQL, SQLite, Turso and more.
Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node
A terminal spreadsheet multitool for discovering and arranging data
TTS with kokoro and onnx runtime
Generate audiobooks from e-books
A Terminal Client for MySQL with AutoCompletion and Syntax Highlighting.
Qtcord is a Discord client built with Qt aiming to bring a lightweight, native experience.
Zulip server and web application. Open-source team chat that helps teams stay productive and focused.
Better GitHub statistics images for your profile, with stats from private repos too
A legible monospace font… the very typeface youβve been trained to recognize since childhood
MetaCall: The ultimate polyglot programming experience.
Run compilers interactively from your web browser and interact with the assembly
Easily run old versions of UNIX for PDP-11 on modern hardware
DevOps Guide - Development to Production all configurations with basic notes to debug efficiently.
πΆ Cross-platform music player
Read/sync your IMAP mailboxes (python2) [LEGACY: move to offlineimap3]
Linux Show Player - Cue player designed for stage productions
A desktop application that extracts YouTube playlist transcripts and enhances them using Google’s Gemini AI models. The output is a book in any language you want.
A lightweight alternative frontend for Reddit.
π Synchronize calendars and contacts.
a bouncer-style Matrix IRC bridge
BrachioGraph is an ultra-cheap (total cost of materials: β¬14) plotter that can be built with minimal skills.
FFmpeg based audio splitter for CDDA images associated with .cue files
🐌 Tool to determine what GCC or (experimental!) Clang flags -march=native would resolve into
A speech to text IBus engine using VOSK
A curated list of the latest breakthroughs in AI (in 2023) by release date with a clear video explanation, link to a more in-depth article, and code.
A Matrix-Google Chat puppeting bridge
Naive performance comparison of a few programming languages (JavaScript, Kotlin, Rust, Swift, Nim, Python, Go, Haskell, D, C++, Java, C#, Object Pascal, Ada, Lua, Ruby)
A set of tools for automatically managing bitrot and format in large quantities of media