Python inference and LoRA trainer package for the LTX-2 audio–video
Official inference repo for FLUX.1 models
Robust Speech Recognition via Large-Scale Weak Supervision
Stable Diffusion web UI
A high-throughput and memory-efficient inference and serving engine
Awesome multilingual OCR toolkits based on PaddlePaddle
Deepfakes Software For All
A simple, high-quality voice conversion tool focused on ease of use
Public repository for Agent Skills
Image polygonal annotation with Python
Personal AI, On Personal Devices
OCRmyPDF adds an OCR text layer to scanned PDF files
1 min voice data can also be used to train a good TTS model
The official Python client for the Huggingface Hub
Video-based AI memory library. Store millions of text chunks in MP4
Style-Bert-VITS2: Bert-VITS2 with more controllable voice styles
Code for running inference and finetuning with SAM 3 model
MCP Server for IDA Pro
From Images to High-Fidelity 3D Assets
Python Client for Supabase. Query Postgres from Flask, Django
Reverse-engineered Python API for Google Gemini web app
The most powerful and modular diffusion model GUI, api and backend
3D reconstruction software
An Async Bot/API wrapper for Twitch made in Python
Ready-to-use OCR with 80+ supported languages