A Lightweight Face Recognition and Facial Attribute Analysis
Open-source, code-first Python toolkit for building, evaluating, etc.
A lightweight audio-to-MIDI converter with pitch bend detection
Offline Text To Speech synthesis for python
Use Microsoft Edge's online text-to-speech service from Python
The agent that grows with you
Python scraper based on AI
Source code of PyGAD, Python 3 library for building genetic algorithms
Video-based AI memory library. Store millions of text chunks in MP4
A high-throughput and memory-efficient inference and serving engine
Image polygonal annotation with Python
Stable Diffusion web UI
Python Optimal Transport
Python inference and LoRA trainer package for the LTX-2 audio–video
Robust Speech Recognition via Large-Scale Weak Supervision
Seamlessly integrate LLMs as Python functions
Official inference repo for FLUX.1 models
The official Python client for the Huggingface Hub
OCRmyPDF adds an OCR text layer to scanned PDF files
1 min voice data can also be used to train a good TTS model
Style-Bert-VITS2: Bert-VITS2 with more controllable voice styles
Awesome multilingual OCR toolkits based on PaddlePaddle
Personal AI, On Personal Devices
Public repository for Agent Skills
From Images to High-Fidelity 3D Assets