Datasets, transforms and models specific to Computer Vision
A Simple and Universal Swarm Intelligence Engine
State-of-the-art 2D and 3D Face Analysis Project
A simple, high-quality voice conversion tool focused on ease of use
AI agent harness for AI coding agents
Industry leading face manipulation platform
The highest-scoring AI memory system ever benchmarked
AI video generator optimized for low VRAM and older GPUs use
Wan2.1: Open and Advanced Large-Scale Video Generative Model
Run Local LLMs on Any Device. Open-source
From Images to High-Fidelity 3D Assets
Official Python inference and LoRA trainer package
Awesome multilingual OCR toolkits based on PaddlePaddle
The most powerful and modular diffusion model GUI, api and backend
Open-source, high-performance AI model with advanced reasoning
TTS with kokoro and onnx runtime
The most powerful local music generation model
[NeurIPS 2023 Spotlight] LightZero
3D reconstruction software
Wan2.2: Open and Advanced Large-Scale Video Generative Model
1 min voice data can also be used to train a good TTS model
Improve your Baduk skills by training with KataGo
Agentic, Reasoning, and Coding (ARC) foundation models
Video-based AI memory library. Store millions of text chunks in MP4
Modular AI image and video generation web UI with extensible tools