Video Object and Interaction Deletion
Official Python inference and LoRA trainer package
Wan2.1: Open and Advanced Large-Scale Video Generative Model
Awesome multilingual OCR toolkits based on PaddlePaddle
Wan2.2: Open and Advanced Large-Scale Video Generative Model
Open-source, high-performance AI model with advanced reasoning
From Images to High-Fidelity 3D Assets
The most powerful local music generation model
Advanced language and coding AI model
Agentic, Reasoning, and Coding (ARC) foundation models
Fast stable diffusion on CPU and AI PC
AlphaFold 3 inference pipeline
Generating Immersive, Explorable, and Interactive 3D Worlds
Official inference repo for FLUX.1 models
An experimental version of DeepSeek model
Open-source multi-speaker long-form text-to-speech model
State-of-the-art TTS model under 25MB
A Family of Open Sourced Music Foundation Models
Towards Real-World Vision-Language Understanding
LTX-Video Support for ComfyUI
MedicalGPT: Training Your Own Medical GPT Model with ChatGPT Training
Ultra-Efficient LLMs on End Device
ChatGLM-6B: An Open Bilingual Dialogue Language Model
Controllable & emotion-expressive zero-shot TTS
An Efficient Agentic Model for Computer Use