Agent-ready RPA suite with visual workflow automation tools engine
The most powerful and modular diffusion model GUI, api and backend
CodeGeeX: An Open Multilingual Code Generation Model (KDD 2023)
Wan2.2: Open and Advanced Large-Scale Video Generative Model
TTS with kokoro and onnx runtime
Wan2.1: Open and Advanced Large-Scale Video Generative Model
OCRmyPDF adds an OCR text layer to scanned PDF files
1 min voice data can also be used to train a good TTS model
Official Python inference and LoRA trainer package
3D reconstruction software
The highest-scoring AI memory system ever benchmarked
A simple, high-quality voice conversion tool focused on ease of use
The most powerful local music generation model
GLM-4.5: Open-source LLM for intelligent agents by Z.ai
Image inpainting tool powered by SOTA AI Model
Enterprise platform for building and orchestrating AI agent workflows
Containerized automation engine for programmable CI/CD workflows
OCR software, free and offline
Robust Speech Recognition via Large-Scale Weak Supervision
Open-source, high-performance AI model with advanced reasoning
Awesome multilingual OCR toolkits based on PaddlePaddle
A modular, primitive-first, python-first PyTorch library
Python inference and LoRA trainer package for the LTX-2 audio–video
Python tool for converting files and office documents to Markdown
World's first open-source, agentic video production system