A lightweight audio-to-MIDI converter with pitch bend detection
A high-throughput and memory-efficient inference and serving engine
NVR with realtime local object detection for IP cameras
Interact with your documents using the power of GPT
Generate audiobooks from e-books, voice cloning & 1107+ languages
OCR software, free and offline
TTS for Context-Aware Speech Generation and True-to-Life Voice Cloning
A command-line productivity tool powered by AI large language models
gpt-4o for windows, macos and linux
Minimal CLI coding agent by Mistral
Generate short videos with one click using AI LLM
Use Microsoft Edge's online text-to-speech service from Python
From Images to High-Fidelity 3D Assets
Kimi Code CLI is your next CLI agent
Implementation of TurboQuant (ICLR 2026)
Open-source AI agent framework
High-Resolution 3D Assets Generation with Large Scale Diffusion Models
Label Studio is a multi-type data labeling and annotation tool
Offline Text To Speech synthesis for python
Python Optimal Transport
Autonomous research from idea to paper. Chat an Idea. Get a Paper 🦞
Qwen3 is the large language model series developed by Qwen team
Official inference repo for FLUX.1 models
An experimental version of DeepSeek model
🐈 nanobot: The Ultra-Lightweight Clawdbot / OpenClaw