TTS with kokoro and onnx runtime
Use Microsoft Edge's online text-to-speech service from Python
Public repository for Agent Skills
Models for object and human mesh reconstruction
AI framework for automated short video creation and editing tools
Code for running inference with the SAM 3D Body Model 3DB
The most powerful Android RPA agent framework
Quick illustration of how one can easily read books together with LLMs
One-click deployment (including offline integration package)
The best ChatGPT that $100 can buy
Generate Any 3D Scene in Seconds
AI bridge enabling assistants to control and automate Unity Editor
Custom Home Assistant configuration with automations and scripts setup
Virtual AI anchor that combines state-of-the-art technology
An agentic Machine Learning Engineer
Multilingual Document Layout Parsing in a Single Vision-Language Model
Ultimate meta-skill for generating best-in-class Claude Code skills
Generate high-definition story short videos with one click using AI
Digital Life Kazik Open Source AI Skills Collection
Play couplet with seq2seq model
Inference script for Oasis 500M
Code for the paper Language Models are Unsupervised Multitask Learners
Chinese text-to-speech engine
Learning to Act by Watching Unlabeled Online Videos
WaveRNN Vocoder + TTS