Qwen3-omni is a natively end-to-end, omni-modal LLM
Complete Two-Factor Authentication for Django
Deep learning library
A security scanner for custom LLM applications
Dockerized FastAPI wrapper for Kokoro-82M text-to-speech model
Controllable and fast Text-to-Speech for over 7000 languages
An AI-powered security review GitHub Action using Claude
Renderer for the harmony response format to be used with gpt-oss
Data Lake for Deep Learning. Build, manage, and query datasets
Making ALL Software Agent-Native
Machine Learning Engineering Open Book
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
State-of-the-art (SoTA) text-to-video pre-trained model
Open-source framework for intelligent speech interaction
A simple yet powerful agent framework that delivers with models
Block Diffusion for Ultra-Fast Speculative Decoding
RAG Search API
Configuration UI for Home Assistant
GEO-first SEO skill for Claude Code
Open source AI model for generating full songs from lyrics prompts
Qwen-Image-Layered: Layered Decomposition for Inherent Editablity
Towards Human-Sounding Speech
When LLM Meets Domain Experts
Jupyter notebook tutorials for OpenVINO
The Standard Webhooks specification