Open-weight, large-scale hybrid-attention reasoning model
Build production-ready AI agents in both Python and Typescript
Our first fully AI generated deep learning system
Jittor is a high-performance deep learning framework
Chat & pretrained large audio language model proposed by Alibaba Cloud
Qwen3-omni is a natively end-to-end, omni-modal LLM
AI tool converting video/audio into structured documents instantly
A security scanner for custom LLM applications
Dockerized FastAPI wrapper for Kokoro-82M text-to-speech model
Controllable and fast Text-to-Speech for over 7000 languages
Plug-and-play library to enable agents to call MCP and UTCP tools
An AI-powered security review GitHub Action using Claude
Renderer for the harmony response format to be used with gpt-oss
Data Lake for Deep Learning. Build, manage, and query datasets
Deep learning library
Machine Learning Engineering Open Book
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
State-of-the-art (SoTA) text-to-video pre-trained model
A simple yet powerful agent framework that delivers with models
This repository contains code released by Google Research
Spark-TTS Inference Code
Block Diffusion for Ultra-Fast Speculative Decoding
A long-running autonomous coding agent powered by the Claude Agent
Open source no-code system for text annotation and building of text
RAG Search API