ChatGLM-6B: An Open Bilingual Dialogue Language Model
Uncommon Objects in 3D dataset
Generating Immersive, Explorable, and Interactive 3D Worlds
Open-Source Financial Large Language Models
Programmatic access to the AlphaGenome model
Qwen3-Coder is the code version of Qwen3
An Efficient Agentic Model for Computer Use
Achieving 3+ generation speedup on reasoning tasks
OpenTinker is an RL-as-a-Service infrastructure for foundation models
Video Object and Interaction Deletion
A Powerful Native Multimodal Model for Image Generation
Audio foundation model excelling in audio understanding
Hackable and optimized Transformers building blocks
Long-form streaming TTS system for multi-speaker dialogue generation
LTX-Video Support for ComfyUI
Multimodal Diffusion with Representation Alignment
Project Lyra: Open Generative 3D World Models
Industrial-level controllable zero-shot text-to-speech system
Repo for SeedVR2 & SeedVR
Qwen-Image is a powerful image generation foundation model
Sharp Monocular Metric Depth in Less Than a Second
Mixture-of-Experts Vision-Language Models for Advanced Multimodal
Inference code for scalable emulation of protein equilibrium ensembles
Foundation model for image generation
Block Diffusion for Ultra-Fast Speculative Decoding