OCR expert VLM powered by Hunyuan's native multimodal architecture
Designed for text embedding and ranking tasks
Helping you get the most out of AWS, wherever you use MCP
State-of-the-art diffusion models for image and audio generation
Trainable, memory-efficient, and GPU-friendly PyTorch reproduction
Lightweight Python library for adding real-time multi-object tracking
Large-Scale Agentic RL for High-Performance CUDA Kernel Generation
Definitions for AI/ML tasks like dataset creation
Long-form streaming TTS system for multi-speaker dialogue generation
LLM training in simple, raw C/CUDA
Volcano Engine Reinforcement Learning for LLMs
AI discovers 520000 stable inorganic crystal structures for research
The most intuitive, flexible, way for researchers to build models
Library for serving Transformers models on Amazon SageMaker
A collection of open-source skills for AI coding agents
Containerized automation engine for programmable CI/CD workflows
LLM powered fuzzing via OSS-Fuzz
Repo of Qwen2-Audio chat & pretrained large audio language model
The leading agent orchestration platform for Claude
Making ALL Software Agent-Native
The official PyTorch implementation of Google's Gemma models
An opinionated CLI to transcribe Audio files w/ Whisper on-device
Open platform for building, deploying, and managing LLM agents
Bringing BERT into modernity via both architecture changes and scaling
A lightweight framework for building LLM-based agents