Generate Any 3D Scene in Seconds
MCP server enabling AI agents to control and automate Windows OS
Library for OCR-related tasks powered by Deep Learning
OpenTinker is an RL-as-a-Service infrastructure for foundation models
AI-ready web crawler that extracts and structures website content
OCR expert VLM powered by Hunyuan's native multimodal architecture
This repository contains code released by Google Research
Intelligent automation and multi-agent orchestration for Claude Code
Run a full local LLM stack with one command using Docker
Large-Scale Agentic RL for High-Performance CUDA Kernel Generation
A Personalized LLM-powered Agent Frameworks
Framework for building AI agents that automate complex web tasks
Quickly get started with AI theory and practical applications
A collection of scientific methods, processes, algorithms
Faster and easier training and deployments
This repository is a curated collection of links to various courses
Designed for training LLM/VLM agents via RL
E2M converts various file types (doc, docx, epub, html, htm, url
The Cradle framework is a first attempt at General Computer Control
CV, NLP, LLM project applications, and advanced engineering deployment
Open Source Deep Research Alternative to Reason and Search
List of independent blogs in Chinese
A book-in-progress about the Linux kernel and its insides
Set of tools to assess and improve LLM security
Pruna is a model optimization framework built for developers