OpenDAN is an open source Personal AI OS
Build AI-powered semantic search applications
Data loaders and abstractions for text and NLP
TensorRT LLM provides users with an easy-to-use Python API
LLM framework for document understanding and semantic retrieval
Low-latency REST API for serving text-embeddings
BISHENG is an open LLM devops platform for next generation apps
The Multi-Agent Framework
Interpretable prompting and models for NLP
Fast and memory-efficient exact attention
Easiest and laziest way for building multi-agent LLMs applications
An AI-powered file management tool that ensures privacy
Reflexion: Language Agents with Verbal Reinforcement Learning
Test-Time Reinforcement Learning
The official implementation of RAPTOR
NeurIPS2025 Spotlight] Quantized Attention
A frontier, first-principles handbook
A New Axis of Sparsity for Large Language Models
"Big Model" trains a visual multimodal VLM with 26M parameters
Gorilla: An API store for LLMs
Langchain-Chatchat (formerly langchain-ChatGLM), local knowledge
Bring the notion of Model-as-a-Service to life
A unified library of SOTA model optimization techniques
Official inference repo for FLUX.2 models
Framework for building realtime multimodal voice AI agents apps