A Pythonic framework to simplify AI service building
A unified framework for scalable computing
GPU environment management and cluster orchestration
LLM training code for MosaicML foundation models
A library for accelerating Transformer models on NVIDIA GPUs
PyTorch library of curated Transformer models and their components
A high-performance ML model serving framework, offers dynamic batching
Replace OpenAI GPT with another LLM in your app
Powering Amazon custom machine learning chips
Fast inference engine for Transformer models
State-of-the-art diffusion models for image and audio generation
Lightweight Python library for adding real-time multi-object tracking
Trainable, memory-efficient, and GPU-friendly PyTorch reproduction
Library for serving Transformers models on Amazon SageMaker
Open-Source AI Camera. Empower any camera/CCTV
Unified Model Serving Framework
The unofficial python package that returns response of Google Bard
Open platform for training, serving, and evaluating language models
A graphical manager for ollama that can manage your LLMs
High quality, fast, modular reference implementation of SSD in PyTorch
Database system for building simpler and faster AI-powered application
Serve machine learning models within a Docker container
Implementation of "Tree of Thoughts
Guide to deploying deep-learning inference networks
Deploy a ML inference service on a budget in 10 lines of code