Large Multimodal Models for Video Understanding and Editing
The repository provides code for running inference with SAM 2
The most intuitive, flexible, way for researchers to build models
Containerized automation engine for programmable CI/CD workflows
Focus on creating classic Python small examples and cases
ContextGem: Effortless LLM extraction from documents
An MCP server for interacting with Google Colab
Local RAG engine for private multimodal knowledge search on devices
Collection of Kaggle Solutions and Ideas
An agentless approach to automatically solve software development
Build a large language model from 0 only with Python foundation
Collection of Gemma 3 variants that are trained for performance
AI multi-agent platform for automated code security auditing system
SOTA Open Source TTS
A fast library for AutoML and tuning
A lightweight framework for building LLM-based agents
A reactive notebook for Python
Automatically Visualize any dataset, any size
machine learning tutorials (mainly in Python3)
4M: Massively Multimodal Masked Modeling
PyTorch code and models for V-JEPA self-supervised learning from video
The Unified Machine Learning Framework
The first real AI developer
Open deep learning compiler stack for cpu, gpu, etc.
The official PyTorch implementation of Google's Gemma models