LISA: Reasoning Segmentation via Large Language Model
InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System
Skywork-R1V is an advanced multimodal AI model series
A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)
Code and models for ICML 2024 paper, NExT-GPT
CV, NLP, LLM project applications, and advanced engineering deployment
Multimodal Agents as Smartphone Users, an LLM-based multimodal agent
ImageBind One Embedding Space to Bind Them All
An industrial grade federated learning framework
Self hosted & open source anonymous 360 review software
A Powerful Native Multimodal Model for Image Generation
Achieving 3+ generation speedup on reasoning tasks
MineContext is your proactive context-aware AI partner
Open-Source Financial Large Language Models
Easily compute clip embeddings and build a clip retrieval system
Python toolkit for OSINT and reconnaissance with 135+ modules
Run PyTorch LLMs locally on servers, desktop and mobile
One-stop solution for creating your digital avatar from chat history
An Efficient Agentic Model for Computer Use
An anomaly detection library comprising state-of-the-art algorithms
Generic templated configuration management for Kubernetes
JAX-based neural network library
A powerful and artistic UI library based on PyQt5
Code for Cicero, an AI agent that plays the game of Diplomacy
Bring the notion of Model-as-a-Service to life