No-code LLM Platform to launch APIs and ETL Pipelines
Official Repo For "Sa2VA: Marrying SAM2 with LLaVA
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
"Big Model" trains a visual multimodal VLM with 26M parameters
Elyra extends JupyterLab with an AI centric approach
OCR expert VLM powered by Hunyuan's native multimodal architecture
Inference script for Oasis 500M
Handwritten Text Recognition (HTR) system implemented with TensorFlow
Fast, powerful, git-native ticket tracking in a single bash script
ICLR2024 Spotlight: curation/training code, metadata, distribution
Flexible Photo Recrafting While Preserving Your Identity
Python package for AutoML on Tabular Data with Feature Engineering
Guiding Instruction-based Image Editing via Multimodal Large Language
Official code for Style Aligned Image Generation via Shared Attention
Creation of a Taylorplot for several machine learning models
Code release for ConvNeXt model
Machine learning glossary
GLIDE: a diffusion-based text-conditional image synthesis model
Generative Adversarial Transformers
All-in-one web-based IDE specialized for machine learning
A real-time approach for mapping all human pixels of 2D RGB images
Visual tracking library based on PyTorch
Convolutional Neural Networks to predict aesthetic quality of images
Cross Audio-Visual Recognition using 3D Architectures