The Iris Book: Addition, Subtraction, Multiplication, and Division
State-of-the-art 2D and 3D Face Analysis Project
A Lightweight Face Recognition and Facial Attribute Analysis
Awesome multilingual OCR toolkits based on PaddlePaddle
Robust Speech Recognition via Large-Scale Weak Supervision
Offline speech recognition API for Android, iOS, Raspberry Pi
Face recognition with deep neural networks
Speech recognition module for Python
NLP Cloud serves high performance pre-trained or custom models for NER
Multilingual speech recognition and audio understanding model
Speech-to-text, text-to-speech, and speaker recognition
Contexts Optical Compression
OCR software, free and offline
Audio foundation model excelling in audio understanding
Handwritten Text Recognition (HTR) system implemented with TensorFlow
High-Performance Face Recognition Library on PaddlePaddle & PyTorch
Open-source industrial-grade ASR models
kaldi-asr/kaldi is the official location of the Kaldi project
A ranked list of awesome machine learning Python libraries
Underthesea - Vietnamese NLP Toolkit
Image polygonal annotation with Python
A GUI tool for extracting hard-coded subtitle (hardsub) from videos
Uses Qwen3-ASR, local LLM, Whisper, TEN-VAD
Book_4_Matrix Power | The Iris Book: From Addition, Subtraction
Open-Source Python3 tool for recognizing layouts, tables, and math