A natural language interface for computers
Openai style api for open large language models
Industrial-strength Natural Language Processing (NLP)
Build AI-powered semantic search applications
Semantic search and workflows for medical/scientific papers
Stanford NLP Python library for many human languages
Data and tools for generating and inspecting OLMo pre-training data
ExtractThinker is a Document Intelligence library for LLMs
The no-nonsense RAG chunking library
An LLM-powered knowledge curation system that researches topics
ReFT: Representation Finetuning for Language Models
Haystack is an open source NLP framework to interact with your data
The Classical Language Toolkit
Efficient Retrieval Augmentation and Generation Framework
Han Language Processing
Large Language Model Text Generation Inference
The library to build & auto-optimize LLM applications
Bring the notion of Model-as-a-Service to life
Trained models & code to predict toxic comments
A Heterogeneous Benchmark for Information Retrieval
Data processing for and with foundation models
Module for automatic summarization of text documents and HTML pages
A Repo For Document AI
Hub of ready-to-use datasets for ML models
Extract schema, statistics and entities from datasets