Functional Machine Learning
Open-source data observability for analytics engineers
Toloka-Kit is a Python library for working with Toloka API
Data science on data without acquiring a copy
SQL-native memory layer enabling persistent context for AI agents
Dataset Management Framework, a Python library and a CLI tool to build
OpenRecall is a fully open-source, privacy-first alternative
A new Minecraft world editor and converter
Recap tracks and transform schemas across your whole application
Ralph is the CMDB / Asset Management system for data center
A Python utility / library to sort imports
Advanced LLM-powered brute-force tool combining AI intelligence
Open Security Controls Assessment Language (OSCAL)
A terminal spreadsheet multitool for discovering and arranging data
Training data (data labeling, annotation, workflow) for all data types
Fast and well tested serialization library on top of dataclasses
Always know what to expect from your data
Daily updated lists of cloud, bot, and service IP ranges
Easy integration with Athena, Glue, Redshift, Timestream, Neptune
OSINT tool for locating WiFi networks using BSSID or SSID data
A Python Automated Machine Learning tool that optimizes ML
A Python framework for creating reproducible, maintainable code
Language-model investigation agent with a terminal UI
Superduper: Integrate AI models and machine learning workflows
Python tool for browser-based interactive data apps in one file