Great Expectations Airflow operator
WebGL-based viewer for volumetric data
Synthetic data generators for structured and unstructured text
Detecting silent model failure. NannyML estimates performance
Data science on data without acquiring a copy
Dataset Management Framework, a Python library and a CLI tool to build
Open-source data observability for analytics engineers
Recap tracks and transform schemas across your whole application
Training data (data labeling, annotation, workflow) for all data types
Easy integration with Athena, Glue, Redshift, Timestream, Neptune
Always know what to expect from your data
The power of Chart.js with Python
The toolkit to test, validate, and evaluate your models and surface
Benchmarking synthetic data generation methods
Make your own running home page
A tool for semi-automatic cell type classification, harmonization
A curated list of data mining papers about fraud detection
A more accurate representation of jupyter notebooks
AutoGluon: AutoML for Image, Text, and Tabular Data
Integrate multiple high-dimensional datasets with fuzzy k-means
BertViz: Visualize Attention in NLP Models (BERT, GPT2, BART, etc.)
Python module that helps you build complex pipelines of batch jobs
Automatically find issues in image datasets
Python scripts for ETL (extract, transform and load) jobs for Ethereum
High-Performance Symbolic Regression in Python and Julia