Easy integration with Athena, Glue, Redshift, Timestream, Neptune
Data science on data without acquiring a copy
A curated list of data mining papers about fraud detection
Synthetic data generators for structured and unstructured text
Benchmarking synthetic data generation methods
An interactive Formula 1 race visualisation and data analysis tool
Pandas on AWS, easy integration with Athena, Glue, Redshift, etc.
BertViz: Visualize Attention in NLP Models (BERT, GPT2, BART, etc.)
Create HTML profiling reports from pandas DataFrame objects
A real-time visualisation of the CO2 emissions of electricity
Project structure for doing and sharing data science work
Python implementation of global optimization with gaussian processes
Make your own running home page
Clean Jupyter notebooks of outputs, metadata, and empty cells
Making DAG construction easier
Streamline your ML workflow
The toolkit to test, validate, and evaluate your models and surface
Diagram generation for understanding codebases and system architecture
The open standard for data logging
Training data (data labeling, annotation, workflow) for all data types
AutoGluon: AutoML for Image, Text, and Tabular Data
airda(Air Data Agent
Collaborative forensic timeline analysis
The open-source tool for building high-quality datasets
Always know what to expect from your data