A curated list of data mining papers about fraud detection
Mie scattering of light by perfect spheres
A tool for semi-automatic cell type classification, harmonization
Synthetic data generators for structured and unstructured text
An AI Hedge Fund Team
Self-hosted platform to unify wearable health data
Integrate multiple high-dimensional datasets with fuzzy k-means
Easy integration with Athena, Glue, Redshift, Timestream, Neptune
Benchmarking synthetic data generation methods
Always know what to expect from your data
Python module that helps you build complex pipelines of batch jobs
A curated list of insanely awesome libraries, packages and resources
Monitor the stability of a Pandas or Spark dataframe
Open-source data observability for analytics engineers
Dataset Management Framework, a Python library and a CLI tool to build
Data Preprocessing Automation: A GUI for easy data cleaning & visualiz
Tool for producing high quality forecasts for time series data
High-Performance Symbolic Regression in Python and Julia
The standard data-centric AI package for data quality and ML
Recap tracks and transform schemas across your whole application
A python wrapper for Alpha Vantage API for financial data.
Great Expectations Airflow operator
Data science on data without acquiring a copy
Make your own running home page
Automatically find issues in image datasets