Apache InLong - a one-stop integration framework for massive data
A graph database that supports more than 100+ billion data
Big Data Stream Analytics Framework.
Distributed messaging and streaming platform with low latency
Unified metadata lake for data & AI assets.
World's first open source data quality & data preparation project
MapReduce-based tool to remove duplicate DNA reads
DSTK - DataScience ToolKit for All of Us
sparse and dense matrix, linear algebra, visualization, big data
Log-linear analysis (data modelling) for high-dimensional data
giServer the easy to use and extensible batch and integration server
Workflow Designer, Hive Editor, Pig Editor, File System Browser