Browse free open source Data Science tools and projects for Linux below. Use the toggles on the left to filter open source Data Science tools by OS, license, language, programming language, and project status.
RStudio is an integrated development environment (IDE) for R
An implementation of the Grammar of Graphics in R
Positron, a next-generation data science IDE
Scalable and Flexible Gradient Boosting
Vector database for scalable similarity search and AI applications
High-Performance Serverless event and data processing platform
Data science spreadsheet with Python & SQL
Project structure for doing and sharing data science work
Graphical User Interface Toolkit for Python with minimal dependencies
A framework for real-life data science
Train machine learning models within Docker containers
Automatic extraction of relevant features from time series
The Go kernel for Jupyter notebooks and nteract
Parallel computing with task scheduling
Always know what to expect from your data
Detecting silent model failure. NannyML estimates performance
Data science on data without acquiring a copy
GPU DataFrame Library
Build data pipelines, the easy way
A reactive notebook for Python
The data science OS
For building machine learning (ML) workflows and pipelines on AWS
Course materials for the Data Science Specialization on Coursera
Library providing end-to-end GPU-accelerated recommender systems
MCPower — simple Monte Carlo power analysis for complex models