Showing 780 open source projects for "python data analysis"

View related business solutions
  • Failed Payment Recovery for Subscription Businesses Icon
    Failed Payment Recovery for Subscription Businesses

    For subscription companies searching for a failed payment recovery solution to grow revenue, and retain customers.

    FlexPay’s innovative platform uses multiple technologies to achieve the highest number of retained customers, resulting in reduced involuntary churn, longer life span after recovery, and higher revenue. Leading brands like LegalZoom, Hooked on Phonics, and ClinicSense trust FlexPay to recover failed payments, reduce churn, and increase customer lifetime value.
    Learn More
  • Premier Construction Software Icon
    Premier Construction Software

    Premier is a global leader in financial construction ERP software.

    Rated #1 Construction Accounting Software by Forbes Advisor in 2022 & 2023. Our modern SAAS solution is designed to meet the needs of General Contractors, Developers/Owners, Homebuilders & Specialty Contractors.
    Learn More
  • 1
    CNN for Image Retrieval
    cnn-for-image-retrieval is a research-oriented project that demonstrates the use of convolutional neural networks (CNNs) for image retrieval tasks. The repository provides implementations of CNN-based methods to extract feature representations from images and use them for similarity-based retrieval. It focuses on applying deep learning techniques to improve upon traditional handcrafted descriptors by learning features directly from data. The code includes training and evaluation scripts that...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 2
    fastNLP

    fastNLP

    fastNLP: A Modularized and Extensible NLP Framework

    fastNLP is a lightweight framework for natural language processing (NLP), the goal is to quickly implement NLP tasks and build complex models. A unified Tabular data container simplifies the data preprocessing process. Built-in Loader and Pipe for multiple datasets, eliminating the need for preprocessing code. Various convenient NLP tools, such as Embedding loading (including ELMo and BERT), intermediate data cache, etc.. Provide a variety of neural network components and recurrence models (covering tasks such as Chinese word segmentation, named entity recognition, syntactic analysis, text classification, text matching, metaphor resolution, summarization, etc.). ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    OpenFrames

    OpenFrames

    Real-time interactive 3D graphics API for scientific simulations

    OpenFrames has moved its primary development repository to GitHub! Everything else will follow. Get it at https://github.com/ravidavi/OpenFrames/wiki OpenFrames is an Application Programming Interface (API) that allows developers to provides the ability to add interactive 3D graphics to any scientific simulation. A simulation developer can use OpenFrames to specify what they want to visualize, without having to know any details of computer graphics programming. OpenFrames is currently...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    Neural Networks Collection

    Neural Networks Collection

    Neural Networks Collection

    This project implements in C++ a bunch of known Neural Networks. So far the project implements: LVQ in several variants, SOM in several variants, Hopfield network and Perceptron. Other neural network types are planned, but not implemented yet. The project can run in two modes: command line tool and Python 7.2 extension. Currently, Python version appears more functional, as it allows easy interaction with algorithms developed by other people.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Simplify Purchasing For Your Business Icon
    Simplify Purchasing For Your Business

    Manage what you buy and how you buy it with Order.co, so you have control over your time and money spent.

    Simplify every aspect of buying for your business in Order.co. From sourcing products to scaling purchasing across locations to automating your AP and approvals workstreams, Order.co is the platform of choice for growing businesses.
    Learn More
  • 5
    NLP Architect

    NLP Architect

    A model library for exploring state-of-the-art deep learning

    NLP Architect is an open-source Python library for exploring state-of-the-art deep learning topologies and techniques for optimizing Natural Language Processing and Natural Language Understanding neural networks. The library includes our past and ongoing NLP research and development efforts as part of Intel AI Lab. NLP Architect is designed to be flexible for adding new models, neural network components, data handling methods, and for easy training and running models. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    gradslam

    gradslam

    gradslam is an open source differentiable dense SLAM library

    gradslam is an open-source framework providing differentiable building blocks for simultaneous localization and mapping (SLAM) systems. We enable the usage of dense SLAM subsystems from the comfort of PyTorch. The question of “representation” is central in the context of dense simultaneous localization and mapping (SLAM). Newer learning-based approaches have the potential to leverage data or task performance to directly inform the choice of representation. However, learning representations...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 7
    Zipline

    Zipline

    Zipline, a Pythonic algorithmic trading library

    ...Zipline is currently used in production as the backtesting and live-trading engine powering Quantopian -- a free, community-centered, hosted platform for building and executing trading strategies. Quantopian also offers a fully managed service for professionals that includes Zipline, Alphalens, Pyfolio, FactSet data, and more. Installing Zipline is slightly more involved than the average Python package. For a development installation (used to develop Zipline itself), create and activate a virtualenv, then run the etc/dev-install script. Please note that Zipline is not a community-led project. Zipline is maintained by the Quantopian engineering team, and we are quite small and often busy.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    bioweb

    bioweb

    polyglot language framework to analyze genetic data

    polyglot framework using Python/C++/JavaScript to fast develop applications to analyze biological sequences
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    Makani

    Makani

    Makani was developed a commercial-scale airborne wind turbine

    ...This open-source repository contains the complete software stack that powered Makani’s research and flight systems, including the flight simulator, autopilot controller, avionics firmware, visualization tools, and ground control software. The software enables simulation, control, and analysis of the Makani M600 turbine system, designed to operate offshore and autonomously manage complex aerodynamic behaviors. Built primarily in C++ and Python, the codebase integrates real-time flight control, sensor fusion, aerodynamic modeling, and telemetry visualization. The project also provides comprehensive simulation environments for studying airborne wind power systems and flight dynamics. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Rezku Point of Sale Icon
    Rezku Point of Sale

    Designed for Real-World Restaurant Operations

    Rezku is an all-inclusive ordering platform and management solution for all types of restaurant and bar concepts. You can now get a fully custom branded downloadable smartphone ordering app for your restaurant exclusively from Rezku.
    Learn More
  • 10
    Metrix++

    Metrix++

    Management of source code quality is possible.

    The project has been moved to https://github.com/metrixplusplus/metrixplusplus ______________________ Metrix++ is an extendable tool to collect and analyse code metrics. - Multiple languages supported - Multiple metrics available - Configurable. Every metric has got 'turn-on' and other configuration options. There are no predefined thresholds for metrics or rules. You can choose and configure any limit you want. - High-performance. Processes thousands of files per minutes. -...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    Pretty Damn Quick (PDQ) analytically solves queueing network models of computer and manufacturing systems, data networks, etc., written in conventional programming languages. Generic or customized reports of predicted performance measures are output.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 12
    AWS IoT Greengrass Core SDK

    AWS IoT Greengrass Core SDK

    SDK to use with functions running on Greengrass Core using Python

    The AWS IoT Greengrass Core SDK is meant to be used by AWS Lambda functions running on an AWS IoT Greengrass Core. It will enable Lambda functions to invoke other Lambda functions deployed to the Greengrass Core, publish messages to the Greengrass Core and work with the local Shadow service. To use the AWS IoT Greengrass Core SDK, you must first import the AWS IoT Greengrass Core SDK in your Lambda function as you would with any other external libraries. You then need to create a client for...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    Frontend Regression Validator (FRED)

    Frontend Regression Validator (FRED)

    Visual regression tool used to compare baseline and updated instances

    Visual regression tool used to compare baseline and updated instances of a website in a deployment pipeline. FRED is an opensource visual regression tool used to compare two instances of a website. FRED is responsible for automatic visual regression testing, with the purpose of ensuring that functionality is not broken by comparing a current(baseline) and an updated version of a website. The visual analysis computes the Normalized Mean Squared error and the Structural Similarity Index on the...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    lzhw

    lzhw

    LZHW Windows command line lossless compression tool for tabular files

    LZHW Command Line Lossless Compression Tool is a Windows command line tool used to compress and decompress files from and to any form, csv, excel etc without any dependencies or installations. Using an optimized algorithm (LZHW) developed from Lempel-Ziv, Huffman and LZ-Welch algorithms. The tool can work in parallel and most of its code is written in Cython, so it is pretty fast. It is based on python lzhw library. Full tool documentation can be found at:...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    Flasgger

    Flasgger

    Easy OpenAPI specs and Swagger UI for your Flask API

    ...Flasgger also comes with SwaggerUI embedded so you can access it and visualize and interact with your API resources. Flasgger also provides validation of the incoming data, using the same specification it can validate if the data received as a POST, PUT, PATCH is valid against the schema defined using YAML, Python dictionaries or Marshmallow Schemas. Flasgger can work with simple function views or MethodViews using docstring as specification, or using @swag_from decorator to get specification from YAML or dict and also provides SwaggerView which can use Marshmallow Schemas as specification. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    interactive-coding-challenges

    interactive-coding-challenges

    120+ interactive Python coding interview challenges

    Interactive Coding Challenges is a collection of practice problems designed to strengthen data structures, algorithms, and problem-solving skills. The repository emphasizes a learn-by-doing approach: you read a prompt, attempt a solution, and verify behavior with tests, often within notebooks or scripts. Problems span arrays, strings, stacks, queues, linked lists, trees, graphs, dynamic programming, and more, mirroring common interview themes. Many challenges include hints and reference...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    ebfformat

    ebfformat

    An Efficient Binary data Format

    ...It is also designed to simplify the programming of input output routines in different programming languages. In a nutshell an EBF file is a collection of data objects. Each data object is specified by a unique name and a single file can have multiple data objects. Each data object is preceded by a meta-data or header which describes the binary data associated with it. Among other things, this header allows the files to be portable across systems with different endianess.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 18
    HyperGAN

    HyperGAN

    Composable GAN framework with api and user interface

    A composable GAN built for developers, researchers, and artists. HyperGAN builds generative adversarial networks in PyTorch and makes them easy to train and share. HyperGAN is currently in pre-release and open beta. Everyone will have different goals when using hypergan. HyperGAN is currently beta. We are still searching for a default cross-data-set configuration. Each of the examples supports search. Automated search can help find good configurations. If you are unsure, you can start with...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    Forecasting Best Practices

    Forecasting Best Practices

    Time Series Forecasting Best Practices & Examples

    ...Rather than creating implementations from scratch, we draw from existing state-of-the-art libraries and build additional utilities around processing and featuring the data, optimizing and evaluating models, and scaling up to the cloud. The examples and best practices are provided as Python Jupyter notebooks and R markdown files and a library of utility functions.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    Tofu

    Tofu

    Tofu is a Python tool for generating synthetic UK Biobank data

    Tofu is a Python library for generating synthetic UK Biobank data. The UK Biobank is a large open-access prospective research cohort study of 500,000 middle-aged participants recruited in England, Scotland and Wales. The study has collected and continues to collect extensive phenotypic and genotypic detail about its participants, including data from questionnaires, physical measures, sample assays, accelerometry, multimodal imaging, genome-wide genotyping and longitudinal follow-up for a wide range of health-related outcomes. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    MMdnn

    MMdnn

    Tools to help users inter-operate among deep learning frameworks

    MMdnn is a set of tools to help users inter-operate among different deep learning frameworks. E.g. model conversion and visualization. Convert models between Caffe, Keras, MXNet, Tensorflow, CNTK, PyTorch Onnx and CoreML. MMdnn is a comprehensive and cross-framework tool to convert, visualize and diagnose deep learning (DL) models. The "MM" stands for model management, and "dnn" is the acronym of deep neural network. We implement a universal converter to convert DL models between frameworks,...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    SageMaker Containers

    SageMaker Containers

    Create SageMaker-compatible Docker containers

    Amazon SageMaker is a fully managed service for data science and machine learning (ML) workflows. You can use Amazon SageMaker to simplify the process of building, training, and deploying ML models. To train a model, you can include your training script and dependencies in a Docker container that runs your training code. A container provides an effectively isolated environment, ensuring a consistent runtime and reliable training process. The SageMaker Training Toolkit can be easily added to...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    PerfKit Benchmarker

    PerfKit Benchmarker

    PerfKit Benchmarker (PKB) contains a set of benchmarks

    PerfKitBenchmarker is an open-source benchmarking framework designed to measure and compare the performance of cloud infrastructure across multiple providers in a consistent and reproducible way. It allows users to evaluate metrics such as latency, throughput, provisioning time, and system performance using a standardized set of benchmarks. The tool supports a wide range of environments, including major cloud platforms, Kubernetes clusters, and even local hardware, making it highly versatile...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 24
    Brand new cheatsheets and handouts

    Brand new cheatsheets and handouts

    Matplotlib 3.1 cheat sheet

    The Brand new cheatsheets and handouts repo is a compact, quick-reference summary of the most commonly used plotting commands and configurations in Matplotlib, intended to serve as a handy reference for experienced users who want to recall syntax or find the right function without digging into full documentation. It lays out common use cases (plot types, styling, figure configuration, saving/exporting, subplot layout, etc.) in a concise and organized format — often serving as a “cheat sheet”...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    Albedo

    Albedo

    A recommender system for discovering GitHub repos

    Albedo is an open-source recommender system aimed at helping developers discover GitHub repositories by learning from activity signals. It treats repositories and developers as a graph of interactions and applies large-scale matrix factorization to model affinities, with Apache Spark providing the distributed data processing. The project focuses on implicit feedback—stars, watches, and other engagement metrics—so it can build useful recommendations without explicit ratings. A reproducible...
    Downloads: 0 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB