Open Source Python Sound/Audio Software

Sort By:

Python Sound/Audio Software

Sound/Audio Python Clear Filters

Browse free open source Python Sound/Audio Software and projects below. Use the toggles on the left to filter open source Python Sound/Audio Software by OS, license, language, programming language, and project status.

AestheticsPro Medical Spa Software
Our new software release will dramatically improve your medspa business performance while enhancing the customer experience

AestheticsPro is the most complete Aesthetics Software on the market today. HIPAA Cloud Compliant with electronic charting, integrated POS, targeted marketing and results driven reporting; AestheticsPro delivers the tools you need to manage your medical spa business. It is our mission To Provide an All-in-One Cutting Edge Software to the Aesthetics Industry.

Learn More
Turn traffic into pipeline and prospects into customers
For account executives and sales engineers looking for a solution to manage their insights and sales data

Docket is an AI-powered sales enablement platform designed to unify go-to-market (GTM) data through its proprietary Sales Knowledge Lake™ and activate it with intelligent AI agents. The platform helps marketing teams increase pipeline generation by 15% by engaging website visitors in human-like conversations and qualifying leads. For sales teams, Docket improves seller efficiency by 33% by providing instant product knowledge, retrieving collateral, and creating personalized documents. Built for GTM teams, Docket integrates with over 100 tools across the revenue tech stack and offers enterprise-grade security with SOC 2 Type II, GDPR, and ISO 27001 compliance. Customers report improved win rates, shorter sales cycles, and dramatically reduced response times. Docket’s scalable, accurate, and fast AI agents deliver reliable answers with confidence scores, empowering teams to close deals faster.

Learn More
1

LAME (Lame Aint an MP3 Encoder)

A high quality MP3 encoder

LAME is an educational tool to be used for learning about MP3 encoding. The goal of the LAME project is to improve the psycho acoustics, quality and speed of MP3 encoding. Note: we provide source code only!

Downloads: 21,741 This Week

Last Update: 2023-12-18
See Project
2

spotDL

Download your Spotify playlists and songs along with album art

spotDL is a command-line tool that allows users to download songs and playlists from Spotify by sourcing the audio from YouTube. Built in Python, it automatically matches Spotify tracks with corresponding videos on YouTube and downloads them with embedded metadata. The tool retrieves important information such as album art, song titles, artist names, and lyrics to organize downloaded files. spotDL is designed to be fast, accurate, and easy to use through a simple command-line interface. It supports downloading individual songs, albums, or entire playlists by providing Spotify URLs or search queries. By combining Spotify metadata with YouTube audio sources, spotDL provides a convenient way to build a local music library.

Downloads: 100 This Week

Last Update: 2025-10-08
See Project
3

savonet

Audio toolkit for streaming, aiming at complex Internet radio systems. Our stream generator is highly extensible and flexible. Script based, it can be a fully automatic daemon streamer, but is also used for live shows with an user-friendly GUI.

7 Reviews

Downloads: 370 This Week

Last Update: 2016-05-24
See Project
4

EasyABC

EasyABC is an open source ABC editor

EasyABC allows the user to create, edit, view, play, convert music written in the ABC music notation language. The program was originally written in Python 2.7 and WxPython by Nils Liberg and runs on Windows, OSX, and Linux. Jan Wybren de Jong has converted to run on Python 3.8 or higher. Frédéric Aupépin has been supporting EasyABC on OSX. EasyABC depends upon other external programs like abc2midi, abcm2ps, fluidsynth. If you install the Windows or Mac executables most of these programs are automatically included.

8 Reviews

Downloads: 250 This Week

Last Update: 2025-01-12
See Project
Award-Winning Medical Office Software Designed for Your Specialty
Succeed and scale your practice with cloud-based, data-backed, AI-powered healthcare software.

RXNT is an ambulatory healthcare technology pioneer that empowers medical practices and healthcare organizations to succeed and scale through innovative, data-backed, AI-powered software.

Learn More
5

Spleeter

Deezer source separation library including pretrained models

Spleeter is the Deezer source separation library with pretrained models written in Python and using Tensorflow. It makes it easy to train music source separation models (assuming you have a dataset of isolated sources), and provides already trained state of the art models for performing various flavours of separation. 2 stems and 4 stems models have state of the art performances on the musdb dataset. Spleeter is also very fast as it can perform separation of audio files to 4 stems 100x faster than real-time when run on a GPU. We designed Spleeter so you can use it straight from command line as well as directly in your own development pipeline as a Python library. It can be installed with Conda, with pip or be used with Docker.

1 Review

Downloads: 39 This Week

Last Update: 2021-09-03
See Project
6

Spatial Media

Specifications and tools for 360º video and spatial audio

spatial-media provides tools for working with spherical video and spatial audio metadata so players and platforms can correctly render immersive media. The utilities inject, inspect, and extract metadata in common container formats (MP4/WebM) to signal 360° projection type, stereoscopy mode, and spatial audio layout. Creators use it to prepare 360/VR180 assets for upload so services know whether a video is monoscopic, top-bottom stereo, or side-by-side, and whether ambisonic audio is present. By separating metadata handling from encoding, the tools slot into existing editing/export workflows without requiring specialized NLE support. They help ensure assets play back with the right projection, field of view, and head-tracked audio in compliant players. For archive and validation use cases, the scripts can also read existing flags to diagnose why a video isn’t being recognized as immersive.

Downloads: 38 This Week

Last Update: 2025-10-10
See Project
7

HuMidi

Play MIDI like a human in ROBLOX with automatic sustain pedals.

HuMidi is a universal piano auto player for ROBLOX. It plays even the most generic MIDI file with great depth, thanks to the humanization and automatic sustain pedal generation algorithms. The pedals are generated with a thorough analysis of the MIDI data, adding more depth to your performance that no other MIDI players could! Personally have been tested in: - Visual Pianos - Starving Pianists - Digital Piano - Piano's Got Talent This tool is universal, as long as the piano in-game allows to be played on a keyboard.

1 Review

Downloads: 306 This Week

Last Update: 2026-03-18
See Project
8

VCClient

Software that uses AI to perform real-time voice conversion

VCClient is a real-time voice conversion system that uses machine learning models to transform a speaker’s voice into another voice with minimal latency. It is designed for live applications such as streaming, gaming, and virtual communication, where immediate feedback is essential. The system supports multiple voice conversion models, including RVC and other neural network-based approaches, allowing users to switch between different voices or customize their output. It provides both a graphical user interface and API access, making it suitable for casual users as well as developers who want to integrate voice transformation into their own applications. The project also supports GPU acceleration, enabling faster inference and smoother real-time performance on compatible hardware. Additionally, it includes tools for training and managing voice models, giving users the ability to create personalized voice profiles.

Downloads: 20 This Week

Last Update: 2026-03-23
See Project
9

DeepSpeech

Open source embedded speech-to-text engine

DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers. DeepSpeech is an open-source Speech-To-Text engine, using a model trained by machine learning techniques based on Baidu's Deep Speech research paper. Project DeepSpeech uses Google's TensorFlow to make the implementation easier. A pre-trained English model is available for use and can be downloaded following the instructions in the usage docs. If you want to use the pre-trained English model for performing speech-to-text, you can download it (along with other important inference material) from the DeepSpeech releases page.

Downloads: 18 This Week

Last Update: 2021-04-08
See Project
The Most Powerful Software Platform for EHSQ and ESG Management
Addresses the needs of small businesses and large global organizations with thousands of users in multiple locations.

Choose from a complete set of software solutions across EHSQ that address all aspects of top performing Environmental, Health and Safety, and Quality management programs.

Learn More
10

SpeechRecognition

Speech recognition module for Python

Library for performing speech recognition, with support for several engines and APIs, online and offline. Recognize speech input from the microphone, transcribe an audio file, save audio data to an audio file. Show extended recognition results, calibrate the recognizer energy threshold for ambient noise levels (see recognizer_instance.energy_threshold for details). Listening to a microphone in the background, various other useful recognizer features. The easiest way to install this is using pip install SpeechRecognition. The first software requirement is Python 2.6, 2.7, or Python 3.3+. This is required to use the library. PyAudio is required if and only if you want to use microphone input (Microphone). PyAudio version 0.2.11+ is required, as earlier versions have known memory management bugs when recording from microphones in certain situations. To hack on this library, first make sure you have all the requirements listed in the "Requirements" section.

Downloads: 17 This Week

Last Update: 2026-04-05
See Project
11

cwtext text to morse code converter

Convert text to International Morse Code. Input is ASCII text. Output can be: - . -..- - on the console, raw 8bit PCM suitable for piping to /dev/audio, .wav files or even (mp3|ogg). Good for headlines on your MP3 player or code practice.

7 Reviews

Downloads: 66 This Week

Last Update: 2013-03-22
See Project
12

GNU Solfege

GNU Solfege is *free* ear training software written in Python 3.4 using the Gtk+ 3 toolkit. The program is designed to be easily extended with lesson files (data files), so the user can create new exercises.

23 Reviews

Downloads: 53 This Week

Last Update: 2016-06-24
See Project
13

Radio Tray

Radio Tray is a streaming player for listening to online radios. Any bug or feature request should be reported in https://bitbucket.org/carlmig/radio-tray/issues

23 Reviews

Downloads: 50 This Week

Last Update: 2013-05-01
See Project
14

Swing Music

Swing Music is a beautiful, self-hosted music player

Swing Music is a beautiful, self-hosted music player and streaming server that lets you bring your personal audio library online with a modern browser-based interface, giving you a private alternative to mainstream streaming services. Designed to be both elegant and powerful, the project scans your local music files (like MP3s or FLACs), organizes metadata, and streams them on-demand to any device with a browser or its Android client. It includes features like folder browsing, playlist creation, related artists/albums exploration, and intelligent metadata normalization so your collection feels clean and connected. The interface handles fuzzy search so typos don’t stop you from finding tracks, supports silence detection and crossfading for a smoother listening experience, and provides statistics about your listening habits to help you understand your personal music library.

Downloads: 11 This Week

Last Update: 2026-02-04
See Project
15

Librosa

Python library for audio and music analysis

Librosa is a powerful Python library for analyzing and processing audio and music signals. Built on top of NumPy, SciPy, and matplotlib, it provides a wide range of tools for feature extraction, time-series manipulation, audio display, and music information retrieval. Whether you're building machine learning models for audio classification or visualizing spectrograms, Librosa is a go-to library for researchers and developers working in audio signal processing.

Downloads: 10 This Week

Last Update: 2025-07-03
See Project
16

Podcastfy.ai

Transforming Multimodal Content into Captivating Multilingual Audio

Podcastfy is an open-source Python package that transforms multi-modal content (text, images) into engaging, multi-lingual audio conversations using GenAI. Input content includes websites, PDFs, youtube videos as well as images. Unlike UI-based tools focused primarily on note-taking or research synthesis (e.g. NotebookLM), Podcastfy focuses on the programmatic and bespoke generation of engaging, conversational transcripts and audio from a multitude of multi-modal sources enabling customization and scale.

Downloads: 10 This Week

Last Update: 2024-11-16
See Project
17

Spotify Music Downloader

Spotify Music Downloader

Download music from Spotify and other music sources.

1 Review

Downloads: 10 This Week

Last Update: 2022-04-27
See Project
18

Tauon

The music player of today

Tauon is a modern, streamlined music player app that's packed with features! An emphasis on playlists and drag-and-drop importing puts you in control of your music library. Faded volume control, 24-bit FLAC support, and gapless playback provide the ultimate listening experience. Excellent CUE sheet support, an original smart playlist system, and network playback from koel or Airsonic servers. Last.fm, Listenbrainz, and Maloja scribbling. MPRIS2 support for desktop integration. Tauon is a playlist-oriented music player that makes the assumption that folders are albums. (You may find the words "album" and "folder" used interchangeably within the interface and documentation) While some features and interface elements may not be immediately intuitive, I hope that once you are familiar, you will find that Tauon is pleasant, fast, and fun to use.

Downloads: 9 This Week

Last Update: 2026-04-05
See Project
19

Curlew Multimedia Converter

Easy to use Multimedia Converter for Linux

8 Reviews

Downloads: 61 This Week

Last Update: 2018-05-26
See Project
20

castero

TUI podcast client for the terminal

castero is a TUI podcast client for the terminal.

Downloads: 8 This Week

Last Update: 2024-09-18
See Project
21

AudioCraft

Audiocraft is a library for audio processing and generation

AudioCraft is a PyTorch library for text-to-audio and text-to-music generation, packaging research models and tooling for training and inference. It includes MusicGen for music generation conditioned on text (and optionally melody) and AudioGen for text-conditioned sound effects and environmental audio. Both models operate over discrete audio tokens produced by a neural codec (EnCodec), which acts like a tokenizer for waveforms and enables efficient sequence modeling. The repo provides inference scripts, checkpoints, and simple Python APIs so you can generate clips from prompts or incorporate the models into applications. It also contains training code and recipes, so researchers can fine-tune on custom data or explore new objectives without building infrastructure from scratch. Example notebooks, CLI tools, and audio utilities help with prompt design, conditioning on reference audio, and post-processing to produce ready-to-share outputs.

Downloads: 7 This Week

Last Update: 2025-10-13
See Project
22

yami

An open-source music player with simple UI

Yami is a lightweight, open-source music player built in Python. It focuses on simplicity and ease of use, providing an intuitive user interface (UI) for users to manage and play their music. Whether you're playing local files or downloading from online sources using spotdl, Yami offers a seamless experience. This project is designed for users who want a minimalistic, cross-platform music player with the ability to integrate external sources like Spotify/YouTube Music.

Downloads: 7 This Week

Last Update: 2025-11-03
See Project
23

PyKaraoke

PyKaraoke is a cross-platform karaoke player. It currently supports CDG (MP3+G, OGG+G, WAV+G), MIDI (.KAR, .MID) and MPEG formats.

5 Reviews

Downloads: 41 This Week

Last Update: 2013-04-25
See Project
24

PyTube Downloader

Let's quickly download YouTube videos & playlists with one click.

PyTube Downloader lets you quickly download YouTube videos and playlists with one click. Choose from 144p to 8K quality and download multiple videos simultaneously. PyTube Downloader 让你轻松一键下载 YouTube 视频和播放列表。支持选择 144p 到 8K 的质量，并支持同时下载多个视频。

1 Review

Downloads: 97 This Week

Last Update: 2026-01-06
See Project
25

StreamTuner2 ♪♬#

Internet radio directory browser

Streamtuner2 is an internet radio station and video browser. It simply lists stations in categories from different directories. Launches your preferred media apps for playback. It's built in Python now, but retains UI similarity with the original StreamTuner 0.99

6 Reviews

Downloads: 49 This Week

Last Update: 2022-02-22
See Project