Search Results for "voice command java"

Sort By:

Showing 3632 open source projects for "voice command java"

View related business solutions

Collect! is a highly configurable debt collection software
Everything that matters to debt collection, all in one solution.

The flexible & scalable debt collection software built to automate your workflow. From startup to enterprise, we have the solution for you.

Learn More
Field Sales+ for MS Dynamics 365 and Salesforce
Maximize your sales performance on the go.

Bring Dynamics 365 and Salesforce wherever you go with Resco’s solution. With powerful offline features and reliable data syncing, your team can access CRM data on mobile devices anytime, anywhere. This saves time, cuts errors, and speeds up customer visits.

Learn More
1

Real-Time Voice Cloning

Clone a voice in 5 seconds to generate arbitrary speech in real-time

...In the first stage, short audio clips are converted into a fixed-dimensional speaker embedding that captures voice characteristics; this embedding is then used by a Tacotron-style synthesizer to generate spectrograms from text, which a WaveRNN-based vocoder finally turns into audio. The repo includes both a command-line demo and a graphical “toolbox” application where you can load reference voices, type text, and hear the synthesized results interactively.

Downloads: 16 This Week

Last Update: 2026-03-09
See Project
2

OmniVoice

High-Quality Voice Cloning TTS for 600+ Languages

The OmniVoice project is a cutting-edge multilingual text-to-speech system designed to generate high-quality speech across more than 600 languages. Built on a diffusion language model-style architecture, it combines scalability with strong performance, enabling both natural-sounding voice synthesis and efficient inference speeds. One of its most notable capabilities is zero-shot voice cloning, allowing users to replicate a speaker’s voice using only a short reference audio clip. In addition, it supports voice design through configurable attributes such as gender, accent, pitch, and speaking style, giving users fine-grained control over generated speech. ...

Downloads: 5 This Week

Last Update: 7 days ago
See Project
3

OpenAI-Compatible Edge-TTS API

Free, high-quality text-to-speech API endpoint to replace OpenAI

...A Docker image is provided for one-command deployment, and environment variables can be used to configure default voice, language, response format, authentication, and logging options.

Downloads: 2 This Week

Last Update: 2025-11-28
See Project
4

FFsubsync

Automagically synchronize subtitles with video

Language-agnostic automatic synchronization of subtitles with video, so that subtitles are aligned to the correct starting point within the video. First, make sure ffmpeg is installed. Make sure ffmpeg is on your path and can be referenced from the command line! Next, grab the script. It should work with both Python 2 and Python 3. There may be occasions where you have a correctly synchronized srt file in a language you are unfamiliar with, as well as an unsynchronized srt file in your native language. In this case, you can use the correctly synchronized srt file directly as a reference for synchronization, instead of using the video as the reference. ffsubsync uses the file extension to decide whether to perform voice activity detection on the audio or to directly extract speech from an srt file. ffsubsync usually finishes in 20 to 30 seconds, depending on the length of the video.

Downloads: 41 This Week

Last Update: 2025-11-24
See Project
Rezku Point of Sale
Designed for Real-World Restaurant Operations

Rezku is an all-inclusive ordering platform and management solution for all types of restaurant and bar concepts. You can now get a fully custom branded downloadable smartphone ordering app for your restaurant exclusively from Rezku.

Learn More
5

SafeClaw

Chat with it via text and voice

SafeClaw is an open-source, entirely local alternative to cloud-based AI assistants like OpenClaw, enabling users to build a personal assistant that runs on their own machine without incurring API usage charges or exposing data to third-party services. It emphasizes privacy and predictability by using traditional programming, rule-based intent parsing, and established machine learning tools rather than large language models, meaning there are no per-token API costs and deterministic...

Downloads: 5 This Week

Last Update: 2026-03-24
See Project
6

peon-ping

Warcraft III Peon voice notifications (+ more!) for Claude Code

Peon-ping is a quirky utility that brings fun and practical voice notifications to your development workflow by using Warcraft III peon-style sound effects whenever significant events occur in your code editor or terminal. The project is built around the idea of reducing cognitive load by audibly alerting you when processes finish, tests fail, or language models complete responses, helping you stay focused without constantly watching the screen.

Downloads: 5 This Week

Last Update: 3 days ago
See Project
7

pyVideoTrans

Translate the video from one language to another and embed dubbing

...The tool supports both command-line and GUI modes, making it accessible to developers and creatives needing batch or automated processing.

Downloads: 12 This Week

Last Update: 2026-04-14
See Project
8

Audiblez

Generate audiobooks from e-books

Audiblez is a tool for generating high-quality .m4b audiobooks directly from .epub e-books using the Kokoro-82M neural text-to-speech model. It focuses on making audiobook creation easy and fast: from a single command, the tool splits an e-book into chapters, synthesizes audio for each section, and then merges the results into a structured audiobook with chapter-based WAV files and a final .m4b container. The Kokoro-82M model it uses is compact (82M parameters) yet natural sounding, trained...

Downloads: 7 This Week

Last Update: 2025-11-30
See Project
9

Harbor LLM

Run a full local LLM stack with one command using Docker

Harbor is an open source, containerized toolkit designed to simplify running local large language model (LLM) environments. It combines a CLI and companion app to launch backends, frontends, and supporting services with minimal setup. With a single command, users can start preconfigured tools like Ollama and Open WebUI, enabling chat, workflows, and integrations immediately. Harbor supports multiple inference engines, including llama.cpp and vLLM, and connects them seamlessly to user interfaces. It also includes tools for web retrieval, image generation, voice interaction, and workflow automation. ...

Downloads: 3 This Week

Last Update: 5 days ago
See Project
Skillfully - The future of skills based hiring
Realistic Workplace Simulations that Show Applicant Skills in Action

Skillfully transforms hiring through AI-powered skill simulations that show you how candidates actually perform before you hire them. Our platform helps companies cut through AI-generated resumes and rehearsed interviews by validating real capabilities in action. Through dynamic job specific simulations and skill-based assessments, companies like Bloomberg and McKinsey have cut screening time by 50% while dramatically improving hire quality.

Learn More
10

JADX

Dex to Java decompiler

Command-line and GUI tools for producing Java source code from Android Dex and apk files. Decompile Dalvik bytecode to java classes from APK, dex, aar, aab and zip files. Decode AndroidManifest.xml and other resources from resources.arsc. Deobfuscator included. Be aware, Android Studio can interfere with jadx debugger, so it is better to close it before attaching to the process in jadx.

Downloads: 1,770 This Week

Last Update: 2026-02-25
See Project
11

sherpa-onnx

Speech-to-text, text-to-speech, and speaker recognition

Speech-to-text, text-to-speech, and speaker recognition using next-gen Kaldi with onnxruntime without an Internet connection. Support embedded systems, Android, iOS, Raspberry Pi, RISC-V, x86_64 servers, websocket server/client, C/C++, Python, Kotlin, C#, Go, NodeJS, Java, Swift, Dart, JavaScript, Flutter.

Downloads: 167 This Week

Last Update: 4 days ago
See Project
12

edge-tts

Use Microsoft Edge's online text-to-speech service from Python

edge-tts is a Python module and command-line tool that gives you direct access to Microsoft Edge’s online text-to-speech service without needing the Edge browser, Windows, or any API key. It wraps the same cloud voices used by Edge, exposing them through a simple CLI (edge-tts, edge-playback) and a Python API, so you can script high-quality speech generation in your own applications.

Downloads: 23 This Week

Last Update: 2025-12-12
See Project
13

Ghidra

Ghidra is a software reverse engineering (SRE) framework

Ghidra is a free and open-source reverse engineering framework developed by the NSA for analyzing compiled software. It supports a wide array of instruction sets and executable formats, offering features such as decompilation, disassembly, scripting, and interactive graphing. Designed for security researchers and analysts, Ghidra provides a robust environment for understanding malware, auditing code, and performing software forensics. It includes both GUI-based and headless analysis modes.

Downloads: 901 This Week

Last Update: 2026-03-04
See Project
14

MLX-Audio

A text-to-speech, speech-to-text and speech-to-speech library

MLX-Audio is a speech library built on Apple’s MLX framework and optimized for Apple Silicon machines (M-series Macs). It focuses on text-to-speech and speech-to-speech workflows, with APIs and a command-line interface that make it easy to generate high-quality audio from text. Because it uses MLX and targets Apple Silicon, inference is fast and can take advantage of hardware acceleration and quantization for efficient on-device performance. The project provides a straightforward CLI (mlx_audio.tts.generate) as well as a Python API for programmatic generation of audio, including parameters for voice choice, speed, language hints, output format, and sample rate. ...

Downloads: 2 This Week

Last Update: 2026-03-30
See Project
15

Remove Windows Ai

Strip Windows 11 of built-in AI features for control and privacy

RemoveWindowsAI is an open source PowerShell-based tool created to help users regain control over their Windows 11 experience by disabling or removing AI-related features that Microsoft has increasingly integrated into the OS. It’s designed to work with currently released, stable versions of Windows 11 and continuously updated to match newly added AI components, especially since the 25H2 major update. The script covers a wide variety of AI surfaces (from core features like Copilot and Recall...

Downloads: 77 This Week

Last Update: 4 days ago
See Project
16

MiniMax-MCP

Official MiniMax Model Context Protocol (MCP) server

MiniMax-MCP is the official Model Context Protocol (MCP) server for accessing MiniMax’s multimodal generative APIs from MCP-compatible clients. It acts as a bridge between tools like Claude Desktop, Cursor, Windsurf, OpenAI Agents, and the MiniMax platform, exposing capabilities such as text-to-speech, voice cloning, image generation, text-to-image, video generation, image-to-video, text-to-video, and music generation. The server is written in Python and distributed under the MIT license,...

Downloads: 2 This Week

Last Update: 2026-04-15
See Project
17

Spring Boot

Easily create Spring-powered, production-grade applications

...By taking an opinionated view of the Spring platform, it enables you to quickly and easily get to the bits you need. Most Spring Boot applications need minimal Spring configuration. You can use it to create a stand-alone Java application or more traditional WAR deployments. A command line tool is also provided that allows you to run spring scripts.

Downloads: 64 This Week

Last Update: 2026-03-26
See Project
18

signal-cli

signal-cli provides a commandline, dbus and JSON-RPC interface

signal-cli is a command line interface for the Signal messenger. It supports registering, verifying, sending and receiving messages. signal-cli uses a patched libsignal-service-java, extracted from the Signal-Android source code. For registering you need a phone number where you can receive SMS or incoming calls. signal-cli is primarily intended to be used on servers to notify admins of important events.

Downloads: 36 This Week

Last Update: 2026-04-04
See Project
19

Flowly AI

Flowly is 100x faster than OpenClaw

...Flowly also includes voice capabilities, enabling real-time phone interactions using speech-to-text and text-to-speech systems. Overall, it provides a powerful, extensible, and privacy-focused alternative to cloud-based AI assistants.

Downloads: 5 This Week

Last Update: 2026-03-29
See Project
20

annyang!

Speech recognition for your site

annyang is a tiny javascript library that lets your visitors control your site with voice commands. annyang supports multiple languages, has no dependencies, weighs just 2kb and is free to use. annyang understands commands with named variables, splats, and optional words. Use named variables for one word arguments in your command. Use splats to capture multi-word text at the end of your command (greedy). Use optional words or phrases to define a part of the command as optional. annyang plays nicely with all browsers, progressively enhancing browsers that support SpeechRecognition, while leaving users with older browsers unaffected. ...

Downloads: 1 This Week

Last Update: 2026-03-11
See Project
21

MCPorter

Call MCPs via TypeScript, masquerading as simple TypeScript API

...Its command-line workflow simplifies what would otherwise be a manual and error-prone process, allowing users to specify source worlds, conversion settings, and target outputs with clarity and control.

Downloads: 4 This Week

Last Update: 4 days ago
See Project
22

picocli

Framework for building GraalVM-enabled command line apps

Picocli is a one-file framework for creating Java command-line applications with almost zero code. It supports a variety of command-line syntax styles including POSIX, GNU, MS-DOS and more. It generates highly customizable usage help messages that use ANSI colors and styles to contrast important elements and reduce the cognitive load on the user. Picocli-based applications can have command line TAB completion showing available options, option parameters, and subcommands, for any level of nested subcommands. ...

Downloads: 2 This Week

Last Update: 2025-04-19
See Project
23

JUnit 5

Programmer-friendly testing framework for Java and the JVM

The 5th major version of the programmer-friendly testing framework for Java and the JVM. JUnit 5 is the next generation of JUnit. The goal is to create an up-to-date foundation for developer-side testing on the JVM. This includes focusing on Java 8 and above, as well as enabling many different styles of testing. Official CI build server for JUnit 5. Used to perform quick checks on submitted pull requests and for build matrices including the latest released OpenJDK and early access builds of the next OpenJDK. ...

Downloads: 105 This Week

Last Update: 2026-02-15
See Project
24

openHAB Distribution

The binary distribution of openHAB

The open Home Automation Bus (openHAB) project aims to provide a universal integration platform for all things around home automation. It is a pure Java solution, fully based on OSGi. It is designed to be vendor-neutral as well as hardware/protocol-agnostic. openHAB brings together different bus systems, hardware devices, and interface protocols by dedicated bindings. These bindings send and receive commands and status updates on the openHAB event bus. This concept allows designing user...

Downloads: 5 This Week

Last Update: 2026-04-14
See Project
25

J2ObjC

A Java to iOS Objective-C translation tool and runtime

J2ObjC is an open-source command-line tool from Google that translates Java source code to Objective-C for the iOS (iPhone/iPad) platform. This tool enables Java source to be part of an iOS application's build, as no editing of the generated files is necessary. The goal is to write an app's non-UI code (such as application logic and data models) in Java, which is then shared by web apps (using GWT), Android apps, and iOS apps.

Downloads: 1 This Week

Last Update: 2025-08-14
See Project