Voice2Input 🎙️ -> ⌨️

Hey there! Welcome to Voice2Input - your new favorite speech-to-text buddy! This app lets you talk instead of type, and it'll automatically paste your words wherever your cursor is. Super handy for when you're feeling lazy or just want to give your fingers a break! 😊

✨ What Makes It Cool

Uses OpenAI's Whisper models to understand what you're saying
Works with tons of languages (it can even guess which one you're speaking!)
Global hotkeys so you can start/stop recording from anywhere
Automatically copies and pastes your words (if you want it to)
Saves everything neatly for later use
Looks pretty sweet with a modern interface

🚀 Getting Started

Make sure you have Python 3.9+ installed
If you're on Linux (which you probably are), grab these:

sudo apt-get install xsel xdotool portaudio19-dev python3-pyaudio

Clone this bad boy:

git clone https://github.com/hosteren/voice2input.git
cd voice2input

Set up your virtual environment (trust me, it's worth it):

python -m venv venv
source venv/bin/activate

Or use conda:

conda create -n v2i python=3.10
conda activate v2i

Install the goodies:

pip install -r requirements.txt

🎮 How to Use It

🔧 Environment Variables

This project uses a .env file to manage sensitive configurations and API credentials. The app loads these variables automatically using python-dotenv. Either create your own .env file or use the .env.example file as a template.

🔧 Running the App

Fire it up:

python app.py

Pick your settings (the gear icon is your friend):
- Choose which mic to use
- Pick a Whisper model (bigger = better but slower)
- Set your language (or let it guess)
- Set up your favorite hotkey combo
Start talking:
- Hit your hotkey (default: Ctrl+Shift+R) or smash that record button
- Say your piece
- Release the hotkey
- Watch the magic happen! ✨

🎯 Pro Tips

The Large-v3 Turbo model is pretty amazing if your GPU can handle it
Keep your recordings short and sweet for best results
The auto-paste feature is super convenient but give it a second to work its magic

🤔 Something Not Working?

Open an issue! I'm still learning how GitHub works, but I'll figure it out! 😅

💝 Credits & Transparency

This app was lovingly crafted with the help of:

Cursor - The AI-powered code editor
Claude 3 Sonnet - Anthropic's amazing AI assistant
A lot of coffee ☕

Big thanks to the AI assistants that helped make this possible while keeping the code clean and maintainable!

📝 License

MIT - Go wild! Just remember where you got it from! 😉

Made with 💖 by a human who talks to computers (literally and figuratively)

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
.env.example		.env.example
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
app.py		app.py
requirements.txt		requirements.txt
v2i_main.png		v2i_main.png
v2i_settings.png		v2i_settings.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Voice2Input 🎙️ -> ⌨️

✨ What Makes It Cool

🚀 Getting Started

🎮 How to Use It

🔧 Environment Variables

🔧 Running the App

🎯 Pro Tips

🤔 Something Not Working?

💝 Credits & Transparency

📝 License

About

Releases

Packages

Languages

License

hosteren/voice2input

Folders and files

Latest commit

History

Repository files navigation

Voice2Input 🎙️ -> ⌨️

✨ What Makes It Cool

🚀 Getting Started

🎮 How to Use It

🔧 Environment Variables

🔧 Running the App

🎯 Pro Tips

🤔 Something Not Working?

💝 Credits & Transparency

📝 License

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages