Forums

Skip to content

Advanced search
  • Quick links
    • Unanswered topics
    • Active topics
    • Search
  • FAQ
  • Login
  • Register
  • Board index Assistance Desktop Environments
  • Search

Speech input and recognition

Problems with GUI applications? Questions about X, KDE, Gnome, Fluxbox, etc.? Come on in. NOTE: For multimedia, go up one forum
Post Reply
Advanced search
3 posts • Page 1 of 1
Author
Message
Massimo B.
Veteran
Veteran
User avatar
Posts: 1940
Joined: Wed Feb 09, 2005 3:05 pm
Location: PB, Germany

Speech input and recognition

  • Quote

Post by Massimo B. » Sat Apr 12, 2025 7:05 pm

Is there anything usable on Linux to have a Speech Input option on any edit field where keyboard typing is possible? Also supporting for more languages beside English, like German?
I only found
app-accessibility/julius (https://github.com/julius-speech/julius)
https://github.com/mkiol/dsnote
And an entry at Wikipedia.

This option today is available on systems like Android and MS Windows, often using a cloud service in the back which might be another issue.
HP ZBook Power G9 i7-12700H,64GB DDR5|HP ProDesk 600 G5 i7-9700,128GB DDR4
Top
undrwater
Guru
Guru
User avatar
Posts: 319
Joined: Tue Jan 28, 2003 2:25 am
Location: Caucasia

  • Quote

Post by undrwater » Sat Jun 14, 2025 11:11 pm

It's be a little while since you posted, but hopefully you find this post somewhat helpful.

I've been on a similar quest, and there are a few projects based on the Whisper-ai project (LLM for speech-to-text), but no ebuilds as of yet. These run in a python venv, so that's how I'm trying them out:

https://github.com/dhruvyad/uttertype
https://github.com/savbell/whisper-writer

Any language that whisper supports should work for these projects.

Additionally, I'm using piper for text-to-speech. Also in a python venv, and I've got a keybinding that reads out highlighted text.

Good hunting!
Open-mindedness is painful...
Top
wildhorse
Apprentice
Apprentice
User avatar
Posts: 185
Joined: Thu Mar 16, 2006 3:59 am
Location: Estados Unidos De América

  • Quote

Post by wildhorse » Tue Jun 17, 2025 2:26 pm

speech to text
https://github.com/mozilla/DeepSpeech

text to speech
https://github.com/mozilla/TTS

text language translation
https://opennmt.net/

You could contact researchers at Fraunhofer IDMT and Fraunhofer IAIS as another source of information.
Top
Post Reply

3 posts • Page 1 of 1

Return to “Desktop Environments”

Jump to
  • Assistance
  • ↳   News & Announcements
  • ↳   Frequently Asked Questions
  • ↳   Installing Gentoo
  • ↳   Multimedia
  • ↳   Desktop Environments
  • ↳   Networking & Security
  • ↳   Kernel & Hardware
  • ↳   Portage & Programming
  • ↳   Gamers & Players
  • ↳   Other Things Gentoo
  • ↳   Unsupported Software
  • Discussion & Documentation
  • ↳   Documentation, Tips & Tricks
  • ↳   Gentoo Chat
  • ↳   Gentoo Forums Feedback
  • ↳   Duplicate Threads
  • International Gentoo Users
  • ↳   中文 (Chinese)
  • ↳   Dutch
  • ↳   Finnish
  • ↳   French
  • ↳   Deutsches Forum (German)
  • ↳   Diskussionsforum
  • ↳   Deutsche Dokumentation
  • ↳   Greek
  • ↳   Forum italiano (Italian)
  • ↳   Forum di discussione italiano
  • ↳   Risorse italiane (documentazione e tools)
  • ↳   Polskie forum (Polish)
  • ↳   Instalacja i sprzęt
  • ↳   Polish OTW
  • ↳   Portuguese
  • ↳   Documentação, Ferramentas e Dicas
  • ↳   Russian
  • ↳   Scandinavian
  • ↳   Spanish
  • ↳   Other Languages
  • Architectures & Platforms
  • ↳   Gentoo on ARM
  • ↳   Gentoo on PPC
  • ↳   Gentoo on Sparc
  • ↳   Gentoo on Alternative Architectures
  • ↳   Gentoo on AMD64
  • ↳   Gentoo for Mac OS X (Portage for Mac OS X)
  • Board index
  • All times are UTC
  • Delete cookies

© 2001–2026 Gentoo Foundation, Inc.

Powered by phpBB® Forum Software © phpBB Limited

Privacy Policy

 

 

magic