You are not logged in.
I want to build a minimal voice-triggered command system for my Arch setup (Hyprland)...
my current goal:
- similar to Bixby/voice assistants, but without any responses/talkback, just does what I/user says/said
- just detect a trigger phrase (smtg like “hey meow”) and execute simple commands like:
- open browser (if possible, search " tom and jerry on Youtube" etc )
- suspend / poweroff
- launch or close applications
-etc
So effectively:
continuous listening => detect trigger => parse command => execute shell command
I’m kinda unsure/konfused about:
- is continuous speech-to-text + trigger phrase detection practical on a lightweight system, or too resource-heavy? if heavy, implement wake word and then load STT?
- how are always-on audio listeners typically implemented on Linux (daemon, service, etc.)?
- for Wayland/Hyprland, is it better to rely purely on shell commands, or use compositor IPC for actions?
I have no plans on/to using heavy assistant frameworks, KISS (keeping it simple and smart).... ![]()
Last edited by 5hridhyan (2026-03-27 16:30:23)
---
Offline