Plugins API¶

Every plugin is a plain Python module that follows a small contract rather than subclassing a base class. A module is loaded if it exposes a NAME string and a handle(cmd, core) function; the optional setup(core) hook runs once at startup, and COMMANDS/DESCRIPTION feed the help screen.

classDiagram
    class PluginContract {
        <<protocol>>
        +str NAME
        +str DESCRIPTION
        +list COMMANDS
        +setup(core)
        +handle(cmd, core)
    }
    class base
    class sleep
    class system
    class media
    class files
    class apps
    class browser
    class dictation
    class mousegrid
    class headtrack

    PluginContract <|.. base
    PluginContract <|.. sleep
    PluginContract <|.. system
    PluginContract <|.. media
    PluginContract <|.. files
    PluginContract <|.. apps
    PluginContract <|.. browser
    PluginContract <|.. dictation
    PluginContract <|.. mousegrid
    PluginContract <|.. headtrack

    note for base "zz_base.py — loads last; help and exit fallback"
    note for mousegrid "00_mousegrid.py — numeric prefix loads it early"
    note for headtrack "00_eyetrack.py — numeric prefix loads it early"

handle returns True when it consumed the command, False to signal the daemon to exit, or None to pass the command to the next plugin. Load order is alphabetical, so numeric prefixes (00_) load a plugin early and the zz_ prefix loads the base plugin last as the catch-all for help and exit.

base (zz_base)¶

plugins.zz_base ¶

Base Plugin - Help and Exit.

setup ¶

setup(c)

Store the core reference for use by the plugin's handlers.

Source code in src/plugins/zz_base.py

def setup(c):
    """Store the core reference for use by the plugin's handlers."""
    global core
    core = c

handle ¶

handle(cmd, core)

Handle the help and exit commands; return None to pass others through.

Returns False to signal the daemon to exit, True if help was shown, or None when the command is for another plugin.

Source code in src/plugins/zz_base.py

def handle(cmd, core):
    """Handle the help and exit commands; return None to pass others through.

    Returns False to signal the daemon to exit, True if help was shown, or None when the
    command is for another plugin.
    """
    cmd_lower = cmd.lower().strip()

    # Exit - but NOT if it's "quit tracking" etc
    if "tracking" not in cmd_lower:
        if cmd_lower in ["exit", "quit", "goodbye", "bye"]:
            core.speak("Goodbye.")
            return False  # Signal to exit
        # Also match "jarvis quit" etc
        if any(cmd_lower.endswith(x) for x in [" exit", " quit"]):
            core.speak("Goodbye.")
            return False

    # Help
    if "help" in cmd_lower or "what can you do" in cmd_lower:
        show_help(core)
        return True

    return None  # Not handled

show_help ¶

show_help(core)

List every plugin's commands on the terminal.

Printed rather than logged: it is the direct reply to an explicit "help" request (the spoken reply tells the user to read the terminal), so it must appear regardless of the configured log verbosity.

Source code in src/plugins/zz_base.py

def show_help(core):
    """List every plugin's commands on the terminal.

    Printed rather than logged: it is the direct reply to an explicit "help"
    request (the spoken reply tells the user to read the terminal), so it must
    appear regardless of the configured log verbosity.
    """
    print("\n=== Available Commands ===")  # noqa: T201
    for plugin in core.plugins:
        if hasattr(plugin, "COMMANDS"):
            print(f"\n{plugin.NAME}:")  # noqa: T201
            for cmd in plugin.COMMANDS:
                print(f"  • {cmd}")  # noqa: T201
    print()  # noqa: T201
    core.speak("Check the terminal for available commands.")

sleep¶

plugins.sleep ¶

Sleep Plugin - Deactivate the assistant by voice.

"Go to sleep" (or "stop listening") releases the microphone and stops wake-word detection until the user reactivates EasySpeak from the GNOME tray indicator. It pairs with that panel icon, which only appears while the assistant is asleep (the one way back, since voice control is off until reactivated).

handle ¶

handle(cmd, core)

Deactivate the assistant on a sleep phrase; return None otherwise.

Source code in src/plugins/sleep.py

def handle(cmd, core):
    """Deactivate the assistant on a sleep phrase; return None otherwise."""
    cmd_lower = cmd.lower().strip()
    if any(phrase in cmd_lower for phrase in SLEEP_PHRASES):
        # Announce only the attempt. The tray controller confirms once sleep
        # actually engages ("Reactivate me from the tray...") or, if it can't
        # reach the indicator, explains and stays awake — so we never promise a
        # tray that isn't there.
        core.speak("Going to sleep.")
        core.deactivate()
        return True
    return None

system¶

plugins.system ¶

System Plugin - Volume, brightness, do not disturb.

setup ¶

setup(c)

Store the core reference for use by the plugin's handlers.

Source code in src/plugins/system.py

def setup(c):
    """Store the core reference for use by the plugin's handlers."""
    global core
    core = c

volume_up ¶

volume_up(core)

Raise the volume one step.

Source code in src/plugins/system.py

def volume_up(core):
    """Raise the volume one step."""
    _media_key(
        core, KEY_VOLUME_UP, ["wpctl", "set-volume", "@DEFAULT_AUDIO_SINK@", "10%+"]
    )

volume_down ¶

volume_down(core)

Lower the volume one step.

Source code in src/plugins/system.py

def volume_down(core):
    """Lower the volume one step."""
    _media_key(
        core, KEY_VOLUME_DOWN, ["wpctl", "set-volume", "@DEFAULT_AUDIO_SINK@", "10%-"]
    )

volume_mute ¶

volume_mute(core)

Toggle mute on the default audio sink.

Source code in src/plugins/system.py

def volume_mute(core):
    """Toggle mute on the default audio sink."""
    _media_key(core, KEY_MUTE, ["wpctl", "set-mute", "@DEFAULT_AUDIO_SINK@", "toggle"])

volume_max ¶

volume_max(core)

Jump near the top (85%, not a blast); no media key does this.

Source code in src/plugins/system.py

def volume_max(core):
    """Jump near the top (85%, not a blast); no media key does this."""
    core.host_run(["wpctl", "set-volume", "@DEFAULT_AUDIO_SINK@", "85%"])

volume_min ¶

volume_min(core)

Drop low (15%, still audible — not a mute), set directly like volume_max.

Source code in src/plugins/system.py

def volume_min(core):
    """Drop low (15%, still audible — not a mute), set directly like volume_max."""
    core.host_run(["wpctl", "set-volume", "@DEFAULT_AUDIO_SINK@", "15%"])

brightness_up ¶

brightness_up(core)

Raise screen brightness one step.

Source code in src/plugins/system.py

def brightness_up(core):
    """Raise screen brightness one step."""
    _media_key(
        core,
        KEY_BRIGHTNESS_UP,
        [*_POWER_SCREEN, "org.gnome.SettingsDaemon.Power.Screen.StepUp"],
    )

brightness_down ¶

brightness_down(core)

Lower screen brightness one step.

Source code in src/plugins/system.py

def brightness_down(core):
    """Lower screen brightness one step."""
    _media_key(
        core,
        KEY_BRIGHTNESS_DOWN,
        [*_POWER_SCREEN, "org.gnome.SettingsDaemon.Power.Screen.StepDown"],
    )

dnd_on ¶

dnd_on(core)

Enable do-not-disturb by hiding notification banners.

Source code in src/plugins/system.py

def dnd_on(core):
    """Enable do-not-disturb by hiding notification banners."""
    core.host_run(
        ["gsettings", "set", "org.gnome.desktop.notifications", "show-banners", "false"]
    )

dnd_off ¶

dnd_off(core)

Disable do-not-disturb by showing notification banners again.

Source code in src/plugins/system.py

def dnd_off(core):
    """Disable do-not-disturb by showing notification banners again."""
    core.host_run(
        ["gsettings", "set", "org.gnome.desktop.notifications", "show-banners", "true"]
    )

handle ¶

handle(cmd, core)

Route a volume/brightness/DND command; return None if none matched.

Source code in src/plugins/system.py

def handle(cmd, core):
    """Route a volume/brightness/DND command; return None if none matched."""
    # Volume -- "louder"/"quieter" etc. work on their own, without "volume"/"sound".
    # Match whole words so "silent" doesn't fire on "silently", "softer" on "softest"...
    words = cmd.split()
    if "very" in words:
        if "loud" in words or "louder" in words:
            volume_max(core)
            return True
        if any(w in words for w in ("silent", "quiet", "quieter", "soft", "softer")):
            volume_min(core)
            return True
    louder = "louder" in words
    quieter = any(word in words for word in ("quieter", "softer", "silent"))
    # No spoken feedback for volume/mute: GNOME's native OSD and chime already
    # acknowledge the change (and a spoken reply would be inaudible once muted).
    if "volume" in cmd or "sound" in cmd or louder or quieter:
        if "up" in cmd or louder:
            volume_up(core)
            return True
        if "down" in cmd or quieter:
            volume_down(core)
            return True
        if "mute" in cmd or "unmute" in cmd:
            volume_mute(core)
            return True

    if "mute" in cmd:
        volume_mute(core)
        return True

    # Brightness
    if "brightness" in cmd or "screen" in cmd:
        if "up" in cmd or "brighter" in cmd:
            core.speak("Brighter.")
            brightness_up(core)
            return True
        if "down" in cmd or "dimmer" in cmd or "darker" in cmd:
            core.speak("Dimmer.")
            brightness_down(core)
            return True

    # Do Not Disturb
    if "do not disturb" in cmd or "dnd" in cmd or "notifications" in cmd:
        if "on" in cmd or "enable" in cmd:
            core.speak("Do not disturb on.")
            dnd_on(core)
            return True
        if "off" in cmd or "disable" in cmd:
            core.speak("Do not disturb off.")
            dnd_off(core)
            return True

    return None  # Not handled

media¶

plugins.media ¶

Media Plugin - Playback controls via MPRIS.

setup ¶

setup(c)

Store the core reference for use by the plugin's handlers.

Source code in src/plugins/media.py

def setup(c):
    """Store the core reference for use by the plugin's handlers."""
    global core
    core = c

get_media_players ¶

get_media_players(core)

Return the bus names of all running MPRIS media players.

Source code in src/plugins/media.py

def get_media_players(core):
    """Return the bus names of all running MPRIS media players."""
    result = core.host_run(
        [
            "dbus-send",
            "--session",
            "--dest=org.freedesktop.DBus",
            "--type=method_call",
            "--print-reply",
            "/org/freedesktop/DBus",
            "org.freedesktop.DBus.ListNames",
        ]
    )
    return [
        line.split('"')[1]
        for line in result.stdout.split("\n")
        if "org.mpris.MediaPlayer2." in line
    ]

media_control ¶

media_control(action, core)

Send an MPRIS action (play/pause/next/previous) to every running player.

Returns False if no player is running or the action is unknown.

Source code in src/plugins/media.py

def media_control(action, core):
    """Send an MPRIS action (play/pause/next/previous) to every running player.

    Returns False if no player is running or the action is unknown.
    """
    players = get_media_players(core)
    if not players:
        return False

    method = {
        "play": "Play",
        "pause": "Pause",
        "next": "Next",
        "previous": "Previous",
    }.get(action)

    if not method:
        return False

    for player in players:
        core.host_run(
            [
                "dbus-send",
                "--session",
                "--type=method_call",
                f"--dest={player}",
                "/org/mpris/MediaPlayer2",
                f"org.mpris.MediaPlayer2.Player.{method}",
            ]
        )
    return True

handle ¶

handle(cmd, core)

Map a playback command to an MPRIS action; return None if not media.

Source code in src/plugins/media.py

def handle(cmd, core):
    """Map a playback command to an MPRIS action; return None if not media."""
    # "stop the music"/"stop playing" mean pause; bare "stop" isn't a media command.
    # Whole-word match so "stop tracking" (eye tracking) isn't caught by "track".
    words = cmd.split()
    stop_music = "stop" in words and any(
        word in words
        for word in ("music", "song", "playback", "track", "player", "playing", "play")
    )

    if "pause" in cmd or stop_music:
        core.speak("Paused.")
        media_control("pause", core)
        return True

    if "play" in cmd and "pause" not in cmd:
        core.speak("Playing.")
        media_control("play", core)
        return True

    if "next" in cmd or "skip" in cmd:
        core.speak("Next.")
        media_control("next", core)
        return True

    if "previous" in cmd or "back" in cmd:
        core.speak("Previous.")
        media_control("previous", core)
        return True

    return None  # Not handled

files¶

plugins.files ¶

Files Plugin - Open folders in file manager.

setup ¶

setup(c)

Store the core reference for use by the plugin's handlers.

Source code in src/plugins/files.py

def setup(c):
    """Store the core reference for use by the plugin's handlers."""
    global core
    core = c

open_folder ¶

open_folder(path, core)

Open a folder in the first available file manager; False if none found.

Source code in src/plugins/files.py

def open_folder(path, core):
    """Open a folder in the first available file manager; False if none found."""
    expanded = os.path.expanduser(path)

    file_managers = [
        ("nautilus", [expanded]),
        ("dolphin", [expanded]),
        ("thunar", [expanded]),
        ("nemo", [expanded]),
        ("xdg-open", [expanded]),
    ]

    for fm, args in file_managers:
        result = core.host_run(["which", fm])
        if result.returncode == 0:
            core.host_run([fm, *args], background=True)
            return True
    return False

handle ¶

handle(cmd, core)

Open a known folder when the command names one; return None otherwise.

Source code in src/plugins/files.py

def handle(cmd, core):
    """Open a known folder when the command names one; return None otherwise."""
    for folder, path in FOLDERS.items():
        if folder in cmd and (
            "open" in cmd or "go to" in cmd or "show" in cmd or "browse" in cmd
        ):
            if open_folder(path, core):
                core.speak(f"Opening {folder}.")
            else:
                core.speak("No file manager found.")
            return True

    return None  # Not handled

apps¶

plugins.apps ¶

Apps Plugin - Launch and close applications.

setup ¶

setup(c)

Store the core reference and resolve the user's default terminal.

Source code in src/plugins/apps.py

def setup(c):
    """Store the core reference and resolve the user's default terminal."""
    global core
    core = c
    _register_terminal(c)

find_app ¶

find_app(name, core)

Find app - returns (type, id).

Source code in src/plugins/apps.py

def find_app(name, core):
    """Find app - returns (type, id)."""
    if name in FLATPAK_APPS:
        result = core.host_run(["flatpak", "info", FLATPAK_APPS[name]])
        if result.returncode == 0:
            return ("flatpak", FLATPAK_APPS[name])

    if name in LOCAL_APPS:
        return ("local", LOCAL_APPS[name])

    result = core.host_run(["which", name])
    if result.returncode == 0:
        return ("local", name)

    return (None, None)

launch_app ¶

launch_app(name, core)

Launch the named app (flatpak or local binary); False if not found.

Source code in src/plugins/apps.py

def launch_app(name, core):
    """Launch the named app (flatpak or local binary); False if not found."""
    app_type, app_id = find_app(name, core)
    if app_type == "flatpak":
        core.host_run(["flatpak", "run", app_id], background=True)
        return True
    if app_type == "local":
        core.host_run([app_id], background=True)
        return True
    return False

close_app ¶

close_app(name, core)

Close the named app (flatpak kill or pkill); False if not found.

Source code in src/plugins/apps.py

def close_app(name, core):
    """Close the named app (flatpak kill or pkill); False if not found."""
    app_type, app_id = find_app(name, core)
    if app_type == "flatpak":
        core.host_run(["flatpak", "kill", app_id])
        return True
    if app_type == "local":
        core.host_run(["pkill", "-f", app_id])
        return True
    return False

handle ¶

handle(cmd, core)

Open or close an app named in the command; return None if none matched.

Source code in src/plugins/apps.py

def handle(cmd, core):
    """Open or close an app named in the command; return None if none matched."""
    all_apps = list(FLATPAK_APPS.keys()) + list(LOCAL_APPS.keys())

    for app in all_apps:
        if ("open" in cmd or "launch" in cmd) and app in cmd:
            if launch_app(app, core):
                core.speak(f"Opening {app}.")
            else:
                core.speak(f"{app} not installed.")
            return True

        if "close" in cmd and app in cmd:
            close_app(app, core)
            core.speak(f"Closing {app}.")
            return True

    return None  # Not handled

browser¶

plugins.browser ¶

Browser Plugin - Qutebrowser voice control via IPC.

ensure_qutebrowser_config ¶

ensure_qutebrowser_config()

Append any missing required lines to qutebrowser's config.py.

Preserves everything else the user has written. Tolerates read-only configs (e.g. Nix Home-Manager symlinks into /nix/store): on a write failure we emit a polite note telling the user which lines to add themselves, rather than crashing startup.

Source code in src/plugins/browser.py

def ensure_qutebrowser_config():
    """Append any missing required lines to qutebrowser's config.py.

    Preserves everything else the user has written. Tolerates read-only configs (e.g.
    Nix Home-Manager symlinks into /nix/store): on a write failure we emit a polite note
    telling the user which lines to add themselves, rather than crashing startup.
    """
    cfg = Path.home() / ".config" / "qutebrowser" / "config.py"

    try:
        cfg.parent.mkdir(parents=True, exist_ok=True)
    except OSError as e:
        _note_missing_qb_lines(
            cfg,
            REQUIRED_QUTEBROWSER_LINES,
            reason=f"could not create {cfg.parent} ({e})",
        )
        return

    try:
        existing = cfg.read_text() if cfg.exists() else ""
    except OSError as e:
        _note_missing_qb_lines(
            cfg, REQUIRED_QUTEBROWSER_LINES, reason=f"could not read it ({e})"
        )
        return

    missing = [line for line in REQUIRED_QUTEBROWSER_LINES if line not in existing]
    if not missing:
        return

    separator = "" if not existing or existing.endswith("\n") else "\n"
    updated = existing + separator + "\n".join(missing) + "\n"

    try:
        cfg.write_text(updated)
    except OSError as e:
        _note_missing_qb_lines(cfg, missing, reason=f"is read-only ({e})")
        return

    verb = "wrote" if not existing else "updated"
    added = "; ".join(missing)
    logger.debug("%s %s (added: %s)", verb, cfg, added)

setup ¶

setup(c)

Store the core reference and write the required qutebrowser config.

Source code in src/plugins/browser.py

def setup(c):
    """Store the core reference and write the required qutebrowser config."""
    global core
    core = c
    ensure_qutebrowser_config()

qb ¶

qb(command)

Send command to qutebrowser via IPC.

Source code in src/plugins/browser.py

def qb(command):
    """Send command to qutebrowser via IPC."""
    logger.debug("  🌐 qutebrowser :%s", command)
    core.host_run(["qutebrowser", f":{command}"])

qb_open ¶

qb_open(url)

Open URL in qutebrowser.

Source code in src/plugins/browser.py

def qb_open(url):
    """Open URL in qutebrowser."""
    core.host_run(["qutebrowser", url])

parse_hint_numbers ¶

parse_hint_numbers(cmd)

Extract hint numbers from spoken words.

Source code in src/plugins/browser.py

def parse_hint_numbers(cmd):
    """Extract hint numbers from spoken words."""
    clean = re.sub(r"[.,!?\-]", " ", cmd.lower())
    words = clean.split()
    digits = [HINT_NUMBERS[word] for word in words if word in HINT_NUMBERS]
    return "".join(digits)

looks_like_hint ¶

looks_like_hint(cmd)

Check if command looks like a hint number (short, mostly digits/number words).

Source code in src/plugins/browser.py

def looks_like_hint(cmd):
    """Check if command looks like a hint number (short, mostly digits/number words)."""
    clean = re.sub(r"[.,!?\-\s]", "", cmd.lower())
    # Must be short
    if len(clean) > 6:
        return False
    # Direct digits like "02", "92"
    if clean.replace("o", "0").isdigit():
        return True
    # Check if all words are number words
    words = cmd.lower().split()
    return len(words) <= 3 and all(w.strip(".,!?") in HINT_NUMBERS for w in words)

parse_hint_number ¶

parse_hint_number(cmd)

Parse spoken numbers into a hint string ('zero two' -> '02').

Source code in src/plugins/browser.py

def parse_hint_number(cmd):
    """Parse spoken numbers into a hint string ('zero two' -> '02')."""
    # Number word mappings
    NUM_WORDS = {
        "zero": "0",
        "oh": "0",
        "o": "0",
        "one": "1",
        "won": "1",
        "wan": "1",
        "two": "2",
        "to": "2",
        "too": "2",
        "tu": "2",
        "three": "3",
        "tree": "3",
        "free": "3",
        "four": "4",
        "for": "4",
        "fore": "4",
        "five": "5",
        "six": "6",
        "sex": "6",
        "seven": "7",
        "eight": "8",
        "ate": "8",
        "nine": "9",
        "nein": "9",
        # Tens
        "ten": "10",
        "eleven": "11",
        "twelve": "12",
        "thirteen": "13",
        "fourteen": "14",
        "fifteen": "15",
        "sixteen": "16",
        "seventeen": "17",
        "eighteen": "18",
        "nineteen": "19",
        "twenty": "2",
        "thirty": "3",
        "forty": "4",
        "fifty": "5",
        "sixty": "6",
        "seventy": "7",
        "eighty": "8",
        "ninety": "9",
    }

    result = []
    words = re.sub(r"[.,!?\-]", " ", cmd.lower()).split()

    for word in words:
        # Direct digit
        if word.isdigit():
            result.append(word)
        # Word to digit
        elif word in NUM_WORDS:
            result.append(NUM_WORDS[word])

    return "".join(result)

parse_spoken_url ¶

parse_spoken_url(spoken)

Convert spoken URL to actual URL.

'claude dot ai' -> 'https://claude.ai'.

Source code in src/plugins/browser.py

def parse_spoken_url(spoken):
    """Convert spoken URL to actual URL.

    'claude dot ai' -> 'https://claude.ai'.
    """
    url = spoken.lower().strip()

    # Replace spoken elements
    url = url.replace(" dot ", ".")
    url = url.replace(" slash ", "/")
    url = url.replace(" colon ", ":")
    url = url.replace(" dash ", "-")
    url = url.replace(" hyphen ", "-")
    url = url.replace(" underscore ", "_")

    # Remove remaining spaces
    url = url.replace(" ", "")

    # Add https:// if no protocol
    if not url.startswith("http://") and not url.startswith("https://"):
        url = "https://" + url

    return url

listen_for_hint ¶

listen_for_hint(core)

Listen for hint number after showing hints.

Source code in src/plugins/browser.py

def listen_for_hint(core):
    """Listen for hint number after showing hints."""
    logger.info("  🔢 Say hint number (e.g. 'zero two'), 'exit links' to cancel")

    # Small delay to let hints render
    time.sleep(0.3)

    # Clear audio buffer
    with contextlib.suppress(Exception):
        core.stream.read(core.stream.get_read_available(), exception_on_overflow=False)

    # Wait for speech
    first = core.wait_for_speech(timeout=10)
    if not first:
        logger.info("  ⏱ Timeout - hints cancelled")
        qb("mode-leave")
        logger.debug("  [listen_for_hint returning - timeout]")
        return

    audio = first + core.record_until_silence()
    cmd = core.transcribe(
        audio, prompt="zero one two three four five six seven eight nine"
    )

    if not cmd:
        logger.debug("  [listen_for_hint - no transcription, waiting again]")
        # Try one more time
        first = core.wait_for_speech(timeout=5)
        if first:
            audio = first + core.record_until_silence()
            cmd = core.transcribe(
                audio, prompt="zero one two three four five six seven eight nine"
            )
        if not cmd:
            qb("mode-leave")
            logger.debug("  [listen_for_hint returning - no transcription]")
            return

    cmd_lower = cmd.lower().strip(".,!? ")
    logger.debug("  ← %s", cmd_lower)

    # Cancel
    if cmd_lower in [
        "exit links",
        "exit link",
        "cancel",
        "nevermind",
        "stop",
        "close",
        "exit",
    ]:
        qb("mode-leave")
        logger.info("  ✗ Hints cancelled")
        logger.debug("  [listen_for_hint returning - cancelled]")
        return

    # Parse hint number
    hint = parse_hint_number(cmd_lower)

    if hint:
        logger.debug("  🔤 Hint: '%s' → '%s'", cmd_lower, hint)
        qb(f"hint-follow {hint}")
        # Wait for page to load, then clear any stuck state
        time.sleep(1.0)
        qb("fake-key <Escape>")
    else:
        # Try phonetic fallback
        hint = parse_hint_numbers(cmd_lower)
        if hint:
            logger.debug("  🔤 Phonetic: '%s' → '%s'", cmd_lower, hint)
            qb(f"hint-follow {hint}")
            # Wait for page to load, then clear any stuck state
            time.sleep(1.0)
            qb("fake-key <Escape>")
        else:
            # Not a hint - might be a browser command, pass it through
            logger.debug("  ↪ Not a hint, trying as command: '%s'", cmd_lower)
            qb("mode-leave")
            handle_browser_command(cmd_lower, core)

    logger.debug("  [listen_for_hint returning - complete]")

handle ¶

handle(cmd, core)

Enter browser mode on a browser command; return None otherwise.

A matching command launches qutebrowser (if needed) and runs the continuous browser-mode loop; reserved global commands (sleep/quit) are passed through. Outside the explicit "open browser", navigation commands are only acted on when a browser is already running, so ambiguous words can't open one.

Source code in src/plugins/browser.py

def handle(cmd, core):
    """Enter browser mode on a browser command; return None otherwise.

    A matching command launches qutebrowser (if needed) and runs the continuous
    browser-mode loop; reserved global commands (sleep/quit) are passed through. Outside
    the explicit "open browser", navigation commands are only acted on when a browser is
    already running, so ambiguous words can't open one.
    """
    cmd_lower = cmd.lower().strip(".,!? ")

    # Global sleep/quit commands belong to other plugins; if we matched them as
    # browser commands we would open qutebrowser instead. Let them through.
    if _is_reserved_global(cmd_lower):
        return None

    # --- Enter browser mode (explicit) ---
    if cmd_lower in ["browser", "browser mode", "open browser", "launch browser"]:
        core.host_run(["qutebrowser"], background=True)
        browser_mode(core)
        return True

    if not _qutebrowser_running(core):
        return None

    # --- Single browser commands → enters browser mode ---
    try:
        result = handle_browser_command(cmd_lower, core)
        if result:
            logger.info("  → Entering browser mode...")
            browser_mode(core)
            return True
    except Exception:
        logger.exception("  ! Browser error")
        return True

    return None

browser_mode ¶

browser_mode(core)

Continuous listening for browser commands.

Source code in src/plugins/browser.py

def browser_mode(core):
    """Continuous listening for browser commands."""
    core.speak("Browser")
    logger.info("=== BROWSER MODE ACTIVE ===")
    logger.info("Say commands directly. 'exit browser' to leave.")

    while True:
        with contextlib.suppress(Exception):
            core.stream.read(
                core.stream.get_read_available(), exception_on_overflow=False
            )

        first = core.wait_for_speech(timeout=30)
        if not first:
            continue

        audio = first + core.record_until_silence()
        cmd = core.transcribe(audio)

        if not cmd:
            continue

        cmd_lower = cmd.lower().strip(".,!? ")
        logger.debug("  [browser] %s", cmd_lower)

        # Exit browser mode - require explicit phrase
        if cmd_lower in [
            "exit browser",
            "leave browser",
            "stop browser",
            "quit browser",
            "close browser",
        ]:
            logger.info("=== BROWSER MODE EXIT ===")
            return

        # Grid triggers - escape to grid mode
        grid_triggers = {"grid", "grit", "grip", "mouse", "pointer", "cursor"}
        if any(w in cmd_lower for w in grid_triggers):
            logger.info("=== BROWSER MODE EXIT → GRID ===")
            core.route_command(cmd_lower)
            return

        # Handle browser command
        if not handle_browser_command(cmd_lower, core):
            logger.debug("  ? Unknown: %s", cmd_lower)

handle_browser_command ¶

handle_browser_command(cmd_lower, core)

Execute a single in-browser command; None if it isn't recognised.

Source code in src/plugins/browser.py

def handle_browser_command(cmd_lower, core):
    """Execute a single in-browser command; None if it isn't recognised."""
    # --- Hints ---
    if cmd_lower in [
        "numbers",
        "number",
        "hints",
        "hint",
        "show numbers",
        "show hints",
        "links",
        "link",
        "blanks",
        "blinks",
        "lynx",
        "lings",
        "lanes",
        "licks",
        "clicks",
    ]:
        qb("hint")
        listen_for_hint(core)
        return True

    if cmd_lower in [
        "numbers new",
        "number new",
        "hints new",
        "new numbers",
        "links new",
        "link new",
        "blanks new",
        "blinks new",
        "lynx new",
    ]:
        qb("hint links tab")
        listen_for_hint(core)
        return True

    # --- Navigation ---
    if cmd_lower in ["back", "go back", "previous page"]:
        qb("back")
        return True

    if cmd_lower in ["forward", "go forward", "next page"]:
        qb("forward")
        return True

    if cmd_lower in ["reload", "refresh", "reload page"]:
        qb("reload")
        return True

    if cmd_lower in ["stop", "stop loading"]:
        qb("stop")
        return True

    # --- Scrolling ---
    if cmd_lower in ["scroll down", "down"]:
        qb(f"jseval -q {SCROLL_DOWN_JS}")
        return True

    if cmd_lower in ["scroll up", "up"]:
        qb(f"jseval -q {SCROLL_UP_JS}")
        return True

    if "page" in cmd_lower and "down" in cmd_lower:
        qb(f"jseval -q {PAGE_DOWN_JS}")
        return True

    if "page" in cmd_lower and "up" in cmd_lower:
        qb(f"jseval -q {PAGE_UP_JS}")
        return True

    if cmd_lower in ["top", "go to top", "scroll to top"]:
        qb(f"jseval -q {SCROLL_TOP_JS}")
        return True

    if cmd_lower in ["bottom", "go to bottom", "scroll to bottom"]:
        qb(f"jseval -q {SCROLL_BOTTOM_JS}")
        return True

    # --- Tabs ---
    # Switch to specific tab by number
    if cmd_lower.startswith("tab "):
        tab_part = cmd_lower.replace("tab ", "").strip()
        tab_num = parse_hint_number(tab_part)
        if tab_num and tab_num.isdigit():
            qb(f"tab-focus {tab_num}")
            return True

    if cmd_lower in ["new tab", "open tab"]:
        qb("open -t about:blank")
        return True

    if cmd_lower in ["close tab", "close this tab"]:
        qb("tab-close")
        return True

    if cmd_lower in ["next tab", "tab right"]:
        qb("tab-next")
        return True

    if cmd_lower in ["last tab", "previous tab", "tab left"]:
        qb("tab-prev")
        return True

    if cmd_lower in ["undo tab", "restore tab", "reopen tab"]:
        qb("undo")
        return True

    # --- Find ---
    if cmd_lower.startswith("find "):
        query = cmd_lower.replace("find ", "", 1).strip()
        if query:
            qb(f"search {query}")
            return True

    if cmd_lower in ["find next", "next match"]:
        qb("search-next")
        return True

    if cmd_lower in ["find previous", "previous match"]:
        qb("search-prev")
        return True

    # --- Escape ---
    if cmd_lower in ["escape", "cancel", "nevermind"]:
        qb("mode-leave")
        return True

    # --- Bookmarks ---
    # Save current page as quickmark
    if (
        "bookmark this" in cmd_lower or "save this" in cmd_lower
    ) and " as " in cmd_lower:
        name = cmd_lower.split(" as ")[-1].strip()
        if name:
            core.speak(f"Saved as {name}.")
            qb(f"quickmark-save {name}")
            return True

    # Load quickmark (user-saved)
    if cmd_lower.startswith(("go to ", "open ")):
        target = cmd_lower.replace("go to ", "").replace("open ", "").strip()

        # Check predefined bookmarks first
        for site, url in BOOKMARKS.items():
            if site == target:
                core.speak(f"Opening {site}.")
                qb_open(url)
                return True

        # Try as spoken URL (contains "dot")
        if "dot" in target or "." in target:
            url = parse_spoken_url(target)
            core.speak(f"Opening {url}.")
            qb_open(url)
            return True

        # Don't catch generic "open X" - let other plugins handle it
        # Only use quickmark for explicit "go to X"
        if cmd_lower.startswith("go to "):
            qb(f"quickmark-load {target}")
            return True

        return None  # Let another plugin handle "open X"

    # --- Search ---
    if cmd_lower.startswith(("search ", "search for ")):
        query = cmd_lower.replace("search for ", "").replace("search ", "").strip()
        if query:
            url = f"https://duckduckgo.com/?q={query.replace(' ', '+')}"
            core.speak(f"Searching for {query}.")
            qb_open(url)
            return True

    # --- Phonetic hint selection (LAST - only if nothing else matched) ---
    # Only try hint parsing if it actually looks like a hint
    if looks_like_hint(cmd_lower):
        # Direct digit input (e.g., "02", "92", "0-2")
        stripped = re.sub(r"[^0-9a-z]", "", cmd_lower)
        if stripped.replace("o", "0").isdigit():
            hint = stripped.replace("o", "0")
            logger.debug("  🔤 Direct digits: '%s' → '%s'", cmd_lower, hint)
            qb(f"hint-follow {hint}")
            return True

        # Try phonetic parsing
        hint = parse_hint_numbers(cmd_lower)
        if hint and hint.isdigit():
            logger.debug("  🔤 Phonetic parsed: '%s' → '%s'", cmd_lower, hint)
            qb(f"hint-follow {hint}")
            return True

    return None

dictation¶

plugins.dictation ¶

Dictation Plugin - Voice to text via AT-SPI.

ensure_gnome_accessibility ¶

ensure_gnome_accessibility()

Enable GNOME's toolkit-accessibility for the AT-SPI bridge.

Silently skipped if gsettings isn't on PATH or the schema isn't installed (i.e. user isn't on GNOME). Warns if it's present but the flip fails. Tells the user to re-login when newly enabled.

Source code in src/plugins/dictation.py

def ensure_gnome_accessibility():
    """Enable GNOME's toolkit-accessibility for the AT-SPI bridge.

    Silently skipped if gsettings isn't on PATH or the schema isn't installed (i.e. user
    isn't on GNOME). Warns if it's present but the flip fails. Tells the user to
    re-login when newly enabled.
    """
    if shutil.which("gsettings") is None:
        return
    schema, key = "org.gnome.desktop.interface", "toolkit-accessibility"
    try:
        current = subprocess.run(
            ["gsettings", "get", schema, key],
            capture_output=True,
            text=True,
            check=False,
        )
        if current.returncode != 0:
            return  # schema missing — not really on GNOME
        if current.stdout.strip() == "true":
            return
        result = subprocess.run(
            ["gsettings", "set", schema, key, "true"],
            capture_output=True,
            text=True,
            check=False,
        )
        if result.returncode == 0:
            logger.info(
                "enabled GNOME toolkit-accessibility — "
                "log out and back in for dictation to work"
            )
        else:
            logger.warning(
                "could not enable GNOME toolkit-accessibility; dictation will not work"
            )
    except OSError:
        pass

setup ¶

setup(c)

Store the core reference and enable the GNOME accessibility bridge.

Source code in src/plugins/dictation.py

def setup(c):
    """Store the core reference and enable the GNOME accessibility bridge."""
    global core
    core = c
    ensure_gnome_accessibility()
    # Offer dictation to the keyboard (silent) activation path too: core runs
    # this while the hotkey combo is held, with no wake word spoken.
    if hasattr(c, "register_push_to_talk"):
        c.register_push_to_talk(
            lambda should_continue: run_push_to_talk(c, should_continue)
        )

atspi_python ¶

atspi_python()

Path to the interpreter that runs the AT-SPI helper.

The helper needs PyGObject and the AT-SPI typelib, which the app's own venv usually lacks. EASYSPEAK_ATSPI_PYTHON lets the packaging point at an interpreter that has them (the Nix flake sets it); otherwise we fall back to the system python3, where distro packages like python3-gi typically live.

Source code in src/plugins/dictation.py

def atspi_python():
    """Path to the interpreter that runs the AT-SPI helper.

    The helper needs PyGObject and the AT-SPI typelib, which the app's own venv usually
    lacks. `EASYSPEAK_ATSPI_PYTHON` lets the packaging point at an interpreter that has
    them (the Nix flake sets it); otherwise we fall back to the system `python3`, where
    distro packages like `python3-gi` typically live.
    """
    return os.environ.get("EASYSPEAK_ATSPI_PYTHON") or "python3"

insert_text ¶

insert_text(text)

Insert text via AT-SPI.

Returns one of INSERTED, NO_FOCUS or BACKEND_ERROR so the caller can give feedback that matches the real cause instead of always blaming focus.

Source code in src/plugins/dictation.py

def insert_text(text):
    """Insert text via AT-SPI.

    Returns one of INSERTED, NO_FOCUS or BACKEND_ERROR so the caller can give feedback
    that matches the real cause instead of always blaming focus.
    """
    cmd = [atspi_python(), ATSPI_HELPER, text]
    try:
        result = subprocess.run(cmd, capture_output=True, text=True, check=False)
    except OSError as exc:
        logger.warning("dictation backend could not start: %s", exc)
        return BACKEND_ERROR

    if "OK" in result.stdout:
        return INSERTED
    if "NO_BACKEND" in result.stdout or result.returncode != 0:
        logger.warning(
            "dictation backend unavailable (PyGObject/AT-SPI missing): %s",
            result.stderr.strip() or "no detail",
        )
        return BACKEND_ERROR
    return NO_FOCUS

format_text ¶

format_text(text)

Convert spoken punctuation to actual punctuation.

Source code in src/plugins/dictation.py

def format_text(text):
    """Convert spoken punctuation to actual punctuation."""
    text = text.strip()

    # Strip ALL punctuation Whisper auto-adds; only explicit commands add it back
    text = re.sub(r"[.,!?;:]+", "", text)
    text = text.strip()

    # Punctuation replacements - order matters!
    replacements = [
        # Editing commands
        (r"\s*backspace\s*", "\b"),
        (r"\s*back space\s*", "\b"),
        (r"\s*delete\s*", "\b"),
        (r"\s*space\s*", " "),
        (r"\s*tab\s*", "\t"),
        # Sentence breaks - add space after
        (r"\s*,?\s*new sentence\s*", ". "),
        (r"\s*,?\s*next sentence\s*", ". "),
        (r"\s*,?\s*new paragraph\s*", "\n\n"),
        (r"\s*,?\s*next paragraph\s*", "\n\n"),
        (r"\s*,?\s*new para\s*", "\n\n"),
        (r"\s*,?\s*new line\s*", "\n"),
        (r"\s*,?\s*newline\s*", "\n"),
        (r"\s*,?\s*you line\s*", "\n"),
        (r"\s*,?\s*line break\s*", "\n"),
        (r"\s*,?\s*enter\s*", "\n"),
        # Punctuation - include common mishearings
        (r"\s*,?\s*comma\s*", ", "),
        (r"\s*,?\s*karma\s*", ", "),
        (r"\s*,?\s*kama\s*", ", "),
        (r"\s*,?\s*carma\s*", ", "),
        (r"\s*,?\s*calm a\s*", ", "),
        (r"\s*,?\s*calm him\s*", ", "),
        (r"\s*,?\s*calm up\s*", ", "),
        (r"\s*,?\s*come a\s*", ", "),
        (r"\s*,?\s*coma\s*", ", "),
        (r"\s*,?\s*calmer\s*", ", "),
        (r",\s*\.", ","),  # Fix comma followed by period
        (r"\s*,?\s*period\s*", ". "),
        (r"\s*,?\s*full stop\s*", ". "),
        (r"\s*,?\s*\.\s*\.+", "."),  # Multiple periods to one
        (r"\s*,?\s*question mark\s*", "? "),
        (r"\s*,?\s*exclamation mark\s*", "! "),
        (r"\s*,?\s*exclamation point\s*", "! "),
        (r"\s*,?\s*colon\s*", ": "),
        (r"\s*,?\s*semicolon\s*", "; "),
        (r"\s*,?\s*semi colon\s*", "; "),
        (r"\s*,?\s*dash\s*", " - "),
        (r"\s*,?\s*hyphen\s*", "-"),
        (r"\s*,?\s*apostrophe\s*", "'"),
        (r"\s*,?\s*open quote\s*", ' "'),
        (r"\s*,?\s*close quote\s*", '" '),
        (r"\s*,?\s*quote\s*", '"'),
        (r"\s*,?\s*open paren\s*", " ("),
        (r"\s*,?\s*close paren\s*", ") "),
        # Common words/symbols
        (r"\s*,?\s*at sign\s*", "@"),
        (r"\s*,?\s*ampersand\s*", "&"),
        (r"\s*,?\s*dollar sign\s*", "$"),
        (r"\s*,?\s*percent sign\s*", "%"),
        (r"\s*,?\s*percent\s*", "%"),
        (r"\s*,?\s*hashtag\s*", "#"),
        (r"\s*,?\s*hash\s*", "#"),
        (r"\s*,?\s*asterisk\s*", "*"),
        (r"\s*,?\s*star\s*", "*"),
        (r"\s*,?\s*underscore\s*", "_"),
        (r"\s*,?\s*slash\s*", "/"),
        (r"\s*,?\s*backslash\s*", "\\\\"),
    ]

    for pattern, replacement in replacements:
        text = re.sub(pattern, replacement, text, flags=re.IGNORECASE)

    # Capitalize after sentence endings
    def capitalize_after(match):
        return match.group(1) + match.group(2).upper()

    text = re.sub(r"([.!?]\s+)([a-z])", capitalize_after, text)

    # Capitalize first letter
    if text:
        text = text[0].upper() + text[1:]

    # Clean up extra spaces
    text = re.sub(r" +", " ", text)
    text = re.sub(r" ([.,!?:;])", r"\1", text)

    # Fix double periods
    text = re.sub(r"\.+", ".", text)

    return text.strip()

run_push_to_talk ¶

run_push_to_talk(core, should_continue)

Dictate while the activation keys are held (the silent-activation path).

Mirrors the voice notes loop but is gated on should_continue — a predicate that is True while the keys remain held — instead of a spoken "stop notes": each utterance captured from core is formatted and inserted until the keys are released. The capture waits re-check should_continue so releasing stops dictation promptly.

Source code in src/plugins/dictation.py

def run_push_to_talk(core, should_continue):
    """Dictate while the activation keys are held (the silent-activation path).

    Mirrors the voice `notes` loop but is gated on `should_continue` — a predicate that
    is True while the keys remain held — instead of a spoken "stop notes": each
    utterance captured from `core` is formatted and inserted until the keys are
    released. The capture waits re-check `should_continue` so releasing stops dictation
    promptly.
    """
    logger.info("🎙️ Push-to-talk dictation — release to end")
    while should_continue():
        core.flush_stream()
        first = core.wait_for_speech(
            timeout=PTT_LISTEN_TIMEOUT, should_continue=should_continue
        )
        if not first:
            continue
        audio = first + core.record_until_silence(should_continue=should_continue)
        text = core.transcribe(audio, prompt=DICTATION_PROMPT)
        if not text:
            continue
        if _dictate_utterance(core, text.strip().lower()):
            return

handle ¶

handle(cmd, core)

Enter dictation mode on a whole-word "note"/"notes"; return None otherwise.

Matching whole words (not substrings) keeps unrelated words like "notebook" or "noted" from triggering it. While in dictation mode it loops, transcribing speech and inserting it into the focused field until "stop notes" is heard.

Source code in src/plugins/dictation.py

def handle(cmd, core):
    """Enter dictation mode on a whole-word "note"/"notes"; return None otherwise.

    Matching whole words (not substrings) keeps unrelated words like "notebook" or
    "noted" from triggering it. While in dictation mode it loops, transcribing speech
    and inserting it into the focused field until "stop notes" is heard.
    """
    words = cmd.split()
    if ("notes" in words or "note" in words) and "stop" not in words:
        core.speak("Dictation")

        logger.info("🎙️ Dictation mode - say 'stop notes' to end")

        while True:
            # Clear buffer and wait for speech
            core.stream.read(
                core.stream.get_read_available(), exception_on_overflow=False
            )
            first = core.wait_for_speech(timeout=30)

            if not first:
                logger.debug("   (waiting...)")
                continue

            audio = first + core.record_until_silence()
            text = core.transcribe(audio, prompt=DICTATION_PROMPT)

            if not text:
                continue

            text = text.strip().lower()
            logger.debug("   Raw: %s", text)

            # Check for exit - include mishearings
            if any(
                x in text
                for x in [
                    "stop notes",
                    "stop note",
                    "end notes",
                    "exit notes",
                    "stop nurts",
                    "stop nots",
                    "stop nuts",
                    "stopnotes",
                    "done notes",
                    "finish notes",
                    "close notes",
                    "closed notes",
                ]
            ):
                core.speak("Done")
                return True

            # Format and insert
            if _dictate_utterance(core, text):
                return True

    return None

mousegrid (00_mousegrid)¶

plugins.00_mousegrid ¶

Mouse Grid Plugin - Voice-controlled mouse via GNOME Shell D-Bus extension.

Features: - Chained numbers: "3 7 5" zooms three times in one utterance - Repetition: "nudge up 5", "scroll down 3" - Drag support: "mark" to start, "drag" to end