mirror of
https://github.com/maglore9900/max_headroom.git
synced 2025-06-04 10:30:24 +00:00
this is a personal project to create a voice directed digital assistant based on the max headroom character.
written in python, using langchain, langgraph, etc.
written to work on Windows. Agent and logic will run on linux but tools are currently windows only.
it currently will respond as an LLM like usual, but also has the following capabilities:
- can also control spotify
- can open applications on windows
- can change the focused window
- set timer
- coming soon: journalling in markdown, with a save target for obsidian
this is a fun work in progress. if you want to use it and or develop for it be my guest. would love to have more tools designed.
Note:
- this will work with openai or ollama models. you will need to set up the .env for that as well as spotify
- this is designed to pull a custom voice from the alltalk project https://github.com/erew123/alltalk_tts, that is how I am cloning max headroom's voice. You can alter or simply not use this, it will currently fallback to pyttsx3.
- speech-to-text can use google, vosk, or faster-whisper. faster-whisper is currently the default and optimal method.
Will move more variables to the .env folders soon.
.env is under the module folder for now
Description
Languages
Python
100%