mirror of
https://github.com/tcsenpai/ollama.git
synced 2025-06-10 21:17:08 +00:00
2.1 KiB
2.1 KiB
Speech to Text Prototype
To run
make {/path/to/whisper.cpp/server}
Update routes.go
- replace
whisperServer
with path to server
api/generate
Request fields
speech
(required):audio
(required): path to audio filemodel
(required): path to whisper modeltranscribe
(optional): if true, will transcribe and return the audio filekeep_alive
: (optional): sets how long the model is stored in memory (default:5m
)
prompt
(optional): if not null, passed in with the transcribed audio
Transcription
curl http://localhost:11434/api/generate -d '{
"speech": {
"model": "/Users/royhan-ollama/.ollama/whisper/ggml-base.en.bin",
"audio": "/Users/royhan-ollama/ollama/llm/whisper.cpp/samples/jfk.wav",
"transcribe": true,
"keep_alive": "1m"
},
"stream": false
}' | jq
Response Generation
curl http://localhost:11434/api/generate -d '{
"model": "llama3",
"prompt": "What do you think about this quote?",
"speech": {
"model": "/Users/royhan-ollama/.ollama/whisper/ggml-base.en.bin",
"audio": "/Users/royhan-ollama/ollama/llm/whisper.cpp/samples/jfk.wav",
"keep_alive": "1m"
},
"stream": false
}' | jq
api/chat
Request fields
model
(required): language model to chat withspeech
(required):model
(required): path to whisper modelkeep_alive
: (optional): sets how long the model is stored in memory (default:5m
)
messages
/message
/audio
(required): path to audio file
curl http://localhost:11434/api/chat -d '{
"model": "llama3",
"speech": {
"model": "/Users/royhan-ollama/.ollama/whisper/ggml-base.en.bin",
"keep_alive": "10m"
},
"messages": [
{
"role": "system",
"content": "You are a Canadian Nationalist"
},
{
"role": "user",
"content": "What do you think about this quote?",
"audio": "/Users/royhan-ollama/ollama/llm/whisper.cpp/samples/jfk.wav"
}
],
"stream": false
}' | jq