ollama/docs/speech.md
2024-08-07 13:04:57 -07:00

2.1 KiB

Speech to Text Prototype

To run

make {/path/to/whisper.cpp/server}

Update routes.go

  • replace whisperServer with path to server

api/generate

Request fields

  • speech (required):
    • audio (required): path to audio file
    • model (required): path to whisper model
    • transcribe (optional): if true, will transcribe and return the audio file
    • keep_alive: (optional): sets how long the model is stored in memory (default: 5m)
  • prompt (optional): if not null, passed in with the transcribed audio

Transcription

curl http://localhost:11434/api/generate -d '{
    "speech": {
        "model": "/Users/royhan-ollama/.ollama/whisper/ggml-base.en.bin",
        "audio": "/Users/royhan-ollama/ollama/llm/whisper.cpp/samples/jfk.wav",
        "transcribe": true,
        "keep_alive": "1m"
    },
    "stream": false
}' | jq

Response Generation

curl http://localhost:11434/api/generate -d '{
    "model": "llama3",
    "prompt": "What do you think about this quote?",
    "speech": {
        "model": "/Users/royhan-ollama/.ollama/whisper/ggml-base.en.bin",
        "audio": "/Users/royhan-ollama/ollama/llm/whisper.cpp/samples/jfk.wav",
        "keep_alive": "1m"
    },
    "stream": false
}' | jq

api/chat

Request fields

  • model (required): language model to chat with
  • speech (required):
    • model (required): path to whisper model
    • keep_alive: (optional): sets how long the model is stored in memory (default: 5m)
  • messages/message/audio (required): path to audio file
curl http://localhost:11434/api/chat -d '{
    "model": "llama3",
    "speech": {
        "model": "/Users/royhan-ollama/.ollama/whisper/ggml-base.en.bin",
        "keep_alive": "10m"
    },
    "messages": [
        {
            "role": "system",
            "content": "You are a Canadian Nationalist"
        },
        {
            "role": "user",
            "content": "What do you think about this quote?",
            "audio": "/Users/royhan-ollama/ollama/llm/whisper.cpp/samples/jfk.wav"
        }
    ],
    "stream": false
}' | jq