ollama/softmax.cuh at e16d13d88bce2115b0166d3b56cda189c5683a9c - ollama - Gitea: Git with a cup of tea

tcsenpai/ollama

mirror of https://github.com/tcsenpai/ollama.git synced 2025-06-12 14:07:10 +00:00

jmorganca 87833dd606 sync

2024-07-29 15:38:52 -07:00

6 lines

142 B

Plaintext

Raw Blame History

 #include "common.cuh"
 #define CUDA_SOFT_MAX_BLOCK_SIZE 1024
 void ggml_cuda_op_soft_max(ggml_backend_cuda_context & ctx, ggml_tensor * dst);