3266 Commits

Author SHA1 Message Date
jmorganca
1a801fba2a remove dependency on llm 2024-07-29 15:38:51 -07:00
jmorganca
727494ea54 grammar 2024-07-29 15:38:51 -07:00
jmorganca
b39fca7088 sampling 2024-07-29 15:38:51 -07:00
jmorganca
db55b1b89d better example module, add port 2024-07-29 15:38:51 -07:00
jmorganca
1124e24aff wip 2024-07-29 15:38:51 -07:00
jmorganca
df44d119a3 add llava to runner 2024-07-29 15:38:51 -07:00
jmorganca
86955c3014 fix output in build_hipblas.sh 2024-07-29 15:38:51 -07:00
jmorganca
c05ba504ef mods to build_hipblas.sh for linux 2024-07-29 15:38:51 -07:00
jmorganca
aaca2ce093 wip 2024-07-29 15:38:51 -07:00
jmorganca
921708003e improve cuda and hipblas build scripts 2024-07-29 15:38:51 -07:00
jmorganca
323a3f1f3a cuda linux 2024-07-29 15:38:51 -07:00
Jeffrey Morgan
07d6e589ca Update README.md 2024-07-29 15:38:51 -07:00
Jeffrey Morgan
aa52dfcaaf Update README.md 2024-07-29 15:38:51 -07:00
jmorganca
31e0de825e disable log file 2024-07-29 15:38:51 -07:00
jmorganca
d65b4ea480 fix readme for llava 2024-07-29 15:38:51 -07:00
jmorganca
878eb9a19f add llava 2024-07-29 15:38:51 -07:00
jmorganca
5818e3b210 llama: add clip dependencies 2024-07-29 15:38:51 -07:00
jmorganca
2a41ad5b1f add clip and parallel requests to the todo list 2024-07-29 15:38:51 -07:00
jmorganca
cf1ec78071 fix cuda build 2024-07-29 15:38:51 -07:00
jmorganca
57d03929cd fix build on windows 2024-07-29 15:38:51 -07:00
jmorganca
0a6b1adbd7 fix ggml-metal.m build constraints 2024-07-29 15:38:51 -07:00
jmorganca
ec60d79a67 fix ggml-metal.m 2024-07-29 15:38:51 -07:00
jmorganca
3d656588a7 avx2 should only add avx2 2024-07-29 15:38:51 -07:00
jmorganca
460d9857e2 fix sync script 2024-07-29 15:38:51 -07:00
jmorganca
a5548a81fc fix ggml-metal.m 2024-07-29 15:38:51 -07:00
jmorganca
634f6a75d0 fix ggml-metal.m 2024-07-29 15:38:51 -07:00
jmorganca
3b5e5a6280 add license headers 2024-07-29 15:38:51 -07:00
jmorganca
853d96b1b1 pre-patch 2024-07-29 15:38:51 -07:00
jmorganca
4dd63c1fef move runner package down 2024-07-29 15:38:51 -07:00
jmorganca
82214396b5 replace static build in llm 2024-07-29 15:38:51 -07:00
jmorganca
8ca4a9a70a fix build 2024-07-29 15:35:09 -07:00
jmorganca
25fd8fd045 wip... 2024-07-29 15:35:09 -07:00
jmorganca
be2f37b5d4 rename server to runner 2024-07-29 15:35:09 -07:00
Jeffrey Morgan
9e28405c54 Update README.md 2024-07-29 15:35:09 -07:00
Jeffrey Morgan
9f3e950120 Update README.md 2024-07-29 15:35:09 -07:00
Jeffrey Morgan
951104045f Update README.md 2024-07-29 15:35:09 -07:00
Jeffrey Morgan
597712006c Update README.md 2024-07-29 15:35:09 -07:00
jmorganca
64e712b12b Add missing hipcc flags 2024-07-29 15:35:09 -07:00
jmorganca
85aea62997 fix .gitattributes 2024-07-29 15:35:09 -07:00
jmorganca
491ff41675 Initial llama Go module 2024-07-29 15:35:09 -07:00
jmorganca
075f2e88d9 add sync of llama.cpp 2024-07-29 15:35:09 -07:00
Daniel Hiltgen
1a83581a8e
Merge pull request #5895 from dhiltgen/sched_faq
Better explain multi-gpu behavior
2024-07-29 14:25:41 -07:00
Daniel Hiltgen
37926eb991
Merge pull request #5927 from dhiltgen/high_cpu_count
Ensure amd gpu nodes are numerically sorted
2024-07-29 14:24:57 -07:00
Daniel Hiltgen
3d4634fdff
Merge pull request #5934 from dhiltgen/missing_cuda_repo
Report better error on cuda unsupported os/arch
2024-07-29 14:24:20 -07:00
royjhan
365431d406
return tool calls finish reason for openai (#5995)
* hot fix

* backend stream support

* clean up

* finish reason

* move to openai
2024-07-29 13:56:57 -07:00
Daniel Hiltgen
161e12cecf
Merge pull request #5932 from dhiltgen/win_font
Explain font problems on windows 10
2024-07-29 13:40:24 -07:00
Jeffrey Morgan
46e6327e0f
api: add stringifier for Tool (#5891) 2024-07-29 13:35:16 -07:00
Jeffrey Morgan
68ee42f995
update llama.cpp submodule to 6eeaeba1 (#6039) 2024-07-29 13:20:26 -07:00
Ikko Eltociear Ashimine
f26aef9a8b
docs: update README.md (#6059)
HuggingFace -> Hugging Face
2024-07-29 10:53:30 -07:00
Michael Yang
38d9036b59
Merge pull request #5992 from ollama/mxyng/save
fix: model save
2024-07-29 09:53:19 -07:00