3315 Commits

Author SHA1 Message Date
Daniel Hiltgen
049f40e4e2
Merge pull request #6107 from dhiltgen/go_server_parallel
llama: Fix parallel requests
2024-07-31 16:36:49 -07:00
Daniel Hiltgen
71b287264d Fix parallel requests 2024-07-31 15:21:43 -07:00
Daniel Hiltgen
41bf8d9932 Update sync with latest llama.cpp layout, and run against b3485 2024-07-31 09:50:39 -07:00
Daniel Hiltgen
5152a430f5 Prefix all build artifacts with an OS/ARCH dir
This will help keep incremental builds from stomping on each other and make it
easier to stitch together the final runner payloads
2024-07-29 15:38:52 -07:00
Daniel Hiltgen
6456a7fd73 Get linux building
Still needs a bit more refinement to (auto)detect cuda/hip and fallback
gracefully if not detected.
2024-07-29 15:38:52 -07:00
jmorganca
8931724a7a add note in readme 2024-07-29 15:38:52 -07:00
jmorganca
7ad4c5334e clean up metal code 2024-07-29 15:38:52 -07:00
jmorganca
9caee9f8e3 fix Makefile on windows 2024-07-29 15:38:52 -07:00
jmorganca
518ba1c793 remove printing 2024-07-29 15:38:52 -07:00
jmorganca
6dfd6db10c dont apply license to stb_image.h and json.hpp 2024-07-29 15:38:52 -07:00
jmorganca
2abf81885d lint 2024-07-29 15:38:52 -07:00
jmorganca
a50c1070f1 update sync header 2024-07-29 15:38:52 -07:00
jmorganca
a48179c340 remove unused script 2024-07-29 15:38:52 -07:00
jmorganca
f6faf66dac fix metal 2024-07-29 15:38:52 -07:00
jmorganca
fd15793930 add header to not edit 2024-07-29 15:38:52 -07:00
jmorganca
5d6a6e5282 add header to not edit 2024-07-29 15:38:52 -07:00
jmorganca
f8424faf75 fix build on windows 2024-07-29 15:38:52 -07:00
jmorganca
095e7a9d7d fix Makefile 2024-07-29 15:38:52 -07:00
jmorganca
e16d13d88b fix README.md 2024-07-29 15:38:52 -07:00
jmorganca
595d8878c4 fix README.md 2024-07-29 15:38:52 -07:00
jmorganca
e80789bd86 consistent whitespace 2024-07-29 15:38:52 -07:00
jmorganca
a2f44f0af5 update .gitattributes 2024-07-29 15:38:52 -07:00
jmorganca
295c202b2f link metal 2024-07-29 15:38:52 -07:00
jmorganca
f96cade3a6 wip 2024-07-29 15:38:52 -07:00
jmorganca
b767f6554c wip meta 2024-07-29 15:38:52 -07:00
jmorganca
87833dd606 sync 2024-07-29 15:38:52 -07:00
jmorganca
2f94ffd801 remove perl docs 2024-07-29 15:38:52 -07:00
jmorganca
e9d15eb277 remove build scripts 2024-07-29 15:38:52 -07:00
jmorganca
4051a26f6f remove need for perl 2024-07-29 15:38:52 -07:00
jmorganca
a687913a97 fix output 2024-07-29 15:38:52 -07:00
jmorganca
6110d25dce arch build 2024-07-29 15:38:52 -07:00
jmorganca
2081ec9ba1 add temporary makefile 2024-07-29 15:38:52 -07:00
jmorganca
4b13e564eb fix cuda and rocm builds 2024-07-29 15:38:51 -07:00
jmorganca
34015ca10d fix cgo flags for darwin amd64 2024-07-29 15:38:51 -07:00
jmorganca
11508826b2 remove -fPIC from build_hipblas.sh 2024-07-29 15:38:51 -07:00
jmorganca
ac090b6b71 fix issues with runner 2024-07-29 15:38:51 -07:00
jmorganca
6dab2a9d3a move sync script back in for now 2024-07-29 15:38:51 -07:00
jmorganca
834aac8450 llama: sync 2024-07-29 15:38:51 -07:00
jmorganca
ac6b154cc4 update to d5c938cd 2024-07-29 15:38:51 -07:00
jmorganca
0574fe199a add patches 2024-07-29 15:38:51 -07:00
jmorganca
028fda3582 cleanup stop code 2024-07-29 15:38:51 -07:00
jmorganca
8ef58a6695 fix example 2024-07-29 15:38:51 -07:00
jmorganca
b9db5ab5d0 revert llm changes 2024-07-29 15:38:51 -07:00
jmorganca
a796b7aeaf num predict 2024-07-29 15:38:51 -07:00
jmorganca
89cb4b8d6b basic progress 2024-07-29 15:38:51 -07:00
jmorganca
0d365e8d34 add more runner params 2024-07-29 15:38:51 -07:00
jmorganca
72ff94efe0 truncate stop properly 2024-07-29 15:38:51 -07:00
jmorganca
240d4cf0aa wip stop tokens 2024-07-29 15:38:51 -07:00
jmorganca
424627c347 embeddings 2024-07-29 15:38:51 -07:00
jmorganca
1a801fba2a remove dependency on llm 2024-07-29 15:38:51 -07:00