Daniel Hiltgen
|
049f40e4e2
|
Merge pull request #6107 from dhiltgen/go_server_parallel
llama: Fix parallel requests
|
2024-07-31 16:36:49 -07:00 |
|
Daniel Hiltgen
|
71b287264d
|
Fix parallel requests
|
2024-07-31 15:21:43 -07:00 |
|
Daniel Hiltgen
|
41bf8d9932
|
Update sync with latest llama.cpp layout, and run against b3485
|
2024-07-31 09:50:39 -07:00 |
|
Daniel Hiltgen
|
5152a430f5
|
Prefix all build artifacts with an OS/ARCH dir
This will help keep incremental builds from stomping on each other and make it
easier to stitch together the final runner payloads
|
2024-07-29 15:38:52 -07:00 |
|
Daniel Hiltgen
|
6456a7fd73
|
Get linux building
Still needs a bit more refinement to (auto)detect cuda/hip and fallback
gracefully if not detected.
|
2024-07-29 15:38:52 -07:00 |
|
jmorganca
|
8931724a7a
|
add note in readme
|
2024-07-29 15:38:52 -07:00 |
|
jmorganca
|
7ad4c5334e
|
clean up metal code
|
2024-07-29 15:38:52 -07:00 |
|
jmorganca
|
9caee9f8e3
|
fix Makefile on windows
|
2024-07-29 15:38:52 -07:00 |
|
jmorganca
|
518ba1c793
|
remove printing
|
2024-07-29 15:38:52 -07:00 |
|
jmorganca
|
6dfd6db10c
|
dont apply license to stb_image.h and json.hpp
|
2024-07-29 15:38:52 -07:00 |
|
jmorganca
|
2abf81885d
|
lint
|
2024-07-29 15:38:52 -07:00 |
|
jmorganca
|
a50c1070f1
|
update sync header
|
2024-07-29 15:38:52 -07:00 |
|
jmorganca
|
a48179c340
|
remove unused script
|
2024-07-29 15:38:52 -07:00 |
|
jmorganca
|
f6faf66dac
|
fix metal
|
2024-07-29 15:38:52 -07:00 |
|
jmorganca
|
fd15793930
|
add header to not edit
|
2024-07-29 15:38:52 -07:00 |
|
jmorganca
|
5d6a6e5282
|
add header to not edit
|
2024-07-29 15:38:52 -07:00 |
|
jmorganca
|
f8424faf75
|
fix build on windows
|
2024-07-29 15:38:52 -07:00 |
|
jmorganca
|
095e7a9d7d
|
fix Makefile
|
2024-07-29 15:38:52 -07:00 |
|
jmorganca
|
e16d13d88b
|
fix README.md
|
2024-07-29 15:38:52 -07:00 |
|
jmorganca
|
595d8878c4
|
fix README.md
|
2024-07-29 15:38:52 -07:00 |
|
jmorganca
|
e80789bd86
|
consistent whitespace
|
2024-07-29 15:38:52 -07:00 |
|
jmorganca
|
a2f44f0af5
|
update .gitattributes
|
2024-07-29 15:38:52 -07:00 |
|
jmorganca
|
295c202b2f
|
link metal
|
2024-07-29 15:38:52 -07:00 |
|
jmorganca
|
f96cade3a6
|
wip
|
2024-07-29 15:38:52 -07:00 |
|
jmorganca
|
b767f6554c
|
wip meta
|
2024-07-29 15:38:52 -07:00 |
|
jmorganca
|
87833dd606
|
sync
|
2024-07-29 15:38:52 -07:00 |
|
jmorganca
|
2f94ffd801
|
remove perl docs
|
2024-07-29 15:38:52 -07:00 |
|
jmorganca
|
e9d15eb277
|
remove build scripts
|
2024-07-29 15:38:52 -07:00 |
|
jmorganca
|
4051a26f6f
|
remove need for perl
|
2024-07-29 15:38:52 -07:00 |
|
jmorganca
|
a687913a97
|
fix output
|
2024-07-29 15:38:52 -07:00 |
|
jmorganca
|
6110d25dce
|
arch build
|
2024-07-29 15:38:52 -07:00 |
|
jmorganca
|
2081ec9ba1
|
add temporary makefile
|
2024-07-29 15:38:52 -07:00 |
|
jmorganca
|
4b13e564eb
|
fix cuda and rocm builds
|
2024-07-29 15:38:51 -07:00 |
|
jmorganca
|
34015ca10d
|
fix cgo flags for darwin amd64
|
2024-07-29 15:38:51 -07:00 |
|
jmorganca
|
11508826b2
|
remove -fPIC from build_hipblas.sh
|
2024-07-29 15:38:51 -07:00 |
|
jmorganca
|
ac090b6b71
|
fix issues with runner
|
2024-07-29 15:38:51 -07:00 |
|
jmorganca
|
6dab2a9d3a
|
move sync script back in for now
|
2024-07-29 15:38:51 -07:00 |
|
jmorganca
|
834aac8450
|
llama: sync
|
2024-07-29 15:38:51 -07:00 |
|
jmorganca
|
ac6b154cc4
|
update to d5c938cd
|
2024-07-29 15:38:51 -07:00 |
|
jmorganca
|
0574fe199a
|
add patches
|
2024-07-29 15:38:51 -07:00 |
|
jmorganca
|
028fda3582
|
cleanup stop code
|
2024-07-29 15:38:51 -07:00 |
|
jmorganca
|
8ef58a6695
|
fix example
|
2024-07-29 15:38:51 -07:00 |
|
jmorganca
|
b9db5ab5d0
|
revert llm changes
|
2024-07-29 15:38:51 -07:00 |
|
jmorganca
|
a796b7aeaf
|
num predict
|
2024-07-29 15:38:51 -07:00 |
|
jmorganca
|
89cb4b8d6b
|
basic progress
|
2024-07-29 15:38:51 -07:00 |
|
jmorganca
|
0d365e8d34
|
add more runner params
|
2024-07-29 15:38:51 -07:00 |
|
jmorganca
|
72ff94efe0
|
truncate stop properly
|
2024-07-29 15:38:51 -07:00 |
|
jmorganca
|
240d4cf0aa
|
wip stop tokens
|
2024-07-29 15:38:51 -07:00 |
|
jmorganca
|
424627c347
|
embeddings
|
2024-07-29 15:38:51 -07:00 |
|
jmorganca
|
1a801fba2a
|
remove dependency on llm
|
2024-07-29 15:38:51 -07:00 |
|