3323 Commits

Author SHA1 Message Date
jmorganca
cd776e49ad llama: wip vision support for runner 2024-08-12 22:18:30 -07:00
Daniel Hiltgen
e584f14e78
Merge pull request #6123 from dhiltgen/go_server_unified
llama: Runtime selection of new or old runners
2024-08-01 15:51:51 -07:00
Daniel Hiltgen
3cc7ec4429 fix dolphin-mistral 2024-08-01 14:50:36 -07:00
Daniel Hiltgen
343aba9fca harden integration tests 2024-08-01 14:50:28 -07:00
Daniel Hiltgen
46c5f5fd9e Runtime selection of new or old runners
This adjusts the new runners to comingle with existing runners so we can use an
env var to toggle the new runners on.
2024-08-01 09:06:01 -07:00
Daniel Hiltgen
c1384c8bcc Implement timings response in Go server
This implements the fields necessary for `run --verbose`
to generate timing information.
2024-08-01 09:05:08 -07:00
Daniel Hiltgen
d0e239b85f
Merge pull request #6110 from dhiltgen/go_server_embeds
llama: Get embeddings working
2024-08-01 07:59:35 -07:00
Daniel Hiltgen
b2f8a6120c Get embeddings working
Truncation doesn't pass, but the other embeddings tests pass
2024-07-31 17:03:05 -07:00
Daniel Hiltgen
049f40e4e2
Merge pull request #6107 from dhiltgen/go_server_parallel
llama: Fix parallel requests
2024-07-31 16:36:49 -07:00
Daniel Hiltgen
71b287264d Fix parallel requests 2024-07-31 15:21:43 -07:00
Daniel Hiltgen
41bf8d9932 Update sync with latest llama.cpp layout, and run against b3485 2024-07-31 09:50:39 -07:00
Daniel Hiltgen
5152a430f5 Prefix all build artifacts with an OS/ARCH dir
This will help keep incremental builds from stomping on each other and make it
easier to stitch together the final runner payloads
2024-07-29 15:38:52 -07:00
Daniel Hiltgen
6456a7fd73 Get linux building
Still needs a bit more refinement to (auto)detect cuda/hip and fallback
gracefully if not detected.
2024-07-29 15:38:52 -07:00
jmorganca
8931724a7a add note in readme 2024-07-29 15:38:52 -07:00
jmorganca
7ad4c5334e clean up metal code 2024-07-29 15:38:52 -07:00
jmorganca
9caee9f8e3 fix Makefile on windows 2024-07-29 15:38:52 -07:00
jmorganca
518ba1c793 remove printing 2024-07-29 15:38:52 -07:00
jmorganca
6dfd6db10c dont apply license to stb_image.h and json.hpp 2024-07-29 15:38:52 -07:00
jmorganca
2abf81885d lint 2024-07-29 15:38:52 -07:00
jmorganca
a50c1070f1 update sync header 2024-07-29 15:38:52 -07:00
jmorganca
a48179c340 remove unused script 2024-07-29 15:38:52 -07:00
jmorganca
f6faf66dac fix metal 2024-07-29 15:38:52 -07:00
jmorganca
fd15793930 add header to not edit 2024-07-29 15:38:52 -07:00
jmorganca
5d6a6e5282 add header to not edit 2024-07-29 15:38:52 -07:00
jmorganca
f8424faf75 fix build on windows 2024-07-29 15:38:52 -07:00
jmorganca
095e7a9d7d fix Makefile 2024-07-29 15:38:52 -07:00
jmorganca
e16d13d88b fix README.md 2024-07-29 15:38:52 -07:00
jmorganca
595d8878c4 fix README.md 2024-07-29 15:38:52 -07:00
jmorganca
e80789bd86 consistent whitespace 2024-07-29 15:38:52 -07:00
jmorganca
a2f44f0af5 update .gitattributes 2024-07-29 15:38:52 -07:00
jmorganca
295c202b2f link metal 2024-07-29 15:38:52 -07:00
jmorganca
f96cade3a6 wip 2024-07-29 15:38:52 -07:00
jmorganca
b767f6554c wip meta 2024-07-29 15:38:52 -07:00
jmorganca
87833dd606 sync 2024-07-29 15:38:52 -07:00
jmorganca
2f94ffd801 remove perl docs 2024-07-29 15:38:52 -07:00
jmorganca
e9d15eb277 remove build scripts 2024-07-29 15:38:52 -07:00
jmorganca
4051a26f6f remove need for perl 2024-07-29 15:38:52 -07:00
jmorganca
a687913a97 fix output 2024-07-29 15:38:52 -07:00
jmorganca
6110d25dce arch build 2024-07-29 15:38:52 -07:00
jmorganca
2081ec9ba1 add temporary makefile 2024-07-29 15:38:52 -07:00
jmorganca
4b13e564eb fix cuda and rocm builds 2024-07-29 15:38:51 -07:00
jmorganca
34015ca10d fix cgo flags for darwin amd64 2024-07-29 15:38:51 -07:00
jmorganca
11508826b2 remove -fPIC from build_hipblas.sh 2024-07-29 15:38:51 -07:00
jmorganca
ac090b6b71 fix issues with runner 2024-07-29 15:38:51 -07:00
jmorganca
6dab2a9d3a move sync script back in for now 2024-07-29 15:38:51 -07:00
jmorganca
834aac8450 llama: sync 2024-07-29 15:38:51 -07:00
jmorganca
ac6b154cc4 update to d5c938cd 2024-07-29 15:38:51 -07:00
jmorganca
0574fe199a add patches 2024-07-29 15:38:51 -07:00
jmorganca
028fda3582 cleanup stop code 2024-07-29 15:38:51 -07:00
jmorganca
8ef58a6695 fix example 2024-07-29 15:38:51 -07:00