jmorganca
|
cd776e49ad
|
llama: wip vision support for runner
|
2024-08-12 22:18:30 -07:00 |
|
Daniel Hiltgen
|
e584f14e78
|
Merge pull request #6123 from dhiltgen/go_server_unified
llama: Runtime selection of new or old runners
|
2024-08-01 15:51:51 -07:00 |
|
Daniel Hiltgen
|
3cc7ec4429
|
fix dolphin-mistral
|
2024-08-01 14:50:36 -07:00 |
|
Daniel Hiltgen
|
343aba9fca
|
harden integration tests
|
2024-08-01 14:50:28 -07:00 |
|
Daniel Hiltgen
|
46c5f5fd9e
|
Runtime selection of new or old runners
This adjusts the new runners to comingle with existing runners so we can use an
env var to toggle the new runners on.
|
2024-08-01 09:06:01 -07:00 |
|
Daniel Hiltgen
|
c1384c8bcc
|
Implement timings response in Go server
This implements the fields necessary for `run --verbose`
to generate timing information.
|
2024-08-01 09:05:08 -07:00 |
|
Daniel Hiltgen
|
d0e239b85f
|
Merge pull request #6110 from dhiltgen/go_server_embeds
llama: Get embeddings working
|
2024-08-01 07:59:35 -07:00 |
|
Daniel Hiltgen
|
b2f8a6120c
|
Get embeddings working
Truncation doesn't pass, but the other embeddings tests pass
|
2024-07-31 17:03:05 -07:00 |
|
Daniel Hiltgen
|
049f40e4e2
|
Merge pull request #6107 from dhiltgen/go_server_parallel
llama: Fix parallel requests
|
2024-07-31 16:36:49 -07:00 |
|
Daniel Hiltgen
|
71b287264d
|
Fix parallel requests
|
2024-07-31 15:21:43 -07:00 |
|
Daniel Hiltgen
|
41bf8d9932
|
Update sync with latest llama.cpp layout, and run against b3485
|
2024-07-31 09:50:39 -07:00 |
|
Daniel Hiltgen
|
5152a430f5
|
Prefix all build artifacts with an OS/ARCH dir
This will help keep incremental builds from stomping on each other and make it
easier to stitch together the final runner payloads
|
2024-07-29 15:38:52 -07:00 |
|
Daniel Hiltgen
|
6456a7fd73
|
Get linux building
Still needs a bit more refinement to (auto)detect cuda/hip and fallback
gracefully if not detected.
|
2024-07-29 15:38:52 -07:00 |
|
jmorganca
|
8931724a7a
|
add note in readme
|
2024-07-29 15:38:52 -07:00 |
|
jmorganca
|
7ad4c5334e
|
clean up metal code
|
2024-07-29 15:38:52 -07:00 |
|
jmorganca
|
9caee9f8e3
|
fix Makefile on windows
|
2024-07-29 15:38:52 -07:00 |
|
jmorganca
|
518ba1c793
|
remove printing
|
2024-07-29 15:38:52 -07:00 |
|
jmorganca
|
6dfd6db10c
|
dont apply license to stb_image.h and json.hpp
|
2024-07-29 15:38:52 -07:00 |
|
jmorganca
|
2abf81885d
|
lint
|
2024-07-29 15:38:52 -07:00 |
|
jmorganca
|
a50c1070f1
|
update sync header
|
2024-07-29 15:38:52 -07:00 |
|
jmorganca
|
a48179c340
|
remove unused script
|
2024-07-29 15:38:52 -07:00 |
|
jmorganca
|
f6faf66dac
|
fix metal
|
2024-07-29 15:38:52 -07:00 |
|
jmorganca
|
fd15793930
|
add header to not edit
|
2024-07-29 15:38:52 -07:00 |
|
jmorganca
|
5d6a6e5282
|
add header to not edit
|
2024-07-29 15:38:52 -07:00 |
|
jmorganca
|
f8424faf75
|
fix build on windows
|
2024-07-29 15:38:52 -07:00 |
|
jmorganca
|
095e7a9d7d
|
fix Makefile
|
2024-07-29 15:38:52 -07:00 |
|
jmorganca
|
e16d13d88b
|
fix README.md
|
2024-07-29 15:38:52 -07:00 |
|
jmorganca
|
595d8878c4
|
fix README.md
|
2024-07-29 15:38:52 -07:00 |
|
jmorganca
|
e80789bd86
|
consistent whitespace
|
2024-07-29 15:38:52 -07:00 |
|
jmorganca
|
a2f44f0af5
|
update .gitattributes
|
2024-07-29 15:38:52 -07:00 |
|
jmorganca
|
295c202b2f
|
link metal
|
2024-07-29 15:38:52 -07:00 |
|
jmorganca
|
f96cade3a6
|
wip
|
2024-07-29 15:38:52 -07:00 |
|
jmorganca
|
b767f6554c
|
wip meta
|
2024-07-29 15:38:52 -07:00 |
|
jmorganca
|
87833dd606
|
sync
|
2024-07-29 15:38:52 -07:00 |
|
jmorganca
|
2f94ffd801
|
remove perl docs
|
2024-07-29 15:38:52 -07:00 |
|
jmorganca
|
e9d15eb277
|
remove build scripts
|
2024-07-29 15:38:52 -07:00 |
|
jmorganca
|
4051a26f6f
|
remove need for perl
|
2024-07-29 15:38:52 -07:00 |
|
jmorganca
|
a687913a97
|
fix output
|
2024-07-29 15:38:52 -07:00 |
|
jmorganca
|
6110d25dce
|
arch build
|
2024-07-29 15:38:52 -07:00 |
|
jmorganca
|
2081ec9ba1
|
add temporary makefile
|
2024-07-29 15:38:52 -07:00 |
|
jmorganca
|
4b13e564eb
|
fix cuda and rocm builds
|
2024-07-29 15:38:51 -07:00 |
|
jmorganca
|
34015ca10d
|
fix cgo flags for darwin amd64
|
2024-07-29 15:38:51 -07:00 |
|
jmorganca
|
11508826b2
|
remove -fPIC from build_hipblas.sh
|
2024-07-29 15:38:51 -07:00 |
|
jmorganca
|
ac090b6b71
|
fix issues with runner
|
2024-07-29 15:38:51 -07:00 |
|
jmorganca
|
6dab2a9d3a
|
move sync script back in for now
|
2024-07-29 15:38:51 -07:00 |
|
jmorganca
|
834aac8450
|
llama: sync
|
2024-07-29 15:38:51 -07:00 |
|
jmorganca
|
ac6b154cc4
|
update to d5c938cd
|
2024-07-29 15:38:51 -07:00 |
|
jmorganca
|
0574fe199a
|
add patches
|
2024-07-29 15:38:51 -07:00 |
|
jmorganca
|
028fda3582
|
cleanup stop code
|
2024-07-29 15:38:51 -07:00 |
|
jmorganca
|
8ef58a6695
|
fix example
|
2024-07-29 15:38:51 -07:00 |
|