Commit Graph

  • 7aa7a3c1e5 remove -fPIC from build_hipblas.sh jmorganca 2024-06-07 12:52:49 -04:00
  • de634b7fd7 fix issues with runner jmorganca 2024-06-07 09:32:52 -07:00
  • 795753be7e move sync script back in for now jmorganca 2024-06-07 09:26:44 -07:00
  • 0eed68fed4 llama: sync jmorganca 2024-06-07 00:27:24 -07:00
  • 783134a3bb update to d5c938cd jmorganca 2024-06-07 00:15:58 -07:00
  • 74a158a79e add patches jmorganca 2024-06-06 23:55:47 -07:00
  • 8f79a2e86a cleanup stop code jmorganca 2024-06-04 00:58:58 -07:00
  • a4d402c403 fix example jmorganca 2024-06-04 00:43:03 -07:00
  • e1dfc757b3 revert llm changes jmorganca 2024-06-04 00:40:19 -07:00
  • 7d0a452938 num predict jmorganca 2024-05-28 23:38:44 -07:00
  • 43efc893d7 basic progress jmorganca 2024-05-28 23:11:48 -07:00
  • 20afaae020 add more runner params jmorganca 2024-05-28 00:02:01 -07:00
  • 72f3fe4b94 truncate stop properly jmorganca 2024-05-27 23:09:56 -07:00
  • a379d68aa9 wip stop tokens jmorganca 2024-05-27 14:38:44 -07:00
  • b2ef3bf490 embeddings jmorganca 2024-05-27 11:33:47 -07:00
  • ce15ed6d69 remove dependency on llm jmorganca 2024-05-26 23:23:09 -07:00
  • c0b94376b2 grammar jmorganca 2024-05-26 23:14:44 -07:00
  • 72be8e27c4 sampling jmorganca 2024-05-26 23:01:05 -07:00
  • d12db0568e better example module, add port jmorganca 2024-05-25 20:11:57 -07:00
  • ec17359a68 wip jmorganca 2024-05-24 10:09:35 -07:00
  • fbc8572859 add llava to runner jmorganca 2024-05-23 18:22:15 -07:00
  • 87af27dac0 fix output in build_hipblas.sh jmorganca 2024-05-20 16:43:53 -07:00
  • 54f391309f mods to build_hipblas.sh for linux jmorganca 2024-05-20 16:15:16 -07:00
  • 28bedcd807 wip jmorganca 2024-05-20 15:27:10 -07:00
  • 922d0acbdb improve cuda and hipblas build scripts jmorganca 2024-05-20 16:17:13 -04:00
  • b22d78720e cuda linux jmorganca 2024-05-19 23:11:30 -07:00
  • 905568a47f Update README.md Jeffrey Morgan 2024-05-19 16:47:50 -07:00
  • a15ac52fbe Update README.md Jeffrey Morgan 2024-05-19 16:47:19 -07:00
  • 9547aa53ff disable log file jmorganca 2024-05-19 16:36:32 -07:00
  • e29205ad6d fix readme for llava jmorganca 2024-05-19 16:33:37 -07:00
  • a8f91d3cc1 add llava jmorganca 2024-05-19 16:30:11 -07:00
  • a9884ae136 llama: add clip dependencies jmorganca 2024-05-19 14:06:46 -07:00
  • e37651cca0 add clip and parallel requests to the todo list jmorganca 2024-05-19 14:01:52 -07:00
  • 593d6836ab fix cuda build jmorganca 2024-05-19 03:34:24 -04:00
  • 533a7e7d50 fix build on windows jmorganca 2024-05-19 03:19:41 -04:00
  • 0873d28b16 fix ggml-metal.m build constraints jmorganca 2024-05-19 00:10:15 -07:00
  • bb795faa6c fix ggml-metal.m jmorganca 2024-05-19 00:06:26 -07:00
  • e86db9381a avx2 should only add avx2 jmorganca 2024-05-18 23:53:29 -07:00
  • 4a5633e4bc fix sync script jmorganca 2024-05-18 23:50:50 -07:00
  • 86f453252b fix ggml-metal.m jmorganca 2024-05-18 23:34:58 -07:00
  • dfd8f34806 fix ggml-metal.m jmorganca 2024-05-18 23:31:41 -07:00
  • beb847b40f add license headers jmorganca 2024-05-18 23:30:28 -07:00
  • 785f76d390 pre-patch jmorganca 2024-05-18 23:27:01 -07:00
  • 9fe48978a8 move runner package down jmorganca 2024-05-18 23:15:51 -07:00
  • 01ccbc07fe replace static build in llm jmorganca 2024-05-18 22:22:46 -07:00
  • ec09be97e8 fix build jmorganca 2024-05-18 21:23:53 -07:00
  • 6129f30479 wip... jmorganca 2024-05-16 13:52:38 -07:00
  • eb1aa97961 rename server to runner jmorganca 2024-05-19 00:13:30 -04:00
  • 5e921e06ac Update README.md Jeffrey Morgan 2024-05-18 19:50:23 -07:00
  • 02089baf70 Update README.md Jeffrey Morgan 2024-05-18 19:49:43 -07:00
  • 870e91be76 Update README.md Jeffrey Morgan 2024-05-18 19:47:19 -07:00
  • 7ecc8e86c4 Update README.md Jeffrey Morgan 2024-05-18 19:46:44 -07:00
  • b1696e308e Add missing hipcc flags jmorganca 2024-05-18 23:07:19 -04:00
  • c646115b31 fix .gitattributes jmorganca 2024-05-18 22:39:41 -04:00
  • 0110994d06 Initial llama Go module jmorganca 2024-04-20 20:44:01 -04:00
  • 2ef3a217d1 add sync of llama.cpp jmorganca 2024-04-20 18:08:09 -04:00
  • 5e2653f9fe
    llm: update llama.cpp commit to 8962422 (#6618) Jeffrey Morgan 2024-09-03 21:12:39 -04:00
  • f29b167e1a
    Use cuda v11 for driver 525 and older (#6620) Daniel Hiltgen 2024-09-03 17:15:31 -07:00
  • 037a4d103e
    Log system memory at info (#6617) Daniel Hiltgen 2024-09-03 14:55:20 -07:00
  • 50c05d57e0
    readme: add Painting Droid community integration (#5514) Mateusz Migas 2024-09-03 22:15:54 +02:00
  • 35159de18a
    readme: update Ollama4j link and add link to Ollama4j Web UI (#6608) Amith Koujalgi 2024-09-04 01:38:50 +05:30
  • 94fff5805f
    Fix sprintf to snprintf (#5664) FellowTraveler 2024-09-03 11:32:59 -05:00
  • 14d5093cd0
    readme: add PartCAD tool to readme for generating 3D CAD models using Ollama (#6605) OpenVMP 2024-09-03 09:28:01 -07:00
  • 9df5f0e8e4
    Reduce docker image size (#5847) R0CKSTAR 2024-09-04 00:25:31 +08:00
  • ad3eb00bee
    readme: add OllamaFarm project (#6508) presbrey 2024-09-02 16:05:36 -04:00
  • bfc2d61549
    readme: add go-crew and Ollamaclient projects (#6583) Jonathan Hecl 2024-09-02 16:34:26 -03:00
  • 741affdfd6
    docs: update faq.md for OLLAMA_MODELS env var permissions (#6587) SnoopyTlion 2024-09-03 03:31:29 +08:00
  • 5f7b4a5e30
    fix(cmd): show info may have nil ModelInfo (#6579) Vimal Kumar 2024-09-01 09:42:17 +05:30
  • 1aad838707
    docs: update GGUF examples and references (#6577) rayfiyo 2024-09-01 11:34:25 +09:00
  • a1cef4d0a5
    Add findutils to base images (#6581) v0.3.9 Daniel Hiltgen 2024-08-31 10:40:05 -07:00
  • c41f0b9e6c
    Merge pull request #6562 from ollama/mxyng/build-artifacts Michael Yang 2024-08-30 09:40:50 -07:00
  • 142cbb722d
    Merge pull request #6482 from ollama/mxyng/client-path Michael Yang 2024-08-30 09:40:34 -07:00
  • 9468c6824a
    Merge pull request #6534 from ollama/mxyng/messages Michael Yang 2024-08-30 09:39:59 -07:00
  • faf1a6ac5a update push to use model.Name mxyng/modelname-7 Michael Yang 2024-05-08 17:34:54 -07:00
  • 11018196e0 remove any unneeded build artifacts Michael Yang 2024-08-29 13:40:43 -07:00
  • 56346ccfa3
    doc: Add Nix and Flox to package manager listing (#6074) Bryan Honof 2024-08-29 18:45:35 +02:00
  • 8e4e509fa4
    update the openai docs to explain how to set the context size (#6548) Patrick Devine 2024-08-28 17:11:46 -07:00
  • 6de85f5c00 slog gin logging mxyng/gin-slog Michael Yang 2024-02-08 11:05:16 -08:00
  • dc08a27d54 remove merges jyan/convert-cmdr Josh Yan 2024-08-28 16:01:13 -07:00
  • cf8af774ab renaming Josh Yan 2024-08-28 15:44:21 -07:00
  • c41bbb45bd linter Josh Yan 2024-08-28 15:40:36 -07:00
  • d073220b65 rebased Josh Yan 2024-08-28 15:39:24 -07:00
  • 47c2b947a9
    Merge pull request #6546 from ollama/mxyng/fix-test Michael Yang 2024-08-28 15:37:47 -07:00
  • 745706c765 refactor layer pruning mxyng/modelname-6 Michael Yang 2024-08-28 13:13:02 -07:00
  • 5eb77bf976
    Merge pull request #6539 from ollama/mxyng/validate-modelpath Michael Yang 2024-08-28 14:38:27 -07:00
  • e4d0a9c325 fix(test): do not clobber models directory Michael Yang 2024-08-28 14:07:48 -07:00
  • 7416ced70f
    add llama3.1 chat template (#6545) Patrick Devine 2024-08-28 14:03:20 -07:00
  • 6761aca1e1 update pull handler to use model.Name mxyng/modelname-5 Michael Yang 2024-08-28 13:06:41 -07:00
  • 3e24edd9ed update push to use model.Name Michael Yang 2024-05-08 17:34:54 -07:00
  • 0e1ec461f9 import jyan/convert-prog Josh Yan 2024-08-28 11:18:23 -07:00
  • 52ef79bb7d last lint (hopefully) Josh Yan 2024-08-28 11:12:39 -07:00
  • 800edd7884 lint again Josh Yan 2024-08-28 11:10:03 -07:00
  • 01b20fe6f1 lint Josh Yan 2024-08-28 11:07:43 -07:00
  • 9cfd2dd3e3
    Merge pull request #6522 from ollama/mxyng/detect-chat Michael Yang 2024-08-28 11:04:18 -07:00
  • 340162fbc3 convert progress Josh Yan 2024-08-28 10:54:52 -07:00
  • 4da5d5beaa lint jyan/quant5 Josh Yan 2024-08-28 10:23:41 -07:00
  • cc17b02b23 update Josh Yan 2024-08-28 09:58:23 -07:00
  • 8e6da3cbc5 update deprecated warnings Michael Yang 2024-08-27 17:57:34 -07:00
  • d9d50c43cc validate model path Michael Yang 2024-08-27 17:56:04 -07:00
  • 6c1c1ad6a9
    throw an error when encountering unsupport tensor sizes (#6538) Patrick Devine 2024-08-27 17:54:04 -07:00