Commit Graph

  • 48685c6ed0
    Document uninstall on windows (#6663) Daniel Hiltgen 2024-09-05 15:57:38 -07:00
  • 9565fa64a8
    Revert "Detect running in a container (#6495)" (#6662) Daniel Hiltgen 2024-09-05 14:26:00 -07:00
  • 6719097649
    llm: make load time stall duration configurable via OLLAMA_LOAD_TIMEOUT Daniel Hiltgen 2024-09-05 14:00:08 -07:00
  • b05c9e83d9
    Introduce GPU Overhead env var (#5922) Daniel Hiltgen 2024-09-05 13:46:35 -07:00
  • a60d9b89ce
    Detect running in a container (#6495) Daniel Hiltgen 2024-09-05 13:24:51 -07:00
  • bf612cd608
    Merge pull request #6260 from ollama/mxyng/mem Michael Yang 2024-09-05 13:22:08 -07:00
  • ef98e56122
    readme: add AiLama to the list of community integrations (#4957) Zeyo 2024-09-06 01:40:44 +05:30
  • 5f944baac7
    Update gpu.md: Add RTX 3050 Ti and RTX 3050 Ti (#5888) Michael 2024-09-05 12:24:26 -06:00
  • 6fc9d22707
    server: fix blob download when receiving a 200 response (#6656) Tobias Heinze 2024-09-05 19:48:26 +02:00
  • f27c00d8c5
    readme: add Gentoo package manager entry to community integrations (#5714) Vitaly Zdanevich 2024-09-05 20:58:14 +04:00
  • c7c845ec52
    Update install.sh:Replace "command -v" with encapsulated functionality (#6035) 王卿 2024-09-06 00:49:48 +08:00
  • cf48603943
    readme: include Enchanted for Apple Vision Pro (#4949) Augustinas Malinauskas 2024-09-05 06:30:19 +01:00
  • 6e67be09b6
    readme: add lsp-ai to community integrations (#5063) Silas Marvin 2024-09-04 22:17:34 -07:00
  • 0f5f060d2b
    readme: add ollama-php library to community integrations (#6361) Arda Günsüren 2024-09-05 08:01:14 +03:00
  • db8c944498 fix gemma2-2b conversion pdevine/geems-2b Patrick Devine 2024-09-04 16:59:23 -07:00
  • b3554778bd
    readme: add vnc-lm discord bot community integration (#6644) jk011ru 2024-09-04 19:46:02 -04:00
  • bbe7b96ded
    llm: use json.hpp from common (#6642) Pascal Patry 2024-09-04 19:34:42 -04:00
  • c18ff18b2c
    readme: add confichat to community integrations (#6378) Rune Berg 2024-09-05 09:26:02 +12:00
  • 133770a548
    docs: add group to manual Linux isntructions and verify service is running (#6430) Tomoya Fujita 2024-09-04 11:45:09 -07:00
  • f36ebfb478
    readme: add gollm to the list of community libraries (#6099) Teïlo M 2024-09-04 20:19:41 +02:00
  • 5b55379651
    readme: add Cherry Studio to community integrations (#6633) 亢奋猫 2024-09-04 22:53:36 +08:00
  • 93eb43d020
    readme: add Go fun package (#6421) Mitar 2024-09-04 16:52:46 +02:00
  • 369479cc30
    docs: fix spelling error (#6391) Carter 2024-09-04 09:42:33 -04:00
  • 7d89e48f5c
    install.sh: update instructions to use WSL2 (#6450) Erkin Alp Güney 2024-09-04 16:34:53 +03:00
  • 27bcce6d9f
    readme: add claude-dev to community integrations (#6630) Sam 2024-09-04 23:32:26 +10:00
  • 491fc312ae
    readme: add PyOllaMx project (#6624) Viz 2024-09-03 23:10:53 -04:00
  • f443dd7b81 llama: sync llama.cpp to commit 8962422 jmorganca/llama-bump jmorganca 2024-09-03 21:23:46 -04:00
  • 8db94469e0 runner.go: Support GGUF LoRAs Jesse Gross 2024-08-28 17:12:06 -07:00
  • c989321509 runner.go: Don't cast a Go handle to a C void * Jesse Gross 2024-08-28 21:07:16 -07:00
  • e4a091bafd runner.go: Support resource usage command line options Jesse Gross 2024-08-28 09:29:09 -07:00
  • fd4ecd1ff5 llama: fix sync script ggml-metal_darwin_arm64.m filename (#6610) Jeffrey Morgan 2024-09-03 14:01:52 -04:00
  • 9d8129b8bb llama: delete unused files (#6523) Jeffrey Morgan 2024-08-29 17:30:11 -07:00
  • c8a1741d9b runner.go: Update TODOs Jesse Gross 2024-08-27 17:05:23 -07:00
  • 46a7c682f2 runner.go: Fix embeddings endpoint Jesse Gross 2024-08-27 13:59:33 -07:00
  • 52e88ab7b3 runner.go: Health endpoint comments Jesse Gross 2024-08-27 13:17:04 -07:00
  • 4ca8579428 runner.go: Cleanups Jesse Gross 2024-08-27 13:19:46 -07:00
  • d022cfc9e6 runner.go: Move pieces[] into sequence Jesse Gross 2024-08-27 10:24:33 -07:00
  • 6ccd0644e1 runner.go: Fix deadlock if a connection is closed during decoding Jesse Gross 2024-08-26 14:20:50 -07:00
  • 0b73cca386 runner.go: Fix resource leaks when removing sequences Jesse Gross 2024-08-26 14:26:48 -07:00
  • 55fb0633db runner.go: Separate KV cache and context sizes Jesse Gross 2024-08-23 17:27:09 -07:00
  • 53b600921e runner.go: Hold mutex for entire time when processing batch Jesse Gross 2024-08-23 16:28:38 -07:00
  • 8e1554c91d runner.go: Scale batches to be processed by numParallel Jesse Gross 2024-08-23 13:44:30 -07:00
  • f52d4b9879 Make new tokenizer logic conditional (#6395) Daniel Hiltgen 2024-08-24 17:25:37 -07:00
  • 76718ead40 runner.go: Support MinP parameter Jesse Gross 2024-08-21 16:13:54 -07:00
  • 90d25d3b0a runner.go: Check for incomplete UTF-8 character Jesse Gross 2024-08-15 13:07:28 -07:00
  • 477f529d26 runner.go: Implement RepeatLastN to penalize repeated tokens Jesse Gross 2024-08-20 11:21:19 -07:00
  • eccd4dd8d2 runner.go: Use correct JSON field names for runners Jesse Gross 2024-08-20 16:58:09 -07:00
  • 69cc5795a7 runner.go: Shift context window when KV cache space is exceeded Jesse Gross 2024-08-14 10:35:49 -07:00
  • 5a441d227a runner.go: Don't decode if nothing has been added to the batch Jesse Gross 2024-08-14 10:32:05 -07:00
  • 8aa97b5e83 llama.go: Advance though tokens when processing multiple batches Jesse Gross 2024-08-13 16:53:35 -07:00
  • 523d84c563 llama.go: Use dynamic buffer for TokenToPiece Jesse Gross 2024-08-19 17:54:57 -07:00
  • ed19fad862 llama.go: Make batch memory allocation match configuration Jesse Gross 2024-08-13 11:18:02 -07:00
  • 5d34320b7c runner.go: Fix off by one in batch size check Jesse Gross 2024-08-13 10:51:50 -07:00
  • 1c36f36c41 llm: Fix array out-of-bounds memory access when tokenizing Jesse Gross 2024-08-15 15:25:21 -07:00
  • 0c2f95f3de runner: Initialize numPredict Jesse Gross 2024-08-13 10:38:03 -07:00
  • ebdf781397 server: Fix double free on runner subprocess error. Jesse Gross 2024-08-16 14:46:33 -07:00
  • 23c7c1326e llm: Fix lint Jesse Gross 2024-08-15 09:50:42 -07:00
  • 8fe30d161c Fix filename for non darwin arm builds Daniel Hiltgen 2024-08-13 14:33:11 -07:00
  • a483a4c4ed lint jmorganca 2024-08-13 11:24:46 -07:00
  • b267ab92b0 Add missing vendor headers to ggml sync Daniel Hiltgen 2024-08-09 16:45:18 -07:00
  • 189ca38f1d Wire up native source file dependencies Daniel Hiltgen 2024-08-05 08:56:47 -07:00
  • 80db43b7b4 Bump llama sync to 1e6f65 Daniel Hiltgen 2024-08-06 16:50:34 -07:00
  • 47b0e81219 fix dolphin-mistral Daniel Hiltgen 2024-08-01 14:47:00 -07:00
  • 21947d5c1b harden integration tests Daniel Hiltgen 2024-08-01 14:41:23 -07:00
  • 751009a5d7 Runtime selection of new or old runners Daniel Hiltgen 2024-08-01 08:54:44 -07:00
  • 8527028bf4 Implement timings response in Go server Daniel Hiltgen 2024-07-29 14:09:55 -07:00
  • e0241118d0 Get embeddings working Daniel Hiltgen 2024-07-31 11:08:09 -07:00
  • f97ee8c506 Fix parallel requests Daniel Hiltgen 2024-07-31 15:02:58 -07:00
  • e9dd656ff5 Update sync with latest llama.cpp layout, and run against b3485 Daniel Hiltgen 2024-07-29 16:21:09 -07:00
  • 6c0d892498 Prefix all build artifacts with an OS/ARCH dir Daniel Hiltgen 2024-06-24 09:23:34 -07:00
  • 13348e3629 Get linux building Daniel Hiltgen 2024-06-23 12:07:41 -07:00
  • 3d5a08c315 add note in readme jmorganca 2024-06-21 16:22:27 -04:00
  • a29851bc9b clean up metal code jmorganca 2024-06-15 10:06:36 -07:00
  • 8dda9293fa fix Makefile on windows jmorganca 2024-06-20 21:52:10 -04:00
  • b3c62dcafd remove printing jmorganca 2024-06-13 18:41:12 -07:00
  • 9b8b7cd9b5 dont apply license to stb_image.h and json.hpp jmorganca 2024-06-13 14:35:11 -07:00
  • 1da6c40f4f lint jmorganca 2024-06-13 14:21:55 -07:00
  • 76ca2de06e update sync header jmorganca 2024-06-13 14:12:23 -07:00
  • 0eabc2e34d remove unused script jmorganca 2024-06-13 14:07:05 -07:00
  • dded27dcfa fix metal jmorganca 2024-06-12 12:18:40 -07:00
  • 080b600865 add header to not edit jmorganca 2024-06-12 11:40:13 -07:00
  • d6b6de9a5a add header to not edit jmorganca 2024-06-12 11:38:42 -07:00
  • 24a741424f fix build on windows jmorganca 2024-06-12 02:47:12 -04:00
  • 4d476d894e fix Makefile jmorganca 2024-06-11 23:18:07 -07:00
  • bd94ddfc56 fix README.md jmorganca 2024-06-11 22:54:45 -07:00
  • f1f54c5bd5 fix README.md jmorganca 2024-06-11 22:54:31 -07:00
  • 18662d1180 consistent whitespace jmorganca 2024-06-11 22:50:10 -07:00
  • 3d1f3569cf update .gitattributes jmorganca 2024-06-11 22:48:06 -07:00
  • 083a9e9b4e link metal jmorganca 2024-06-11 22:46:14 -07:00
  • d0703eaf44 wip jmorganca 2024-06-11 18:53:48 -07:00
  • ce00e387c3 wip meta jmorganca 2024-06-11 11:12:00 -07:00
  • 763d7b601c sync jmorganca 2024-06-10 17:23:09 -07:00
  • 4d0e6c55b0 remove perl docs jmorganca 2024-06-10 09:26:19 -07:00
  • 3375b82c56 remove build scripts jmorganca 2024-06-10 02:56:37 -04:00
  • b8c1065ab6 remove need for perl jmorganca 2024-06-10 00:04:21 -04:00
  • a632a04426 fix output jmorganca 2024-06-09 23:53:40 -04:00
  • 110f37ffb0 arch build jmorganca 2024-06-09 20:19:11 -07:00
  • f2f03ff7f2 add temporary makefile jmorganca 2024-06-09 22:33:31 -04:00
  • ba0ff1c46a fix cuda and rocm builds jmorganca 2024-06-09 19:49:22 -04:00
  • 9966a055e5 fix cgo flags for darwin amd64 jmorganca 2024-06-09 14:30:41 -07:00