Commit Graph

  • f84cc9939c allow ollama.com to call inference and info endpoints brucemacd/allow-ollama Bruce MacDonald 2024-08-09 09:31:09 -07:00
  • f7e3b9190f
    cmd: spinner progress for transfer model data (#6100) Josh 2024-08-12 11:46:32 -07:00
  • 980dd15f81
    cmd: speed up gguf creates (#6324) Josh 2024-08-12 11:46:09 -07:00
  • 781585d9bd return 204 for cross-origin OPTIONS royh/ep-methods Roy Han 2024-07-10 12:52:34 -07:00
  • b84a54be05 return 405 for bad method Roy Han 2024-07-10 11:23:48 -07:00
  • 01d544d373
    OpenAI: Simplify input output in testing (#5858) royjhan 2024-08-12 13:33:34 -04:00
  • 1dc3ef3aa9
    Revert "server: speed up single gguf creates (#5898)" (#6323) Josh 2024-08-12 09:57:51 -07:00
  • 8aac22438e
    server: speed up single gguf creates (#5898) Josh 2024-08-12 09:28:55 -07:00
  • 15c2d8fe14
    server: parallelize embeddings in API web handler instead of in subprocess runner (#6220) v0.3.5 Jeffrey Morgan 2024-08-11 11:57:10 -07:00
  • 25906d72d1
    llm: prevent loading too large models on windows (#5926) Daniel Hiltgen 2024-08-11 11:30:20 -07:00
  • 023451ce47
    add integration obook-summary (#6305) CognitiveTech 2024-08-10 21:43:08 -04:00
  • 9b53e39d8e
    Merge pull request #6258 from coolljt0725/fix_typo Jesse Gross 2024-08-09 17:19:48 -07:00
  • 97fae2df95
    Merge pull request #6235 from Nicholas42/fix_line_endings Michael Yang 2024-08-09 17:06:30 -07:00
  • 160d9d4900
    Merge pull request #6171 from ollama/mxyng/remove-temp Michael Yang 2024-08-09 15:47:13 -07:00
  • d4e6407464 Restrict text files with explicit line feeds to *.go. Nicholas Schwab 2024-08-09 23:14:13 +02:00
  • b7f7d8cd15
    Merge pull request #6291 from dhiltgen/no_sparse_fail Daniel Hiltgen 2024-08-09 12:30:25 -07:00
  • 2fa1db4345 Don't hard fail on sparse setup error Daniel Hiltgen 2024-08-09 11:57:48 -07:00
  • 71b0945fc6
    Merge pull request #6290 from dhiltgen/intel_npe Daniel Hiltgen 2024-08-09 12:14:42 -07:00
  • 30823ec925 update readme royh/whisper Roy Han 2024-08-09 11:32:27 -07:00
  • 5bca2e60a7 Harden intel boostrap for nil pointers Daniel Hiltgen 2024-08-09 11:31:38 -07:00
  • 89f3bae306 cli Roy Han 2024-08-09 11:04:26 -07:00
  • 67472e0e89
    Also flag *.icns as binary Nicholas42 2024-08-09 13:41:20 +02:00
  • e9aa5117c4
    Merge pull request #6133 from dhiltgen/cuda_repo Daniel Hiltgen 2024-08-08 12:33:35 -07:00
  • 2473bdba5e
    Merge pull request #6182 from dhiltgen/more_patterns Daniel Hiltgen 2024-08-08 12:33:17 -07:00
  • 2003d60159 llama3.1 memory Michael Yang 2024-08-08 11:18:13 -07:00
  • 7d1c0047fa
    Merge pull request #6247 from ollama/jessegross/layers Jesse Gross 2024-08-08 10:46:43 -07:00
  • 7b61eba471 server/download.go: Fix a typo in log Jitang Lei 2024-08-08 20:28:01 +08:00
  • 7edaf6e7e8 manifest: Store layers inside manifests consistently as values. Jesse Gross 2024-08-07 14:22:17 -07:00
  • 97ec8cfd4e image: Clarify argument to WriteManifest is config Jesse Gross 2024-08-07 11:44:25 -07:00
  • 5b3a21b578
    add metrics to docs (#6079) royjhan 2024-08-07 17:43:44 -04:00
  • ad0c19dde4
    Use llama3.1 in tools example (#5985) Kyle Kelley 2024-08-07 14:20:50 -07:00
  • 2fe945412a get api models mxyng/api-models Michael Yang 2024-08-07 11:43:44 -07:00
  • ad7e822883 audio processing error prop Roy Han 2024-08-07 14:05:22 -07:00
  • d503f04b32 expiration Roy Han 2024-08-07 13:01:04 -07:00
  • 8ccf543c53 chat doc Roy Han 2024-08-06 17:31:51 -07:00
  • 75ad6309b4 chat support Roy Han 2024-08-06 16:42:02 -07:00
  • a5181a8c51 error handling Roy Han 2024-08-06 11:45:13 -07:00
  • 2a9feb0707 model flexibility Roy Han 2024-08-06 10:53:29 -07:00
  • e4d35198a2 transcribe Roy Han 2024-08-05 14:16:32 -07:00
  • 17f9dc6d08 save whisper port Roy Han 2024-08-05 13:19:01 -07:00
  • 97d9dffa80 err check Roy Han 2024-08-02 16:57:34 -07:00
  • 65483180b9 working poc Roy Han 2024-08-02 16:54:28 -07:00
  • 1ac92eae7c submodule Roy Han 2024-07-31 16:23:17 -07:00
  • 69eb06c40e
    Merge pull request #6145 from ollama/jessegross/bug5840 Jesse Gross 2024-08-07 11:24:15 -07:00
  • 1829fb61bd manifest: Fix crash on startup when trying to clean up unused files (#5840) Jesse Gross 2024-08-05 17:13:52 -07:00
  • ce67706037 Set *.png and *.ico to be treated as binary files. Nicholas Schwab 2024-08-07 18:15:17 +02:00
  • 69207b4987 fix memory mxyng/fix-memory Michael Yang 2024-08-06 19:06:40 -07:00
  • 685a53534b manifest: Don't prune layers if we can't open a manifest file Jesse Gross 2024-08-01 15:05:16 -07:00
  • de4fc29773
    llm: reserve required number of slots for embeddings (#6219) v0.3.4 Jeffrey Morgan 2024-08-06 23:20:49 -04:00
  • e04c7012c2
    update llama.cpp submodule to 1e6f6554 (#6208) Jeffrey Morgan 2024-08-06 15:11:45 -04:00
  • d4a7216c82
    Fixed invalid option provided not displaying the invalid option name problem. (#6202) Chua Chee Seng 2024-08-07 02:37:16 +08:00
  • a4fdd03c3b
    Merge pull request #6207 from dhiltgen/sparse_win Daniel Hiltgen 2024-08-06 11:06:06 -07:00
  • fc85f50a2b Ensure sparse files on windows during download Daniel Hiltgen 2024-08-06 10:46:31 -07:00
  • 86b907f82a
    sort batch results (#6189) royjhan 2024-08-05 19:55:34 -04:00
  • 10d49bce70
    Merge pull request #6190 from ollama/mxyng/fix-integration Michael Yang 2024-08-05 16:45:49 -07:00
  • 7ed367419e fix concurrency test Michael Yang 2024-08-05 16:34:54 -07:00
  • 50ee8b5f56
    Merge pull request #6186 from dhiltgen/numa Daniel Hiltgen 2024-08-05 15:20:06 -07:00
  • 03bdac0595
    Merge pull request #6146 from ollama/mxyng/testing Michael Yang 2024-08-05 13:00:05 -07:00
  • f457d63400 Implement linux NUMA detection Daniel Hiltgen 2024-08-05 12:56:20 -07:00
  • 04210aa6dd Catch one more error log Daniel Hiltgen 2024-08-05 09:28:07 -07:00
  • 43f9d92008 close pid file Michael Yang 2024-08-05 00:34:09 -07:00
  • ed6c8bfe57 removeall to remove non-empty temp dirs Michael Yang 2024-08-05 00:02:47 -07:00
  • 39f2bc6bfc
    Merge pull request #6167 from ollama/mxyng/line-feed Michael Yang 2024-08-05 00:06:28 -07:00
  • b73b0940ef
    Disable paging for journalctl (#6154) frob 2024-08-05 06:10:53 +02:00
  • 6a07344786 line feed Michael Yang 2024-08-04 17:25:33 -07:00
  • 8b920f35a4
    Add Gemma 2 2b (#6151) sryu1 2024-08-05 00:58:39 +10:00
  • 4221e39867
    Reference ollama integration with Harbor (#6147) Ivan Charapanau 2024-08-03 02:03:46 +02:00
  • a091fadfda use testing tempdirs Michael Yang 2024-08-02 15:55:34 -07:00
  • 77ccbf04dc
    Merge pull request #6128 from ollama/mxyng/lint Michael Yang 2024-08-02 14:58:40 -07:00
  • 4addf6b587
    Update OpenAI Compatibility Docs with /v1/completions (#5311) royjhan 2024-08-02 16:16:23 -04:00
  • 85c7f11170
    Update docs (#5310) royjhan 2024-08-02 16:05:57 -04:00
  • df3802a65f Adjust arm cuda repo paths Daniel Hiltgen 2024-08-01 17:22:25 -07:00
  • b732beba6a lint Michael Yang 2024-08-01 14:52:15 -07:00
  • ce1fb4447e
    Fix models/{model} URL (#6132) v0.3.3 Kim Hallberg 2024-08-02 01:31:47 +02:00
  • 558a54b098
    Update OpenAI Compatibility Docs with /v1/embeddings (#5470) royjhan 2024-08-01 19:00:29 -04:00
  • ed52833bb1
    Add to docs (#5309) royjhan 2024-08-01 18:58:13 -04:00
  • e584f14e78
    Merge pull request #6123 from dhiltgen/go_server_unified Daniel Hiltgen 2024-08-01 15:51:51 -07:00
  • 6f133a0bdd
    OpenAI: Add Usage to v1/embeddings (#5886) royjhan 2024-08-01 18:49:37 -04:00
  • f561eecfb8
    Update OpenAI Compatibility Docs with /v1/models (#5151) royjhan 2024-08-01 18:48:44 -04:00
  • ff7c9060ec
    Merge pull request #6115 from slouffka/fix-context Michael Yang 2024-08-01 15:13:59 -07:00
  • 3cc7ec4429 fix dolphin-mistral Daniel Hiltgen 2024-08-01 14:47:00 -07:00
  • 343aba9fca harden integration tests Daniel Hiltgen 2024-08-01 14:41:23 -07:00
  • 7b8ca2dc0f add q4_0_4_4/4_8/8_8 jyan/q4_4/8 Josh Yan 2024-08-01 14:31:36 -07:00
  • 0ff42e84b0
    Merge pull request #4756 from ollama/mxyng/convert2 Michael Yang 2024-08-01 14:16:30 -07:00
  • 8a9f946ca7 Refactor and format code. Vyacheslav Moskalev 2024-08-02 03:50:05 +07:00
  • 46c5f5fd9e Runtime selection of new or old runners Daniel Hiltgen 2024-08-01 08:54:44 -07:00
  • c1384c8bcc Implement timings response in Go server Daniel Hiltgen 2024-07-29 14:09:55 -07:00
  • d0e239b85f
    Merge pull request #6110 from dhiltgen/go_server_embeds Daniel Hiltgen 2024-08-01 07:59:35 -07:00
  • 3b5210548e Refactor code. Remove extra variable. Vyacheslav Moskalev 2024-08-01 19:56:15 +07:00
  • b0c216584c Better types and naming closer to style. Vyacheslav Moskalev 2024-08-01 19:43:44 +07:00
  • 49a5483139 Change the order of context and prompt. Vyacheslav Moskalev 2024-08-01 19:25:56 +07:00
  • 6bc5c13758 Fix extra context concatenation in generate handler (#5980). Vyacheslav Moskalev 2024-08-01 15:45:41 +07:00
  • 3e614260af
    Merge pull request #6109 from ollama/mxyng/fix-modelfile Michael Yang 2024-07-31 17:05:43 -07:00
  • b2f8a6120c Get embeddings working Daniel Hiltgen 2024-07-31 11:08:09 -07:00
  • d87b4a488e fix modelfile message quotes Michael Yang 2024-07-31 16:52:09 -07:00
  • 049f40e4e2
    Merge pull request #6107 from dhiltgen/go_server_parallel Daniel Hiltgen 2024-07-31 16:36:49 -07:00
  • 4c14855ad7
    Merge pull request #6106 from ollama/mxyng/default-sliding-window-attention v0.3.2 Michael Yang 2024-07-31 16:12:06 -07:00
  • dc77bbcfa4
    server: fix json marshalling of downloadBlobPart (#6108) Blake Mizerany 2024-07-31 16:01:24 -07:00
  • d8e2664c33 convert: fix parse functions Michael Yang 2024-07-31 15:39:11 -07:00
  • eafc607abb convert: only extract large files Michael Yang 2024-06-29 16:53:59 -07:00