Commit Graph

  • 73d69bc90b remove types Josh Yan 2024-08-12 09:47:05 -07:00
  • 9bc42f532b rmv api type Josh Yan 2024-08-12 09:45:44 -07:00
  • 07c0f66f5e rm print Josh Yan 2024-08-01 16:49:03 -07:00
  • 4a7bfca902 change progress msg Josh Yan 2024-07-31 10:57:25 -07:00
  • 04f2154505 fixed cgo Josh Yan 2024-07-31 10:52:11 -07:00
  • de9b21b472 quantize progress Josh Yan 2024-07-31 10:48:18 -07:00
  • 93ea9240ae
    Move ollama executable out of bin dir (#6535) v0.3.8 Daniel Hiltgen 2024-08-27 16:19:00 -07:00
  • 413ae39f3c update templates to use messages Michael Yang 2024-08-27 11:34:30 -07:00
  • 60e47573a6 more tokenizer tests Michael Yang 2024-08-27 11:11:53 -07:00
  • d13c3daa0b
    add safetensors to the modelfile docs (#6532) Patrick Devine 2024-08-27 14:46:47 -07:00
  • 1713eddcd0
    Fix import image width (#6528) Patrick Devine 2024-08-27 14:19:47 -07:00
  • 4e1c4f6e0b
    Update manual instructions with discrete ROCm bundle (#6445) Daniel Hiltgen 2024-08-27 13:42:28 -07:00
  • 397cae7962
    llm: fix typo in comment (#6530) Sean Khatiri 2024-08-27 16:28:29 -04:00
  • a6d30ecefe working causal attention paligemma-support Josh Yan 2024-08-27 11:34:32 -07:00
  • 1c70a00f71 adjust image sizes Patrick Devine 2024-08-27 11:15:25 -07:00
  • eae3af6807 clean up convert tokenizer Michael Yang 2024-08-27 10:45:39 -07:00
  • 3eb08377f8 detect chat template from configs that contain lists Michael Yang 2024-08-26 16:36:50 -07:00
  • 80eef7c7b1 changes Josh Yan 2024-08-27 10:47:13 -07:00
  • cb576a6b23 fix ref pdevine/import-docs Patrick Devine 2024-08-26 19:59:33 -07:00
  • ac80010db8
    update the import docs (#6104) Patrick Devine 2024-08-26 19:57:26 -07:00
  • 15b7ff3a89 more comments Patrick Devine 2024-08-26 19:56:45 -07:00
  • 3ad243466b comments Patrick Devine 2024-08-26 19:54:06 -07:00
  • 47fa0839b9
    server: clean up route names for consistency (#6524) Jeffrey Morgan 2024-08-26 19:36:11 -07:00
  • a13e583c49 cleanup whitespace Patrick Devine 2024-08-26 18:09:21 -07:00
  • 3c1994d0ee small change Patrick Devine 2024-07-31 15:15:33 -07:00
  • 1b2da3829d update the import docs Patrick Devine 2024-08-26 18:04:46 -07:00
  • 5a67f93eae fix tests jmorganca/openai-context jmorganca 2024-08-25 12:45:51 -07:00
  • dc04f41eb7 fix linter issues jmorganca 2024-08-25 12:41:37 -07:00
  • 9899f18e18 openai: increase context window when max_tokens is provided jmorganca 2024-08-25 12:31:47 -07:00
  • 0f92b19bec
    Only enable numa on CPUs (#6484) v0.3.7 Daniel Hiltgen 2024-08-24 17:24:50 -07:00
  • a33e56cddb uses input prompt Josh Yan 2024-08-23 16:29:59 -07:00
  • 69be940bf6
    gpu: Group GPU Library sets by variant (#6483) v0.3.7-rc6 Daniel Hiltgen 2024-08-23 15:11:56 -07:00
  • e6802df906 fixed patches, llava jyan/paligemma Josh Yan 2024-08-23 14:12:26 -07:00
  • 9638c24c58
    Merge pull request #5446 from ollama/mxyng/faq Michael Yang 2024-08-23 14:05:59 -07:00
  • bb362caf88 update faq Michael Yang 2024-07-02 15:02:07 -07:00
  • 386af6c1a0 passthrough OLLAMA_HOST path to client Michael Yang 2024-08-23 13:16:30 -07:00
  • c631633bce paligemma demo works Josh Yan 2024-08-23 13:18:26 -07:00
  • 7de230f005 paligemma patch Roy Han 2024-08-16 11:51:23 -07:00
  • a62817d677 demo jyan/p2 Josh Yan 2024-08-23 13:01:23 -07:00
  • 0c819e167b
    convert safetensor adapters into GGUF (#6327) Patrick Devine 2024-08-23 11:29:56 -07:00
  • 7a1e1c1caf
    gpu: Ensure driver version set before variant (#6480) Daniel Hiltgen 2024-08-23 11:21:12 -07:00
  • 0b03b9c32f
    llm: Align cmake define for cuda no peer copy (#6455) Daniel Hiltgen 2024-08-23 11:20:39 -07:00
  • 90ca84172c
    Fix embeddings memory corruption (#6467) Daniel Hiltgen 2024-08-22 14:51:42 -07:00
  • 30dd74930d mid Josh Yan 2024-08-21 16:03:15 -07:00
  • 6bd8a4b0a1
    Merge pull request #6064 from ollama/mxyng/convert-llama3 Michael Yang 2024-08-21 12:57:09 -07:00
  • 77903ab8b4 llama3.1 Michael Yang 2024-07-29 14:53:02 -07:00
  • e22286c9e1
    Merge pull request #5365 from ollama/mxyng/convert-gemma2 Michael Yang 2024-08-21 11:48:43 -07:00
  • 107f695929
    Merge pull request #4917 from ollama/mxyng/convert-bert Michael Yang 2024-08-21 11:48:29 -07:00
  • 4ecc70d3b4
    Merge pull request #6386 from zwwhdls/fix-new-layer Michael Yang 2024-08-21 10:58:45 -07:00
  • 3546bbd08c convert gemma2 Michael Yang 2024-06-28 13:27:05 -07:00
  • beb49eef65 create bert models from cli Michael Yang 2024-06-07 14:55:56 -07:00
  • 5a28b9cf5f bert Michael Yang 2024-06-06 08:59:04 -07:00
  • ce78e400c2 trying Josh Yan 2024-08-20 13:38:33 -07:00
  • edeea1d6f0 server jyan/palitest Josh Yan 2024-08-20 09:17:17 -07:00
  • a017cf2fea
    Split rocm back out of bundle (#6432) v0.3.7-rc5 Daniel Hiltgen 2024-08-20 07:26:38 -07:00
  • 19e5a890f7
    CI: remove directories from dist dir before upload step (#6429) v0.3.7-rc4 Daniel Hiltgen 2024-08-19 15:19:21 -07:00
  • f91c9e3709
    CI: handle directories during checksum (#6427) v0.3.7-rc3 Daniel Hiltgen 2024-08-19 13:48:45 -07:00
  • 2df6905ede
    Merge pull request #6424 from dhiltgen/cuda_v12 v0.3.7-rc2 Daniel Hiltgen 2024-08-19 12:11:58 -07:00
  • d8be22e47d Fix overlapping artifact name on CI Daniel Hiltgen 2024-08-19 12:07:18 -07:00
  • 652c273f0e
    Merge pull request #5049 from dhiltgen/cuda_v12 v0.3.7-rc1 Daniel Hiltgen 2024-08-19 11:14:24 -07:00
  • 88e7705079
    Merge pull request #6402 from rick-github/numParallel Daniel Hiltgen 2024-08-19 11:07:22 -07:00
  • f9e31da946 Review comments Daniel Hiltgen 2024-08-15 14:38:14 -07:00
  • 88bb9e3328 Adjust layout to bin+lib/ollama Daniel Hiltgen 2024-08-14 16:32:57 -07:00
  • 3b19cdba2a Remove Jetpack Daniel Hiltgen 2024-08-13 13:30:28 -07:00
  • 927d98a6cd Add windows cuda v12 + v11 support Daniel Hiltgen 2024-07-12 14:33:13 -07:00
  • f6c811b320 Enable cuda v12 flags Daniel Hiltgen 2024-07-12 11:35:41 -07:00
  • 4fe3a556fa Add cuda v12 variant and selection logic Daniel Hiltgen 2024-06-13 20:46:14 -07:00
  • fc3b4cda89 Report GPU variant in log Daniel Hiltgen 2024-06-19 09:36:30 -07:00
  • d470ebe78b Add Jetson cuda variants for arm Daniel Hiltgen 2024-05-30 21:54:07 -07:00
  • c7bcb00319 Wire up ccache and pigz in the docker based build Daniel Hiltgen 2024-08-09 07:21:40 -07:00
  • 74d45f0102 Refactor linux packaging Daniel Hiltgen 2024-07-08 12:50:11 -07:00
  • 9fddef3731
    server: limit upload parts to 16 (#6411) Jeffrey Morgan 2024-08-19 09:20:52 -07:00
  • 885cf45087 Fix white space. Richard Lyons 2024-08-18 03:07:16 +02:00
  • 9352eeb752 Reset NumCtx. Richard Lyons 2024-08-18 02:55:01 +02:00
  • 0ad0e738cd Override numParallel only if unset. Richard Lyons 2024-08-18 01:43:26 +02:00
  • 450400107b paligemma patch Roy Han 2024-08-16 11:51:23 -07:00
  • bdc4308afb fix: chmod new layer to 0o644 when creating it zwwhdls 2024-08-16 11:43:19 +08:00
  • d29cd4c2ed
    Merge pull request #6381 from eust-w/main Daniel Hiltgen 2024-08-15 15:31:15 -07:00
  • a84c05cf91 fix: Add tooltip to system tray icon eust-w 2024-08-16 06:00:12 +08:00
  • e3d7f32af7
    Merge pull request #6363 from ollama/mxyng/fix-noprune Michael Yang 2024-08-15 12:20:38 -07:00
  • 3a75e74e34 only skip invalid json manifests Michael Yang 2024-08-15 10:29:14 -07:00
  • 237dccba1e skip invalid manifest files Michael Yang 2024-08-14 16:36:07 -07:00
  • b3f75fc812 fix noprune Michael Yang 2024-08-14 14:37:51 -07:00
  • 8200c371ae
    add CONTRIBUTING.md (#6349) Jeffrey Morgan 2024-08-14 15:19:50 -07:00
  • e7254617e3 ......... bmizerany/embedspeedup Blake Mizerany 2024-08-14 01:22:23 -07:00
  • 0a8d6ea86d
    Fix typo and improve readability (#5964) longtao 2024-08-14 08:54:19 +08:00
  • 8e1050f366
    server: reduce max connections used in download (#6347) Blake Mizerany 2024-08-13 16:47:35 -07:00
  • eda8a32a09
    update chatml template format to latest in docs (#6344) Bruce MacDonald 2024-08-13 23:39:18 +00:00
  • a0a40aa20c
    Merge pull request #6346 from ollama/mxyng/lint Michael Yang 2024-08-13 14:58:35 -07:00
  • 2697d7f5aa lint Michael Yang 2024-08-13 13:40:37 -07:00
  • 1f32276178
    Update openai.md to remove extra checkbox (#6345) Pamela Fox 2024-08-13 13:36:05 -07:00
  • 4c4fe3f87f
    Merge pull request #6343 from dhiltgen/revert_win_go_version v0.3.6 Daniel Hiltgen 2024-08-13 11:53:49 -07:00
  • feedf49c71 Go back to a pinned Go version Daniel Hiltgen 2024-08-13 11:44:50 -07:00
  • 8b00a415ab
    Load Embedding Model on Empty Input (#6325) royjhan 2024-08-13 13:19:56 -04:00
  • 48de4b56c8 cleanup jmorganca/llama-vit jmorganca 2024-08-12 22:19:26 -07:00
  • cd776e49ad llama: wip vision support for runner jmorganca 2024-08-12 22:18:30 -07:00
  • 01b80e9ffc
    Merge pull request #5443 from ollama/mxyng/convert-phi3 Michael Yang 2024-08-12 15:47:58 -07:00
  • bd5e432630 update import.md Michael Yang 2024-08-05 10:30:32 -07:00
  • aec77d6a05 support new "longrope" attention factor Bruce MacDonald 2024-07-02 14:40:01 -07:00
  • 6ffb5cb017 add conversion for microsoft phi 3 mini/medium 4k, 128 Michael Yang 2024-06-03 15:53:58 -07:00