test ci #3

dhiltgen · 2024-09-11T20:35:20Z

No description provided.

When we determine a GPU is too small for any layers, it's not always clear why. This will help troubleshoot those scenarios.

…#6681)

Includes small improvements to document layout and code blocks

…pletion (ollama#6688)

* Quiet down dockers new lint warnings Docker has recently added lint warnings to build. This cleans up those warnings. * Fix go lint regression

add *_proxy to env map for debugging

This adds back a check which was lost many releases back to verify /dev/kfd permissions which when lacking, can lead to confusing failure modes of: "rocBLAS error: Could not initialize Tensile host: No devices found" This implementation does not hard fail the serve command but instead will fall back to CPU with an error log. In the future we can include this in the GPU discovery UX to show detected but unsupported devices we discovered.

fixes line wrapping on long texts

refactor show ouput

If there are any pending reponses (such as from potential stop tokens) then we should send them back before ending the sequence. Otherwise, we can be missing tokens at the end of a response. Fixes ollama#6707

runner: Flush pending responses before returning

* Optimize container images for startup This change adjusts how to handle runner payloads to support container builds where we keep them extracted in the filesystem. This makes it easier to optimize the cpu/cuda vs cpu/rocm images for size, and should result in faster startup times for container images. * Refactor payload logic and add buildx support for faster builds * Move payloads around * Review comments * Converge to buildx based helper scripts * Use docker buildx action for release

Corrects x86_64 vs amd64 discrepancy

scripts: fix incremental builds on linux or similar

…llama#6789)

rick-github and others added 30 commits September 6, 2024 01:16

openai: fix "presence_penalty" typo and add test (ollama#6665)

fe91d7f

Improve logging on GPU too small (ollama#6666)

56318fb

When we determine a GPU is too small for any layers, it's not always clear why. This will help troubleshoot those scenarios.

readme: add Plasmoid Ollama Control to community integrations (ollama…

5446903

…#6681)

readme: add Archyve to community integrations (ollama#6680)

8a027bc

openai: don't scale temperature or frequency_penalty (ollama#6514)

da91534

docs: improve linux install documentation (ollama#6683)

108fb6c

Includes small improvements to document layout and code blocks

openai: align chat temperature and frequency_penalty options with com…

06d4fba

…pletion (ollama#6688)

readme: add crewAI with mesop to community integrations

30c8f20

readme: add crewAI to community integrations (ollama#6699)

bb6a086

catch when model vocab size is set correctly (ollama#6714)

84b84ce

Quiet down dockers new lint warnings (ollama#6716)

4a8069f

* Quiet down dockers new lint warnings Docker has recently added lint warnings to build. This cleans up those warnings. * Fix go lint regression

docs: update examples to use llama3.1 (ollama#6718)

83a9b52

add *_proxy for debugging

dddb72e

Merge pull request ollama#6732 from ollama/mxyng/debug-proxy

735a0ca

add *_proxy to env map for debugging

readme: add QodeAssist to community integrations (ollama#6754)

7d69008

refactor show ouput

ecab6f1

fixes line wrapping on long texts

Merge pull request ollama#6762 from ollama/mxyng/show-output

0343926

refactor show ouput

add "stop" command (ollama#6739)

abed273

runner: Flush pending responses before returning

93ac376

If there are any pending reponses (such as from potential stop tokens) then we should send them back before ending the sequence. Otherwise, we can be missing tokens at the end of a response. Fixes ollama#6707

Merge pull request ollama#6767 from ollama/jessegross/bug_6707

c354e87

runner: Flush pending responses before returning

readme: add ollama_moe to community integrations (ollama#6752)

5a00dc9

examples: polish loganalyzer example (ollama#6744)

d066d9b

examples: updated requirements.txt for privategpt example

fef257c

Use GOARCH for build dirs (ollama#6779)

fda0d3b

Corrects x86_64 vs amd64 discrepancy

Fix incremental builds on linux (ollama#6780)

56b9af3

scripts: fix incremental builds on linux or similar

readme: add Obsidian Quiz Generator plugin to community integrations (o…

d889c6f

…llama#6789)

readme: add vim-intelligence-bridge to Terminal section (ollama#6818)

b330c83

fix typo in import docs (ollama#6828)

d81cfd7

TESTING

2efce80

dhiltgen force-pushed the dumm_ci branch from e987658 to 2efce80 Compare September 16, 2024 22:56

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

test ci #3

test ci #3

dhiltgen commented Sep 11, 2024

test ci #3

Are you sure you want to change the base?

test ci #3

Conversation

dhiltgen commented Sep 11, 2024