BUG: fix loading multiple gguf parts #1987

qinxuye · 2024-07-31T11:53:20Z

#1075 introduced multiple gguf parts, in that PR, the gguf parts were merged into one file, but now llama.cpp will fail to load this file, instead, the llama.cpp could recognize the whole files with given the first part, thus, in this PR, we just symlink the gguf file to the first part, then when loading, we tell llama.cpp the realpath.

Fixes #1912

BUG: fix loading multiple gguf parts

7be6ca9

XprobeBot added the bug Something isn't working label Jul 31, 2024

XprobeBot added this to the v0.14.0 milestone Jul 31, 2024

qinxuye mentioned this pull request Jul 31, 2024

llama.cpp引擎加载qwen2-72b会报错 #1912

Closed

3 tasks

amumu96 approved these changes Aug 2, 2024

View reviewed changes

qinxuye merged commit 32ee89b into xorbitsai:main Aug 2, 2024

qinxuye deleted the bug/gguf-parts branch August 2, 2024 04:10

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

BUG: fix loading multiple gguf parts #1987

BUG: fix loading multiple gguf parts #1987

Uh oh!

qinxuye commented Jul 31, 2024 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

BUG: fix loading multiple gguf parts #1987

BUG: fix loading multiple gguf parts #1987

Uh oh!

Conversation

qinxuye commented Jul 31, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

qinxuye commented Jul 31, 2024 •

edited

Loading