Skip to content

Conversation

@qinxuye
Copy link
Contributor

@qinxuye qinxuye commented Jul 31, 2024

#1075 introduced multiple gguf parts, in that PR, the gguf parts were merged into one file, but now llama.cpp will fail to load this file, instead, the llama.cpp could recognize the whole files with given the first part, thus, in this PR, we just symlink the gguf file to the first part, then when loading, we tell llama.cpp the realpath.

Fixes #1912

@XprobeBot XprobeBot added the bug Something isn't working label Jul 31, 2024
@XprobeBot XprobeBot added this to the v0.14.0 milestone Jul 31, 2024
@qinxuye qinxuye merged commit 32ee89b into xorbitsai:main Aug 2, 2024
@qinxuye qinxuye deleted the bug/gguf-parts branch August 2, 2024 04:10
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

bug Something isn't working

Projects

None yet

Development

Successfully merging this pull request may close these issues.

llama.cpp引擎加载qwen2-72b会报错

3 participants