-
Notifications
You must be signed in to change notification settings - Fork 777
FEAT: Support QwQ-32B #3005
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
FEAT: Support QwQ-32B #3005
Conversation
|
About MLX version, you could refer to codes below to make it more compact. inference/xinference/model/llm/llm_family.json Lines 1446 to 1456 in aeb1ccd
|
Modified. |
|
All tests failed, seems some places in your jsons have wrong syntax. |
|
uh, I use vscode instead of vim to look for json formatting issues, maybe the encoding is different |
OK, could you resolve it? |
Solved. |
qinxuye
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM
|
修改容器之后 commit 成新镜像了吗?不然不是自动还原了? |
docker exec进入容器,修改代码后,docker restart容器,这样不行吗? |
|
Only ['qwen1.5-chat', 'qwen1.5-moe-chat', 'qwen2-instruct', 'qwen2-moe-instruct', 'qwen2.5-instruct', 'qwen2.5-coder-instruct', 'glm4-chat', 'glm4-chat-1m', 'llama-3.1-instruct', 'deepseek-r1-distill-qwen', 'deepseek-r1-distill-llama'] support tool calls。 |
|
QwQ 32B启用函数调用,+1 |
开个新 issue 把。 |
No description provided.