Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Support Rerank models and API #214

Closed
pengjiang80 opened this issue Sep 4, 2024 · 3 comments
Closed

Support Rerank models and API #214

pengjiang80 opened this issue Sep 4, 2024 · 3 comments
Labels
enhancement New feature or request
Milestone

Comments

@pengjiang80
Copy link

Rerank support is a very common requirement in RAG scenarios, we should support Rerank models and API asap.

@pengjiang80 pengjiang80 added the enhancement New feature or request label Sep 4, 2024
@gitlawr gitlawr added this to the 0.4.0 milestone Sep 29, 2024
@pengjiang80
Copy link
Author

Refer to ggerganov/llama.cpp#9510. As llama.cpp already has initial support for reranking, we may support this with llama-box first.

@Finenyaco
Copy link
Contributor

Finenyaco commented Oct 14, 2024

GPUStack version: main(72d9d5f)

When integrating with Dify, using the port of the GPUStack server results in an error, but integration is successful when using the port of the reranker model instance directly.

image

@Finenyaco
Copy link
Contributor

Verified on main de9a368

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

3 participants