Skip to content

ORT内存复用,HTTP Server自动减容 #14

@lona-cn

Description

@lona-cn
  • 在使用ORT推理时,需要对Ort::Value的使用进行优化
  • HTTP Server应该支持/infer/unload来手动卸载模型以及提供/infer/stats查询运行状态,并且内部设置定时器自动unload模型

Metadata

Metadata

Assignees

No one assigned

    Labels

    bugSomething isn't workingenhancementNew feature or request

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions