Provides a GGUF model loader and parser to facilitate reading llm model params from dist to memory.
It's modified from llama.cpp with tag:b4721, we removed irrelevant code and keep the interface simple.
- Run and pass the gguf test
- Simplify the code and remove the ggml calculation code
- Simplify the inferface