I am interested in NLP, and Machine Learning for Healthcare. Currently, I am interested in training and serving LLM (Large Language Models).
-
KAIST
- Seoul, Republic of Korea
-
18:11
(UTC +09:00) - https://ljm565.github.io/
- in/jun-min-lee-189383264
Highlights
- Pro
Pinned Loading
-
tensorrtllm_backend_tutorial
tensorrtllm_backend_tutorial PublicForked from triton-inference-server/tensorrtllm_backend
The Triton TensorRT-LLM Backend
Python
-
universal-llm-trainer
universal-llm-trainer PublicUniversal LLM Trainer including LoRA, QLoRA, deepspeed, etc.
Python 3
-
deepseek-r1-local-serving
deepseek-r1-local-serving PublicHere, we provide a comprehensive guide on serving the 671B DeepSeek-R1 model on a local GPU setup from start to finish. All you need are 2 A100 GPUs to get started.
Dockerfile 2
-
-
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.