Pinned Loading
-
CS336-LLM_from_scratch-assginment1
CS336-LLM_from_scratch-assginment1 Public从零实现 Transformer LM,在 TinyStories 上完成训练.扩展:KV-cache 解码优化 小规模 Attention Residuals 架构复现和消融实验
Python 5
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.