Solutions Architect and AI developer
-
naranjositos.tech
- Santiago
-
10:47
(UTC -04:00) - https://huggingface.co/xaskasdf
- https://naranjositos.tech/
Pinned Loading
-
ntransformer
ntransformer PublicHigh-efficiency LLM inference engine in C++/CUDA. Run Llama 70B on RTX 3090.
-
gpu-nvme-direct
gpu-nvme-direct PublicGPU-initiated NVMe I/O via PCIe BAR MMIO — CUDA kernels directly issue NVMe commands, eliminating CPU from the storage data path
-
brandon-tiny
brandon-tiny PublicUltra-small instruction-following language models (10M-110M params) that run on a PlayStation 2. BLiMP 73.3%, HellaSwag 32.4% at just 10.7M parameters.
Python 2
-
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.