-
SEU
-
16:59
(UTC -12:00)
-
-
risc0 Public
Forked from risc0/risc0RISC Zero is a zero-knowledge verifiable general computing platform based on zk-STARKs and the RISC-V microarchitecture.
C++ Apache License 2.0 UpdatedOct 16, 2025 -
KernelBench Public
Forked from ScalingIntelligence/KernelBenchKernelBench: Can LLMs Write GPU Kernels? - Benchmark with Torch -> CUDA problems
Python Other UpdatedAug 29, 2025 -
assignment1-basics Public
Forked from stanford-cs336/assignment1-basicsStudent version of Assignment 1 for Stanford CS336 - Language Modeling From Scratch
Python MIT License UpdatedAug 29, 2025 -
-
FastDeploy Public
Forked from PaddlePaddle/FastDeployLarge Language Model Deployment Toolkit
Python Apache License 2.0 UpdatedAug 6, 2025 -
flux Public
Forked from bytedance/fluxA fast communication-overlapping library for tensor/expert parallelism on GPUs.
C++ Apache License 2.0 UpdatedJul 30, 2025 -
Athena_torch Public
Forked from hxzd5568/Athena_torchA torch model extract tool which is helpful in building the torch unit test files.
Python UpdatedJul 15, 2025 -
-
-
vllm Public
Forked from vllm-project/vllmA high-throughput and memory-efficient inference and serving engine for LLMs
-
-
-
How_to_optimize_in_GPU Public
Forked from Liu-xiandong/How_to_optimize_in_GPUThis is a series of GPU optimization topics. Here we will introduce how to optimize the CUDA kernel in detail. I will introduce several basic kernel optimizations, including: elementwise, reduce, s…
Cuda Apache License 2.0 UpdatedApr 4, 2025 -
A_Share_investment_Agent Public
Forked from 24mlight/A_Share_investment_AgentPython MIT License UpdatedFeb 9, 2025 -
CUDA-Learn-Notes Public
Forked from xlite-dev/LeetCUDA📚200+ Tensor/CUDA Cores Kernels, ⚡️flash-attn-mma, ⚡️hgemm with WMMA, MMA and CuTe (98%~100% TFLOPS of cuBLAS/FA2 🎉🎉).
Cuda GNU General Public License v3.0 UpdatedFeb 7, 2025 -
-
pintos Public
Forked from PKU-OS/pintosThe pintos source distribution for PKU Operating System Course projects
C UpdatedNov 14, 2024 -
CUDATutorial Public
Forked from RussWong/CUDATutorialA CUDA tutorial to make people learn CUDA program from 0
Cuda UpdatedJul 9, 2024 -
-
-
plonk-intro-notebook Public
Forked from coset-io/plonk-intro-notebookJupyter Notebook UpdatedJun 10, 2024 -
-
-
alpha-llm Public
Dig the alpha and signals from Chinese stock market
MIT License UpdatedMar 24, 2024 -
-
-
-
-
icicle Public
Forked from ingonyama-zk/iciclea GPU Library for Zero-Knowledge Acceleration
C MIT License UpdatedDec 6, 2023