-
The University of Texas at Austin
- United States
- https://www.bagus.my.id
- https://orcid.org/0000-0002-8485-581X
Highlights
- Pro
-
SC26_power-dev Public
Forked from mlcommons/power-devDev repo for power measurement for the MLPerf™ benchmarks
Python Apache License 2.0 UpdatedApr 27, 2026 -
SC26_MLPerf_Inference_Loadgen Public
Forked from mlcommons/inferenceReference implementations of MLPerf® inference benchmarks
Python Apache License 2.0 UpdatedApr 27, 2026 -
SC26_NVLink_vs_PCIe Public
Artifact Description / Artifact Evaluation for SC26 Paper
Python MIT License UpdatedApr 27, 2026 -
SC26_TensorRT-LLM Public
Forked from NVIDIA/TensorRT-LLMTensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. Tensor…
C++ Apache License 2.0 UpdatedApr 27, 2026 -
SC25_AMD_CDNA3_Artifact Public
This is the Artifact Description and Artifact Evaluation (AD/AE) for Supercomputing 2025 Paper
Shell MIT License UpdatedApr 23, 2026 -
SC26_MLPerf_inference_results_v5.0 Public
Forked from mlcommons/inference_results_v5.0This repository contains the results and code for the MLPerf® Inference v5.0 benchmark.
HTML UpdatedFeb 12, 2026 -
tt-vllm Public
Forked from tenstorrent/vllmA high-throughput and memory-efficient inference and serving engine for LLMs
Python Apache License 2.0 UpdatedJan 12, 2026 -
tt-installer Public
Forked from tenstorrent/tt-installerInstall the tenstorrent stack with one command
Shell Apache License 2.0 UpdatedSep 15, 2025 -
GPU_Roofline_Tools Public
This is experimental repository to create roofline of GPU (NVIDIA, AMD, Intel) similar to Empirical Roofline Tool (ERT)
-
SC2025_RRZE_HPC_gpu-benches Public
Forked from RRZE-HPC/gpu-benchescollection of benchmarks to measure basic GPU capabilities
C++ GNU General Public License v3.0 UpdatedAug 25, 2025 -
SC2025_TransferBench Public
Forked from ROCm/TransferBenchTransferBench is a utility capable of benchmarking simultaneous copies between user-specified devices (CPUs/GPUs)
C++ MIT License UpdatedAug 25, 2025 -
SC2025_rocHPCG Public
Forked from ROCm/rocHPCGHPCG benchmark based on ROCm platform
C++ BSD 3-Clause "New" or "Revised" License UpdatedAug 25, 2025 -
SC2025_rocHPL Public
Forked from ROCm/rocHPLHigh Performance Linpack for Next-Generation AMD HPC Accelerators
C++ Other UpdatedAug 25, 2025 -
SC25_ROCM_GROMACS_2025.2 Public
ROCM Forks of GROMACS 2025.2 with modifications to support HEFFTE in HIP
C++ GNU Lesser General Public License v2.1 UpdatedAug 25, 2025 -
PerformanceProfiling Public
Tools/Configs/Scripts that I create for My Projects
Python MIT License UpdatedApr 13, 2025 -
-
QLoRA-Experiment Public
Experiment with QLoRA, LLM, and BitandBytes of HuggingFace.
-
-
unifios-utilities Public
Forked from unifi-utilities/unifi-commonA collection of enhancements for UnifiOS based devices
Shell GNU General Public License v3.0 UpdatedJan 8, 2025 -
-
-
ECE382N-GPU-Lab Public
GPU Lab for ECE382N Computer Performance Evaluation/Benchmark
Python MIT License UpdatedMar 6, 2024 -
unifi-ddns Public
Forked from willswire/unifi-ddnsCloudflare DDNS (Dynamic DNS) support for UniFi OS
JavaScript UpdatedFeb 29, 2024 -
MeZO Public
Forked from princeton-nlp/MeZO[NeurIPS 2023] MeZO: Fine-Tuning Language Models with Just Forward Passes. https://arxiv.org/abs/2305.17333
Python MIT License UpdatedJan 11, 2024 -
gnn Public
GNN Comparison using Torch Geometric
GNU General Public License v2.0 UpdatedFeb 27, 2023 -
CAGCN Public
Forked from yuwvandy/CAGCNThis repository includes the implementation of CAGCN and the experiments on link prediction/node classification
Python MIT License UpdatedFeb 23, 2023 -
-
pygcn Public
Forked from tkipf/pygcnGraph Convolutional Networks in PyTorch
Python MIT License UpdatedFeb 2, 2023 -
Megatron-DeepSpeed Public
Forked from deepspeedai/Megatron-DeepSpeedOngoing research training transformer language models at scale, including: BERT & GPT-2
Python Other UpdatedAug 7, 2022 -
LightGCN-PyTorch Public
Forked from gusye1234/LightGCN-PyTorchThe PyTorch implementation of LightGCN
Python UpdatedJun 29, 2022