Skip to content
View Ldpe2G's full-sized avatar
:octocat:
I may be slow to respond.
:octocat:
I may be slow to respond.

Block or report Ldpe2G

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
45 results for forked starred repositories
Clear filter

A tiny demo of interfacing CUDA via nanobind with a pytorch tensor

Cuda 7 Updated Dec 24, 2024

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 71 2 Updated Oct 28, 2024

Agents of C.L.I.

TypeScript 144 19 Updated Sep 10, 2025

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 113 50 Updated Dec 24, 2025

Fast and memory-efficient exact attention

Python 204 69 Updated Dec 24, 2025

Fast and memory-efficient exact attention ported to rocm

Python 12 1 Updated Dec 1, 2023

Tensors and Dynamic neural networks in Python with strong GPU acceleration

Python 247 75 Updated Dec 23, 2025

Development repository for the Triton language and compiler

Python 138 37 Updated Dec 23, 2025

A rule-based tunnel in Go.

Go 506 491 Updated Oct 2, 2024

Llama 2 Everywhere (L2E)

C 1,523 45 Updated Aug 27, 2025

Instruct-tune LLaMA on consumer hardware

Jupyter Notebook 3 1 Updated Mar 31, 2023

Llama 2 inference in one file of pure Go

Python 109 13 Updated Jul 29, 2023

Inference Llama 2 in one file of pure Rust 🦀

Python 235 6 Updated Sep 11, 2023

Inference Llama 2 in one file of pure C

Python 43 3 Updated Jul 27, 2023

Inference Llama 2 in one file of pure C++

Python 86 14 Updated Aug 4, 2023

Ongoing research training transformer language models at scale, including: BERT & GPT-2

Python 2,208 365 Updated Aug 14, 2025

Deploy über-JARs. Restart processes. (port of codahale/assembly-sbt)

Scala 1,961 224 Updated Sep 29, 2025

An unofficial cuda assembler, for all generations of SASS, hopefully :)

Python 84 10 Updated Mar 20, 2023

📚 C/C++ 技术面试基础知识总结,包括语言、程序库、数据结构、算法、系统、网络、链接装载库等知识及面试经验、招聘、内推等信息。This repository is a summary of the basic knowledge of recruiting job seekers and beginners in the direction of C/C++ technology, in…

C++ 14 6 Updated Aug 7, 2022

Public repo for the GANFT video

Python 102 25 Updated May 5, 2022

Code + Playground Colab for the paper "Language Models are Unsupervised Multitask Learners"

Jupyter Notebook 107 45 Updated Oct 7, 2025

博客啦~

HTML 1 Updated Nov 27, 2022

YOLOv4 / Scaled-YOLOv4 / YOLO - Neural Networks for Object Detection (Windows and Linux version of Darknet )

C 22,195 7,948 Updated Dec 15, 2025

Simple Scala interface for Graphviz

Scala 2 Updated Jun 22, 2019

StyleGAN Encoder - converts real images to latent space

Jupyter Notebook 750 180 Updated Dec 1, 2022
Jupyter Notebook 65 28 Updated Apr 2, 2019

COCO API - Dataset @ http://cocodataset.org/

C++ 54 26 Updated Sep 10, 2024

Matrix Shadow:Lightweight CPU/GPU Matrix and Tensor Template Library in C++/CUDA for (Deep) Machine Learning

C++ 33 11 Updated Nov 1, 2016

Lightweight, Portable, Flexible Distributed/Mobile Deep Learning with Dynamic, Mutation-aware Dataflow Dep Scheduler; for Python, R, Julia, Scala, Go, Javascript and more

C++ 12 9 Updated Aug 13, 2020

Bounding Box Regression with Uncertainty for Accurate Object Detection (CVPR'19)

Python 366 34 Updated May 2, 2024
Next