Skip to content
View nouniiefhizuf's full-sized avatar

Block or report nouniiefhizuf

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results
Python 1 Updated Feb 5, 2026

Qwen3-0.6B megakernel: 527 tok/s decode on RTX 3090 (3.8x faster than PyTorch)

Cuda 46 4 Updated Feb 2, 2026

12 Lessons to Get Started Building AI Agents

Jupyter Notebook 50,154 17,546 Updated Feb 6, 2026

All Algorithms implemented in Python

Python 217,501 50,020 Updated Feb 2, 2026

A tiny edit to nGPT and some custom kernels to speed it up

Python 2 Updated Apr 1, 2025

A minimal Agentic RAG built with LangGraph — learn Retrieval-Augmented Generation Agents in minutes.

Jupyter Notebook 1,997 274 Updated Jan 21, 2026

Ecosystem of libraries and tools for writing and executing fast GPU code fully in Rust.

Rust 5,045 224 Updated Jan 15, 2026

Karras et al. (2022) diffusion models for PyTorch

Jupyter Notebook 21 7 Updated May 25, 2024

PyTorch half precision gemm lib w/ fused optional bias + optional relu/gelu

Cuda 2 Updated Dec 2, 2024

Neighborhood Attention Extension. Bringing attention to a neighborhood near you!

Cuda 1 Updated Aug 5, 2024

In this project, I developed a live sketching functionlity using open-cv and created an app using streamlit

Python 1 Updated Dec 12, 2022
Jupyter Notebook 46 19 Updated Feb 3, 2026

This is a python implementation of the Direct Linear Transform for 3d coordinates to 2d image coordinates and vice versa

Python 1 Updated Apr 29, 2024

Learn the building blocks of how to build DeepSeek from scratch.

Jupyter Notebook 99 28 Updated Sep 22, 2025

In-depth tutorials on LLMs, RAGs and real-world AI agent applications.

Jupyter Notebook 27,998 4,577 Updated Jan 30, 2026

A Hands on series on developing LLM applications

68 12 Updated Sep 28, 2024

[CCS'24] A dataset consists of 15,140 ChatGPT prompts from Reddit, Discord, websites, and open-source datasets (including 1,405 jailbreak prompts).

Jupyter Notebook 2 Updated Jun 10, 2024

CUDA by Example, written by two senior members of the CUDA software platform team, shows programmers how to employ this new technology. The authors introduce each area of CUDA development through w…

C 475 148 Updated Jun 30, 2023
C++ 3 2 Updated Mar 8, 2025

100 days of building GPU kernels!

Cuda 569 64 Updated Apr 27, 2025

GPU Kernels

Cuda 220 20 Updated Apr 27, 2025

Learn the building blocks of how to build nano-kimi from scratch

Jupyter Notebook 7 1 Updated Nov 19, 2025

Jarvis is a voice-activated, conversational AI assistant powered by a local LLM (Qwen via Ollama). It listens for a wake word, processes spoken commands using a local language model with LangChain,…

Python 273 95 Updated Sep 8, 2025

21 Lessons, Get Started Building with Generative AI

Jupyter Notebook 106,150 56,890 Updated Feb 6, 2026

Kimi K2 is the large language model series developed by Moonshot AI team

10,303 777 Updated Jan 21, 2026
Next