Skip to content
View Hungreeee's full-sized avatar

Highlights

  • Pro

Block or report Hungreeee

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

[ICLR 2026] Official pytorch implementation of The Unseen Frontier: Pushing the Limits of LLM Sparsity with Surrogate-Free ADMM

Python 12 3 Updated Feb 21, 2026

Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"

Python 13,594 909 Updated Dec 17, 2024

[ICLR 2024] Dynamic Sparse Training with Structured Sparsity

Jupyter Notebook 25 7 Updated Apr 12, 2024

RWKV (pronounced RwaKuv) is an RNN with great LLM performance, which can also be directly trained like a GPT transformer (parallelizable). We are at RWKV-7 "Goose". So it's combining the best of RN…

Python 14,563 1,007 Updated Jun 13, 2026
Python 4,524 491 Updated Apr 22, 2026

Code for Winning the Lottery Ahead of Time: Efficient Early Network Pruning (ICML 2022)

Python 31 3 Updated Nov 15, 2023

Introduction to Parallel Programming class code

Cuda 1,353 1,145 Updated Jun 27, 2022

Material for gpu-mode lectures

Jupyter Notebook 6,179 623 Updated Jun 15, 2026

🧠「大模型」2小时完全从0训练64M的小参数LLM!Train a 64M-parameter LLM from scratch in just 2h!

Python 51,778 6,652 Updated Jun 1, 2026

List of AI Residency Programs

3,296 270 Updated Apr 4, 2025

This repository is a curated collection of resources, tutorials, and practical examples designed to guide you through the journey of mastering CUDA programming. Whether you're just starting or look…

453 45 Updated Feb 22, 2025

InferX: Inference as a Service Platform

Rust 217 25 Updated Jun 15, 2026

Pruna is a model optimization framework built for developers, enabling you to deliver faster, more efficient models with minimal overhead.

Python 1,215 91 Updated Jun 15, 2026

The fastai deep learning library

Jupyter Notebook 28,033 7,660 Updated May 20, 2026
Python 5 Updated Jun 6, 2025

Clinical NLP Shared Task @ NAACL'24

Python 45 12 Updated Aug 20, 2025

DSPy: The framework for programming—not prompting—language models

Python 35,048 2,978 Updated Jun 11, 2026

TextGrad: Automatic ''Differentiation'' via Text -- using large language models to backpropagate textual gradients. Published in Nature.

Python 3,607 294 Updated Jul 25, 2025

You like pytorch? You like micrograd? You love tinygrad! ❤️

Python 33,093 4,185 Updated Jun 15, 2026

[NeurIPS'22] EHRSQL: A Practical Text-to-SQL Benchmark for Electronic Health Records

Python 109 17 Updated Apr 28, 2026

Build reliable customer-facing AI agents with Parlant: an interaction control harness optimized for controlled, consistent, and predictable LLM interactions.

Python 18,116 1,535 Updated Jun 14, 2026

Research code artifacts for Code World Model (CWM) including inference tools, reproducibility, and documentation.

Python 883 71 Updated Jun 11, 2026

[ACL 2025 Industry Track, Oral] Sentiment Reasoning for Healthcare

Jupyter Notebook 164 22 Updated Jan 5, 2026

An associative memory system that stores and retrieves experiences using the 5W1H framework (Who, What, When, Where, Why, How) and content-addressable memory.

Python 173 28 Updated Sep 15, 2025

🤖 Chat with your SQL database 📊. Accurate Text-to-SQL Generation via LLMs using Agentic Retrieval 🔄.

Python 23,622 2,427 Updated Feb 2, 2026

🚀 The fast, Pythonic way to build MCP servers and clients.

Python 25,642 2,067 Updated Jun 6, 2026

A framework for prompt tuning using Intent-based Prompt Calibration

Python 2,987 265 Updated Dec 2, 2025

Build Real-Time Knowledge Graphs for AI Agents

Python 27,465 2,746 Updated Jun 15, 2026

Code for Retrieval augmented text-to-SQL generation for epidemiological question answering using electronic health records

Python 24 7 Updated May 15, 2024
Next