Skip to content
View gokunwu's full-sized avatar
😀
Focusing
😀
Focusing
  • beijing

Block or report gokunwu

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results
Python 4,628 373 Updated Mar 16, 2026

🚀 「大模型」1小时从0训练67M参数的视觉多模态VLM!🌏 Train a 67M-parameter VLM from scratch in just 1 hours!

Python 7,273 796 Updated Apr 4, 2026

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

Jupyter Notebook 89,935 13,744 Updated Apr 1, 2026

Simple RL training for reasoning

Python 3,846 289 Updated Dec 23, 2025

Download web video and audio

C# 5,429 249 Updated Apr 4, 2026

A list of AI autonomous agents

27,044 2,620 Updated Feb 26, 2025

[ICML 2025] Predictive Data Selection: The Data That Predicts Is the Data That Teaches

Python 63 9 Updated Mar 4, 2025

No fortress, purely open ground. OpenManus is Coming.

Python 55,616 9,700 Updated Feb 11, 2026

深度学习经典、新论文逐段精读

32,819 2,781 Updated Mar 22, 2025

Fully open reproduction of DeepSeek-R1

Python 25,964 2,408 Updated Apr 2, 2026

Awesome-LLM: a curated list of Large Language Model

26,587 2,415 Updated Jul 31, 2025

The Most Comprehensive Survey of Video Quality Assessment to Date.

96 2 Updated Dec 24, 2024

DataDreamer: Prompt. Generate Synthetic Data. Train & Align Models.   🤖💤

Python 1,107 60 Updated Feb 2, 2025

Making large AI models cheaper, faster and more accessible

Python 41,371 4,520 Updated Mar 30, 2026

LongLLaMA is a large language model capable of handling long contexts. It is based on OpenLLaMA and fine-tuned with the Focused Transformer (FoT) method.

Python 1,464 85 Updated Nov 7, 2023

Implementation of plug in and play Attention from "LongNet: Scaling Transformers to 1,000,000,000 Tokens"

Python 716 60 Updated Jan 7, 2024

Awesome Pretrained Chinese NLP Models,高质量中文预训练模型&大模型&多模态模型&大语言模型集合

Python 5,542 510 Updated Mar 22, 2026

A 13B large language model developed by Baichuan Intelligent Technology

Python 2,939 235 Updated Sep 6, 2023

Example models using DeepSpeed

Python 1 Updated Apr 12, 2023

CCF ADL 2019 slides for knowledge graph fusion

140 29 Updated Oct 17, 2020

⏰ Agenticly track worldwide conference deadlines (Website, Python Cli, Wechat Applet)

Rust 8,832 583 Updated Apr 2, 2026

A collection of research and survey papers of real-time bidding (RTB) based display advertising techniques.

3,690 939 Updated Dec 20, 2024

Papers on Computational Advertising

Python 4,379 1,185 Updated Feb 9, 2021

LeetCode Solutions: A Record of My Problem Solving Journey.( leetcode题解,记录自己的leetcode解题之路。)

JavaScript 55,799 9,435 Updated Jul 16, 2025

OneFlow is a deep learning framework designed to be user-friendly, scalable and efficient.

C++ 9,391 1,015 Updated Dec 4, 2025

GPT-3: Language Models are Few-Shot Learners

15,750 2,264 Updated Sep 18, 2020

Similarity search engine built around Faiss library

Python 78 9 Updated Dec 8, 2022
Next