Skip to content
View gokunwu's full-sized avatar
😀
Focusing
😀
Focusing
  • beijing

Block or report gokunwu

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

🚀 「大模型」1小时从0训练26M参数的视觉多模态VLM!🌏 Train a 26M-parameter VLM from scratch in just 1 hours!

Python 5,208 546 Updated Oct 30, 2025

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

Jupyter Notebook 77,938 11,503 Updated Nov 3, 2025

Simple RL training for reasoning

Python 3,783 279 Updated Aug 3, 2025

Download web video and audio

C++ 4,139 191 Updated Oct 18, 2025

A list of AI autonomous agents

23,828 1,971 Updated Feb 26, 2025

[ICML 2025] Predictive Data Selection: The Data That Predicts Is the Data That Teaches

Python 56 8 Updated Mar 4, 2025

No fortress, purely open ground. OpenManus is Coming.

Python 50,668 8,845 Updated Nov 3, 2025

深度学习经典、新论文逐段精读

31,846 2,738 Updated Mar 22, 2025

Fully open reproduction of DeepSeek-R1

Python 25,613 2,401 Updated Sep 8, 2025

Awesome-LLM: a curated list of Large Language Model

25,467 2,165 Updated Jul 31, 2025

The Most Comprehensive Survey of Video Quality Assessment to Date.

87 2 Updated Dec 24, 2024

DataDreamer: Prompt. Generate Synthetic Data. Train & Align Models.   🤖💤

Python 1,073 55 Updated Feb 2, 2025

Making large AI models cheaper, faster and more accessible

Python 41,221 4,536 Updated Oct 13, 2025

LongLLaMA is a large language model capable of handling long contexts. It is based on OpenLLaMA and fine-tuned with the Focused Transformer (FoT) method.

Python 1,462 85 Updated Nov 7, 2023

Implementation of plug in and play Attention from "LongNet: Scaling Transformers to 1,000,000,000 Tokens"

Python 712 62 Updated Jan 7, 2024

Awesome Pretrained Chinese NLP Models,高质量中文预训练模型&大模型&多模态模型&大语言模型集合

Python 5,445 509 Updated Oct 25, 2025

A 13B large language model developed by Baichuan Intelligent Technology

Python 2,960 237 Updated Sep 6, 2023

Example models using DeepSpeed

Python 1 Updated Apr 12, 2023

CCF ADL 2019 slides for knowledge graph fusion

140 29 Updated Oct 17, 2020

⏰ Collaboratively track worldwide conference deadlines (Website, Python Cli, Wechat Applet) / If you find it useful, please star this project, thanks~

Rust 8,116 545 Updated Nov 5, 2025

A collection of research and survey papers of real-time bidding (RTB) based display advertising techniques.

3,674 946 Updated Dec 20, 2024

Papers on Computational Advertising

Python 4,355 1,195 Updated Feb 9, 2021

LeetCode Solutions: A Record of My Problem Solving Journey.( leetcode题解,记录自己的leetcode解题之路。)

JavaScript 55,696 9,479 Updated Jul 16, 2025

OneFlow is a deep learning framework designed to be user-friendly, scalable and efficient.

C++ 9,367 1,010 Updated Aug 20, 2025

GPT-3: Language Models are Few-Shot Learners

15,782 2,297 Updated Sep 18, 2020

Similarity search engine built around Faiss library

Python 78 9 Updated Dec 8, 2022

UNF(Universal NLP Framework)

Python 71 10 Updated Mar 6, 2020
Next