gokunwu

Follow

😀

Focusing

WuKun gokunwu

😀

Focusing

Follow

I Do NLP & ML

17 followers · 57 following

beijing

Starred repositories

38 stars written in Python

deepseek-ai / DeepSeek-V3

Python 100,185 16,322 Updated Aug 28, 2025

d2l-ai / d2l-zh

《动手学深度学习》：面向中文读者、能运行、可讨论。中英文版被70多个国家的500多所大学用于教学。

Python 73,690 11,976 Updated Jul 30, 2024

josephmisiti / awesome-machine-learning

A curated list of awesome Machine Learning frameworks, libraries and software.

Python 70,502 15,149 Updated Oct 28, 2025

scikit-learn / scikit-learn

scikit-learn: machine learning in Python

Python 63,940 26,412 Updated Nov 6, 2025

FoundationAgents / OpenManus

No fortress, purely open ground. OpenManus is Coming.

Python 50,709 8,852 Updated Nov 3, 2025

hpcaitech / ColossalAI

Making large AI models cheaper, faster and more accessible

Python 41,223 4,538 Updated Nov 7, 2025

huggingface / open-r1

Fully open reproduction of DeepSeek-R1

Python 25,617 2,400 Updated Sep 8, 2025

MorvanZhou / tutorials

机器学习相关教程

Python 12,666 5,716 Updated Dec 22, 2020

rushter / MLAlgorithms

Minimal and clean examples of machine learning algorithms implementations

Python 10,921 1,774 Updated Jun 15, 2025

lauris / awesome-scala

A community driven list of useful Scala libraries, frameworks and software.

Python 9,167 1,268 Updated Sep 20, 2024

clips / pattern

Web mining module for Python, with tools for scraping, natural language processing, machine learning, network analysis and visualization.

Python 8,841 1,580 Updated Jun 10, 2024

numenta / nupic-legacy

Numenta Platform for Intelligent Computing is an implementation of Hierarchical Temporal Memory (HTM), a theory of intelligence based strictly on the neuroscience of the neocortex.

Python 6,352 1,550 Updated Dec 3, 2024

zihangdai / xlnet

XLNet: Generalized Autoregressive Pretraining for Language Understanding

Python 6,177 1,167 Updated May 28, 2023

lonePatient / awesome-pretrained-chinese-nlp-models

Awesome Pretrained Chinese NLP Models，高质量中文预训练模型&大模型&多模态模型&大语言模型集合

Python 5,449 509 Updated Oct 25, 2025

jingyaogong / minimind-v

🚀 「大模型」1小时从0训练26M参数的视觉多模态VLM！🌏 Train a 26M-parameter VLM from scratch in just 1 hours!

Python 5,227 551 Updated Oct 30, 2025

wzhe06 / Ad-papers

Papers on Computational Advertising

Python 4,356 1,195 Updated Feb 9, 2021

hkust-nlp / simpleRL-reason

Simple RL training for reasoning

Python 3,783 279 Updated Aug 3, 2025

rlcode / reinforcement-learning

Minimal and Clean Reinforcement Learning Examples

Python 3,596 741 Updated Mar 24, 2023

baichuan-inc / Baichuan-13B

A 13B large language model developed by Baichuan Intelligent Technology

Python 2,959 237 Updated Sep 6, 2023

tensorflow / lingvo

Lingvo

Python 2,852 452 Updated Oct 29, 2025

jiesutd / NCRFpp

NCRF++, a Neural Sequence Labeling Toolkit. Easy use to any sequence labeling tasks (e.g. NER, POS, Segmentation). It includes character LSTM/CNN, word LSTM/CNN and softmax/CRF components.

Python 1,897 443 Updated Jun 30, 2022

CStanKonrad / long_llama

LongLLaMA is a large language model capable of handling long contexts. It is based on OpenLLaMA and fine-tuned with the Focused Transformer (FoT) method.

Python 1,462 85 Updated Nov 7, 2023

andersbll / deeppy

Deep learning in Python

Python 1,380 303 Updated Dec 28, 2020

fendouai / Awesome-TensorFlow-Chinese

Awesome-TensorFlow-Chinese，TensorFlow 中文资源精选，官方网站，安装教程，入门教程，视频教程，实战项目，学习路径。QQ群：167122861，公众号：磐创AI，微信群二维码：http://www.tensorflownews.com/

Python 1,372 329 Updated Jan 30, 2021

pbloem / former

Simple transformer implementation from scratch in pytorch. (archival, latest version on codeberg)

Python 1,093 171 Updated Mar 20, 2025

datadreamer-dev / DataDreamer

DataDreamer: Prompt. Generate Synthetic Data. Train & Align Models. 🤖💤

Python 1,074 55 Updated Feb 2, 2025

kyegomez / LongNet

Implementation of plug in and play Attention from "LongNet: Scaling Transformers to 1,000,000,000 Tokens"

Python 712 62 Updated Jan 7, 2024

attardi / deepnl

Deep Learning for Natural Language Processing

Python 463 116 Updated Jan 17, 2019

erickrf / nlpnet

A neural network architecture for NLP tasks, using cython for fast performance. Currently, it can perform POS tagging, SRL and dependency parsing.

Python 408 104 Updated Nov 19, 2021

UKPLab / elmo-bilstm-cnn-crf

BiLSTM-CNN-CRF architecture for sequence tagging using ELMo representations.

Python 388 80 Updated Nov 21, 2022

Starred topics

Natural language processing

Machine learning

Deep learning

llama