Skip to content
View lzzk's full-sized avatar

Block or report lzzk

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

Jupyter Notebook 89,351 13,638 Updated Mar 26, 2026

删库

9,733 1,687 Updated Oct 20, 2025

Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks.

Python 18,090 2,908 Updated Mar 26, 2026

Awesome-LLM: a curated list of Large Language Model

26,553 2,405 Updated Jul 31, 2025

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

Python 22,067 2,695 Updated Jan 23, 2026

Making large AI models cheaper, faster and more accessible

Python 41,377 4,524 Updated Mar 16, 2026

Simple UI for LLM Model Finetuning

Jupyter Notebook 2,060 130 Updated Dec 21, 2023

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

Python 32,200 6,671 Updated Sep 30, 2025

Reading list for research topics in multimodal machine learning

6,843 897 Updated Aug 20, 2024

中英文敏感词、语言检测、中外手机/电话归属地/运营商查询、名字推断性别、手机号抽取、身份证抽取、邮箱抽取、中日文人名库、中文缩写库、拆字词典、词汇情感值、停用词、反动词表、暴恐词表、繁简体转换、英文模拟中文发音、汪峰歌词生成器、职业名称词库、同义词库、反义词库、否定词库、汽车品牌词库、汽车零件词库、连续英文切割、各种中文词向量、公司名字大全、古诗词库、IT词库、财经词库、成语词库、地名词库、…

Python 79,644 15,143 Updated May 10, 2024

A machine translation reading list maintained by Tsinghua Natural Language Processing Group

TeX 2,443 440 Updated Aug 9, 2024

Unsupervised text tokenizer for Neural Network-based text generation.

C++ 11,718 1,332 Updated Mar 26, 2026

Open Source Neural Machine Translation and (Large) Language Models in PyTorch

Python 7,000 2,252 Updated Oct 14, 2025

CoreNLP: A Java suite of core NLP tools for tokenization, sentence segmentation, NER, parsing, coreference, sentiment analysis, etc.

Java 10,062 2,718 Updated Feb 10, 2026

Implementation of Reinforcement Learning Algorithms. Python, OpenAI Gym, Tensorflow. Exercises and Solutions to accompany Sutton's Book and David Silver's course.

Jupyter Notebook 21,932 6,153 Updated Jul 13, 2023

A curated list of speech and natural language processing resources

2,227 291 Updated Apr 2, 2019

NanGe - A Rule-based Chinese-English Machine Translation System

C++ 20 8 Updated Jul 23, 2017

The "Python Machine Learning (1st edition)" book code repository and info resource

Jupyter Notebook 12,602 4,396 Updated Nov 20, 2024