Starred repositories
《动手学深度学习》:面向中文读者、能运行、可讨论。中英文版被70多个国家的500多所大学用于教学。
A curated list of awesome Machine Learning frameworks, libraries and software.
scikit-learn: machine learning in Python
No fortress, purely open ground. OpenManus is Coming.
Making large AI models cheaper, faster and more accessible
Fully open reproduction of DeepSeek-R1
Minimal and clean examples of machine learning algorithms implementations
A community driven list of useful Scala libraries, frameworks and software.
Web mining module for Python, with tools for scraping, natural language processing, machine learning, network analysis and visualization.
Numenta Platform for Intelligent Computing is an implementation of Hierarchical Temporal Memory (HTM), a theory of intelligence based strictly on the neuroscience of the neocortex.
XLNet: Generalized Autoregressive Pretraining for Language Understanding
Awesome Pretrained Chinese NLP Models,高质量中文预训练模型&大模型&多模态模型&大语言模型集合
🚀 「大模型」1小时从0训练26M参数的视觉多模态VLM!🌏 Train a 26M-parameter VLM from scratch in just 1 hours!
Minimal and Clean Reinforcement Learning Examples
A 13B large language model developed by Baichuan Intelligent Technology
NCRF++, a Neural Sequence Labeling Toolkit. Easy use to any sequence labeling tasks (e.g. NER, POS, Segmentation). It includes character LSTM/CNN, word LSTM/CNN and softmax/CRF components.
LongLLaMA is a large language model capable of handling long contexts. It is based on OpenLLaMA and fine-tuned with the Focused Transformer (FoT) method.
Awesome-TensorFlow-Chinese,TensorFlow 中文资源精选,官方网站,安装教程,入门教程,视频教程,实战项目,学习路径。QQ群:167122861,公众号:磐创AI,微信群二维码:http://www.tensorflownews.com/
Simple transformer implementation from scratch in pytorch. (archival, latest version on codeberg)
DataDreamer: Prompt. Generate Synthetic Data. Train & Align Models. 🤖💤
Implementation of plug in and play Attention from "LongNet: Scaling Transformers to 1,000,000,000 Tokens"
A neural network architecture for NLP tasks, using cython for fast performance. Currently, it can perform POS tagging, SRL and dependency parsing.
BiLSTM-CNN-CRF architecture for sequence tagging using ELMo representations.