Stars
A lightweight coding agent for open models like Deepseek, Kimi, and Qwen
SuperCLUE: 中文通用大模型综合性基准 | A Benchmark for Foundation Models in Chinese
Benchmarking large language models' complex reasoning ability with chain-of-thought prompting
📋 A list of open LLMs available for commercial use.
Alpaca dataset from Stanford, cleaned and curated
Implementation of the LLaMA language model based on nanoGPT. Supports flash attention, Int8 and GPTQ 4bit quantization, LoRA and LLaMA-Adapter fine-tuning, pre-training. Apache 2.0-licensed.
The simplest, fastest repository for training/finetuning medium-sized GPTs.
A minimalist revision of the miniKanren language with no additional features
LBL model for computing distributed representtion of words
Repo for counting stars and contributing. Press F to pay respect to glorious developers.
A library for efficient similarity search and clustering of dense vectors.
Library for fast text representation and classification.
Demo code for the paper STC2 which released three short text datasets for clustering and classification
The most cited deep learning papers
Working Theano implementation of Pixel RNN on MNIST.
Bi-directional Attention Flow (BiDAF) network is a multi-stage hierarchical process that represents context at different levels of granularity and uses a bi-directional attention flow mechanism to …
A C++ toolkit for neural machine translation for CPU
Recurrent Neural Network - A curated list of resources dedicated to RNN
📖 A curated list of resources dedicated to Natural Language Processing (NLP)
Theano was a Python library that allows you to define, optimize, and evaluate mathematical expressions involving multi-dimensional arrays efficiently. It is being continued as PyTensor: www.github.…
Deep Learning (Python, C, C++, Java, Scala, Go)
A Python implementation of Probabilistic Context-Free Grammar Parser.