Skip to content
View bohanli's full-sized avatar

Highlights

  • Pro

Block or report bohanli

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Simple RL training for reasoning

Python 3,846 289 Updated Dec 23, 2025

The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.

Python 8,933 605 Updated May 3, 2024

推荐/广告/搜索领域工业界经典以及最前沿论文集合。A collection of industry classics and cutting-edge papers in the field of recommendation/advertising/search.

Python 2,107 267 Updated Mar 25, 2026

LibRerank is a toolkit for re-ranking algorithms. There are a number of re-ranking algorithms, such as PRM, DLCM, GSF, miDNN, SetRank, EGRerank, Seq2Slate.

Python 269 47 Updated Feb 21, 2022

Controllable Multi-Objective Re-ranking with Policy Hypernetworks (KDD 2023)

Python 38 5 Updated Oct 6, 2024

An open-source tool-augmented conversational language model from Fudan University

Python 12,086 1,132 Updated Jul 13, 2024

From Chain-of-Thought prompting to OpenAI o1 and DeepSeek-R1 🍓

3,582 202 Updated May 7, 2025

Benchmarking large language models' complex reasoning ability with chain-of-thought prompting

Jupyter Notebook 2,770 143 Updated Aug 4, 2024

sparse word2vec

C++ 107 34 Updated Jul 7, 2022

4mc - splittable lz4 and zstd in hadoop/spark/flink

C 109 37 Updated Apr 21, 2023

Automatically exported from code.google.com/p/word2vec

C 1,584 546 Updated Feb 28, 2023

Pytorch implementation of "A Probabilistic Formulation of Unsupervised Text Style Transfer" by He. et. al. at ICLR 2020

Python 162 26 Updated Oct 19, 2022

Free online textbook of Jupyter notebooks for fast.ai Computational Linear Algebra course

Jupyter Notebook 10,791 2,675 Updated Apr 16, 2024

Optimus: the first large-scale pre-trained VAE language model

Python 393 41 Updated Sep 6, 2023

XTREME is a benchmark for the evaluation of the cross-lingual generalization ability of pre-trained multilingual models that covers 40 typologically diverse languages and includes nine tasks.

Python 652 110 Updated Jan 4, 2023

我的信息学竞赛讲课课件

TeX 1,202 254 Updated Jan 31, 2020

Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more

Python 35,339 3,512 Updated Apr 9, 2026

Script that crawls meta data from ICLR OpenReview webpage. Tutorials on installing and using Selenium and ChromeDriver on Ubuntu.

Jupyter Notebook 461 41 Updated Dec 25, 2019

A booklet on machine learning systems design with exercises. NOT the repo for the book "Designing Machine Learning Systems", which is `dmls-book`

HTML 10,327 1,610 Updated Apr 15, 2023
Jupyter Notebook 45 5 Updated Nov 3, 2019

Codes for <Kernelized Bayesian Softmax for Text Generation> in NeurIPS 2019

Python 16 3 Updated Nov 20, 2019

Data and Code for ICLR2020 Paper "TabFact: A Large-scale Dataset for Table-based Fact Verification"

Python 414 49 Updated Sep 19, 2023

AAAI-20 paper: Cross-Lingual Natural Language Generation via Pre-Training

Python 129 16 Updated Aug 4, 2021

Implementation of INLG 19 paper: Rethinking Text Attribute Transfer: A Lexical Analysis

Python 16 5 Updated Sep 30, 2019

Fast, general, and tested differentiable structured prediction in PyTorch

Jupyter Notebook 1,127 94 Updated Apr 20, 2022

LAnguage Model Analysis

Python 1,389 187 Updated Jul 7, 2024

PyTorch implementation of A Surprisingly Effective Fix for Deep Latent Variable Modeling of Text (EMNLP 2019)

Python 48 7 Updated Feb 16, 2020

Code examples for CMU CS11-731, Machine Translation and Sequence-to-sequence Models

Python 35 9 Updated Nov 4, 2019
Python 42 5 Updated Nov 9, 2019
Next