Skip to content
View jinglong92's full-sized avatar

Block or report jinglong92

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results
Python 1 Updated Mar 28, 2026

verl: Volcano Engine Reinforcement Learning for LLMs

Python 20,283 3,529 Updated Mar 28, 2026

LLM101n: Let's build a Storyteller

36,624 2,002 Updated Aug 1, 2024

A tiny scalar-valued autograd engine and a neural net library on top of it with PyTorch-like API

Jupyter Notebook 15,255 2,312 Updated Aug 8, 2024

Reproducing Yann LeCun 1989 paper "Backpropagation Applied to Handwritten Zip Code Recognition", to my knowledge the earliest real-world application of a neural net trained with backpropagation.

Jupyter Notebook 732 83 Updated Feb 3, 2024

Implementation of Denoising Diffusion Probabilistic Model in Pytorch

Python 10,466 1,271 Updated Feb 11, 2026

Consistency Distilled Diff VAE

Python 2,212 79 Updated Nov 7, 2023

Code for the paper "Planning with Diffusion for Flexible Behavior Synthesis"

Python 1,256 200 Updated Jul 18, 2024

面向开发者的 LLM 入门教程,吴恩达大模型系列课程中文版

Jupyter Notebook 23,646 2,865 Updated Jun 12, 2025

✨ Light and Fast AI Assistant. Support: Web | iOS | MacOS | Android | Linux | Windows

TypeScript 87,603 60,026 Updated Dec 2, 2025

Inference code for Llama models

Python 59,274 9,827 Updated Jan 26, 2025

This repo contains the source code in my personal column (https://zhuanlan.zhihu.com/zhaoyeyu), implemented using Python 3.6. Including Natural Language Processing and Computer Vision projects, suc…

Jupyter Notebook 3,527 2,109 Updated Jun 14, 2021

Deep Recommenders

Python 328 107 Updated Jul 6, 2023

Budget Constrained Bidding for Display Advertising using Model-free Reinforcement Learning

Python 48 19 Updated Dec 13, 2019

Papers on Computational Advertising

Python 4,380 1,185 Updated Feb 9, 2021

Code of ICML-2020 paper Dynamic Knapsack Optimization Towards Efficient Multi-Channel Sequential Advertising

Python 27 8 Updated Aug 12, 2020

Uplift modeling and causal inference with machine learning algorithms

Python 5,782 856 Updated Mar 21, 2026

Learning Scheduling Algorithms for Data Processing Clusters

Python 319 93 Updated Jun 15, 2021

An implementation of GCN-NPEC for VRP

Python 37 4 Updated Jul 14, 2021

Illustrated Examples from Sutton and Barto

Jupyter Notebook 38 10 Updated May 11, 2023

xingtian is a componentized library for the development and verification of reinforcement learning algorithms

Python 317 89 Updated Sep 12, 2023

Code associated with the NeurIPS19 paper "Weighted Linear Bandits in Non-Stationary Environments"

Jupyter Notebook 17 2 Updated Nov 14, 2019

Grokking Deep Reinforcement Learning

Jupyter Notebook 1,009 276 Updated Feb 4, 2022

This repository contains implementations and illustrative code to accompany DeepMind publications

Jupyter Notebook 14,790 2,861 Updated Mar 27, 2026

Machine Learning for Combinatorial Optimization - NeurIPS'21 competition

Python 138 32 Updated Aug 29, 2022

Exact Combinatorial Optimization with Graph Convolutional Neural Networks (NeurIPS 2019)

Python 406 115 Updated Dec 21, 2021

Adds CityFlow to Gym

Python 32 17 Updated Nov 15, 2021
Next