Skip to content
View Jiaxin-Pei's full-sized avatar

Block or report Jiaxin-Pei

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.

77,566 8,971 Updated Feb 5, 2026

Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the PaLM architecture. Basically ChatGPT but with PaLM

Python 7,875 680 Updated Oct 11, 2025

A modern, minimalist portfolio template built with Astro and Tailwind CSS. Perfect for developers looking to showcase their skills, experience, and projects in a clean, professional way.

Astro 4,837 4,124 Updated Mar 26, 2026

A simple tool to update bib entries with their official information (e.g., DBLP or the ACL anthology).

Python 3,001 166 Updated Jul 9, 2025

Crawl BookCorpus

Python 854 112 Updated Jul 14, 2023

A curated list of awesome Active Learning

796 74 Updated Mar 26, 2026

A PyTorch implementation of "TextFuseNet: Scene Text Detection with Richer Fused Features".

Python 486 122 Updated Jul 2, 2021

potato: the portable annotation tool

Python 375 69 Updated Mar 28, 2026

Charsiu: A neural phonetic aligner.

Jupyter Notebook 336 43 Updated Sep 19, 2022

A simple interface to the Project Gutenberg corpus.

Python 332 61 Updated Jan 12, 2023

A suite of tools for managing crowdsourcing tasks from the inception through to data packaging for research use.

Python 312 76 Updated Dec 13, 2024

程序员的浪漫方式集合,欢迎大家积极补充...

Python 133 35 Updated Oct 3, 2023

Tools for collecting social media data around focal events

Python 85 15 Updated Mar 29, 2022

Mapping of US Zipcode, county, and state information from Census data

Ruby 79 37 Updated Sep 6, 2024

A dataset contains 37 million douban dushu comments

70 5 Updated Dec 1, 2018

计算精神病学在线文献报告讨论会(Computational psychiatry online journal club(CPoJC))

54 9 Updated Aug 31, 2022
Jupyter Notebook 45 3 Updated Oct 14, 2024

ReCAP: Recursive Context-Aware Reasoning and Planning for Large Language Model Agents, NeurIPS 2025

PDDL 35 1 Updated Nov 15, 2025

An NLP processing pipeline for characters in fanfiction. Developed by students at Carnegie Mellon University from 2019-2021.

Python 34 7 Updated Feb 2, 2026

For associated data that can be mashed up with ours. This is data like Census demographics, CDC flu rates, and hospital beds

Jupyter Notebook 27 19 Updated May 26, 2020

Official code of "The Automated but Risky Game: Modeling Agent-to-Agent Negotiations and Transactions in Consumer Markets"

Jupyter Notebook 25 4 Updated Mar 24, 2026

The official repo for SocKET: Social Knowledge Evaluation Tests

Python 24 2 Updated May 12, 2025

Topic Modeling for The New York Times News Dataset

Python 20 11 Updated May 23, 2017

County Level Election Results Analysis (2016)

R 14 3 Updated Nov 27, 2016

Code for the paper "Modeling Information Change in Science Communication with Semantically Matched Paraphrases" from EMNLP 2022

Python 13 1 Updated Oct 20, 2022

Official repository for the ICWSM '21 paper "More than meets the tie: Examining the Role of Interpersonal Relationships in Social Networks"

Python 12 2 Updated Apr 26, 2023

robots.txt-style permission manifest for web agents

8 Updated Dec 10, 2025
Next