-
Georgetown University
- Washington DC
- https://www.linkedin.com/in/peijin-li-pl724georgetown/
- https://www.datascienceportfol.io/peijinli
- https://medium.com/@lipeijin0405_41840
-
-
-
goldenvisa-rag-chat Public
LusAI is a local, RAG-based assistant for navigating Portugal’s Golden Visa policies. It combines document ingestion, semantic search, and LLM generation to deliver reliable answers from official d…
HTML UpdatedJul 3, 2025 -
This project applies XGBoost to predict user behavior in the online travel industry, optimizing hotel bookings and enhancing personalized marketing through data-driven insights.
-
Gender-Differences-in-Competitive-Persistence-An-Empirical-Study-Using-Dutch-Math-Olympiad-Data Public
Replication: Do Women Give Up Competing More Easily? Evidence from the Lab and the Dutch Math Olympiad
r gender-equality replication-study applied-economics regression-discontinuity-designs sharp-regression-discontinuityHTML UpdatedSep 6, 2024 -
This app presents the worldwide distribution of U.S. Grants spending from FY 2014 to FY 2022. The dashboard offers insights into how grants were allocated globally, highlighting trends, growth rate…
-
Project website: https://peijin0405.github.io/WorldNews-Subreddit-Analysis-PySpark/
Jupyter Notebook MIT License UpdatedSep 4, 2024 -
-
This project employs XGBoost regression and XGBoost classifier model to predict user order and user churn on online travel agency data. Reach 97% prediction accuracy.
Jupyter Notebook UpdatedMay 6, 2024 -
-
-
-
-
This project aims to study the influence factors of international students' mobility with the case of international students from B&R countries studying in China.
-
-
-
In this report, I used social network analysis techniques to study the Huawei's customer connecting pattern. This report is based on the data from Huawei Social Network Data on Kaggle platform. Dat…
-
This project aims to answer the question of the common features of successful social enterprises by applying unsupervised learning on 5,210 B corporations impact data.
mapping unsupervised-learning k-means-implementation-in-python k-means-clustering empirical-research unsupervised-clustering tfidf-text-analysisJupyter Notebook UpdatedJan 31, 2023 -
Data-Viz-R Public
This repo shows some data viz I created with R.
-
Build a Naive Bayes classification model from scratch, and evaluate the performance of the model on the DBpedia14 dataset.
Jupyter Notebook UpdatedJan 6, 2023 -
An app built with Streamlit. It reveals the whole picture of government grant funding in a specific state.