- San Francisco Bay Area
- https://www.linkedin.com/in/changlinz/
Lists (1)
Sort Name ascending (A-Z)
Stars
Data Engineering Zoomcamp is a free 9-week course on building production-ready data pipelines. The next cohort starts in January 2026. Join the course here 👇🏼
FinGPT: Open-Source Financial Large Language Models! Revolutionize 🔥 We release the trained model on HuggingFace.
Free MLOps course from DataTalks.Club
QLoRA: Efficient Finetuning of Quantized LLMs
This repo is meant to serve as a guide for Machine Learning/AI technical interviews.
My implementation of the original GAT paper (Veličković et al.). I've additionally included the playground.py file for visualizing the Cora dataset, GAT embeddings, an attention mechanism, and entr…
My solution to the book <A collection of Data Science Take-home Challenges>
Compute Sentence Embeddings Fast!
Multilabel classification for Toxic comments challenge using Bert
Learning Graph Normalization for Graph Neural Networks
Jupyter Notebook + Python code of twitter sentiment analysis
🏆 Kaggle 8th place solution
Image similarity using Triplet Loss
Analysis of NYC Green Taxi and a model to predict the tip as a percentage of the total fare
In this repo, I've trained an object detection model to find the number of RBC, WBC, PLATELETS Count from the microscopic blood-smeared images.
Clustering analysis of one million tweets using scikit-learn, including basic benchmarking of various clustering algorithms
Social Network Facebook Analysis (Python, Networkx)
Using tessseract library API in python with cffi
Deep Learning for Semantic Text Matching
X-ray diffraction data analysis for high pressure and high temperature experiments
Using NLP techniques to classify companies according to their descriptions
A collection of machine learning examples using PySpark
Extracting relevant information like invoice number, date, amount etc. from PDF files using OCR and NLP techniques
Kaggle-Porto Seguro’s Safe Driver Prediction:Predict if a driver will file an insurance claim next year.