Skip to content
View zskigen's full-sized avatar

Block or report zskigen

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this userโ€™s behavior. Learn more about reporting abuse.

Report abuse
zskigen/README.md

Hi there ๐Ÿ‘‹ I'm Z (they/them)

๐Ÿ“Š Iโ€™m a Senior at Pomona College, majoring in Mathematics & Statistics with a minor in Data Science.

๐Ÿ” Iโ€™m interested in Bayesian methods, probabilistic modeling, and applied statistics, with projects spanning healthcare, biomechanics, and baseball analytics.


๐Ÿ“ Current Projects

  • Senior Thesis (2025โ€“26): Bayesian Joint Modeling of Pain and Depression

    • Developing a hierarchical Bayesian framework to jointly model chronic pain and depression outcomes in transgender healthcare.
    • Comparing empirical Bayes vs. full Bayes approaches, with EB+ correction.
    • Running simulation studies on posterior contraction for small subgroups and building decision-focused MCID calibration curves.
  • Data Science Capstone (2025โ€“26): Cloud-Based Baseball Analytics Infrastructure

    • Designing a cloud-based architecture to ingest and persist tracking API data using PostgreSQL and Python ETL pipelines.
    • Building automated statistical reporting workflows (Python, SQL, CI) to deliver reproducible summaries of pitch-level data.
    • Prototyping a Streamlit app for interactive data visualization and model exploration.

๐Ÿงช Past Projects

  • Quantitative Analyst Associate, Philadelphia Phillies (Summer 2025)

    • Directed independent research on the Automated Ball-Strike Challenge System (AAA level).
    • Built SQL + Python pipelines (BigQuery, nested CTEs, window functions) to compute per-pitch challenge run values and estimate opportunity cost ($L$).
  • Biomechanical Drivers of Pitch Velocity (2024)

    • Analyzed Driveline OpenBiomechanics and TrackMan datasets to identify biomechanical predictors of velocity.
    • Built nonlinear models (XGBoost) with tuned CV, achieving ~2โ€“3 mph prediction error.
    • Applied feature importance analysis to study energy transfer through the pitching kinetic chain.
  • Pomona-Pitzer Baseball Analytics (2023โ€“present)

    • Co-Director of Analytics & Data Engineering.
    • Developed Stuff+/Pitching+/Location+ models, opponent scouting pipelines, and automated workflows to support NCAA DIII decision-making.

๐Ÿ› ๏ธ Skills

Programming: Python, R, SQL, Bash
Statistical & ML Methods: Bayesian inference, probabilistic modeling, calibration, cross-validation, feature engineering
Libraries: XGBoost, scikit-learn, tidymodels, tidyverse, pandas, NumPy, Matplotlib, SHAP
Databases & Workflow: Google BigQuery, PostgreSQL, Git/GitHub, LaTeX


Pinned Loading

  1. Trackman-API-ETL Trackman-API-ETL Public

    Data Science Minor Capstone

    Python

  2. mysite mysite Public

    CSS

  3. TidyTuesdays TidyTuesdays Public

    TidyTuesday is a weekly social data project and is organized by the Data Science Learning Community. Here are some of my projects!

    HTML 1