Senior Computational Scientist
Translational Genomics • Single-cell & Bulk RNA-seq • ML-driven Bioinformatics
I design, own, and execute end-to-end pipelines that transform high-dimensional genomics and proteomics data into reproducible, actionable insights.
I lead translational genomics projects, from data generation to reproducible pipelines, spanning transcriptomics, proteomics, and genome-scale network modeling.
-
🧬 Nature Communications (2025)
End-to-end scRNA-seq pipeline for drug perturbation analysis in SARS-CoV-2 lung organoids -
🧫 Nature Immunology (2023)
End-to-end CyTOF analysis for immune profiling and condition comparison -
🧪 BMC Bioinformatics (2022)
G4Boost: Machine-learning framework for G-quadruplex prediction (Cited in Cell & Nature Comms) -
🕸️ BMC Plant Biology (2022)
Pan-genome co-expression network analysis in maize (First author, conceptual precursor to PanPPI) -
🕸️ G3 (2024)
PanPPI: Pan-genome protein-protein interaction networks (Co-first author) -
🌱 JoVE (2021)
mirMachine: Automated miRNA annotation pipeline for plant genomes
- Single-cell RNA-seq (10x Genomics, PARSE)
- Bulk RNA-seq (differential expression, pathway analysis)
- CyTOF / mass cytometry (immune profiling)
- Genome & transcriptome assembly
- Variant calling & functional annotation
- Drug perturbation & condition-response analysis
Python R Perl • scanpy Seurat pandas NumPy DESeq2 • scikit-learn XGBoost RandomForest • Snakemake Docker Conda HPC (Slurm)
LinkedIn • Google Scholar • 📍 San Mateo, CA