Research focus: probabilistic models for multi-omics integration, latent factor models, clinical genomics, HPC workflows.
Data Scientist · Bioinformatics · Multi-Omics Integration · Machine Learning
📍 Ecuador 📞 +593 963500658 ✉️ eddgeag@gmail.com · egeraud@uoc.edu · egeraud@doctor.upv.es
Data scientist and bioinformatician with ~10 years of experience in biostatistics, omics data processing, and machine learning. My work focuses on multi-omics integration, biomarker discovery, and statistical modelling for clinical and biomedical applications. I routinely develop HPC-based pipelines (R, Python, Bash) for whole-exome sequencing, multi-omics integration and computational genomics.
I am currently pursuing a PhD in Statistics & Optimization (UPV & UV), specialising in multi-omics methods applied to PCOS, metabolic risk, and plant-omics. I am seeking a short research stay to deepen my training in statistical multi-omics, modelling, and computational biology.
- Multi-omics integration (MOFA2, mixOmics, latent factor models)
- Bayesian models, probabilistic machine learning
- Clinical genomics and WES-based biomarker discovery
- Single-cell and population-level omics
- HPC workflows for reproducible computational biology
Universidad Politécnica Salesiana — Professor (2022–Present) Molecular Genetics, Machine Learning, Bioinformatics; thesis advising; scientific writing.
Laboratorio Biomolecular — Bioinformatician (2021–Present) HPC WES pipeline (R/Bash); Sanger automation; SQL + PowerBI; nutrigenomics.
KAUST (Remote) — Bioinformatician (2021) MOFA2 Bayesian multi-omics integration for leukemia biomarker discovery.
IBEC — Bioinformatician (2019–2020) R pipelines for fraud detection in food samples.
Fluttr — Data Scientist (2017–2019) Machine-learning models for intelligent recruitment.
IDIBAPS — Bioinformatician (2015–2017) Neuronal signal processing; entropy-based metrics.
PhD in Statistics & Optimization – UPV & UV (2022–Present) Focus: multi-omics integration; PCOS (N-omics), metabolic risk (P-omics), plant biomarker discovery (P-OMICS)
MSc Bioinformatics & Biostatistics — UOC & UB (2022) Thesis: Multi-omics Integration of Polycystic Ovary Syndrome
MSc Biomedical Engineering — UB & UPC (2015) Thesis: Anaemia Detector at Point of Care
BSc Biomedical Engineering — UB & UPC (2014) Thesis: Nanoparticle Detector at Point of Care
Programming: R, Python, SQL, Bash/zsh, Git, Docker, HPC Bioinformatics: WES/NGS pipelines, Sanger, BWA, Samtools, Bedtools, Vcftools, FastQC, GATK Omics/Databases: ENSEMBL, refSeq, UCSC, OMIM, HPO, GO, DO, GSEA Statistics & ML: regression, classification, Bayesian models, probabilistic methods Other: Power BI, image/signal processing, scientific communication, teaching
metatest_final — Structured framework for multi-omics integration using MOFA2 + supervised modelling. Includes latent factor analysis, feature selection and classification pipelines applied to metabolic risk datasets.
repo_cedia — Whole-exome sequencing (WES) diagnostic pipeline developed and used in the laboratory. R/Bash workflow for alignment, QC, variant calling, annotation and reporting.
CNV_exomes — Complementary CNV detection workflow for exome data. Implements coverage-based CNV calling, QC, segmentation and integration into the diagnostic WES pipeline.
TFM_project — Scripts and exploratory notebooks from my MSc Bioinformatics & Biostatistics final project, focused on multi-omics integration (transcriptomics, metabolomics, clinical data) including MOFA2..
integromics_2 — Continuation of the MSc multi-omics integration project (PCOS). Contains extended modelling attempts, exploratory analyses and prototype pipelines leading to later structured work.
Two talks derived from PhD research on multi-omics analysis of PCOS Authors: E. Géraud-Aguilar et al. (UPV, Ramón y Cajal / CIBERDEM, CIPF)
Spanish (Native) · Catalan (Native) · English (B2)
If you would like to discuss collaborations, research opportunities, or computational multi-omics, feel free to reach out via email.
- metatest_final — Final structured framework for MOFA2 + supervised modelling (PCOS multi-omics).
- repo_cedia — Real-world WES diagnostic pipeline (R/Bash/SQL).
- integromics_2 — Exploratory multi-omics integration workflow used during MSc research.