Skip to content
View eddgeag's full-sized avatar

Block or report eddgeag

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
eddgeag/README.md

👋 Edmond Géraud Aguilar

🔬 Computational Biology • Multi-Omics Integration • Bayesian Modelling

Research focus: probabilistic models for multi-omics integration, latent factor models, clinical genomics, HPC workflows.

Data Scientist · Bioinformatics · Multi-Omics Integration · Machine Learning

📍 Ecuador 📞 +593 963500658 ✉️ eddgeag@gmail.com · egeraud@uoc.edu · egeraud@doctor.upv.es

About me

Data scientist and bioinformatician with ~10 years of experience in biostatistics, omics data processing, and machine learning. My work focuses on multi-omics integration, biomarker discovery, and statistical modelling for clinical and biomedical applications. I routinely develop HPC-based pipelines (R, Python, Bash) for whole-exome sequencing, multi-omics integration and computational genomics.

I am currently pursuing a PhD in Statistics & Optimization (UPV & UV), specialising in multi-omics methods applied to PCOS, metabolic risk, and plant-omics. I am seeking a short research stay to deepen my training in statistical multi-omics, modelling, and computational biology.


Research interests

  • Multi-omics integration (MOFA2, mixOmics, latent factor models)
  • Bayesian models, probabilistic machine learning
  • Clinical genomics and WES-based biomarker discovery
  • Single-cell and population-level omics
  • HPC workflows for reproducible computational biology

Experience

Universidad Politécnica Salesiana — Professor (2022–Present) Molecular Genetics, Machine Learning, Bioinformatics; thesis advising; scientific writing.

Laboratorio Biomolecular — Bioinformatician (2021–Present) HPC WES pipeline (R/Bash); Sanger automation; SQL + PowerBI; nutrigenomics.

KAUST (Remote) — Bioinformatician (2021) MOFA2 Bayesian multi-omics integration for leukemia biomarker discovery.

IBEC — Bioinformatician (2019–2020) R pipelines for fraud detection in food samples.

Fluttr — Data Scientist (2017–2019) Machine-learning models for intelligent recruitment.

IDIBAPS — Bioinformatician (2015–2017) Neuronal signal processing; entropy-based metrics.


Education

PhD in Statistics & Optimization – UPV & UV (2022–Present) Focus: multi-omics integration; PCOS (N-omics), metabolic risk (P-omics), plant biomarker discovery (P-OMICS)

MSc Bioinformatics & Biostatistics — UOC & UB (2022) Thesis: Multi-omics Integration of Polycystic Ovary Syndrome

MSc Biomedical Engineering — UB & UPC (2015) Thesis: Anaemia Detector at Point of Care

BSc Biomedical Engineering — UB & UPC (2014) Thesis: Nanoparticle Detector at Point of Care


Technical skills

Programming: R, Python, SQL, Bash/zsh, Git, Docker, HPC Bioinformatics: WES/NGS pipelines, Sanger, BWA, Samtools, Bedtools, Vcftools, FastQC, GATK Omics/Databases: ENSEMBL, refSeq, UCSC, OMIM, HPO, GO, DO, GSEA Statistics & ML: regression, classification, Bayesian models, probabilistic methods Other: Power BI, image/signal processing, scientific communication, teaching


Selected Projects

metatest_final — Structured framework for multi-omics integration using MOFA2 + supervised modelling. Includes latent factor analysis, feature selection and classification pipelines applied to metabolic risk datasets.

repo_cedia — Whole-exome sequencing (WES) diagnostic pipeline developed and used in the laboratory. R/Bash workflow for alignment, QC, variant calling, annotation and reporting.

CNV_exomes — Complementary CNV detection workflow for exome data. Implements coverage-based CNV calling, QC, segmentation and integration into the diagnostic WES pipeline.

TFM_project — Scripts and exploratory notebooks from my MSc Bioinformatics & Biostatistics final project, focused on multi-omics integration (transcriptomics, metabolomics, clinical data) including MOFA2..

integromics_2 — Continuation of the MSc multi-omics integration project (PCOS). Contains extended modelling attempts, exploratory analyses and prototype pipelines leading to later structured work.


Presentations

Two talks derived from PhD research on multi-omics analysis of PCOS Authors: E. Géraud-Aguilar et al. (UPV, Ramón y Cajal / CIBERDEM, CIPF)


Languages

Spanish (Native) · Catalan (Native) · English (B2)


Contact

If you would like to discuss collaborations, research opportunities, or computational multi-omics, feel free to reach out via email.

Selected Projects

  • metatest_final — Final structured framework for MOFA2 + supervised modelling (PCOS multi-omics).
  • repo_cedia — Real-world WES diagnostic pipeline (R/Bash/SQL).
  • integromics_2 — Exploratory multi-omics integration workflow used during MSc research.

Pinned Loading

  1. repo_cedia repo_cedia Public

    EXOMAS

    R

  2. TFM TFM Public

    Master thesis UOC

    HTML

  3. CNVs CNVs Public

    repo_CNV

    Shell

  4. integromics_2 integromics_2 Public

    R

  5. metatest_final metatest_final Public

    R