Skip to content
View alesanzdro's full-sized avatar
  • FISABIO
  • Valencian Community (SPAIN)

Block or report alesanzdro

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
alesanzdro/README.md

Hi there, I'm Alejandro πŸ‘‹

Bioinformatician Β Β·Β  HPC Systems Administrator Β Β·Β  Pipeline Engineer

Genomic Epidemiology & Pathogen Surveillance @ FISABIO β€” Valencia, Spain πŸ‡ͺπŸ‡Έ


🧬 About me

I'm a bioinformatician with 10+ years of experience in genomic data analysis and Linux/HPC administration. I work at the EPIMOL group (FISABIO) on the genomic surveillance program of the Valencian Community, where I split my time between designing reproducible pipelines, keeping shared HPC infrastructure healthy, and acting as the technical bridge between the molecular epidemiology team and IT.

I tend to identify as a problem solver β€” whether that means recovering a failed system at 11 pm, debugging an obscure pipeline error, or tracking down why a 10 GbE link suddenly stopped behaving. I care about solutions that are robust, reproducible, and useful for the people who actually have to run them.

  • Currently working on respiratory viruses (SARS‑CoV‑2, Influenza, RSV) and antimicrobial resistance in clinical and foodborne pathogens, including Candida auris.
  • Previously: WGS / WES + CNV analysis on FFPE prostate cancer samples at CNIO under a US Department of Defense contract; transcriptomics & miRNA regulatory networks at I2SysBio.
  • Hands‑on experience with Nanopore (MinION / PromethION) and Illumina (MiSeq / NextSeq), plus epigenomics (modified base detection in Neisseria gonorrhoeae).
  • Day‑job sysadmin for shared HPC servers (storage, GPU basecalling, NFS, Samba, networking, user management).

⭐ Flagship project β€” EPILIMS

A complete LIMS platform for molecular epidemiology, designed and built from scratch to scale genomic surveillance across hospitals of the Valencian Community.

EPILIMS is my main ongoing project: a multi-container web application that centralises the full sample lifecycle β€” from reception and wet-lab tracking, to sequencing runs, to bioinformatic results across viral, bacterial and fungal pipelines.

  • Multi-container architecture β€” Django web app, PostgreSQL database, reverse proxy, background workers, and pipeline integration, all orchestrated via Docker.
  • Designed for sensitive clinical data β€” role-based access (lab, bioinformatics, hospital read-only), audit trail, and protection of personally identifiable patient information in compliance with GDPR / LOPDGDD.
  • Built around real surveillance workflows β€” supports multiple pipelines per sequencing run, hybrid Illumina + Nanopore data, lineage assignment, AMR profiling, and per-hospital dashboards.
  • 40+ tables, normalised schema β€” patients, samples, wet-lab extractions, sequencing runs, pipeline executions and results, with materialised views for fast hospital-facing reporting.

πŸ§ͺ Open-source pipelines

Pipeline Stack Description
epitaxmag Nextflow DSL2 Β· Nanopore Modular environmental metagenomics pipeline: multi-tool taxonomic profiling, MAG recovery, virulence screening and AMR linkage.
epicandi Snakemake Β· Illumina + ONT WGS surveillance pipeline for Candida auris (assembly, polishing, annotation, AMR mutations, alternative genetic code).
epibac Snakemake Β· Python Bacterial genomic analysis pipeline: assembly, MLST, virulence and antimicrobial resistance characterisation.
DGSPefsa Python Β· Bash Foodborne pathogen surveillance β€” supporting routine reporting in collaboration with public-health authorities.

🧰 Tech stack

Languages

Python R Bash SQL

Workflow & pipelines

Nextflow Snakemake nf-core

Web & data

Django PostgreSQL FastAPI

AI & LLM β€” providers and models

Claude OpenAI Gemini DeepSeek Ollama Hugging%20Face Llama

Agentic development & ML

Claude%20Code MCP Cursor PyTorch

Containers, environments & HPC

Docker Singularity Conda SLURM Lmod

Infrastructure & ops

Linux Git GitHub%20Actions NFS


🧠 Areas of expertise

  • Sequencing β€” end‑to‑end Nanopore (MinION / PromethION) and Illumina (MiSeq / NextSeq) workflows, including duplex basecalling and modified base calling
  • Variant analysis β€” germline, somatic and CNV calling, including challenging FFPE samples
  • Pathogen genomics β€” viral lineage assignment, AMR, MLST, phylogenetics, outbreak investigation
  • Epigenomics β€” methylation calling and modified base detection
  • Pipeline engineering β€” Nextflow DSL2 & Snakemake, parallelization strategies, reproducible environments
  • Full-stack development β€” Django / FastAPI web apps with PostgreSQL backends, deployed in Docker
  • AI-assisted engineering β€” daily multi-provider LLM workflow (Claude, ChatGPT/OpenAI, Gemini, DeepSeek) and self-hosted open-weight models via Ollama (Llama, Qwen, Mistral, DeepSeek-Coder, Gemma). Hands-on prompt engineering, agentic coding with Claude Code and Cursor, custom MCP servers and integrations, and LLM-powered automation for pipeline scaffolding, code review and internal tooling. Comfortable embedding deep-learning components (neural basecallers, ML-based variant callers) into production bioinformatics pipelines.
  • HPC sysadmin β€” SLURM, Lmod, conda/mamba, Docker / Singularity, NFS, Samba, RAID storage, networking
  • Troubleshooting β€” pipelines, systems, networks, the occasional human

πŸ“Š GitHub stats


πŸ“« Get in touch


"Everything you do should be done with passion."

Pinned Loading

  1. DGSPefsa DGSPefsa Public

    Identification and surveillance of antimicrobial resistance (AMR), EFSA.

    Shell

  2. epibac epibac Public

    Pipeline for basic bioinformatic analysis of bacteria and study of AMR and MLST.

    Python 3