- Written by: Mauricio Peñuela
- Last update: 02/07/25
This repository contains dataframes, code, results, and figures for data exploration, model training, and validation related to the recombination prediction project from the Genome Biology Unit, Graduate School of Experimental Plant Sciences, Wageningen University.
The datasets and analyses cover:
- Interspecific recombinant populations of Brassica oleracea
- An interspecific dataset of tomato (Solanum lycopersicum × S. pimpinellifolium cross)
Folder names reflect the type of training features used in each analysis:
- k-mers: k-mers length 3
- expression: expression elements
- repetitive: repetitive elements
- all: all feature types combined
Each subfolder contains results and models corresponding to the respective input features.
Clone the repository:
git clone git@github.com:maurope/wur-recombination-prediction.git
Install all of the modules listed in the Python requirements file into the project environment:
pip install -r requirements.txt
- All analyses are conducted in Python using Jupyter Notebooks.
- Make sure to use an updated Python version (3.9+ recommended).
This project is open access and intended for academic and research purposes. You may freely use and adapt the materials, provided appropriate attribution is given.
For questions or collaborative opportunities, feel free to reach out to the author.