A pipeline for processing Illumina DNA Methylation array datasets: 450k, EPIC and EPICv2
Goals
- tidy up whatever has been written now and write custom containers for the whole pipeline (3 processes for now)
- attach the Dockerfile to the pipeline itself (cellranger in scrnaseq cannot be run with conda so I think this pipeline could rely on containers only until we find a fix)
- scripts rely on the directory access/path rather than the file access/paths (difficulty in fetching the directory itself from the test-data repo: perhaps add module for creating the CSV file with the input path as the scripts currently are expecting it and align with the requirements for proper -profile test setup.
- tidy up the R scripts and parametrize hardcoded options
- add a disclaimer that this will not run with conda
- update docs
- coordinate if the output of the pipeline generates same results using previously analyzed projects (align with @Ghada Nouairia)
- remove FastQC module and remove MultiQC module or add custom QC plots with versions to the report
- currently two tests are failing on GitHub actions related to pipeline logos, but these pass the nfcore tools repo.
Motivation
Differential analysis of DNA methylation can come from sequencing reads, but also from Illumina methylation array which are for detecting methylation intensity at certain pre-programmed sites. This pipeline is for the latter technique as there already is a nf-core project for the former (methylseq).
category
pipelines