Skip to content

Single-end pipelines non-conditional

ewallac2 requested to merge ewallac2/crac_pipelines:SE_EW into master

Adds new pipeline scripts, described in README.md

The new scripts include some features that could be more widely useful and adapted in other pipelines:

  • runs pipeline_printout_graph for a graphical description of pipeline to flowchart.jpg
  • runs fastqc on demultiplexed fastq files to count reads and assess quality
  • runs bamqc on alignment files to assess alignment quality
  • relies on novo putting out sam-format output with broad compatibility
  • runs bamtools sort/index converting sam alignment files to bam files for input into genome browsers, bedtools, etc.
  • runs multiBamCov from bedtools (uses pipeline merge, including keeping track of input file order in columns via the header)
  • runs pyPileup.py if a genelist is supplied
  • runs pyReadCounters.py twice, once with blocks and muts, once without.

These scripts avoid the conditional running allowed in original CRAC_pipeline_SE.py, because that was not playing well with pipeline_printout_graph.

CRAC_pipeline_SE_demultonly.py

CRAC pipeline for single-end reads, adapted by Edward Wallace, that does demultiplexing and not deduplicating. This adds extra QC and readout steps.

CRAC_pipeline_SE_demult_dedup.py

CRAC pipeline for single-end reads, adapted by Edward Wallace, that does both demultiplexing and deduplicating. This adds extra QC and readout steps.

test_pipeline_flexbar_SE.py

All this does is run flexbar on the input file, to test that you have ruffus and flexbar both work.

Edited by ewallac2

Merge request reports