CWL + Docker Workflow

Overview

A workflow management approach that combines Common Workflow Language (CWL) for portable pipeline specification with Docker containers for reproducible execution environments. CWL describes tool inputs, outputs, and computational steps in a language-agnostic YAML/JSON format; Docker encapsulates software dependencies into isolated images. Together, they enable cloud-platform-independent reproducibility of bioinformatics pipelines.

Used by

  • The TCGA MC3 somatic calling pipeline was distributed as a CWL + Docker workflow at https://github.com/OpenGenomics/mc3, enabling reproducible execution of the seven-caller ensemble across DNAnexus and other cloud platforms PMID:29596782

Notes

  • CWL workflows are portable across compliant executors (cwltool, Toil, Cromwell, etc.) and cloud platforms (AWS, GCP, Azure, DNAnexus).
  • Docker container versioning is critical for reproducibility; pinning image digests (not just tags) is recommended.
  • The MC3 CWL release was intended as a reference implementation to enable other groups to replicate or extend the pipeline.

Sources

This page was processed by entity-page-writer on 2026-05-15.