Methodology

A breakdown of the analyses performed by this workflow.

Cheat-Sheets

Table of contents

The VCF Validation Workflow design follows the following order-of-events graph:

Rule Map/Diagram
---
title: VCF Validation Workflow
---
flowchart TD

  classDef bcftools stroke:#FF5733,fill:#D3D3D3,stroke-width:4px,color:black;
  classDef plink stroke:#36454F,fill:#D3D3D3,stroke-width:4px,color:black;
  classDef python stroke:#FEBE10,fill:#D3D3D3,stroke-width:4px,color:black;
  classDef admixture stroke:#333,fill:#D3D3D3,stroke-width:4px,color:black;
  classDef tabix stroke:#023020,fill:#D3D3D3,stroke-width:4px,color:black;
  classDef gatk stroke:#007FFF,fill:#D3D3D3,stroke-width:4px,color:black;
  classDef workflow stroke:#fff,fill:#000000,stroke-width:4px,color:white;

  subgraph ValidateVcfWorkflow [Validate VCF Workflow]
      wipeInfo[[**wipeInfo**:
  Remove INFO column for computational processing  efficiency]]
      normalize[[**normalize**:
  Normalize all SNPs]]
      sort[[**sort**:
  Ensure correct variant order]]
      filter[[**filter**:
  Remove all variants except SNPs]]
      annotate[[**annotate**:
  Annotate VCF against given reference VCF such as dbSNP, and rename any unknown variants.]]

      tabix[[**tabix**: Generate tabix-index for targeted decompression]]

      class tabix tabix;
      class wipeInfo,normalize,sort,filter,annotate bcftools;

      subgraph liftoverProtocol [Liftover]
          direction LR
          liftoverProtocolStart(((Start)))
          liftover[[liftover:
  Perform reference genome
  liftover]]
          liftoverProtocolEnd(((End)))
          ifLiftoverRequired{Is a
  liftover
  required?}

          liftoverProtocolStart --> ifLiftoverRequired
          ifLiftoverRequired --> |yes| liftover --> liftoverProtocolEnd
          ifLiftoverRequired --> |no| liftoverProtocolEnd
          class liftover gatk;
      end

      tabix --> wipeInfo & normalize & sort & filter & annotate
      wipeInfo --> normalize --> sort --> filter --> annotate --> liftoverProtocol
  end

Creative Commons by Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0) Powered by Snakemake

This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.. This project is managed by the Institute for Cellular and Molecular Medicine at the University of Pretoria.