GATK Data Pre-processing
October 2, 2022
GATK is a Genome Analysis Toolkit. Here are some notes about GATK:
GATK data pre-processing
Here is a workflow from GATK:
- Raw Mapped Reads (Bam) ->
MarkDuplicates
Check and compare results
samtools view BEFORE_MARKDUPLICATES.bam | wc -l
samtools view AFTER_MARKDUPLICATES.bam | wc -l
- ->
BaseRecalibrator
+ApplyBQSR
-> Analysis-Ready Rads (Bam)