Professional Documents
Culture Documents
Indels
coming
soon!
(M2)
+
some
post-processing
to
rescue
TiN
variants
and
eliminate
ar<facts
Would
you
trust
a
variant
call
made
at
this
site?
Disambigua#ng
types
of
contamina#on
Tumor cells
Normal cells
Method
described
in
Cibulskis
et
al.,
2011
bioinforma#cs.oxfordjournals.org/
content/27/18/2601
Contamina#ng
popula#on
of
samples
Your
Sample
Wait,
how
do
I
know
which
sites
are
hom-var?
Contamina#ng
popula#on
of
samples
Your
Sample
The
underlying
algorithm
c: contamination
f: minor allele frequency
e: sequencing error rate
1-c c
Bayesian
approach
to
calculate
the
posterior
probability
of
the
f 1-f
contamina#on
level
and
determine
the
maximum
a
posteriori
probability
(MAP)
1-e e 1-e e e 1-e
es#mate
of
the
MINOR MAJOR MINOR MAJOR MINOR MAJOR contamina#on
level
P(MINOR | genotype) = (1-c)(1-e) + cf(1-e) + c(1-f)(e)
P(MAJOR | genotype) = (1-c)(e) + cf(e) + c(1-f)(1-e)
How
to
run
it
java
jar
ContEst.jar
\
-T
Contamina<on
\
-R
reference.fasta
\
-I
sample.bam
\
-B:pop,vcf
popula<on_stra<ed_af_hapmap.vcf
\
-B:genotypes,vcf
normal_sample.vcf
\
-BTI
genotypes
\
-o
contamina<on_results.txt
Indels
coming
soon!
(M2)
+
some
post-processing
to
rescue
TiN
variants
and
eliminate
ar<facts
talks
Further
reading
Documenta#on
coming
soon
to
the
GATK
website
In
the
mean#me,
see
hgp://www.broadins#tute.org/cancer/cga/Home