Welcome to Scribd!

Skip carousel

GATKwr8 S 2 Contamination Estimation

Uploaded by

Kartikeya Singh

0% found this document useful (0 votes)

8 views15 pages

Copyright

Available Formats

PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

GATKwr8 S 2 Contamination Estimation

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

0% found this document useful (0 votes)

8 views15 pages

GATKwr8 S 2 Contamination Estimation

Uploaded by

Kartikeya Singh

GATKwr8 S 2 Contamination Estimation

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

Jump to Page

You are on page 1of 15

Search inside document

talks

Es#ma#ng cross-sample contamina#on

with ContEst
Soma#c Variant Discovery Workow

Indels coming
soon! (M2)

+ some post-processing
to rescue TiN variants
and eliminate ar<facts
Would you trust a variant call made at this site?
Disambigua#ng types of contamina#on

Cross-sample (dierent people)

Tumor <-> normal (dierent #ssue)
Tumor subclones (dierent cell lines)
Bacterial cells (esp. in saliva, cheek swabs)

Tumor cells

Normal cells

Normal Tumor Other contamina#ng cells

ContEst: (cross-sample) Contamina#on Estima#on

Here contamina#on = cells

from other samples

Method described in
Cibulskis et al., 2011
bioinforma#cs.oxfordjournals.org/
content/27/18/2601

ContEst is not intended to determine stromal contamina<on

(the number of normal cells in your tumor sequence)

Stromal contamina#on is es#mated in post-processing using a

tool called ABSOLUTE by Carter et al.,
www.nature.com/nbt/journal/v30/n5/abs/nbt.2203.html
ContEst method in a nutshell

Evaluate genotypes of your sample at a set of sites that

are expected to be homozygous-variant

Contamina#ng
popula#on of
samples

Your Sample
Wait, how do I know which sites are hom-var?

Genotyping array Array-free

by on-the-y genotyping
Contamina#on es#ma#on with a genotyping array

Select sites that

are HOM-VAR in
the array data

Any REF at those

sites = probably
contamina#on
Contamina#on es#ma#on with on-the-y genotyping

Iden<fy HOM-VAR sites by genotyping the matched normal

(preferred) or tumor (if unmatched)

Call HOM-VAR any site

with > 80% bases
showing ALT with at
least 50X coverage

Any REF at those sites =

probably contamina#on
Popula#on allele frequency magers

Popula#on allele frequency is important too:

the mismatching reads only reect part of the true total contamina#on

Contamina#ng
popula#on of
samples

Your Sample
The underlying algorithm

c: contamination
f: minor allele frequency
e: sequencing error rate

1-c c
Bayesian approach to
calculate the posterior
probability of the
f 1-f
contamina#on level and
determine the maximum a
posteriori probability (MAP)
1-e e 1-e e e 1-e
es#mate of the
MINOR MAJOR MINOR MAJOR MINOR MAJOR contamina#on level
P(MINOR | genotype) = (1-c)(1-e) + cf(1-e) + c(1-f)(e)
P(MAJOR | genotype) = (1-c)(e) + cf(e) + c(1-f)(1-e)
How to run it

java jar ContEst.jar \
-T Contamina<on \
-R reference.fasta \
-I sample.bam \
-B:pop,vcf popula<on_stra<ed_af_hapmap.vcf \
-B:genotypes,vcf normal_sample.vcf \
-BTI genotypes \
-o contamina<on_results.txt

Contamina#on es#ma#on for the sample overall

(used by MuTect in next step)
Contamina#on for each lane in the sample
(by read group can blacklist RGs) add -llc
XXXXXXX
LANE
to your commandline
How to interpret the contamina#on values

0-2% - Fine, everything is good!

2-5% - Slightly contaminated, might be worth looking

into if your sample produces weird downstream results

>50% unusable contamina#on,

as you approach 100%
Between and 5 and 15%, heavily contamina#on theres a chance
contaminated but salvageable, its a sample swap
watch these samples, and expect
much manual review

Between 15 and 50%, heavy contaminated, most likely worth

removing samples and follow up with project management
Soma#c Variant Discovery Workow

Indels coming
soon! (M2)

+ some post-processing
to rescue TiN variants
and eliminate ar<facts
talks

Further reading
Documenta#on coming soon to the GATK website

In the mean#me, see
hgp://www.broadins#tute.org/cancer/cga/Home

MicroParaReviewer PDF
Document12 pages
MicroParaReviewer PDF
Einah Einah
No ratings yet
Application of ANOVA
Document18 pages
Application of ANOVA
Uma Shankar
38% (8)
Macromolecules Worksheet
Document6 pages
Macromolecules Worksheet
Myka Zoldyck
0% (1)
Comparative Genomics and Target Discovery: Maarten Sollewijn Gelpke MDI, Organon
Document35 pages
Comparative Genomics and Target Discovery: Maarten Sollewijn Gelpke MDI, Organon
pinkbutter
No ratings yet
Andriani Daskalaki-Handbook of Research On Systems Biology Applications in Medicine (2008)
Document917 pages
Andriani Daskalaki-Handbook of Research On Systems Biology Applications in Medicine (2008)
Алексей Почтилейтинантзапаса
100% (2)
Erik Garrison - Iowa Talk 2
Document32 pages
Erik Garrison - Iowa Talk 2
Sergio Nemirovsky
No ratings yet
T-Tests, Anovas & Regression: and Their Application To The Statistical Analysis of Neuroimaging
Document39 pages
T-Tests, Anovas & Regression: and Their Application To The Statistical Analysis of Neuroimaging
Work Place
No ratings yet
Expl Anati Onofsi MPL Exmethod: I Ntroducti On
Document12 pages
Expl Anati Onofsi MPL Exmethod: I Ntroducti On
Maduamaka Ihejiamatu
No ratings yet
Final Capstone Poster-Mann PDF
Document1 page
Final Capstone Poster-Mann PDF
Amy Nottingham-Martin
No ratings yet
4 RNAseq-Quantification LO
Document30 pages
4 RNAseq-Quantification LO
Manovriti Thakur
No ratings yet
GATKwr8 S 3 Variant Calling With MuTect
Document37 pages
GATKwr8 S 3 Variant Calling With MuTect
Kartikeya Singh
No ratings yet
Chapter 7 Genetics
Document20 pages
Chapter 7 Genetics
edomin00
No ratings yet
SCI. Tema 2B
Document39 pages
SCI. Tema 2B
laylaestrellada
No ratings yet
Bioinformatics: Stats Bootcamp
Document63 pages
Bioinformatics: Stats Bootcamp
manisha
No ratings yet
Lva1 App6891 PDF
Document33 pages
Lva1 App6891 PDF
abhishek
No ratings yet
Science of Living System (BS20001) : - Soumya de
Document45 pages
Science of Living System (BS20001) : - Soumya de
Mayank Priayadarshi
No ratings yet
BS10003 - Transcription and Translation - December 2020
Document38 pages
BS10003 - Transcription and Translation - December 2020
dhiraj more
No ratings yet
1.RNA Seq Part1 WorkingToTheGoal
Document75 pages
1.RNA Seq Part1 WorkingToTheGoal
Parisha Singh
No ratings yet
We Are Intechopen, The World'S Leading Publisher of Open Access Books Built by Scientists, For Scientists
Document21 pages
We Are Intechopen, The World'S Leading Publisher of Open Access Books Built by Scientists, For Scientists
jibitesh
No ratings yet
GATKwr12 3 IndelRealignment PDF
Document15 pages
GATKwr12 3 IndelRealignment PDF
Alexander Louis Smith
No ratings yet
Performance of The AVENIO Tumor Tissue Analysis Kits Across Illumina Sequencing Platforms
Document4 pages
Performance of The AVENIO Tumor Tissue Analysis Kits Across Illumina Sequencing Platforms
pappu
No ratings yet
2 Geneovrh
Document28 pages
2 Geneovrh
Andres Zabala
No ratings yet
6 Molecular Markers
Document18 pages
6 Molecular Markers
sadiewang0812
No ratings yet
Application of ANOVA
Document19 pages
Application of ANOVA
abhay_prakash_ranjan
100% (2)
CS369 StringAlgs PDF
Document33 pages
CS369 StringAlgs PDF
kamalsmanek
No ratings yet
Microarray Analysis::: - Data Pre-Processing - Normalization - Molecular Diagnosis - Statistical Classification
Document34 pages
Microarray Analysis::: - Data Pre-Processing - Normalization - Molecular Diagnosis - Statistical Classification
Romlah Romlah
No ratings yet
08 Binomial Distribution Calculations
Document2 pages
08 Binomial Distribution Calculations
Mary Oviedo
No ratings yet
Chan Workshop AK Latest
Document63 pages
Chan Workshop AK Latest
Yanelisa Pulani
No ratings yet
Genome Basic Concept, Terminology and Tools
Document47 pages
Genome Basic Concept, Terminology and Tools
marina nikolidaki
No ratings yet
Mito NGS
Document49 pages
Mito NGS
Laél Bullock
No ratings yet
RNA Interference
Document23 pages
RNA Interference
choryn modina
No ratings yet
Replication-Viruses
Document69 pages
Replication-Viruses
Sadam Irshad
No ratings yet
Science of Living System: Arindam Mondal
Document48 pages
Science of Living System: Arindam Mondal
Sohini Roy
No ratings yet
Noncoding Regions - Recombination
Document39 pages
Noncoding Regions - Recombination
komaltahir2021
No ratings yet
Bim3007 Final
Document17 pages
Bim3007 Final
ljl010113
No ratings yet
ANOVA: Analysis of Variance: Prof. Rohit Joshi, Prof. Achinta Kr. Sarmah
Document40 pages
ANOVA: Analysis of Variance: Prof. Rohit Joshi, Prof. Achinta Kr. Sarmah
vvinaybhardwaj
No ratings yet
COMP90016 2023 08 Variant Calling II
Document41 pages
COMP90016 2023 08 Variant Calling II
Lynn CHEN
No ratings yet
Noise Effect On Arabic Alphadigits in Au
Document4 pages
Noise Effect On Arabic Alphadigits in Au
Abdelkbir Ws
No ratings yet
Rnaseq and Chip-Seq Principles: A) Quantifying Against A Genome
Document7 pages
Rnaseq and Chip-Seq Principles: A) Quantifying Against A Genome
hesham12345
No ratings yet
Data Poisoning Attacks: Shusen Wang
Document17 pages
Data Poisoning Attacks: Shusen Wang
MInh Thanh
No ratings yet
Week. Please Come To
Document29 pages
Week. Please Come To
mahmud000
No ratings yet
Application of Biotechnology To Wheat Improvement
Document28 pages
Application of Biotechnology To Wheat Improvement
Jagadeesh
No ratings yet
Call Drop HO
Document1 page
Call Drop HO
SABER1980
No ratings yet
GG 3
Document52 pages
GG 3
Sarah Medjadj
No ratings yet
Genome Sequence Assembly
Document7 pages
Genome Sequence Assembly
madura c
No ratings yet
The NSMS: NSMR NSML
Document30 pages
The NSMS: NSMR NSML
Antoaneta Pap
No ratings yet
02 NGS Considerations
Document10 pages
02 NGS Considerations
Dethleff90
No ratings yet
RpoBSequencerW19 Copy Lab Report
Document7 pages
RpoBSequencerW19 Copy Lab Report
Akash Mehta
No ratings yet
Gene Fine Structure Analysis in Prokaryotes and Viruses
Document32 pages
Gene Fine Structure Analysis in Prokaryotes and Viruses
erica williams
No ratings yet
Pusch Nik 2017
Document14 pages
Pusch Nik 2017
Mauricio Ríos
No ratings yet
Genome Annotation
Document25 pages
Genome Annotation
Sajjad Hossain Shuvo
No ratings yet
Genome of Virus
Document46 pages
Genome of Virus
gail
No ratings yet
Image Compression Fundamentals
Document11 pages
Image Compression Fundamentals
amant
No ratings yet
Gene Technology: Lecture 8 - Chapter 7 Mobile DNA Sequences in The Genome
Document39 pages
Gene Technology: Lecture 8 - Chapter 7 Mobile DNA Sequences in The Genome
Tania Khan
No ratings yet
Rapd &amp Sts
Document7 pages
Rapd &amp Sts
Mukul Kumar
No ratings yet
5-4188s1 4draft 1
Document48 pages
5-4188s1 4draft 1
api-414476993
No ratings yet
CLC Genetics Chapter 3
Document51 pages
CLC Genetics Chapter 3
Châu Anh Trần
No ratings yet
AlinhamentosMultiplos 2023-24
Document24 pages
AlinhamentosMultiplos 2023-24
mariana.duarte22larsen
No ratings yet
ECT423 M2 Ktunotes - in
Document25 pages
ECT423 M2 Ktunotes - in
paperprep3
No ratings yet
Lecture3 High Throughput Sequencing 2019
Document68 pages
Lecture3 High Throughput Sequencing 2019
Charlie Hou
No ratings yet
Soal Biosel PDF
Document7 pages
Soal Biosel PDF
jessica
No ratings yet
RTWP Problem Troubleshooting Guideline (Huawei)
Document4 pages
RTWP Problem Troubleshooting Guideline (Huawei)
Fachriansyah Fachruddin
No ratings yet
The Common Lisp Condition System: Beyond Exception Handling with Control Flow Mechanisms
From Everand
The Common Lisp Condition System: Beyond Exception Handling with Control Flow Mechanisms
Michał "phoe" Herda
No ratings yet
4030 Full
Document10 pages
4030 Full
Kartikeya Singh
No ratings yet
Faculty of Science Application For Computational Biology Programme FOR ACADEMIC YEAR 2017/2018
Document2 pages
Faculty of Science Application For Computational Biology Programme FOR ACADEMIC YEAR 2017/2018
Kartikeya Singh
No ratings yet
GATE 2018 Admit Card S4: Examination Centre
Document1 page
GATE 2018 Admit Card S4: Examination Centre
Kartikeya Singh
No ratings yet
Ciml v0 - 99 All
Document227 pages
Ciml v0 - 99 All
Kartikeya Singh
No ratings yet
Seminar Circuit Application Form 2015
Document2 pages
Seminar Circuit Application Form 2015
Kartikeya Singh
No ratings yet
Advertisement of Dr. Naresh Kumar in PDF
Document3 pages
Advertisement of Dr. Naresh Kumar in PDF
Kartikeya Singh
No ratings yet
Lectures Bio
Document41 pages
Lectures Bio
Kartikeya Singh
No ratings yet
PCR Primer Design 2013
Document29 pages
PCR Primer Design 2013
Kartikeya Singh
No ratings yet
Institute of Advanced Study in Science and Technology
Document1 page
Institute of Advanced Study in Science and Technology
Kartikeya Singh
No ratings yet
Projectappointments DR - Subeer 7july2017
Document2 pages
Projectappointments DR - Subeer 7july2017
Kartikeya Singh
No ratings yet
Atomic and Molecular Physics Rajkumar
Document3 pages
Atomic and Molecular Physics Rajkumar
Kartikeya Singh
25% (8)
PCR Primer Design 2013
Document29 pages
PCR Primer Design 2013
Kartikeya Singh
No ratings yet
Base Map Preparation For Master Plan Mapping of Farrukhabad - Fatehgarh Area
Document1 page
Base Map Preparation For Master Plan Mapping of Farrukhabad - Fatehgarh Area
Kartikeya Singh
No ratings yet
DBT JR F General Guidelines
Document23 pages
DBT JR F General Guidelines
Kartikeya Singh
No ratings yet
Drexel BIOMED: School of Biomedical Engineering, Science & Health Systems
Document35 pages
Drexel BIOMED: School of Biomedical Engineering, Science & Health Systems
Behnam Mirhashemi
100% (1)
Cell and Molecular Biology Lab Experiment
Document4 pages
Cell and Molecular Biology Lab Experiment
Mhel Rose Benitez
No ratings yet
Modern Concepts in Penicillium and Aspergillus Classification
Document451 pages
Modern Concepts in Penicillium and Aspergillus Classification
Thaina Araújo
50% (2)
1ST SA BIOCHEMISTRY - Almendras
Document5 pages
1ST SA BIOCHEMISTRY - Almendras
Cherry Dagohoy
No ratings yet
Conservation of Embryo and Ovules 1
Document41 pages
Conservation of Embryo and Ovules 1
INDRA RACHMAWATI
No ratings yet
Course Hero
Document17 pages
Course Hero
Bal
No ratings yet
High Sensitivity CRP - IMMULITE and IMMULITE 1000 - Rev 06 DXDCM 09017fe980297730-1538194293759
Document36 pages
High Sensitivity CRP - IMMULITE and IMMULITE 1000 - Rev 06 DXDCM 09017fe980297730-1538194293759
Deqsa Corporativo
0% (1)
Flavonoid Application
Document8 pages
Flavonoid Application
AH Siddiqui
No ratings yet
Sports Medicine and Health Science
Document9 pages
Sports Medicine and Health Science
Javier Estelles Muñoz
No ratings yet
Toxoplasmosis
Document48 pages
Toxoplasmosis
Irahmal Irahmal
No ratings yet
Samer Hawash 27s Resume
Document2 pages
Samer Hawash 27s Resume
api-438554282
No ratings yet
Natural Antimicrobial and Bioactive Compounds From Ludwigia Parviflora Roxb
Document6 pages
Natural Antimicrobial and Bioactive Compounds From Ludwigia Parviflora Roxb
nguyen ba trung
No ratings yet
Presented By: Dear Professor:: Seminar 1
Document62 pages
Presented By: Dear Professor:: Seminar 1
pradnya sadigale
No ratings yet
Nitrobacter Winogradsky
Document7 pages
Nitrobacter Winogradsky
FerryKurniawan
No ratings yet
Transgene: Transparent Frog
Document3 pages
Transgene: Transparent Frog
geobee emmanuel
No ratings yet
Mendivil-Giro 17 Is-Unive PDF
Document25 pages
Mendivil-Giro 17 Is-Unive PDF
halers
No ratings yet
An Introduction To Haematopoiesis Prof Vernon Louw Clinical Haematology University of Cape Town
Document35 pages
An Introduction To Haematopoiesis Prof Vernon Louw Clinical Haematology University of Cape Town
Ammaarah Isaacs
No ratings yet
Rainsure Company, Instrument and Assays Introduction CLV
Document61 pages
Rainsure Company, Instrument and Assays Introduction CLV
Mohammed H. Keshta
No ratings yet
Class 9th Cell - The Unit of Life
Document40 pages
Class 9th Cell - The Unit of Life
Kabir Rai
No ratings yet
Systematic Anatomy OF Dicqtyledons: Ajay Book Service
Document543 pages
Systematic Anatomy OF Dicqtyledons: Ajay Book Service
JOSE FRANCISCO FRANCO NAVIA
No ratings yet
Multiple Choice Questions: Patterns of Chromosome Inheritance
Document10 pages
Multiple Choice Questions: Patterns of Chromosome Inheritance
Arwa
No ratings yet
! Genetica - Curs + LP
Document740 pages
! Genetica - Curs + LP
Tobei Achim
No ratings yet
Ijms 22 04779
Document15 pages
Ijms 22 04779
Sofiya -
No ratings yet
MMP-1 Practical Handout v1.0
Document5 pages
MMP-1 Practical Handout v1.0
Maisha Jashim
No ratings yet
Photosynthesis PDF
Document22 pages
Photosynthesis PDF
bhaskar ray
No ratings yet
Superoxide Dismutase (SOD) A Promising Enzyme in The Area of Biopharmaceuticals in Its Native and Immobilized Form A Review
Document9 pages
Superoxide Dismutase (SOD) A Promising Enzyme in The Area of Biopharmaceuticals in Its Native and Immobilized Form A Review
IJRASETPublications
No ratings yet
Natural Selection Quiz Review Guide Answer Key
Document1 page
Natural Selection Quiz Review Guide Answer Key
Helen Saga
No ratings yet