Welcome to Scribd!

Bethany Percha - Machine Learning Approaches Poster

Uploaded by

0% found this document useful (0 votes)

210 views1 page

Clinical information is often recorded as narrative (unstructured) text. Natural language processing could be used to extract relevant information. A feedback system would prompt the physician to modify the report as needed.

Original Description:

Copyright

Available Formats

PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

Copyright:

Attribution Non-Commercial (BY-NC)

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

0% found this document useful (0 votes)

210 views1 page

Bethany Percha - Machine Learning Approaches Poster

Uploaded by

AMIA

Copyright:

Attribution Non-Commercial (BY-NC)

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

Jump to Page

You are on page 1of 1

Search inside document

Machine Learning Approaches to Automatic BI-RADS Classification of Mammography Reports

Bethany Percha and Daniel Rubin

Program in Biomedical Informatics and Department of Radiology, Stanford University

Introduction Preprocessing Classification

Clinical information is often recorded as 41,142 reports extracted from Stanford’s radTF database Technique % Accuracy
narrative (unstructured) text. 38,665 were diagnostic mammograms (not specimen analyses or descriptions of biopsy Naive Bayes 76.4
This is problematic for both researchers and procedures) Multinomial Naive Bayes 83.1
clinicians, as free text thwarts attempts to 22,109 had BI-RADS codes (older reports frequently don’t have them) and were unilateral K-Nearest Neighbors (K=10) 87.5
standardize language and ensure document (single-breast) mammography reports Support Vector Machines
completeness. Each remaining report was processed as follows: LIBLINEAR (L2-norm, one-against-one) 89.3
Natural language processing could be used LIBLINEAR (Multiclass Cramer) 89.3
to extract relevant information from LIBLINEAR-POLY2 (polynomial kernel, degree 2) 90.1
unstructured text reports, but reports must Accuracy was determined using 10-fold cross-validation.
be both complete and consistent.
Misclassification error did not decrease significantly with more training
A feedback system which extracts relevant data (high bias). Including more features, such as bigrams, did not
information from text as it is being generated improve performance.
and prompts the physician to modify the
report as needed would be useful, both in
physician training and clinical practice.
Here we demonstrate our preliminary results
in building an automatic classification
system to automatically assign BI-RADS
assessment codes to mammography
reports.
0 Incomplete
1 Negative
2 Benign finding(s) The final confusion matrix (units are %) was:
3 Probably benign Class Classified As. . .
4 Suspicious After preprocessing, the reports were converted into feature vectors, where each feature was the 0 1 2 3 4 5 6
0 93.7 2.3 3.1 0.1 0.8 0.0 0.0
abnormality number of times a given word stem appeared in a report. There were 2,216 unique stems. 1 0.4 93.6 5.9 0.1 0.0 0.0 0.0
5 Highly suggestive of 2 0.9 11.1 87.1 0.1 0.6 0.0 0.1
malignancy Feature Ranking 3 7.1 21.1 49.1 9.7 12.6 0.0 0.3
6 Known biopsy - 4 8.5 3.7 10.6 0.6 75.9 0.0 0.7
The most informative features were chosen using chi-squared attribute evaluation. The 5 0.0 0.0 0.0 0.0 100.0 0.0 0.0
proven malignancy most informative stems were: 6 4.9 4.9 24.6 0.8 27.9 0.0 36.9
Stem Most Common Context Occurrences per report by class
0 1 2 3 4 5 6 Conclusions
breast (Many contexts.) 4.2 1.9 3.8 4.6 5.7 6.9 7.7
featur no mammographic features of malignancy 0.1 1.1 1.2 0.1 0.1 0.1 0.1 Radiologists’ word choices are a good indicator of which BI-RADS class
nippl x cm from the nipple (Describing a mass.) 1.2 0.1 0.2 0.8 2.3 4.6 2.8 they choose, but the correspondence is not perfect, particularly for the
malign no mammographic features of malignancy 0.1 1.1 1.2 0.1 0.1 0.3 0.3 higher BI-RADS values.
evalu incompletely evaluated 1.0 0.0 0.0 0.1 0.1 0.1 0.2
The development of training software for radiologists based on this
incomplet incompletely evaluated 0.9 0.0 0.0 0.0 0.0 0.0 0.0
mammograph no mammographic features of malignancy 0.3 1.5 1.8 0.7 0.9 1.7 1.2 approach could help them standardize their descriptions of images, and
stabl stable post-biopsy change 0.2 0.3 1.5 0.7 0.4 0.2 0.5 learn to better describe which specific features of the image cause them
calcif calcifications 0.6 0.1 0.7 1.3 1.5 1.9 2.0 to place it in a given class.

A Bayesian Network-Based Genetic Predictor For Alcohol Dependence
Document1 page
A Bayesian Network-Based Genetic Predictor For Alcohol Dependence
AMIA
No ratings yet
Privacy-By-Design-Understanding Data Access Models For Secondary Data
Document42 pages
Privacy-By-Design-Understanding Data Access Models For Secondary Data
AMIA
No ratings yet
Process Automation For Efficient Translational Research On Endometrioid Ovarian CarcinomaI (Poster)
Document1 page
Process Automation For Efficient Translational Research On Endometrioid Ovarian CarcinomaI (Poster)
AMIA
No ratings yet
An Exemplar For Data Integration in The Biomedical Domain Driven by The ISA Framework
Document32 pages
An Exemplar For Data Integration in The Biomedical Domain Driven by The ISA Framework
AMIA
No ratings yet
Standard-Based Integration Profiles For Clinical Research and Patient Safety - SALUS - SRDC - Sinaci
Document18 pages
Standard-Based Integration Profiles For Clinical Research and Patient Safety - SALUS - SRDC - Sinaci
AMIA
No ratings yet
Platform For Personalized Oncology
Document33 pages
Platform For Personalized Oncology
AMIA
No ratings yet
Standard-Based Integration Profiles For Clinical Research and Patient Safety - Introduction
Document5 pages
Standard-Based Integration Profiles For Clinical Research and Patient Safety - Introduction
AMIA
No ratings yet
Privacy Beyond Anonymity-Decoupling Data Through Encryption (Poster)
Document2 pages
Privacy Beyond Anonymity-Decoupling Data Through Encryption (Poster)
AMIA
No ratings yet
Capturing Patient Data in Small Animal Veterinary Practice
Document1 page
Capturing Patient Data in Small Animal Veterinary Practice
AMIA
No ratings yet
Research Networking Usage at A Large Biomedical Institution (Poster)
Document1 page
Research Networking Usage at A Large Biomedical Institution (Poster)
AMIA
No ratings yet
TBI Year-In-Review 2013
Document91 pages
TBI Year-In-Review 2013
AMIA
No ratings yet
Phenotype-Genotype Integrator (PheGenI) Updates
Document1 page
Phenotype-Genotype Integrator (PheGenI) Updates
AMIA
No ratings yet
Bioinformatics Needs Assessment and Support For Clinical and Translational Science Research
Document1 page
Bioinformatics Needs Assessment and Support For Clinical and Translational Science Research
AMIA
No ratings yet
The Clinical Translational Science Ontology Affinity Group
Document16 pages
The Clinical Translational Science Ontology Affinity Group
AMIA
No ratings yet
Genome and Proteome Annotation Using Automatically Recognized Concepts and Functional Networks
Document21 pages
Genome and Proteome Annotation Using Automatically Recognized Concepts and Functional Networks
AMIA
No ratings yet
Analysis of Sequence-Based COpy Number Variation Detection Tools For Cancer Studies
Document8 pages
Analysis of Sequence-Based COpy Number Variation Detection Tools For Cancer Studies
AMIA
No ratings yet
Standardizing Phenotype Variable in The Database of Genotypes and Phenotypes
Document21 pages
Standardizing Phenotype Variable in The Database of Genotypes and Phenotypes
AMIA
No ratings yet
Drug-Drug Interaction Prediction Through Systems Pharmacology Analysis (Poster)
Document1 page
Drug-Drug Interaction Prediction Through Systems Pharmacology Analysis (Poster)
AMIA
No ratings yet
Research Data Management Needs of Clinical and Translational Science Researchers
Document1 page
Research Data Management Needs of Clinical and Translational Science Researchers
AMIA
No ratings yet
A Probabilistic Model of Functional
Document17 pages
A Probabilistic Model of Functional
AMIA
No ratings yet
An Efficient Genetic Model Selection Algorithm To Predict Outcomes From Genomic Data
Document1 page
An Efficient Genetic Model Selection Algorithm To Predict Outcomes From Genomic Data
AMIA
No ratings yet
Creating A Biologist-Oriented Interface and Code Generation System For A Computational Modeling Assistant
Document1 page
Creating A Biologist-Oriented Interface and Code Generation System For A Computational Modeling Assistant
AMIA
No ratings yet
Educating Translational Researchers in Research Informatics Principles and Methods-An Evaluation of A Model Online Course and Plans For Its Dissemination
Document29 pages
Educating Translational Researchers in Research Informatics Principles and Methods-An Evaluation of A Model Online Course and Plans For Its Dissemination
AMIA
No ratings yet
Beyond The Hype-Developing, Implementing and Sharing Pharmacogenomic Clinical Decision Support
Document31 pages
Beyond The Hype-Developing, Implementing and Sharing Pharmacogenomic Clinical Decision Support
AMIA
No ratings yet
Developing, Implementing, and Sharing Pharmacogenomics CDS (TBI Panel)
Document23 pages
Developing, Implementing, and Sharing Pharmacogenomics CDS (TBI Panel)
AMIA
No ratings yet
A Workflow For Protein Function Discovery (Poster)
Document1 page
A Workflow For Protein Function Discovery (Poster)
AMIA
No ratings yet
Predicting Antigenic Simillarity From Sequence For Influenza Vaccine Strain Selection (Poster)
Document1 page
Predicting Antigenic Simillarity From Sequence For Influenza Vaccine Strain Selection (Poster)
AMIA
No ratings yet
Clustering of Somatic Mutations To Characterize Cancer Heterogeneity With Whole Genome Sequencing
Document1 page
Clustering of Somatic Mutations To Characterize Cancer Heterogeneity With Whole Genome Sequencing
AMIA
No ratings yet
Qualitative and Quantitative Image-Based Biomarkers of Therapeutic Response For Triple Negative Cancer
Document47 pages
Qualitative and Quantitative Image-Based Biomarkers of Therapeutic Response For Triple Negative Cancer
AMIA
No ratings yet
An Empirical Framework For Genome-Wide Single Nucleotide Polymorphism-Based Predictive Modeling
Document16 pages
An Empirical Framework For Genome-Wide Single Nucleotide Polymorphism-Based Predictive Modeling
AMIA
No ratings yet
The Subtle Art of Not Giving a F*ck: A Counterintuitive Approach to Living a Good Life
From Everand
The Subtle Art of Not Giving a F*ck: A Counterintuitive Approach to Living a Good Life
Mark Manson
Rating: 4 out of 5 stars
4/5 (5794)
The Little Book of Hygge: Danish Secrets to Happy Living
From Everand
The Little Book of Hygge: Danish Secrets to Happy Living
Meik Wiking
Rating: 3.5 out of 5 stars
3.5/5 (399)
A Heartbreaking Work Of Staggering Genius: A Memoir Based on a True Story
From Everand
A Heartbreaking Work Of Staggering Genius: A Memoir Based on a True Story
Dave Eggers
Rating: 3.5 out of 5 stars
3.5/5 (231)
Hidden Figures: The American Dream and the Untold Story of the Black Women Mathematicians Who Helped Win the Space Race
From Everand
Hidden Figures: The American Dream and the Untold Story of the Black Women Mathematicians Who Helped Win the Space Race
Margot Lee Shetterly
Rating: 4 out of 5 stars
4/5 (894)
The Yellow House: A Memoir (2019 National Book Award Winner)
From Everand
The Yellow House: A Memoir (2019 National Book Award Winner)
Sarah M. Broom
Rating: 4 out of 5 stars
4/5 (98)
Shoe Dog: A Memoir by the Creator of Nike
From Everand
Shoe Dog: A Memoir by the Creator of Nike
Phil Knight
Rating: 4.5 out of 5 stars
4.5/5 (537)
Elon Musk: Tesla, SpaceX, and the Quest for a Fantastic Future
From Everand
Elon Musk: Tesla, SpaceX, and the Quest for a Fantastic Future
Ashlee Vance
Rating: 4.5 out of 5 stars
4.5/5 (474)
Never Split the Difference: Negotiating As If Your Life Depended On It
From Everand
Never Split the Difference: Negotiating As If Your Life Depended On It
Chris Voss
Rating: 4.5 out of 5 stars
4.5/5 (838)
Grit: The Power of Passion and Perseverance
From Everand
Grit: The Power of Passion and Perseverance
Angela Duckworth
Rating: 4 out of 5 stars
4/5 (587)
Devil in the Grove: Thurgood Marshall, the Groveland Boys, and the Dawn of a New America
From Everand
Devil in the Grove: Thurgood Marshall, the Groveland Boys, and the Dawn of a New America
Gilbert King
Rating: 4.5 out of 5 stars
4.5/5 (265)
Yes Please
From Everand
Yes Please
Amy Poehler
Rating: 4 out of 5 stars
4/5 (1891)
Angela's Ashes: A Memoir
From Everand
Angela's Ashes: A Memoir
Frank McCourt
Rating: 4.5 out of 5 stars
4.5/5 (440)
The Emperor of All Maladies: A Biography of Cancer
From Everand
The Emperor of All Maladies: A Biography of Cancer
Siddhartha Mukherjee
Rating: 4.5 out of 5 stars
4.5/5 (271)
On Fire: The (Burning) Case for a Green New Deal
From Everand
On Fire: The (Burning) Case for a Green New Deal
Naomi Klein
Rating: 4 out of 5 stars
4/5 (73)
The Hard Thing About Hard Things: Building a Business When There Are No Easy Answers
From Everand
The Hard Thing About Hard Things: Building a Business When There Are No Easy Answers
Ben Horowitz
Rating: 4.5 out of 5 stars
4.5/5 (344)
Team of Rivals: The Political Genius of Abraham Lincoln
From Everand
Team of Rivals: The Political Genius of Abraham Lincoln
Doris Kearns Goodwin
Rating: 4.5 out of 5 stars
4.5/5 (234)
Fear: Trump in the White House
From Everand
Fear: Trump in the White House
Bob Woodward
Rating: 3.5 out of 5 stars
3.5/5 (738)
The Glass Castle: A Memoir
From Everand
The Glass Castle: A Memoir
Jeannette Walls
Rating: 4.5 out of 5 stars
4.5/5 (1712)
Rise of ISIS: A Threat We Can't Ignore
From Everand
Rise of ISIS: A Threat We Can't Ignore
Jay Sekulow
Rating: 3.5 out of 5 stars
3.5/5 (137)
Principles: Life and Work
From Everand
Principles: Life and Work
Ray Dalio
Rating: 4 out of 5 stars
4/5 (599)
The Unwinding: An Inner History of the New America
From Everand
The Unwinding: An Inner History of the New America
George Packer
Rating: 4 out of 5 stars
4/5 (45)
The World Is Flat 3.0: A Brief History of the Twenty-first Century
From Everand
The World Is Flat 3.0: A Brief History of the Twenty-first Century
Thomas L. Friedman
Rating: 3.5 out of 5 stars
3.5/5 (2219)
Steve Jobs
From Everand
Steve Jobs
Walter Isaacson
Rating: 4.5 out of 5 stars
4.5/5 (806)
John Adams
From Everand
John Adams
David McCullough
Rating: 4.5 out of 5 stars
4.5/5 (2409)
The Gifts of Imperfection: Let Go of Who You Think You're Supposed to Be and Embrace Who You Are
From Everand
The Gifts of Imperfection: Let Go of Who You Think You're Supposed to Be and Embrace Who You Are
Brené Brown
Rating: 4 out of 5 stars
4/5 (1090)
Bad Feminist: Essays
From Everand
Bad Feminist: Essays
Roxane Gay
Rating: 4 out of 5 stars
4/5 (1015)
The Outsider: A Novel
From Everand
The Outsider: A Novel
Stephen King
Rating: 4 out of 5 stars
4/5 (1839)
Brooklyn: A Novel
From Everand
Brooklyn: A Novel
Colm Toibin
Rating: 3.5 out of 5 stars
3.5/5 (1937)
The Sympathizer: A Novel (Pulitzer Prize for Fiction)
From Everand
The Sympathizer: A Novel (Pulitzer Prize for Fiction)
Viet Thanh Nguyen
Rating: 4.5 out of 5 stars
4.5/5 (119)
A Man Called Ove: A Novel
From Everand
A Man Called Ove: A Novel
Fredrik Backman
Rating: 4.5 out of 5 stars
4.5/5 (4609)
The Light Between Oceans: A Novel
From Everand
The Light Between Oceans: A Novel
M.L. Stedman
Rating: 4.5 out of 5 stars
4.5/5 (789)
The Woman in Cabin 10
From Everand
The Woman in Cabin 10
Ruth Ware
Rating: 3.5 out of 5 stars
3.5/5 (2322)
Manhattan Beach: A Novel
From Everand
Manhattan Beach: A Novel
Jennifer Egan
Rating: 3.5 out of 5 stars
3.5/5 (792)
The Perks of Being a Wallflower
From Everand
The Perks of Being a Wallflower
Stephen Chbosky
Rating: 4.5 out of 5 stars
4.5/5 (2099)
Wolf Hall: A Novel
From Everand
Wolf Hall: A Novel
Hilary Mantel
Rating: 4 out of 5 stars
4/5 (3811)
Little Women
From Everand
Little Women
Louisa May Alcott
Rating: 4 out of 5 stars
4/5 (104)
The Art of Racing in the Rain: A Novel
From Everand
The Art of Racing in the Rain: A Novel
Garth Stein
Rating: 4 out of 5 stars
4/5 (4200)
Sing, Unburied, Sing: A Novel
From Everand
Sing, Unburied, Sing: A Novel
Jesmyn Ward
Rating: 4 out of 5 stars
4/5 (1103)
A Tree Grows in Brooklyn
From Everand
A Tree Grows in Brooklyn
Betty Smith
Rating: 4.5 out of 5 stars
4.5/5 (1929)
The Constant Gardener: A Novel
From Everand
The Constant Gardener: A Novel
John le Carre
Rating: 3.5 out of 5 stars
3.5/5 (104)
Her Body and Other Parties: Stories
From Everand
Her Body and Other Parties: Stories
Carmen Maria Machado
Rating: 4 out of 5 stars
4/5 (821)
A Fuzzy Expert System For Earthquake Prediction, Case Study: The Zagros Range
Document4 pages
A Fuzzy Expert System For Earthquake Prediction, Case Study: The Zagros Range
Mehdi Zare
No ratings yet
Explaining The Placebo Effect: Aliefs, Beliefs, and Conditioning
Document33 pages
Explaining The Placebo Effect: Aliefs, Beliefs, and Conditioning
Djayuzman No Mago
No ratings yet
Dr. Ramon de Santos National High School learning activity explores modal verbs and nouns
Document3 pages
Dr. Ramon de Santos National High School learning activity explores modal verbs and nouns
Mark Jhoriz Villafuerte
No ratings yet
Wood - Neal.2009. The Habitual Consumer PDF
Document14 pages
Wood - Neal.2009. The Habitual Consumer PDF
maja0205
No ratings yet
Mind-Body (Hypnotherapy) Treatment of Women With Urgency Urinary Incontinence: Changes in Brain Attentional Networks
Document10 pages
Mind-Body (Hypnotherapy) Treatment of Women With Urgency Urinary Incontinence: Changes in Brain Attentional Networks
Marie Fraulein Petalcorin
No ratings yet
Revised Blooms Taxonomy
Document4 pages
Revised Blooms Taxonomy
Jonel Pagalilauan
100% (1)
ملزمة اللغة الكتاب الازرق من 1-10
Document19 pages
ملزمة اللغة الكتاب الازرق من 1-10
A. BASHEER
No ratings yet
Curry Lesson Plan Template: Objectives
Document4 pages
Curry Lesson Plan Template: Objectives
api-310511167
No ratings yet
Classroom Talk - Making Talk More Effective in The Malaysian English Classroom
Document51 pages
Classroom Talk - Making Talk More Effective in The Malaysian English Classroom
double time
No ratings yet
Problem Based Learning in A Higher Education Environmental Biotechnology Course
Document14 pages
Problem Based Learning in A Higher Education Environmental Biotechnology Course
Nur Amani Abdul Rani
No ratings yet
Draft 2020 English Yearly Lesson Plan for Year 2
Document5 pages
Draft 2020 English Yearly Lesson Plan for Year 2
Nurul Amirah Asha'ari
No ratings yet
Word Retrieval and RAN Intervention Strategies
Document2 pages
Word Retrieval and RAN Intervention Strategies
Paloma Mc
100% (1)
English Writing Skills Class 11 ISC
Document8 pages
English Writing Skills Class 11 ISC
Anushka Swargam
No ratings yet
Rhythmic Activities Syllabus Outlines Course Outcomes
Document5 pages
Rhythmic Activities Syllabus Outlines Course Outcomes
Trexia Pantila
No ratings yet
Fatima - Case Analysis
Document8 pages
Fatima - Case Analysis
api-307983833
No ratings yet
Lesson Plan Bahasa Inggris SMK Kelas 3
Document5 pages
Lesson Plan Bahasa Inggris SMK Kelas 3
fluthfi
100% (1)
Nature of Human Beings
Document12 pages
Nature of Human Beings
Athirah Md Yunus
No ratings yet
SE Constructions in Spanish
Document4 pages
SE Constructions in Spanish
bored_15
No ratings yet
Mariam Toma - Critique Popular Media Assignment
Document5 pages
Mariam Toma - Critique Popular Media Assignment
Mariam Amgad
No ratings yet
Claremont School of Binangonan
Document6 pages
Claremont School of Binangonan
Nerissa Ticod Aparte
No ratings yet
Prime Time 3 Work Book 78-138pg
Document60 pages
Prime Time 3 Work Book 78-138pg
Salome Samushia
No ratings yet
Etp 87 PDF
Document68 pages
Etp 87 PDF
tony
No ratings yet
Level 1, Module 3 Hot Spot Extra Reading PDF
Document3 pages
Level 1, Module 3 Hot Spot Extra Reading PDF
Ngan Hoang
No ratings yet
Academic Paper2
Document10 pages
Academic Paper2
api-375702257
No ratings yet
Politics and Administration Research Review and Future Directions
Document25 pages
Politics and Administration Research Review and Future Directions
Wena
No ratings yet
Disabled Theater Dissolves Theatrical Pact
Document11 pages
Disabled Theater Dissolves Theatrical Pact
G88_
No ratings yet
Neural Networks and CNN
Document25 pages
Neural Networks and CNN
cn8q8nvnd5
No ratings yet
Lesson Plan in Science 6 4 Quarter: Kagawaran NG Edukasyon Sangay NG Lungsod NG Dabaw
Document2 pages
Lesson Plan in Science 6 4 Quarter: Kagawaran NG Edukasyon Sangay NG Lungsod NG Dabaw
Emmanuel Nicolo Tagle
No ratings yet
1 My First Perceptron With Python Eric Joel Barragan Gonzalez (WWW - Ebook DL - Com)
Document96 pages
1 My First Perceptron With Python Eric Joel Barragan Gonzalez (WWW - Ebook DL - Com)
Sahib Qafarsoy
No ratings yet
Compliment of Set
Document4 pages
Compliment of Set
Francisco Rosellosa Lood
No ratings yet