Open navigation menu

Welcome to Scribd!

Pattern Discovery 10.3

Uploaded by

0% found this document useful (0 votes)

28 views6 pages

Session 3. Strategy 2: Post Topic Modeling Phrase Construction

Copyright

© © All Rights Reserved

Available Formats

PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

Session 3. Strategy 2: Post Topic Modeling Phrase Construction

Copyright:

© All Rights Reserved

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

0% found this document useful (0 votes)

28 views6 pages

Pattern Discovery 10.3

Uploaded by

Session 3. Strategy 2: Post Topic Modeling Phrase Construction

Copyright:

© All Rights Reserved

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

Jump to Page

You are on page 1of 6

Search inside document

Session 3.

Strategy 2: Post
Topic Modeling Phrase
Construction

Strategy 2: Post Topic Modeling Phrase Construction

Perform Latent Dirichlet Allocation on corpus to assign each token a topic label

Merge adjacent unigrams with the same topic label by a distribution-free

permutation test on arbitrary-length back-off model

End recursive merging when all significant adjacent unigrams have been
merged

TurboTopics [Blei & Lafferty09] Phrase construction as a post-processing step

to Latent Dirichlet Allocation

KERT [Danilevsky et al.14] Phrase construction as a post-processing step

to Latent Dirichlet Allocation

Perform frequent pattern mining on each topic

Perform phrase ranking based on four different criteria

Example of TurboTopics

Perform LDA on corpus to assign each token a topic label

E.g., phase11 transition11 . game153 theory127
Then merge adjacent unigrams with same topic label

KERT: Topical Keyphrase Extraction & Ranking

[Danilevsky, et al. 2014]

knowledge discovery using least squares support vector machine classifiers

learning
support vectors for reinforcement learning

support vector machines

a hybrid approach to feature selection

reinforcement learning

pseudo conditional random fields

feature selection

automatic web page classification in a dynamic and hierarchical way

conditional random fields

classification
inverse time dependency in convex regularized learning

decision trees

postprocessing decision trees to extract actionable knowledge

variance minimization least squares support vector machines

Unigram topic assignment: Topic 1 & Topic 2

4

Topical keyphrase
extraction & ranking

Framework of KERT
1. Run bag-of-words model inference and assign topic label to each token
2. Extract candidate keyphrases within each topic

Frequent pattern mining

3. Rank the keyphrases in each topic

Popularity: information retrieval vs. cross-language information retrieval

Discriminativeness: only frequent in documents about topic t

Concordance: active learning vs.learning classification

Completeness: vector machine vs. support vector machine

Comparability property: directly compare phrases of mixed lengths

5

KERT: Topical Phrases on Machine Learning

Top-Ranked Phrases by Mining Paper Titles in DBLP
kpRel

KERT
(-popularity)

KERT
(-discriminativeness)

KERT
(-concordance)

KERT
[Danilevsky et al. 14]

learning

effective

support vector machines

learning

learning

classification

text

feature selection

classification

support vector machines

selection

probabilistic

reinforcement learning

selection

reinforcement learning

models

identification

conditional random fields

feature

feature selection

algorithm

mapping

constraint satisfaction

decision

conditional random fields

bayesian

classification

[Zhao et al. 11]

features

task
Comparison
of phrasedecision
rankingtrees
methods

decision

planning

dimensionality reduction

trees

decision trees

The topic that represents the area of Machine Learning

You might also like

Shoe Dog: A Memoir by the Creator of Nike
From Everand
Shoe Dog: A Memoir by the Creator of Nike
Phil Knight
Rating: 4.5 out of 5 stars
4.5/5 (537)
cs229 Prob PDF
Document12 pages
cs229 Prob PDF
indoexchange
No ratings yet
Grit: The Power of Passion and Perseverance
From Everand
Grit: The Power of Passion and Perseverance
Angela Duckworth
Rating: 4 out of 5 stars
4/5 (587)
Pattern Discovery 11.5
Document5 pages
Pattern Discovery 11.5
Rossana Cunha
No ratings yet
Hidden Figures: The American Dream and the Untold Story of the Black Women Mathematicians Who Helped Win the Space Race
From Everand
Hidden Figures: The American Dream and the Untold Story of the Black Women Mathematicians Who Helped Win the Space Race
Margot Lee Shetterly
Rating: 4 out of 5 stars
4/5 (890)
Stanford University CS224d - Deep Learning For Natural Language Processing - Syllabus
Document3 pages
Stanford University CS224d - Deep Learning For Natural Language Processing - Syllabus
Rossana Cunha
No ratings yet
The Yellow House: A Memoir (2019 National Book Award Winner)
From Everand
The Yellow House: A Memoir (2019 National Book Award Winner)
Sarah M. Broom
Rating: 4 out of 5 stars
4/5 (98)
cs229 Linalg
Document26 pages
cs229 Linalg
Awais Ali
No ratings yet
The Little Book of Hygge: Danish Secrets to Happy Living
From Everand
The Little Book of Hygge: Danish Secrets to Happy Living
Meik Wiking
Rating: 3.5 out of 5 stars
3.5/5 (399)
Pattern Discovery 11.3
Document6 pages
Pattern Discovery 11.3
Rossana Cunha
No ratings yet
On Fire: The (Burning) Case for a Green New Deal
From Everand
On Fire: The (Burning) Case for a Green New Deal
Naomi Klein
Rating: 4 out of 5 stars
4/5 (73)
The Impact of Translation Technologies
Document23 pages
The Impact of Translation Technologies
Rossana Cunha
No ratings yet
The Subtle Art of Not Giving a F*ck: A Counterintuitive Approach to Living a Good Life
From Everand
The Subtle Art of Not Giving a F*ck: A Counterintuitive Approach to Living a Good Life
Mark Manson
Rating: 4 out of 5 stars
4/5 (5794)
VVV Project
Document16 pages
VVV Project
Rossana Cunha
No ratings yet
Never Split the Difference: Negotiating As If Your Life Depended On It
From Everand
Never Split the Difference: Negotiating As If Your Life Depended On It
Chris Voss
Rating: 4.5 out of 5 stars
4.5/5 (838)
Ergonomic Criteria for Evaluating Human-Computer Interfaces
Document83 pages
Ergonomic Criteria for Evaluating Human-Computer Interfaces
Rossana Cunha
No ratings yet
Elon Musk: Tesla, SpaceX, and the Quest for a Fantastic Future
From Everand
Elon Musk: Tesla, SpaceX, and the Quest for a Fantastic Future
Ashlee Vance
Rating: 4.5 out of 5 stars
4.5/5 (474)
Pattern Discovery 11.6
Document2 pages
Pattern Discovery 11.6
Rossana Cunha
No ratings yet
Yes Please
From Everand
Yes Please
Amy Poehler
Rating: 4 out of 5 stars
4/5 (1891)
(Garcia-2009) Beyond Translation Memory Computers and The Profes
Document17 pages
(Garcia-2009) Beyond Translation Memory Computers and The Profes
Rossana Cunha
No ratings yet
A Heartbreaking Work Of Staggering Genius: A Memoir Based on a True Story
From Everand
A Heartbreaking Work Of Staggering Genius: A Memoir Based on a True Story
Dave Eggers
Rating: 3.5 out of 5 stars
3.5/5 (231)
Simultaneously Infer Phrases and Topics
Document4 pages
Simultaneously Infer Phrases and Topics
Rossana Cunha
No ratings yet
Principles: Life and Work
From Everand
Principles: Life and Work
Ray Dalio
Rating: 4 out of 5 stars
4/5 (599)
Pattern Discovery 11.4
Document5 pages
Pattern Discovery 11.4
Rossana Cunha
No ratings yet
The Emperor of All Maladies: A Biography of Cancer
From Everand
The Emperor of All Maladies: A Biography of Cancer
Siddhartha Mukherjee
Rating: 4.5 out of 5 stars
4.5/5 (271)
Pattern Discovery 11.2
Document7 pages
Pattern Discovery 11.2
Rossana Cunha
No ratings yet
The Gifts of Imperfection: Let Go of Who You Think You're Supposed to Be and Embrace Who You Are
From Everand
The Gifts of Imperfection: Let Go of Who You Think You're Supposed to Be and Embrace Who You Are
Brené Brown
Rating: 4 out of 5 stars
4/5 (1090)
Pattern Discovery 11.1
Document10 pages
Pattern Discovery 11.1
Rossana Cunha
No ratings yet
The World Is Flat 3.0: A Brief History of the Twenty-first Century
From Everand
The World Is Flat 3.0: A Brief History of the Twenty-first Century
Thomas L. Friedman
Rating: 3.5 out of 5 stars
3.5/5 (2219)
Session 5. A Comparative Study of Three Strategies
Document7 pages
Session 5. A Comparative Study of Three Strategies
Rossana Cunha
No ratings yet
Team of Rivals: The Political Genius of Abraham Lincoln
From Everand
Team of Rivals: The Political Genius of Abraham Lincoln
Doris Kearns Goodwin
Rating: 4.5 out of 5 stars
4.5/5 (234)
Coursera - Pattern Discovery in Data Mining - Week 1 Quiz
Document5 pages
Coursera - Pattern Discovery in Data Mining - Week 1 Quiz
Rossana Cunha
No ratings yet
The Hard Thing About Hard Things: Building a Business When There Are No Easy Answers
From Everand
The Hard Thing About Hard Things: Building a Business When There Are No Easy Answers
Ben Horowitz
Rating: 4.5 out of 5 stars
4.5/5 (344)
Pattern Discovery 10.4
Document10 pages
Pattern Discovery 10.4
Rossana Cunha
No ratings yet
Devil in the Grove: Thurgood Marshall, the Groveland Boys, and the Dawn of a New America
From Everand
Devil in the Grove: Thurgood Marshall, the Groveland Boys, and the Dawn of a New America
Gilbert King
Rating: 4.5 out of 5 stars
4.5/5 (265)
Preparing Curriculum For Translator Training Courses
Document4 pages
Preparing Curriculum For Translator Training Courses
Rossana Cunha
No ratings yet
Fear: Trump in the White House
From Everand
Fear: Trump in the White House
Bob Woodward
Rating: 3.5 out of 5 stars
3.5/5 (738)
Pattern Discovery 10.1
Document5 pages
Pattern Discovery 10.1
Rossana Cunha
No ratings yet
Angela's Ashes: A Memoir
From Everand
Angela's Ashes: A Memoir
Frank McCourt
Rating: 4.5 out of 5 stars
4.5/5 (440)
Activity No.9 Frequency Domain Modeling in Matlab
Document10 pages
Activity No.9 Frequency Domain Modeling in Matlab
NAHUM ZERAH SACAY
No ratings yet
Rise of ISIS: A Threat We Can't Ignore
From Everand
Rise of ISIS: A Threat We Can't Ignore
Jay Sekulow
Rating: 3.5 out of 5 stars
3.5/5 (137)
The Chameleon Cipher-192 (CC-192)
Document23 pages
The Chameleon Cipher-192 (CC-192)
Magdy Saeb
No ratings yet
Steve Jobs
From Everand
Steve Jobs
Walter Isaacson
Rating: 4.5 out of 5 stars
4.5/5 (806)
Computer Science & Engineering: Apex Institute of Technology
Document13 pages
Computer Science & Engineering: Apex Institute of Technology
mudit n
No ratings yet
John Adams
From Everand
John Adams
David McCullough
Rating: 4.5 out of 5 stars
4.5/5 (2409)
Eigen Values of A Matrix by Power
Document9 pages
Eigen Values of A Matrix by Power
Varnika Singh
No ratings yet
The Unwinding: An Inner History of the New America
From Everand
The Unwinding: An Inner History of the New America
George Packer
Rating: 4 out of 5 stars
4/5 (45)
Quantitative Techniques For Management: MBA First Year Paper No. 6
Document4 pages
Quantitative Techniques For Management: MBA First Year Paper No. 6
kunalc-1
No ratings yet
Bad Feminist: Essays
From Everand
Bad Feminist: Essays
Roxane Gay
Rating: 4 out of 5 stars
4/5 (1015)
Document
Document2 pages
Document
Sarkis Sepetjian
No ratings yet
The Glass Castle: A Memoir
From Everand
The Glass Castle: A Memoir
Jeannette Walls
Rating: 4.5 out of 5 stars
4.5/5 (1711)
DFT and FFT Concepts for Signal Analysis
Document35 pages
DFT and FFT Concepts for Signal Analysis
Sridhar Kandlagunta
No ratings yet
Wolf Hall: A Novel
From Everand
Wolf Hall: A Novel
Hilary Mantel
Rating: 4 out of 5 stars
4/5 (3811)
ECS455 - 2 - 4 - Erlang B
Document17 pages
ECS455 - 2 - 4 - Erlang B
mahamd saied
No ratings yet
The Outsider: A Novel
From Everand
The Outsider: A Novel
Stephen King
Rating: 4 out of 5 stars
4/5 (1839)
CUDA Parallel Prefix Sum (Scan) in 40 Characters
Document17 pages
CUDA Parallel Prefix Sum (Scan) in 40 Characters
Renato Quezada E
No ratings yet
The Perks of Being a Wallflower
From Everand
The Perks of Being a Wallflower
Stephen Chbosky
Rating: 4.5 out of 5 stars
4.5/5 (2099)
What Is Advance Algorithm Analysis
Document55 pages
What Is Advance Algorithm Analysis
Common Wisdom
No ratings yet
The Woman in Cabin 10
From Everand
The Woman in Cabin 10
Ruth Ware
Rating: 3.5 out of 5 stars
3.5/5 (2322)
Linear Algebra and Random Processes (CS6015)
Document5 pages
Linear Algebra and Random Processes (CS6015)
CHAKRADHAR PALAVALASA
No ratings yet
The Light Between Oceans: A Novel
From Everand
The Light Between Oceans: A Novel
M.L. Stedman
Rating: 4.5 out of 5 stars
4.5/5 (789)
Measurement Uncertainty Presentation
Document18 pages
Measurement Uncertainty Presentation
Stefan Konyan
No ratings yet
The Sympathizer: A Novel (Pulitzer Prize for Fiction)
From Everand
The Sympathizer: A Novel (Pulitzer Prize for Fiction)
Viet Thanh Nguyen
Rating: 4.5 out of 5 stars
4.5/5 (119)
Homework 1-4 PDF
Document5 pages
Homework 1-4 PDF
ivan david alvarez herrera
No ratings yet
Little Women
From Everand
Little Women
Louisa May Alcott
Rating: 4 out of 5 stars
4/5 (104)
Chapter 19
Document39 pages
Chapter 19
Shoukat
No ratings yet
Brooklyn: A Novel
From Everand
Brooklyn: A Novel
Colm Toibin
Rating: 3.5 out of 5 stars
3.5/5 (1937)
Ce206 Ode
Document23 pages
Ce206 Ode
Borhan Shaikat
No ratings yet
A Man Called Ove: A Novel
From Everand
A Man Called Ove: A Novel
Fredrik Backman
Rating: 4.5 out of 5 stars
4.5/5 (4609)
Tensorflow in Practice: Bayu Dwi Prasetya
Document1 page
Tensorflow in Practice: Bayu Dwi Prasetya
Bayu Prasetya
No ratings yet
The Art of Racing in the Rain: A Novel
From Everand
The Art of Racing in the Rain: A Novel
Garth Stein
Rating: 4 out of 5 stars
4/5 (4200)
Unit III
Document109 pages
Unit III
Ashutosh Ashu
No ratings yet
Manhattan Beach: A Novel
From Everand
Manhattan Beach: A Novel
Jennifer Egan
Rating: 3.5 out of 5 stars
3.5/5 (792)
Data Structures and Algorithms: Unit - 2
Document2 pages
Data Structures and Algorithms: Unit - 2
Manikandan Thangavelu
No ratings yet
A Tree Grows in Brooklyn
From Everand
A Tree Grows in Brooklyn
Betty Smith
Rating: 4.5 out of 5 stars
4.5/5 (1929)
Root Locus Diagram - GATE Study Material in PDF
Document7 pages
Root Locus Diagram - GATE Study Material in PDF
Atul Choudhary
100% (1)
Sing, Unburied, Sing: A Novel
From Everand
Sing, Unburied, Sing: A Novel
Jesmyn Ward
Rating: 4 out of 5 stars
4/5 (1103)
Iss Homework Period Max of Students: Openssl 2 Weeks From 5/15/2021 1 Student
Document6 pages
Iss Homework Period Max of Students: Openssl 2 Weeks From 5/15/2021 1 Student
bader alkhoder
No ratings yet
The Constant Gardener: A Novel
From Everand
The Constant Gardener: A Novel
John le Carre
Rating: 3.5 out of 5 stars
3.5/5 (104)
Information Gain Calculator
Document14 pages
Information Gain Calculator
fansafir
No ratings yet
Her Body and Other Parties: Stories
From Everand
Her Body and Other Parties: Stories
Carmen Maria Machado
Rating: 4 out of 5 stars
4/5 (821)
Unit 1 INFORMATION THEORY SOURCE CODING MCQ
Document16 pages
Unit 1 INFORMATION THEORY SOURCE CODING MCQ
Shubham
No ratings yet
EE-210. Signals and Systems Homework 4: 5 April
Document9 pages
EE-210. Signals and Systems Homework 4: 5 April
Muhammad Asif
No ratings yet
Recent Developments Induction Motor Drives Fault Diagnosis Using AI Techniques
Document8 pages
Recent Developments Induction Motor Drives Fault Diagnosis Using AI Techniques
fahim47
No ratings yet
Data Warehousing and Mining April 2019
Document4 pages
Data Warehousing and Mining April 2019
phanikumarsindhe
No ratings yet
Modern Numerical Nonlinear Optimization - Andrei
Document824 pages
Modern Numerical Nonlinear Optimization - Andrei
Juan Rodriguez
No ratings yet
Decision Mathematics 1: Sample
Document38 pages
Decision Mathematics 1: Sample
Bumblebee
No ratings yet
ML Mod6
Document24 pages
ML Mod6
amarthya v
No ratings yet
CS 189 - 289A - Introduction To Machine Learning
Document6 pages
CS 189 - 289A - Introduction To Machine Learning
Tim Apple
No ratings yet
Elec97312023a1
Document2 pages
Elec97312023a1
paul omondi
No ratings yet