Welcome to Scribd!

Effective Pattern Discovery For Text Mining (2012)

Uploaded by

0% found this document useful (0 votes)

23 views8 pages

This paper presents an innovative and effective pattern discovery technique. It includes the processes of pattern deploying and pattern evolving. Experiments demonstrate that the proposed solution achieves encouraging performance.

Original Description:

Original Title

Effective Pattern Discovery for Text Mining(2012)

Copyright

Available Formats

PPTX, PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

Copyright:

Available Formats

Download as PPTX, PDF, TXT or read online from Scribd

Flag for inappropriate content

0% found this document useful (0 votes)

23 views8 pages

Effective Pattern Discovery For Text Mining (2012)

Uploaded by

indrakshi25

Copyright:

Available Formats

Download as PPTX, PDF, TXT or read online from Scribd

Flag for inappropriate content

Jump to Page

You are on page 1of 8

Search inside document

EFFECTIVE PATTERN DISCOVERY

FOR TEXT MINING(2012)

ABSTRACT
Many data mining techniques have been proposed for mining useful patterns in text

documents. However, how to effectively use and update discovered patterns is still
an open research issue, especially in the domain of text mining. Since most existing
text mining methods adopted term-based approaches, they all suffer from the
problems of polysemy and synonymy. Over the years, people have often held the
hypothesis that pattern (or phrase)-based approaches should perform better than the
term-based ones, but many experiments do not support this hypothesis. This paper
presents an innovative and effective pattern discovery technique which includes the
processes of pattern deploying and pattern evolving, to improve the effectiveness of
using and updating discovered patterns for finding relevant and interesting
information. Substantial experiments on RCV1 data collection and TREC topics
demonstrate that the proposed solution achieves encouraging performance.

EXISTING SYSTEM
Sequential patterns, frequent itemsets, co-occurring terms and

multiple grams Many data mining techniques have been

proposed in the last decade. These techniques include
association rule mining, frequent itemset mining, sequential
pattern mining, maximum pattern mining, and closed pattern
mining. However, using these discovered knowledge (or
patterns) in the field of text mining is difficult and ineffective.
The reason is that some useful long patterns with high
specificity lack in support (i.e., the low-frequency problem).

PROPOSED SYSTEM
To improve the effectiveness by effectively using closed patterns

in text mining. In addition, a two-stage model that used both

term-based methods and pattern based methods was introduced
to significantly improve the performance of information filtering.
Natural language processing (NLP) is a modern computational
technology that can help people to understand the meaning of
text documents. For a long time, NLP was struggling for dealing
with uncertainties in human languages. Recently, a new conceptbased model was presented to bridge the gap between NLP and
text mining,

which analyzed terms on the sentence and document levels. This model

included three components. The first component analyzed the semantic

structure of sentences; the second component constructed a conceptual
ontological graph (COG) to describe the semantic structures; and the last
component extracted top concepts based on the first two components to
build feature vectors using the standard vector space model. The
advantage of the concept-based model is that it can effectively
discriminate between non important terms and meaningful terms which
describe a sentence meaning. Compared with the above methods, the
concept-based model usually relies upon its employed NLP techniques.

ADVANTAGES
In this system we are giving high priority for long

sequence in the evaluated patterin.

In this system we are giving term weight based on

occurrence of term in long pattern (sequence).

In this system we are reducing the term we

assign

negative weights if terms occurs in booth negative and

positive documents.

HARDWARE REQUIREMENTS

System

: Pentium IV 2.4 GHz.

Hard Disk

: 40 GB.

Monitor

: 15 VGA Colour.

Mouse

: 2 button (or) 3 button mouse

Ram

: 1 GB.

SOFTWARE REQUIREMENTS
Operating system

: - Windows XP/windows 7

Coding Language : java

History: Art & Culture Through the Ages
Document10 pages
History: Art & Culture Through the Ages
indrakshi25
No ratings yet
GATE EE Model Question Paper 1
Document18 pages
GATE EE Model Question Paper 1
indrakshi25
No ratings yet
Txt.13 - Std'11 - Sociology - Introducing Sociology
Document104 pages
Txt.13 - Std'11 - Sociology - Introducing Sociology
Kavish Bhardwaj
100% (1)
Economic Survey
Document16 pages
Economic Survey
nikhil_barshettiwat
No ratings yet
IES 2013 General Ability Solved Question Paper PDF
Document18 pages
IES 2013 General Ability Solved Question Paper PDF
indrakshi25
No ratings yet
NCERT Book Psychology Class XI
Document188 pages
NCERT Book Psychology Class XI
nikhilam.com
86% (14)
Class7 Geography Unit07 NCERT TextBook EnglishEdition
Document8 pages
Class7 Geography Unit07 NCERT TextBook EnglishEdition
Senthil Kumar
No ratings yet
NCERT Book Psychology XII
Document211 pages
NCERT Book Psychology XII
KanishkaVKhatri
100% (11)
Plan
Document18 pages
Plan
indrakshi25
No ratings yet
Abstract
Document3 pages
Abstract
indrakshi25
No ratings yet
CHPT 3
Document57 pages
CHPT 3
indrakshi25
No ratings yet
Data Mining in Market Research
Document55 pages
Data Mining in Market Research
epo_tian
No ratings yet
Hadoop Presentation
Document17 pages
Hadoop Presentation
Aashika Mittal
No ratings yet
The Subtle Art of Not Giving a F*ck: A Counterintuitive Approach to Living a Good Life
From Everand
The Subtle Art of Not Giving a F*ck: A Counterintuitive Approach to Living a Good Life
Mark Manson
Rating: 4 out of 5 stars
4/5 (5794)
The Little Book of Hygge: Danish Secrets to Happy Living
From Everand
The Little Book of Hygge: Danish Secrets to Happy Living
Meik Wiking
Rating: 3.5 out of 5 stars
3.5/5 (399)
A Heartbreaking Work Of Staggering Genius: A Memoir Based on a True Story
From Everand
A Heartbreaking Work Of Staggering Genius: A Memoir Based on a True Story
Dave Eggers
Rating: 3.5 out of 5 stars
3.5/5 (231)
Hidden Figures: The American Dream and the Untold Story of the Black Women Mathematicians Who Helped Win the Space Race
From Everand
Hidden Figures: The American Dream and the Untold Story of the Black Women Mathematicians Who Helped Win the Space Race
Margot Lee Shetterly
Rating: 4 out of 5 stars
4/5 (894)
The Yellow House: A Memoir (2019 National Book Award Winner)
From Everand
The Yellow House: A Memoir (2019 National Book Award Winner)
Sarah M. Broom
Rating: 4 out of 5 stars
4/5 (98)
Shoe Dog: A Memoir by the Creator of Nike
From Everand
Shoe Dog: A Memoir by the Creator of Nike
Phil Knight
Rating: 4.5 out of 5 stars
4.5/5 (537)
Elon Musk: Tesla, SpaceX, and the Quest for a Fantastic Future
From Everand
Elon Musk: Tesla, SpaceX, and the Quest for a Fantastic Future
Ashlee Vance
Rating: 4.5 out of 5 stars
4.5/5 (474)
Never Split the Difference: Negotiating As If Your Life Depended On It
From Everand
Never Split the Difference: Negotiating As If Your Life Depended On It
Chris Voss
Rating: 4.5 out of 5 stars
4.5/5 (838)
Grit: The Power of Passion and Perseverance
From Everand
Grit: The Power of Passion and Perseverance
Angela Duckworth
Rating: 4 out of 5 stars
4/5 (587)
Devil in the Grove: Thurgood Marshall, the Groveland Boys, and the Dawn of a New America
From Everand
Devil in the Grove: Thurgood Marshall, the Groveland Boys, and the Dawn of a New America
Gilbert King
Rating: 4.5 out of 5 stars
4.5/5 (265)
Yes Please
From Everand
Yes Please
Amy Poehler
Rating: 4 out of 5 stars
4/5 (1891)
Angela's Ashes: A Memoir
From Everand
Angela's Ashes: A Memoir
Frank McCourt
Rating: 4.5 out of 5 stars
4.5/5 (440)
The Emperor of All Maladies: A Biography of Cancer
From Everand
The Emperor of All Maladies: A Biography of Cancer
Siddhartha Mukherjee
Rating: 4.5 out of 5 stars
4.5/5 (271)
On Fire: The (Burning) Case for a Green New Deal
From Everand
On Fire: The (Burning) Case for a Green New Deal
Naomi Klein
Rating: 4 out of 5 stars
4/5 (73)
The Hard Thing About Hard Things: Building a Business When There Are No Easy Answers
From Everand
The Hard Thing About Hard Things: Building a Business When There Are No Easy Answers
Ben Horowitz
Rating: 4.5 out of 5 stars
4.5/5 (344)
Team of Rivals: The Political Genius of Abraham Lincoln
From Everand
Team of Rivals: The Political Genius of Abraham Lincoln
Doris Kearns Goodwin
Rating: 4.5 out of 5 stars
4.5/5 (234)
Fear: Trump in the White House
From Everand
Fear: Trump in the White House
Bob Woodward
Rating: 3.5 out of 5 stars
3.5/5 (738)
The Glass Castle: A Memoir
From Everand
The Glass Castle: A Memoir
Jeannette Walls
Rating: 4.5 out of 5 stars
4.5/5 (1712)
Rise of ISIS: A Threat We Can't Ignore
From Everand
Rise of ISIS: A Threat We Can't Ignore
Jay Sekulow
Rating: 3.5 out of 5 stars
3.5/5 (137)
Principles: Life and Work
From Everand
Principles: Life and Work
Ray Dalio
Rating: 4 out of 5 stars
4/5 (599)
The Unwinding: An Inner History of the New America
From Everand
The Unwinding: An Inner History of the New America
George Packer
Rating: 4 out of 5 stars
4/5 (45)
The World Is Flat 3.0: A Brief History of the Twenty-first Century
From Everand
The World Is Flat 3.0: A Brief History of the Twenty-first Century
Thomas L. Friedman
Rating: 3.5 out of 5 stars
3.5/5 (2219)
Steve Jobs
From Everand
Steve Jobs
Walter Isaacson
Rating: 4.5 out of 5 stars
4.5/5 (806)
John Adams
From Everand
John Adams
David McCullough
Rating: 4.5 out of 5 stars
4.5/5 (2409)
The Gifts of Imperfection: Let Go of Who You Think You're Supposed to Be and Embrace Who You Are
From Everand
The Gifts of Imperfection: Let Go of Who You Think You're Supposed to Be and Embrace Who You Are
Brené Brown
Rating: 4 out of 5 stars
4/5 (1090)
Bad Feminist: Essays
From Everand
Bad Feminist: Essays
Roxane Gay
Rating: 4 out of 5 stars
4/5 (1015)
The Outsider: A Novel
From Everand
The Outsider: A Novel
Stephen King
Rating: 4 out of 5 stars
4/5 (1839)
Brooklyn: A Novel
From Everand
Brooklyn: A Novel
Colm Tóibín
Rating: 3.5 out of 5 stars
3.5/5 (1937)
The Sympathizer: A Novel (Pulitzer Prize for Fiction)
From Everand
The Sympathizer: A Novel (Pulitzer Prize for Fiction)
Viet Thanh Nguyen
Rating: 4.5 out of 5 stars
4.5/5 (119)
A Man Called Ove: A Novel
From Everand
A Man Called Ove: A Novel
Fredrik Backman
Rating: 4.5 out of 5 stars
4.5/5 (4609)
The Light Between Oceans: A Novel
From Everand
The Light Between Oceans: A Novel
M.L. Stedman
Rating: 4.5 out of 5 stars
4.5/5 (789)
The Woman in Cabin 10
From Everand
The Woman in Cabin 10
Ruth Ware
Rating: 3.5 out of 5 stars
3.5/5 (2322)
Manhattan Beach: A Novel
From Everand
Manhattan Beach: A Novel
Jennifer Egan
Rating: 3.5 out of 5 stars
3.5/5 (792)
The Perks of Being a Wallflower
From Everand
The Perks of Being a Wallflower
Stephen Chbosky
Rating: 4.5 out of 5 stars
4.5/5 (2099)
Wolf Hall: A Novel
From Everand
Wolf Hall: A Novel
Hilary Mantel
Rating: 4 out of 5 stars
4/5 (3811)
Little Women
From Everand
Little Women
Louisa May Alcott
Rating: 4 out of 5 stars
4/5 (104)
The Art of Racing in the Rain: A Novel
From Everand
The Art of Racing in the Rain: A Novel
Garth Stein
Rating: 4 out of 5 stars
4/5 (4200)
Sing, Unburied, Sing: A Novel
From Everand
Sing, Unburied, Sing: A Novel
Jesmyn Ward
Rating: 4 out of 5 stars
4/5 (1103)
A Tree Grows in Brooklyn
From Everand
A Tree Grows in Brooklyn
Betty Smith
Rating: 4.5 out of 5 stars
4.5/5 (1929)
The Constant Gardener: A Novel
From Everand
The Constant Gardener: A Novel
John le Carre
Rating: 3.5 out of 5 stars
3.5/5 (104)
Her Body and Other Parties: Stories
From Everand
Her Body and Other Parties: Stories
Carmen Maria Machado
Rating: 4 out of 5 stars
4/5 (821)
Inflatable Packers en
Document51 pages
Inflatable Packers en
David Luheto
No ratings yet
2.8 V6 5V (Aha & Atq)
Document200 pages
2.8 V6 5V (Aha & Atq)
Vladimir Socin Shakhbazyan
No ratings yet
The Value of Repeat Biopsy in SLE
Document8 pages
The Value of Repeat Biopsy in SLE
Sergio Cerpa
No ratings yet
Cars Should Be Banned
Document3 pages
Cars Should Be Banned
Irwan
No ratings yet
Discretionary Lending Power Updated Sep 2012
Document28 pages
Discretionary Lending Power Updated Sep 2012
akranjan888
No ratings yet
28 Government Service Insurance System (GSIS) vs. Velasco, 834 SCRA 409, G.R. No. 196564 August 7, 2017
Document26 pages
28 Government Service Insurance System (GSIS) vs. Velasco, 834 SCRA 409, G.R. No. 196564 August 7, 2017
ekang
No ratings yet
How To Make Money in The Stock Market
Document40 pages
How To Make Money in The Stock Market
tcb660
50% (2)
Advance Bio-Photon Analyzer ABPA A2 Home Page
Document5 pages
Advance Bio-Photon Analyzer ABPA A2 Home Page
StellaEstel
100% (1)
Janapriya Journal of Interdisciplinary Studies - Vol - 6
Document186 pages
Janapriya Journal of Interdisciplinary Studies - Vol - 6
abiskar
No ratings yet
5.0 A Throttle Control H-Bridge
Document26 pages
5.0 A Throttle Control H-Bridge
rumellemur59
No ratings yet
Chapter 1 Qus Only
Document28 pages
Chapter 1 Qus Only
Sakshar
No ratings yet
2022 Product Catalog Web
Document100 pages
2022 Product Catalog Web
Edinson Reyes Valderrama
No ratings yet
L-1 Linear Algebra Howard Anton Lectures Slides For Student
Document19 pages
L-1 Linear Algebra Howard Anton Lectures Slides For Student
Hasnain Abbasi
No ratings yet
Introduction To Succession-1
Document8 pages
Introduction To Succession-1
amun din
No ratings yet
Sapkale Sandspit 2020
Document5 pages
Sapkale Sandspit 2020
jbs_geo
No ratings yet
Craft's Folder Structure
Document2 pages
Craft's Folder Structure
Wow
No ratings yet
ABS Rules for Steel Vessels Under 90m
Document91 pages
ABS Rules for Steel Vessels Under 90m
Gean Antonny Gamarra Damian
No ratings yet
Weibull Statistic and Growth Analysis in Failure Predictions
Document9 pages
Weibull Statistic and Growth Analysis in Failure Predictions
gmitsuta
No ratings yet
iec-60896-112002-8582
Document3 pages
iec-60896-112002-8582
tamjid.kabir89
No ratings yet
WELDING EQUIPMENT CALIBRATION STATUS
Document4 pages
WELDING EQUIPMENT CALIBRATION STATUS
AMIT SHAH
No ratings yet
Abb Drives: User'S Manual Flashdrop Mfdt-01
Document62 pages
Abb Drives: User'S Manual Flashdrop Mfdt-01
Сергей Салтыков
No ratings yet
CSEC IT Fundamentals of Hardware and Software
Document2 pages
CSEC IT Fundamentals of Hardware and Software
R.D. Khan
100% (1)
AE383LectureNotes PDF
Document105 pages
AE383LectureNotes PDF
Poyraz Bulut
No ratings yet
Eritrea and Ethiopia Beyond The Impasse PDF
Document12 pages
Eritrea and Ethiopia Beyond The Impasse PDF
The Ethiopian Affair
No ratings yet
Code Description DSMC
Document35 pages
Code Description DSMC
Ankit Bansal
No ratings yet
Salary Slip Oct Pacific
Document1 page
Salary Slip Oct Pacific
BHARAT SHARMA
No ratings yet
KDL 23S2000
Document82 pages
KDL 23S2000
Carlos Segura
No ratings yet
Planning For Network Deployment in Oracle Solaris 11.4: Part No: E60987
Document30 pages
Planning For Network Deployment in Oracle Solaris 11.4: Part No: E60987
errr33
No ratings yet
Witepsol
Document21 pages
Witepsol
Anastasius Hendrian
No ratings yet
Sav 5446
Document21 pages
Sav 5446
Michael
100% (2)