Professional Documents
Culture Documents
Abstract
Sentiment analysis deals with the computational treatment
of opinion, sentiment, and subjectivity in text, has attracted
a great deal of attention. Sentiment analysis has been
widely used across a wide range of domains in recent
years, such as information retrieval, question answering
systems and social network. This paper presents a new
method for improving the semantic knowledge base for
sentiment classification in social web applications. It
comprises the three steps. First, to identify sentiment
terms. Next, to provide the context information from
training corpus and ground this information to lexical
resources such as WordNet. This Work applies to a
transfer learning method called cross-domain sentiment
classification. In Sentiment Analysis, transfer learning can
be applied to transfer sentiment classification from one
domain to another or building a bridge between two
domains. This is achieved by learning the semantic
knowledge base across the different domains. A model
called AS_LDA is used for the sentiment classification.
The performance of the proposed system improves the
accuracy of the Sentiment Classifier to a significant extent.
Key terms: WordNet, Sentiment Analysis, Cross-Domain
sentiment Classification, Transfer Learning.
1. INTRODUCTION
Sentiment analysis is a technique to classify peoples
opinions in product reviews, blogs or social networks.
Large datasets are available on-line today, they can be
numerical or text file and they can be structured, semistructured or non-structured. Approaches and technique to
apply and extract useful information from these data have
been the major focuses of many researchers and
practitioners lately. Many different information retrieval
techniques and tools have been proposed according to
different data types. In addition to data and text mining,
there has seen a growing interest in non-topical text
analysis in recent years. Sentiment analysis is one of them.
Sentiment analysis, also known as opinion mining, is to
identify and extract subjective information in source
materials, which can be positive, neutral, or negative.
www.ijsret.org
109
International Journal of Scientific Research Engineering & Technology (IJSRET), ISSN 2278 0882
Volume 4, Issue 2, February 2015
2. RELATED WORK
Common Sentiment Analysis Task [5] proposed the basic
task of opinion mining is polarity classification and
Agreement detection. Polarity classification occurs when a
piece of text stating an opinion on a single issue is
classified as one of two opposing sentiments. Polarity
classifications also identify pro and con expressions in
online reviews. Agreement detection determines whether a
pair of text documents should receive the same or different
sentiment-related labels.
WORDNET relations [2] proposed WORDNET-AFFECT,
generates synsets that still represent affective concepts. If
the resulting synsets are members of WORDNETAFFECT, then the answer is trivially affirmative. For other
relations such as hyperonymy, entailment, causes, verbgroup it assumed the affective mean and it is necessary to
manually filter the synsets in order to select those affective
concepts.
NLP curves [6] proposed the automated analysis
techniques for extract and manipulate text meanings. A
NLP system must have access to a significant amount of
knowledge about the world and the domain of discourse.
3. PROPOSED SYSTEM
The proposed knowledge base can be used to fix the
existing context-aware approaches use vector space
address the problem of contextual polarity change. This
aims to increase the lexicons coverage and derive
information for subsequent sentiment analysis. We use
WordNet terms and their polarity values to generate a
baseline sentiment lexicon, identify sentiment terms, and
extract context information from training corpus and
ground this information to lexical resources such as
WordNet. This knowledge base is to make as domain
adaptation for cross Domain sentiment classification. It
provides a two-stage framework for cross-domain
sentiment classification. In the first stage they built a
bridge between the source domain and the target domain to
get some most confidently labeled documents in the target
domain. In the second stage they exploited the intrinsic
structure, revealed by these labeled documents to label the
target-domain data.
WordNet is a lexical database for English language that
groups English word into set of synonyms called synset.
WordNet distinguish between nouns, verbs, adjective as
major categories. At Concept level, WordNet which is
given in Figure 1 is used as a knowledge base for deriving
the semantic and lexical relations.
www.ijsret.org
Fig.1 WordNet
110
International Journal of Scientific Research Engineering & Technology (IJSRET), ISSN 2278 0882
Volume 4, Issue 2, February 2015
Domain
Specific
Keywords
Test Data
Word Splitter
Cross
Domain
Sentiment
Classification
Removing
Stop Words
POS Tagger
Feature
Extraction
Preprocessor
Preprocessing
Test
Data
Lexical Analyzer
Splitter
Accuracy
Detection
POS Tagger
Bag of
Words
www.ijsret.org
111
International Journal of Scientific Research Engineering & Technology (IJSRET), ISSN 2278 0882
Volume 4, Issue 2, February 2015
KBTL
SCL
0.8
0.84
0.86
0.88
5. CONCLUSION
Predicted Class
Actual Class
0.82
YES
NO
YES
TP
FN
NO
FP
TN
REFERENCES
[1] A. Das, B. Gambaeck, Sentimantics: conceptual
spaces for lexical sentiment polarity representation with
contextuality|, Proceedings of 3rd Workshop on
Computational Approaches to Subjectivity and Sentiment
www.ijsret.org
112
International Journal of Scientific Research Engineering & Technology (IJSRET), ISSN 2278 0882
Volume 4, Issue 2, February 2015
of
113