Professional Documents
Culture Documents
I.
INTRODUCTION
RELATED WORK
PROPOSED SYSTEM
(-) Disadvantages
A. Data
Twitter is a popular microblogging service, allows users to
post tweets, status message with length up to 140 characters
[Davenport]. These tweets usually carry personal views or
emotions towards the subject mentioned in the tweets.
Because of that, in this paper we choose twitter as a data
source.
The system will use the API to perform data collection
through twitter. API or commonly called Application
Programming Interface is a program or application provided
by the certain developer that we or the other application
developers can more easily access the application. This API
essentially serves as a bridge between the application with
other applications.
We must remember that twitters public API provides only
1% or less of its entire traffic, without control over the
sampling procedure, which is likely insufficient for
accurate analysis of public opinion [5].
MOTIVATION
(+) Advantages
IV.
B. Preprocessing
The text of tweets differs from the text in articles, books,
or even spoken language. It includes many idiosyncratic
uses, such as emoticons, URLs, RT for re-tweet, @ for
user mentions, # for hashtags, and repetitions. It is
necessary to preprocess and normalize the text [6].
C. Sentiment Model
The design of the sentiment model used in our system
was based on the assumption that the opinions expressed
would be highly subjective. Therefore, data will be classified
directly after passing through the processing as in figure 4.
D. Opinion Classification
In opinion classification, we will be categorized into 3
parts: positive, negative and neutral. For example, if we have a
tweet like this
I like the iphone because bodycase, camera and also not
slow
and
Your iphone slow? This is the trick to make it faster
The first tweet containing slow and followed by not so
the system will classify these tweets into positive category,
while the second tweet containing question marks after slow
so the system will classify these tweets into neutral category.
If there is a tweet containing slow and not followed or after
by other word, the system will classify into negative category.
For better performance, the system should be able to assess
how the level of satisfaction a tweet, for example a tweet
TABLE IV.
Keyword
TWEET CLASSIFICATION
Positive
Negative
Neutral
Iphone Slow
12
84
Samsung Slow
16
61
After classifying the data next rating and visualize the data
in a graph so easily understood by users. As for the formula
used is :
Rating=
(1)
V.
RESULT
Negative Tweet
100
Postive Tweet + Negative Tweet
TOTAL TWEET
Total
Total Tweet
182
Iphone Slow
102
Samsung Slow
80
VI.
CONCLUSION
[2]
W. Duan, B. Gu, and A.B. Whinston, The Dynamics of Online Wordof-Mouth and Product Sales - An Empirical Investigation of the Movie
Industry, J. Retailing, vol. 84, no.2, 2008, pp. 233242.
W.Zhou, Y.Liu, Online Product Rating Manipulation and Market
Performance, May 2015.
[3]
[4]
[5]
[6]
[7]
[8]
[9]