Professional Documents
Culture Documents
PG Student, 2Professor,
Department of Computer Science and Engineering,
Kongunadu College of Engineering and Technology, Anna University, Tamilnadu.
ABSTRACT
Big data is an enticing technology that provides
large number of storage capacity and by using a large
number of computers together it enable the users to
deploy applications in an shared environment.Personal
Health Records (PHR) are user friendly online solution
that helps patients to manage their health details. One of
the major problems in moving to cloud computing is its
security and privacy issues. The encryption technique
that is used is not suitable for data that is processed and
shared frequently and anonymizingbig data and
managing anonymized data sets are still challenges for
securing details of patients. Thus, a scalable two-phase
bottom up generalization (BUG) approach has been
proposed to anonymize large-scale data sets using the
dynamic MapReduceframework on cloud. In both the
phases of approach, group of innovative dynamic
MapReduce algorithms is designed to concretely
accomplish thespecialization computation in a highly
scalable way and ensure efficiency.
Keywords: Electronic Health Records, BUG approach,
HDFS, Dynamic MapReduce
I. INTRODUCTION
The primary goal of big data analytics is to help
companies make more informed business decisions by
enabling datascientists, predictive modellers and other
analytics professionals to analyze large volumes of
transaction data, as well as other forms of data that may
be
untapped
by
conventional
business
intelligence (BI) programs. That could include Web
server logs and Internet clickstream data, social media
content and social network activity reports, text from
customer emails and survey responses, mobile-phone
call detail records and machine data captured by sensors
connected to the Internet of Things. Some people
exclusively associate big data with semi-structured
and unstructured data of that sort, but consulting firms
like Gartner Inc. and Forrester Research Inc. also
II. RELATED
ANALYSIS
WORK
AND
PROBLEM
www.ijsret.org
405
International Journal of Scientific Research Engineering & Technology (IJSRET), ISSN 2278 0882
Volume 4, Issue 4, April 2015
www.ijsret.org
406
International Journal of Scientific Research Engineering & Technology (IJSRET), ISSN 2278 0882
Volume 4, Issue 4, April 2015
REFERENCES
V. CONCLUSION
The scalability problem of data anonymization for big
data applications on cloud has been studied using Bottom
Up Generalization (BUG) and proposed scalable
Advanced Bottom Up Generalization. The proposed BUG
performed as Data partitioned, executing the driver
producing the intermediate results. Then, the intermediate
results are merged and generalization is applied to
produce anonymized data without violating k-anonymity.
The MapReduce Framework is effectively applied on
cloud for data anonymization and shows that scalability
and efficiency of centralized BUG are improved
significantly over existing approaches. In future
Journal paper:
[1] Chaudhuri.S (2012), What Next?: A Half-Dozen
Data Management Research Goals for Big Data and
the Cloud, Proc. 31st Symp. Principles of
Database Systems (PODS 12), pp. 1-4.
[2] Dean and Ghemawat (2008), Mapreduce:
Simplified Data Processing on Large Clusters,
Communication in ACM, volume. 51, number. 1,
Pages: 107-113.
[3] HariKumar.R M.E (CSE),Dr.P.UmaMaheswariPh.d
(2014), Literature Survey on Big Data and
Preserving Privacy for the Big Data in Cloud
International Journal of Technical Research and
Applications e-ISSN: 2320-8163, www.ijtra.com
Volume 2, Issue 6 ,PP. 128-133 128 .
[4] Iwuchukwu
and
Naughton
(2007),
KAnonymization as Spatial Indexing: Toward
Scalable
and
Incremental
Anonymization,Proceedings of 33rd International
Conference on Very Large Data Bases (VLDB
07), Pages: 746-757.
[5] Monali S. Bachhav, Manekar (2014), A Survey
on Two-Phase Top-Down Specialization for Data
Anonymization using Map Reduce on Cloud
International Journal of Computer Applications
(0975 8887).
[6] Saranya.M, SenthamilSelvi.R (2015), A Survey on
Privacy Preservation for Anonymzing Data
International Journal of Emerging Engineering
Research and Technology Volume 3, Issue 1, PP
17-21.
[7] Sherifa.G(2015), Privacy-Preserving Data
Publishing for TwoPhase TDS approach using
Mapreduce on Cloud International Journal of
Advanced
Research
in
Computer
and
Communication Engineering Vol. 4, Issue 2.
[8] Sreedhar.R,Umamaheshwari.D (2014), Big-Data
Processing With Privacy Preserving Map-Reduce
Cloud
International Journal of Innovative
Research in Science, Engineering and Technology ,
Volume 3, Special Issue1.
www.ijsret.org
407