You are on page 1of 5

e-ISSN (O): 2348-4470

Scientific Journal of Impact Factor (SJIF): 4.14


p-ISSN (P): 2348-6406

International Journal of Advance Engineering and Research


Development
Volume 3, Issue 12, December -2016

Real-Time Big Data Analytical Architecture for Remote Sensing Application


Poonam Shinde1, Rupali Mhase2, Sneha Pawar3, Nayan Soudagar4, Prof. Shubhangi vairagar5

Computer Engineering, Siddhant College of Engineering, Pune

Abstract The assets of remote senses digital world daily generate Big volume of period of time information (mainly
remarked the term Big Data), wherever insight data incorporates a potential significance if collected and mass
effectively. In todays era, there's an excellent deal additional to period of time remote sensing Big information than
it looks initially, Associate in Nursingd extracting the helpful data in an economical manner leads a system toward a
significant procedure challenges, like to investigate, aggregate, and store, wherever information area unit remotely
collected. Keeping visible the on top of mentioned factors, there's a desire for planning a system design that
welcomes each real-time, yet as offline processing. Therefore, during this paper, we tend to propose period of
time Big information analytical design for remote sensing satellite application. The planned design contains 3 main
units, like 1) remote sensing Big information acquisition unit (RSDU); 2) processing unit (DPU); and
3) information analysis call unit (DADU). First, RSDU acquires information from the satellite and sends this
information to the bottom Station, wherever initial process takes place. Second, DPU plays an important role in
design for economical process of period of time Big information by providing filtration, load leveling, and
multiprocessing. Third, DADU is that the higher layer unit of the planned design, that is answerable for compilation,
storage of the results, and generation of call supported the results received from DPU. The planned design has the
potential of dividing, load leveling, and multiprocessing of solely helpful information. Thus, it leads
to expeditiously analyzing period of time remote sensing Big information exploitation earth observatory system. What is
more, the planned design has the potential of storing incoming information to perform offline analysis on for the most
part hold on dumps, once needed. Finally, an in depth analysis of remotely detected earth
observatory Big information for land and ocean space area utit provided exploitation Hadoop and ocean space,
additionally, varied algorithms area unit planned for every level of RSDU, DPU, Associate in Nursingd DADU to
find land yet as ocean space to elaborate the operating of an design.

Keywords- Big Data, data analysis decision unit (DADU), data processing unit (DPU), land and sea area, offline, real-
time.

I. INTRODUCTION

Big data could be a term wont to describe a set of knowledge sets with the subsequent 3 characteristics:
Volume- massive amounts of knowledge generated.
Velocity-Frequency and speed of that information square measure generated, captured and shared
Variety-Diversity of knowledge sorts and formats from varied sources.

The size and quality of Big data makes it tough to use ancient direction and processing tools. Big data is being created
in abundant shorter cycles from hours to milliseconds. there's additionally a trend a foot to make larger information bases
by combining smaller information sets in order that data correlations are often discovered.

Big data has become the new frontier of data management given the number of knowledge todays systems square
measure generating and intense. it's driven the necessity for technological infrastructure and tools which will capture,
store, analyse and visualize large amounts of disparate structured and unstructured data. These data square measure being
generated at increasing volumes from data intensive technologies as well as, however not restricted to, the
employment of the net for activities like accesses to data, social networking, mobile computing and commerce. firms and
governments have begun to acknowledge that there square measure fallow opportunities to enhance their
enterprises which will be discovered from these data.

A. Big Data Analytics

Analytics once applied within the context of Big data is that the method of examining Big amounts of knowledge, from a
various variety of knowledge sources and in numerous formats, to deliver insights which will alter choices in real
or close to real time. Big data analytical approaches are often utilized to acknowledge inherent patterns, correlations and
anomalies which might be discovered as a results of desegregation large amounts of information from totally
different data sets. Big data analytics needs the employment of recent frameworks, technologies and processes to
@IJAERD-2016, All rights Reserved 447
International Journal of Advance Engineering and Research Development (IJAERD)
Volume 3, Issue 12, December -2016, e-ISSN: 2348 - 4470, print-ISSN: 2348-6406

manage it. Nonetheless its arrival within the enterprise software package area has created some confusion as business
leaders attempt to perceive the variations between it and ancient data deposition (DW) and business intelligence (BI)
tools. There square measure necessary distinctions and enough differentiating price between BDA and DW/BI systems
that build BDA distinctive. Gartner defines a knowledge warehouse as a storage design designed to carry data extracted
from dealing systems, operational data stores and external sources. The warehouse then combines that data in Associate
in Nursing mixture, outline type appropriate for enterprise-wide data analysis and coverage for predefined
business wants.

B. Big Data Computing

The rising importance of big-data computing stems from advances in many alternative technologies. Sensors: Digital
data square measure being generated by many alternative sources, as well as digital imagers (telescopes, video
cameras, MRI machines), chemical and biological sensors), and even the uncountable people and organizations
generating sites. pc networks: information from the numerous totally different sources are often collected
into Big data sets via localized detector networks, moreover because the net.

C. Big Data Analytics In Cloud Computing

Most company enterprises face vital challenges in absolutely investment their information. Frequently,
data is latched away in multiple databases and process systems throughout the enterprise, and also the
queries customers Associate in Nursingd analysts raise need an mixture read of all information, generally
totaling many terabytes. Cerri et al projected Knowledge within the cloud in situ of data within the cloud to
support cooperative tasks that square measure computationally intensive and facilitate distributed, heterogeneous
data. This can be termed as Utility Computing derived from needed information in and out of Cloud the utilities like
electricity, gas that we tend to solely buy what we tend to use from a shared resource. With the growing interest in cloud,
analytics could be a difficult task. In general, Business Intelligence applications like image process, net searches,
understanding customers and their shopping for habits, provide chains and ranking and Bio-informatics
(e.g. cistron structure prediction) square measure information intensive applications. Cloudare often an ideal match for
handling such analytical services. for instance, Googles Map Reduce are often leveraged for analytics because
it showing intelligence chunks the info into smaller storage units and distributes the computation among low-
priced process units.

D. Moving big data Into Cloud

Big data could be a data analysis methodology enabled by recent advances in technologies and design.
However, Big data entails a large commitment of hardware and process resources, creating adoption prices of
Big data technology prohibitory to tiny and medium sized businesses. Cloud computing offers the promise of
Big data implementation to tiny and medium sized businesses. Big processing is performed through a programming
paradigm referred to as MapReduce. Typically, implementation of the MapReduce
paradigm needs networked connected storage and data processing. The computing wants of MapReduce
programming square measure usually on the far side what tiny and medium sized business square measure able
to commit.

II. LITERATURE SURVEY

A. Big Data and Cloud Computing: Current State and Future Opportunities_
Authors: Divyakant Agrawal Sudipto Das Amr El Abbadi

Description:
Scalable management systems (DBMS)both for update intensive application workloads in addition as call support
systems for descriptive and deep analyticsare a essential a part of the cloud infrastructure and play a vital role in
making certain the graceful transition of applications from the standard enterprise infrastructures to next generation cloud
infrastructures. tho' ascendable information management has been a vision for quite 3 decadesand far analysis has
focused on giant scale information management in ancient enterprise setting, cloud computing brings its own set of novel
challenges that has got to be self-addressed to confirm the success of information management solutions within
the cloud setting. This tutorial presents Associate in Nursing organized image of the challenges sweet-faced by
application developers and DBMS designers in developing and deploying net scale applications. Our background study
encompasses each categories of systems: (i) for supporting update serious applic actions, and (ii) for ad-hoc analytics
and call support. we tend to then specialize in providing Associate in Nursing in-depth analysis of systems for supporting
update intensive web-applications and supply a survey of the state-of-theart during this domain. we tend to crystallize the

@IJAERD-2016, All rights Reserved 448


International Journal of Advance Engineering and Research Development (IJAERD)
Volume 3, Issue 12, December -2016, e-ISSN: 2348 - 4470, print-ISSN: 2348-6406

look selections created by some successful systems giant scale management systems, analyze the applying demands and
access patterns, and enumerate the desiderata for a cloud-bound DBMS.

B. Remote Sensing Processing: From Multicore to GPU


Authors: Emmanuel Christophe, Member, Julien Michel, and Jordi Inglada.

Description:
As the quantity of knowledge and therefore the quality of the process rise, the demand for process power in remote
sensing applications is increasing. The process speed may be a important side to modify a productive interaction between
the human operator and therefore the machine so as to realize ever a lot of advanced tasks satisfactorily.
Graphic process units (GPU) ar sensible candidates to hurry up some tasks. With the recent developments, programing
these devices became terribly straightforward. However, one supply of quality is on the frontier of this hardware: the way
to handle a picture that doesn't have a convenient size as an influence of two, the way to handle a picture that'stoo huge to
suit the GPU memory? This paper presents a framework that has established to
be economical with normalimplementations of image process algorithms and it's incontestable that
it conjointly allows a fast development of GPUdiversifications. many cases from the only to the a lot
of advanced ar elaborated and illustrate speedups of up to four hundred times.

C. A Big Data Architecture for Large Scale Security Monitoring


Authors: Samuel Marchal, Xiuyan Jiang, Radu State, Thomas Engel

Description:
Network traffic could be a made supply of data for security observation. but the increasing volume of information to treat
raises problems, rendering holistic analysis of network traffic troublesome. during this paper we tend to propose an
answer to deal with the tremendous quantity of information to analyse for security observation views. we tend
tointroduce associate design dedicated to security observation of native enterprise networks. the appliance domain of
such a system is especially network intrusion detection and interference, however are
often used also for rhetoricalanalysis. This design integrates 2 systems, one dedicated
to ascendible distributed information storage and managementand therefore the different dedicated
to information exploitation. DNS data, NetFlow records, communications protocoltraffic and Protea
cynaroides information square measure strip-mined and related to in an exceedingly distributed system that leverages
state of the art massive information resolution. information correlation schemes square measureplanned and their
performance square measure evaluated against many well-known massive information framework as well as Hadoop and
Spark.

III. SYSTEM ANALYSIS


A. Existing System

Most recently designed sensors based techniques used in the earth and the planetary observatory system are generating
continuous stream of data. Further, majority of work have been done in the various fields of remotely sensor satellite
image information, for example, change detection, gradient-based edge detection, region similarity based edge detection,
and intensity gradient technique for efficient intraprediction.

B. Disadvantages of Existing System:


1. Consequences of transformation of remotely sensed data to the scientific understanding are a critical task.
2. Normally, the data collected from remote areas are not in a format ready for analysis.
3. In remote access networks, where the data source such as sensors can produce an overwhelming amount of raw
data.

C. Proposed System:

The present the high speed continuous stream of data or the high volume offline data to Big Data, which is leading us
to a new world of challenges. This paper presents a remote sensing Big Data analytical architecture, which is used to
analyze real time, or offline data. At first, the data are remotely preprocessed, which is then readable by the machines.
Furthermore, this needful information is transmitted to the Earth Base Station for further data processing. Earth Base
Station performs two types of processing, such as processing of real-time and offline data. In case of the o ffline data, the
data are transmitted to offline data-storage device. The incorporation of offline data-storage device helps in later usage
of the data, whereas the real-time information is directly send or transmitted to the filtration and load balancer server,
where filtration algorithm is employed, which extracts the useful information from the Big Data.
Whereas, the load balancer balances the processing power by equal distribution of the real-time data to the servers.
The filtration and load-balancing server not only filters and balances load, but it is also used to increase the system

@IJAERD-2016, All rights Reserved 449


International Journal of Advance Engineering and Research Development (IJAERD)
Volume 3, Issue 12, December -2016, e-ISSN: 2348 - 4470, print-ISSN: 2348-6406

efficiency. The introduced architecture and the algorithms are implemented in Hadoop using MapReduce programming
by applying remote sensing earth observatory data. The introduced architecture is composed of three major units, such as
1) RSDU; 2) DPU; and 3) DADU. These units implement algorithms for each level of the architecture depending on the
required analysis.

Fig. Flow of Proposed System

D. Advantages of Proposed System:

1. With data acquisition, in which much of the data are of no interest that can be filtered or compressed by orders
of magnitude. With a view to using such filters, they do not discard useful information.

2. With data extraction, which drags out the useful information from the underlying sources and delivers it in a
structured formation suitable for analysis. For instance, the data set is reduced to single-class label to facilitate
analysis, even though the first thing that we used to think about Big Data as always describing the fact.

3. The incorporation of offline data-storage device helps in later usage of the data,

4. The load balancer balances the processing power by equal distribution of the real-time data to the servers.

@IJAERD-2016, All rights Reserved 450


International Journal of Advance Engineering and Research Development (IJAERD)
Volume 3, Issue 12, December -2016, e-ISSN: 2348 - 4470, print-ISSN: 2348-6406

IV. CONCLUSION AND FUTURE SCOPE

In this paper, we introduced architecture for real-time Big Data analysis for the remote sensing application. The
suggested the architecture efficiently and analyzed the real-time and offline remote sensing Big Data for decision-
making. The proposed architecture is designed of three major units, such as 1) RSDU; 2) DPU; and 3) DADU. These
units implement algorithms for each stage of architecture depending on the required analysis. The architecture of the real-
time Big is generic (application independent) that is used for any type of remote sensing Big Data analysis. Afterwords,
the capabilities of the parallel processing, filtering and dividing of only useful information are performed by discarding
all other extra data. These processes make a better selection for the real-time remote sensing Big Data analysis. The
algorithms proposed in this paper for each unit and subunits are used to analyze remote sensing data sets, which helps in
the better understanding of land and the sea area. The architecture proposed welcomes researchers and organizations for
any type of remote sensory Big Data analysis by developing algorithms for each level of the architecture depending on
their analysis requirement.
For future work, we are planning to extend the proposed architecture to make it compatible for Big Data analysis for all
applications, e.g., sensors and social networking. We are also planning to use the proposed architecture to perform
complex analysis on earth observatory data for decision making at realtime, such as earthquake prediction, Tsunami
prediction, fire detection, etc.

ACKNOWLEDGMENT

We might want to thank the analysts and also distributers for making their assets accessible. We additionally appreciative
to commentator for their significant recommendations furthermore thank the school powers for giving the obliged base
and backing.
REFERENCES

[1] M Mazhar, U Rathore, Anand Paul, A Ahmad, Bo-Wei Chen, Bormin Huang, and Wen Ji, Real-Time Big Data
Analytical Architecture for Remote Sensing Application, IEEE JOURNAL OF SELECTED TOPICS IN APPLIED
EARTH OBSERVATIONS AND REMOTE SENSING, 2015.

[2] D. Agrawal, S. Das, and A. E. Abbadi, Big Data and cloud computing: Current state and future opportunities, in
Proc. Int. Conf. Extending Database Technol. (EDBT), 2011, pp. 530533.

[3] S. Marchal, X. Jiang, R. State, and T. Engel, A Big Data architecture for large scale security monitoring, in Proc.
IEEE Int. Congr. Big Data, 2014, pp. 5663.

[4] X. Yi, F. Liu, J. Liu, and H. Jin, Building a network highway for Big Data: Architecture and challenges, IEEE
Netw., vol. 28, no. 4, pp. 513, Jul./Aug. 2014.

[5] E. Christophe, J. Michel, and J. Inglada, Remote sensing processing: From multicore to GPU, IEEE J. Sel. Topics
Appl. Earth Observ. Remote Sens., vol. 4, no. 3, pp. 643652, Aug. 2011.

[6] Y.Wang et al., Using a remote sensing driven model to analyze effect of land use on soil moisture in the Weihe
River Basin, China, IEEE J. Sel. Topics Appl. Earth Observ. Remote Sens., vol. 7, no. 9, pp. 38923902, Sep. 2014.

[7] K. Michael and K. W. Miller, Big Data: New opportunities and new challenges [guest editors introduction], IEEE
Comput., vol. 46, no. 6, pp. 2224, Jun. 2013.

[8] C. Eaton, D. Deroos, T. Deutsch, G. Lapis, and P. C. Zikopoulos, Understanding Big Data: Analytics for Enterprise
Class Hadoop and Streaming Data. New York, NY, USA: Mc Graw-Hill, 2012.

[9] R. D. Schneider, Hadoop for Dummies Special Edition. Hoboken, NJ, USA: Wiley, 2012.

@IJAERD-2016, All rights Reserved 451

You might also like