Professional Documents
Culture Documents
1302810019
Abstract
Big Data may well be the Next Big Thing in the IT world. Big data burst upon
the scene in the first decade of the 21st century. The first organizations to
embrace it were online and startup firms. Firms like Google, eBay, LinkedIn, and
Facebook were built around big data from the beginning. Like many new
information technologies, big data can bring about dramatic cost reductions,
substantial improvements in the time required to perform a computing task, or
new product and service offerings. Big Data is similar to small data, but bigger
in size but having data bigger it requires different approaches: Techniques, tools
and architecture an aim to solve new problems or old problems in a better way
Big Data generates value from the storage and processing of very large quantities
of digital information that cannot be analyzed with traditional computing
techniques.
Table of Contents
Certificate
Acknowledgement ... 1
Abstract ...2
Table of Contents...3
Lists of Figures..5
Chapter 1 : Introduction
10
10
3.1
11
3.2
Chapter 4 : 3d formats ,handling and reslicing
4.1
12
12
can be used
3D formats that 3D-DOCTOR support
14
15
4.2
4.3
4.3.1
15
16
4.3.2
16
4.3.3
Chapter 5 : 3d surface rendering
17
17
5.1
Creating 3D surface model from images
18
5.1.1
5.1.2
19
20
20
7.1
3D Model Examples
3D Volume Calculation,
Measurements and Quantitative
Analysis
Chapter 8 : 3d measurements
22
Object Measurements:
22
8.1
3D models
THE BASICS
24
25
9.1
Volume Rendering:
25
Interactive Segmentation
Auto Segmentation
25
9.1.1
9.1.2
26
9.2
27
9.3
28
9.4
28
9.5
29
9.6
30
9.6.1
30
9.6.2
30
9.6.3
30
9.7
31
9.8
Chapter 10 : advance 3d image processing
10.1 Virtual doctor trains patients in
32
32
3D
10.1.
32
1
10.1.2
32
33
34
3D DOCTOR
34
12.1.
34
12.1.2
34
1
12.
35
2
Chapter 13 : Conclusion
36
Chapter 14 : References
37
List of Figures
Figure No.
Title
Page No.
3.1
4.1
4.3
3 Vs Characteristics..........
Brainstorming Big Data Architecture......
Processing Big Data...
10
14
16
5.1
17
6.1
19
7.1
20
7.2
21
CHAPTER 1
INTRODUCTION TO BIG DATA
Big data is a broad term for data sets so large or complex that
traditional data processing applications are inadequate. Challenges
include analysis, capture, data curation, search, sharing, storage,
transfer, visualization, and information privacy. The term often refers
simply to the use of predictive analytics or other certain advanced
methods to extract value from data, and seldom to a particular size of
data set. Accuracy in big data may lead to more confident decision
making. And better decisions can mean greater operational efficiency,
cost reductions and reduced risk.
Analysis of data sets can find new correlations, to "spot business
trends, prevent diseases, combat crime and so on." Scientists,
practitioners of media and advertising and governments alike regularly
meet difficulties with large data sets in areas including Internet search,
finance and business informatics. Scientists encounter limitations in eScience work, including meteorology, genomics, connectomics,
complex physics simulations, and biological and environmental
research.
Data sets grow in size in part because they are increasingly being
gathered by cheap and numerous information-sensing mobile devices,
aerial (remote sensing), software logs, cameras, microphones, radiofrequency identification (RFID) readers, and wireless sensor networks.
7
CHAPTER 2
WHAT IS BIG DATA?
Big data usually includes data sets with sizes beyond the ability of
commonly used software tools to capture, curate, manage, and process
data within a tolerable elapsed time. Big data "size" is a constantly
moving target, as of 2012 ranging from a few dozen terabytes to many
petabytes of data. Big data is a set of techniques and technologies that
require new forms of integration to uncover large hidden values from
large datasets that are diverse, complex, and of a massive scale.
In a 2001 research report and related lectures, META Group (now
Gartner) analyst Doug Laney defined data growth challenges and
opportunities as being three-dimensional, i.e. increasing volume
(amount of data), velocity (speed of data in and out), and variety (range
of data types and sources). Gartner, and now much of the industry,
continue to use this "3Vs" model for describing big data. In 2012,
Gartner updated its definition as follows: "Big data is high volume,
high velocity, and/or high variety information assets that require new
forms of processing to enable enhanced decision making, insight
discovery and process optimization." Additionally, a new V "Veracity"
is added by some organizations to describe it.
If Gartners definition (the 3Vs) is still widely used, the growing
maturity of the concept fosters a more sound difference between big
data and Business Intelligence, regarding data and their use:
Business Intelligence uses descriptive statistics with data with
high information density to measure things, detect trends etc.;
10
CHAPTER 3
CHARACTERSTICS OF BIG DATA
3 VS
Characteristi
cs
VOLUME
VELOCITY
VARIETY
Figure : 3.1
Variety - The next aspect of Big Data is its variety. This means
that the category to which Big Data belongs to is also a very
essential fact that needs to be known by the data analysts. This
11
helps the people, who are closely analyzing the data and are
associated with it, to effectively use the data to their advantage
and thus upholding the importance of the Big Data.
Veracity - The quality of the data being captured can vary greatly.
Accuracy of analysis depends on the veracity of the source data.
12
13
CHAPTER 4
ARCHITECTURE
4.1 In 2000, Seisint Inc. developed C++ based distributed file sharing
framework for data storage and querying. Structured, semi-structured
and/or unstructured data is stored and distributed across multiple
servers. Querying of data is done by modified C++ called ECL which
uses apply scheme on read method to create structure of stored data
during time of query. In 2004 LexisNexis acquired Seisint Inc. and
2008 acquired ChoicePoint, Inc. and their high speed parallel
processing platform. The two platforms were merged into HPCC
Systems and in 2011 was open sourced under Apache v2.0 License.
Currently HPCC and Quantcast File Systemare the only publicly
available platforms capable of analyzing multiple exabytes of data.
In 2004, Google published a paper on a process called MapReduce
that used such an architecture. The MapReduce framework provides
a parallel processing model and associated implementation to process
huge amounts of data. With MapReduce, queries are split and
distributed across parallel nodes and processed in parallel (the Map
step). The results are then gathered and delivered (the Reduce step).
The framework was very successful, so others wanted to replicate the
algorithm. Therefore, an implementation of the MapReduce
framework was adopted by an Apache open source project named
Hadoop.
MIKE2.0 is an open approach to information management that
acknowledges the need for revisions due to big data implications in
an article titled "Big Data Solution Offering".
14
Figure: 4.1
15
models:
key
value,
graph,
document,
column-family
Hadoop Distributed File System
HBase
Hive
characteristics
Moving code to data
Implementing polyglot data store solutions
Aligning business goals to the appropriate data store.
4.3.3 PROCESSING
Integrating disparate data stores
Mapping data to the programming framework
Connecting and extracting data from storage
Transforming data for processing
Subdividing data in preparation for Hadoop MapReduce
Employing Hadoop MapReduce
Creating the components of Hadoop MapReduce jobs
Distributing data processing across server farms
Executing Hadoop MapReduce jobs
Monitoring the progress of job flows
17
Figure: 4.3
CHAPTER 5
WHY BIG DATA?
5.1 Standard table for Big Data size
18
Figure: 5.1
IBM claims 90% of todays stored data was generated in just the
last two years.
CHAPTER 6
HOW BIG DATA IS DIFFERENT?
1) Automatically generated by a machine (e.g. Sensor embedded in an
20
engine)
2) Typically an entirely new source of data (e.g. Use of the internet)
3) Not designed to be friendly (e.g. Text streams)
4) May not have much values
Need to focus on the important part
Figure: 6.1
CHAPTER 7
BIG DATA SOURCES
Users
21
Large and
Growing files
Applicati
on
Sensors
Systems
Figure: 7.1
Competitive advantage
Better business decisions: strategic and operational
Effective marketing, customer satisfaction, increased revenue
Figure: 7.2
CHAPTER 8
TOOLS USED IN BIG DATA
8.1 TOOLS
23
24
CHAPTER 9
APPLICATIONS
25
9.1 Government
26
31
CHAPTER 10
33
34
CHAPTER 11
BENEFITS OF BIG DATA
Real-time big data isnt just a process for storing petabytes or
exabytes of data in a data warehouse, its about the ability to make
better decisions and take meaningful actions at the right time.
Fast forward to the present and technologies like Hadoop give you
the scale and flexibility to store data before you know how you are
going to process it.
Technologies such as MapReduce, Hive and Impala enable you to
run queries without changing the data structures underneath.
Our newest research finds that organizations are using big data to
target customer-centric outcomes, tap into internal data and build a
better information ecosystem.
Big Data is already an important part of the $64 billion database
and data analytics market.
And the Internet boom of the 1990s, and the social media
explosion of today.
35
CHAPTER 12
BIG DATA IMPACTS & FUTURE
Gaining attraction
Huge market opportunities for IT services (82.9% of revenues)
and analytics firms (17.1 % ).
Current market size is $200 million. By 2015 $1 billion.
The opportunity for Indian service providers lies in offering
services around Big Data implementation and analytics for
global multinationals.
36
37
CHAPTER 13
CONCLUSION
The availability of Big Data, low-cost commodity hardware, and
new information management and analytic software have produced a
unique moment in the history of data analysis. The convergence of
these trends means that we have the capabilities required to analyze
astonishing data sets quickly and cost-effectively for the first time in
history. These capabilities are neither theoretical nor trivial. They
represent a genuine leap forward and a clear opportunity to realize
enormous gains in terms of efficiency, productivity, revenue, and
profitability.
The Age of Big Data is here, and these are truly
revolutionary times if both business and technology
professionals continue to work together and deliver on
the promise.
38
CHAPTER 14
REFERENCES
www.google.com
www.wikipedia.com
www.studymafia.org
39