Professional Documents
Culture Documents
productivity
Introduction
Data has become a torrent flowing into every area of
the global economy
Companies churn out a burgeoning volume of
transactional data, capturing trillions of bytes of
information about their customers, suppliers, and
operations
Social media sites, smart phones, and other consumer
devices including PCs and laptops have allowed
billions of individuals around the world to contribute to
the amount of big data available
Each second of high-definition video, for example,
generates more than 2,000 times as many bytes as
What do we mean by
"big data"?
Big data has been the buzz in public-sector circles for just a few years now, but its roots run
deep.
1983
IBM releases DB2, its latest relational database management system using structure query
language (both developed in the 1970s) that would become a mainstay in government.
1985
Object-oriented programming (OOP) languages, such as Eiffel, start to catch on. Although
OOP dates to the 1960s, it would over the next decade become the dominant programming
language.
1990
Archie, the first tool used for searching on the Internet, is created.
1991
The World Wide Web, using Hyper Text Transfer Protocol (HTTP) and the Hyper Text Markup
Language (HTML), appears as a publicly available service for sharing information.
95
Sun releases the Java platform, with the Java language first invented in
97
A paper on visualization is published which discusses the challenges of
with data sets too large for the computing resources at hand Big data
98
Carlo Strozzi develops an open-source relational database and calls it N
Google is founded.
01
Tim Berners-Lee, inventor of the World Wide Web, coins the term Sem
Web, a dream for machine-to-machine interactions in which compu
become capable of analyzing all the data on the Web.Wikipedia is lau
2002
2003
The amount of digital information created by computers
and other data
systems in this one year surpasses the amount of
information created in all
of human history prior to 2003, according to IDC and EMC
studies.
2005
Apache Hadoop, destined to become a foundation of
government big data efforts, is created.
2008
The number of devices connected to the Internet exceeds
the worlds
population.
1
IBM's Watson scans and analyzes 4 terabytes (200
million pages) of data in seconds to defeat two
human players on Jeopardy! Work begins in UnQL,a query language
for NoSQL databases.
2
The Obama administration announces the Big Data Research and Deve
Initiative, consisting of 84 programs in six departments. The National S
Foundation publishes Core Techniques and Technologies for Advancing
Big Data Science & Engineering.IDC and EMC estimate that 2.8 zettab
data will be created in 2012 .The report predicts that the digital world w
2020 hold 40 zettabytes.
Creating transparency
Policing
Premium costing
HR
Suspect tracking
Sport
Identifying leavers
Talent spotting
Gambling
Odds calculator
Crowd sourcing
Mobile record retrieval