Professional Documents
Culture Documents
An Introduction
DATA
What is Data?
Information in raw or unorganized form (such as alphabets, numbers, or symbols) that refer to, or represent, conditions, ideas, or objects.
Distinct pieces of information, usually formatted in a special way. A collection of facts, such as values or measurements.
Data can exist in a variety of forms -- as numbers or text on pieces of paper, as bits and bytes stored in electronic memory, or as facts stored in a person's mind.
Few Examples
A word file, an excel sheet, an image, an e-mail, a Facebook status, a Twitter tweet, a YouTube video and the list is endless.
EVOLUTION OF COMPUTERS
1960 Main Frame Computers
Took large space. Occupied an entire room. All data centralized. Stored at a single location. To analyze the data stored, physicists had to come from across the globe. No networks. Difficult management of data.
EVOLUTION OF COMPUTERS
Late 90s-2000 Laptop Computers
Computers become portable and handy-mobile. Commonly used in a variety of settings, including work, education, and personal multimedia. A laptop combines the components and inputs as a desktop computer; including display, speakers, keyboard, and pointing device. Modern day laptops are equipped with connecting and networking technologies such as Bluetooth, WI-FI etc. In wide use across the globe.
EVOLUTION OF COMPUTERS
Main Frames
IBM Clones
Desktop PC
Laptop
Tablet
THE INTERNET
Technologists say, the world has turned to a Global Village. Well, the credit to this goes to the Internet. The Internet has made distances shorter and the world smaller. The Internet is defined as the worldwide interconnection of individual networks operated by government, industry, academia, and private parties. In a matter of very few years, the Internet consolidated itself as a very powerful platform that has changed forever the way we do business, and the way we communicate. Size of Internet Expanding Daily. Every hour, every minute, rather every second.
PRESENT SCENARIO
PRESENT SCENARIO
THE PROBLEM
The core problem going by todays scenario is the extensively growing quantum of Data.
The world's technological per-capita capacity to store information has roughly doubled every 40 months since the 1980s. From the beginning of human history till 2003 only 5 exabytes of data was generated. But since then, the situation has transformed drastically. Everyday we create 2.5 quintillion bytes of data; 90% of the data in the world today has been created in the last two years alone. 1 Quintillion = 10^18 bytes i.e 1000000000000000000 bytes or 1 exabyte. Too much.. Right? Predictions say that the same amount of data would be generated in minutes in the days to come. The Big Question How to manage & process this enormous data store using the present technologies?
THE PROBLEM
Few Illustrations How Much Data?
Google processes 20 PB a day Wayback Machine has 3 PB + 100 TB/month Facebook has 2.5 PB of user data + 15 TB/day eBay has 6.5 PB of user data + 50 TB/day CERNs Large Hydron Collider (LHC) generates 15 PB a year
Thank You!