You are on page 1of 90

CLOUD COMPUTING

CORNERSTONE OF CONVERGING TECHNOLOGIES

LEEN BLOM
May, 2014

ABOUT ME

Leen Blom
Manager Research & Development at Centric
Background in software development, database management,
application design, head software development, architect, R&D
manager
Consultant, Manager consultancy, R&D manager

Now member of team Enterprise Innovation


Member of Contact Group Universities, guest lecturer at Rotterdam
University of applied sciences

STRUCTURE OF THIS LECTURE

Converging technologies
What is Cloud

The basics
Public cloud providers
State of art
Legacy applications and Cloud

Cloud and converging technologies

Discussion

Internet-of-Things
Big Data
Mobile

CONVERGING TECHNOLOGIES

WHAT ARE CONVERGING TECHNOLOGIES?

IDC: vision on IT

Gartner: vision on businesses

Science: second stage Convergence of knowledge and technology for the


benefit of society (CKTS)

3rd Platform" for IT growth and innovation, built on Mobile devices, Cloud services, Social
technologies and Big Data
Nexus of Forces mobile, social, cloud and information

NBIC convergence is connecting emerging technologies based on their shared elemental


components such as atoms, DNA, bits, and synapses (all with shared abstractions from
information science), integrated across scales
See: Converging of Knowledge, Technology and Society: Beyond Convergence of Nano-BioInfo-Cognitive Technologies (July 2013, Mihail C. Roco et al)

Wikipedia

Nanotechnology, Biotechnology, Information technology and Cognitive science (see


Emerging technologies#Acronyms)

WHAT ARE CONVERGING TECHNOLOGIES?

For this lecture we will look into the relevance for


Internet of Things
How will centralized Cloud match distributed IoT?

Big Data
Cloud is big, so Big Data matches Cloud?

Mobile
Apps for the enterprise: do they need Cloud or just Cloud principles?

CLOUD: THE BASICS

CLOUD COMPUTING BY NIST

Service Models

source: http://www.nist.gov/itl/cloud/index.cfm

CLOUD SERVICE MODELS


Capex

Opex
source: https://www.e-education.psu.edu/cloudGIS/cloud_introduction

CHARACTERISTICS OF CLOUD

On-demand self-service

Broad network access

Resource pooling

Rapid elasticity

Measured service

A consumer can unilaterally provision computing capabilities without requiring human


interaction with each service provider.
Capabilities are available over the network and accessed through standard
mechanisms
The providers computing resources are pooled to serve multiple consumers
Customer generally has no control or knowledge over the exact location of the
provided resources
Capabilities can be elastically provisioned and released, in some cases automatically,
often appear to be unlimited and can be appropriated in any quantity at any time.
Cloud systems automatically control and optimize resource use by leveraging a
metering

See also: http://arxiv.org/pdf/0901.0131

CLOUD PROMISE: ELASTICITY

PURE CLOUD IS PUBLIC

Google, Amazon, Microsoft and others like Rackspace


Randy Bias, Cloudscaling.com

PUBLIC, COMMUNITY, PRIVATE OR HYBRID

Public cloud

Private cloud (including on-premises with Cloud-principes)

Community cloud

Hybrid cloud

Cloud services are provided in a virtualized environment, constructed using pooled shared
physical resources, and accessible over a public network such as the internet
Is a particular model of cloud computing that involves a distinct and secure cloud based
environment in which only the specified client can operate
Is a collaborative effort in which infrastructure is shared between several organizations from a
specific community with common concerns (security, compliance, jurisdiction, etc.), whether
managed internally or by a third-party and hosted internally or externally

Is an integrated cloud service utilizing both private and public clouds to perform distinct
functions within the same organization.

CLOUD: PRIVACY AND SECURITY

What is more secure: Private Cloud or Public Cloud?


Private

Pro: easier to isolate


Con: need to keep knowledge up-to-date

Public

Pro: may expect top security measures


Con: dependent on provider

more than 77 million accounts


affected, 12 million had
unencrypted credit card
numbers

What would respect privacy more: Private Cloud or Public Cloud?


Private

Pro: no intrusion of people outside


Con: need to keep knowledge up-to-date

Public

Pro: easier to uniform policies


Con: conflicting laws and jurisdiction in some cases

National security electronic surveillance


program operated by the United States
National Security Agency (NSA) since
2007

WHY DO ENTERPRISES NEED CLOUD?

Is it just like outsourcing?


Using data center services?

How can businesses benefit from Cloud?


What do we do with legacy software?
Some Cloud characteristics can be fulfilled in outsourcing
scenario
Sharing resources means lower costs
But is contradicting to customization and unique selling points

CLOUD VERSUS CLOUD PRINCIPLES

Can we benefit from Cloud advantages without Cloud?


Answer is: Yes, by adopting Cloud principles

Characteristic

Cloud

Cloud principle

On-demand self-service Indispensible to be cost


effective

Possible and preferrable, by providing


users a provisioning portal

Broad network access

Indispensible

In your own hands

Resource pooling

Indispensible to be cost
effective

Possible and preferrable, by virtualisation

Rapid elasticity

Mandatory to fullfil Cloud


promise

Depending on business, f.i. seasonal


fluctuations

Measured service

Mandatory for billing

Not mandatory, internal charging

CLOUD REQUIREMENTS FOR ENTERPRISES

Pure Cloud

Outsourced Hybrid

Private Hybrid

Mixed Cloud

Note

Public Cloud
Public Cloud + Outsourced Private Cloud
Outsourced Private Cloud + On Premises Private Cloud
On Premises Private Cloud + Own data center
On Premises Private Cloud = using Cloud principles!
Own data center = traditional system management

CLOUD REQUIREMENTS FOR ENTERPRISES


pure cloud
outsourced hybrid
private hybdrid
mixed cloud
privatly owned

NIST

Requirement

Public cloud

Outsourced Private cloud On premise Private cloud

Own data center

On-demand self-service

Must

Wise

Wise

Depends

Broad network access

Must

Must

LAN

LAN

Resource pooling

Must

Wise

Difficult

Difficult

Rapid elasticity

Must

Must

Not possible

Not possbile

Measured service

Must

Depends

Internal charging

Internal charging

Ultimate

Ultimate

Depends

Depends

Depends on jurisdiction

Local jurisdiction

Local jurisdiction

Depends

SLA

Own hands

Own hands

More Lock-in

Lock-in

Less Lock-in

Less Lock-in

Security

Privacy Depends on jurisdiction


Availability
Freedom of choice

WHAT ABOUT CLOUD ADOPTION?

Research by Everest Group, March 2013


See: Enterprise Cloud Adoption Survey 2013: Summary of
Results

CLOUD VERSUS INTERNAL DATA CENTER

84% thinks Cloud will reduce


dependence from internal
data center provisioning over
the next 3 years
Issues currently with vendor
management, floor space,
cross-vendor incompatibility

CLOUD GROWTH

Source: Keala Consultancy, februari 2014

CLOUD LOCK-IN

IaaS: moving towards standardisation

PaaS: lock-in inevitable

SaaS: complicated

OpenStack, open source cloud infrastructure components


Support from HP, Cisco, Red Hat, Canonical (Ubuntu) and VMware.
Infrastructure components accessible via OpenStack API
Development: SalesForce One, Google Apps Engine, Microsoft
Azure: all proprietary
Deployment: Windows stack, LAMP: you are on your own
Data structures cause lock-in
Need regular and readable backup on premises

From my Blog (Dutch) 'Nummerbehoud in de cloud' http://t.co/NgdCI1sfmv

CLOUD:
PUBLIC CLOUD PROVIDERS

PUBLIC CLOUD PROVIDERS COMPARED

Research in progress

The Big Three

Google Cloud Platform

Started with PaaS and added IaaS in 2012

Amazon Web Services

Started with IaaS and added some PaaS services

Microsoft Azure

Started with PaaS and added IaaS in 2012

Not easy: not all concepts are equal

But we are seeing closer convergence

COMPUTE COMPARED
Cloud characteristic

Amazon Web Services

Google Cloud Platform

Microsoft Azure

Compute Engine

Compute

Amazon Machine Image (all OS-es)

Standard Images (mostly Linux)

Windows Server and Linux

Xen
AWS Marketplace

KVM
Public images

Hyper-V
Marketplace

App Engine

Visual Studio Online, SDK's

Java, Python, PhP, Go

.Net, Java, Python, PHP, Ruby, Node.js


Web Sites: automatic scaling
Mobile Services: authentication, push

Virtual Servers Amazon EC2 instances (general purpose)


Virtual Images
Hypervisor
Distribution of images

Application Development Specific images with development tools


Programming languages
Specific applications

High Performance
Computing

No limits
Micro-instances: low cost
GPU-instances
Memory-optimized

Cloud Services: reliable API provisioning

Storage-optimized

StorSimple

Enhanced Networking: SR-OIV


Cluster-networking: MPI standard

Open Source Grid Engine

Windows HPC pack (MPI)

STORAGE COMPARED
Cloud characteristic

Standard service
Organization of data
Persistent storage (f.i. files)
Optimized for IO

Amazon Web Services

Google Cloud Platform

Microsoft Azure

Amazon Elastic Blockstore

Cloud Storage

Block blobs

Block blobs

Blobs, access RESTful API

Simple structure, RESTful API

Standard volumes (EBS)

buckets and objects

Page Blobs and Disks in VHD format

Provisioned IOPS volumes

n/a

n/a

Cloud SQL

Azure SQL Server

MySQL

SQL Server (and Oracle RDBMS)

NoSQL

Cloud Datastore

NoSQL, Azure Tables

DynamoDB, EMR (Hadoop),


marketplace

unknown

MongoDB, CouchDB, etc

Amazon SimpleDB

Hadoop, Hive, Pig, BigQuery

Azure HDInsight, Hadoop

Eventually consistent, Consistent

Eventually consistent, Consistent

Eventually consistent

Public Datasets Program

BigQuery

Amazon Relational Database Service


(RDS)
MySQL, PostgreSQL, SQL Server or
Database type
Oracle

Relational SQL

Non-Relational
NoSQL-types
Big Data
ACID
Public datasets

SERVICES COMPARED
Cloud characteristic
Directory

DNS
Virtual Private Cloud
Load balancer

Amazon Web Services

Google Cloud Platform

AWS Identity and Access Management GITKit for OpenID, Google Apps Directory
(IAM)
Sync

Microsoft Azure
Azure Active Directory

Dynamic DNS Service

Cloud DNS

Windows Azure-provided name


resolution

Amazon VPC, VPN

Integrated in Google Compute Engine

Site-to-site VPN, ExpressRoute

Needs Amazon support

Automatically

Azure Load Balancer, Traffic Manager,


CDN

Regions US, EU, ASIA and regional locations


Management
AWS Management Console
Billing
Per hour
Messaging
Amazon Simple Queue Service
Service Bus
Notification Amazon Simple Notification Service
Scheduling
Workflow
Amazon Simple Workflow
Development

US, EU, ASIA and regional locations


Cloud Deployment Manager
Per minute, minimum 10 minutes
Task Queues
Google Cloud Messaging
Scheduled Tasks

US, EU, ASIA and regional locations


Azure Management Portal
Per minute
Azure Queues
Service Bus Queues, BizTalk Services
Notification Hub
Scheduler
Visual Studion Online

CLOUD WORKSPACE OR DAAS

Characteristics

Price

Lower prices mean

Unmanaged
No backups
No end user
support
Office functionality
only

Online desktop
Managed
Local jurisdiction
Rapid elasticity
Adaptive: legacy application exposure
Adaptive: integration other SaaS and Cloud services
Adaptive: Customizable
Private Cloud

Self service

Global players
standard services
only

Per user per month

Centric

Has its value!

Amazon still
missing

Unified communications
Two factor authentication
VPN
Back-up data and e-mail
End user support

Price (indicative>

50-75,-

75-100,-

50-75,-

< 20,-

< 5,-

MORE RESEARCH ON CLOUD PROVIDERS

What is the impact of PaaS on Independent Software


Vendors?
What is the impact of OpenId or OAuth on Identity & Access
Management in a Hybrid Cloud scenario?
How to avoid vendor lock-in in a SaaS scenario?

CLOUD: STATE OF ART

CLOUD TODAY

Moving to hybrid cloud


Many reasons

Focus from IaaS to PaaS

IaaS: modern outsourcing of data center services


PaaS: modern outsourcing of deployment and development
services

New authentication methods

Like Fast Identity Online (FIDO)

HYBRID CLOUD: IAAS

At this moment associated to IaaS combined with on premises


data center

HYBRID IAAS CLOUD: 5 REASONS

Ad hoc fluctuation because of unexpected success


Increase capacity instantly

Real-time analysis combined with data warehouse in the cloud


If in-house equipment produces high volumes of data

Isolating mobile app traffic from data center


To prevent high bandwidth access to own premises

Data center virtualization


Business continuity

Moving into the cloud step-by-step


To prevent big bang, taking time to learn
Taking personnel turnover into account

Source (my blog in Dutch): http://www.centric.eu/NL/Default/Themas/Blogs/2014/01/24/Nummerbehoud-in-de-cloud

HYBRID CLOUD: SAAS

Where is my data being stored?


Privacy

Which jurisdiction?
Geopolitics?

Security

Corporate espionage

Performance
Latency?

Analytics

Where do I consolidate?
Local or in the cloud?

Integration?

FOCUS ON PAAS

Not on PaaS providers


A lot of people want to say PaaS is DOA dead on arrival,
Tim Crawford of Avoa

PaaS is lock in by definition (IMHO)

Deployment
Automation: as an accelerator for DevOps
Standardisation: to provide Development and Test environments

Development
Online and offline development tools, deploying on standardized
platforms or Cloud
Productive and efficient application development and maintenance

PAAS GROWTH

See green bars


Top

Second best

Note growth of Management

Database management systems

Application Infrastructure &


Middleware
Application Development
(f.i. no. 4, yellow)
Needed for DevOps

PAAS MARKET FORECAST

65% of PaaS
Support development
But are not development
environments by itself

BPM services
More SaaS than PaaS

NEW AUTHENTICATION METHODS

Multi-factor authentication

Biometric security

If MFA becomes more secure

Like Google: username/password followed by text message


May add more than 2 factors

Face recognition
Fingerprints
Iris scan

It will be trusted by Cloud providers


De facto solution for Single Sign On
Source: http://alexbilbie.com/2013/02/a-guide-to-oauth-2-grants/

NEW AUTHENTICATION METHODS

FIDO Alliance

To develop a standards-based open approach that automatically


detects when a FIDO-enabled device is present and offers users the
option to replace passwords with more secure authentication
techniques such as biometrics
FIDO protocols are based on public key cryptography and are
strongly resistant to phishing
Universal Authentication Framework
Universal 2nd Factor protocol (by Google)

Samsung and PayPal recently announced adoption

Competitive projects: Google Authenticator

Includes implementations of one-time passcode generators for


several mobile platforms, as well as a pluggable authentication
module (PAM).
Source: http://fidoalliance.org/specifications

FIDO: ALL ABOUT STANDARDS


FIDO Registration

FIDO Login

CLOUD: WHAT ABOUT LEGACY?

LEGACY

How many apps did you install on your smartphone?


How many apps do you really use?
In NL smartphones average user has 28 apps installed
How many applications do companies use?
Harris Interactive (2012)

Surveyed 150 senior IT decision-makers from organizations with $500 million or more
in annual revenue.
50%: more than 500 applications deployed
(34%: more than 1,000 applications deployed).

57% of users use fewer than 249 applications on a typical day


28% of users use fewer than 50 apps a day.

Software is main inhibitor for moving to the Cloud!


Software vendors and in-house development

LEGACY AND THE CLOUD

How do we deal with this? The Centric-case


Infrastructure-as-a-Service in place

Adaptive Infrastructure Services


Standardisation on infrastructure
Fail-over, scalable, managed services

Platforms almost in place

Standardisation on deployment architecture


Isolation of customer data
Security measures

Software in transition
Web-enabled
Multi-tenant

ADAPTIVE INFRASTRUCTURE SERVICES


IT Stack

Content

Deployment
Services

Customer

Print Services

Apps &

PaaS
File Services

E-mail services

Business-

Publishing Services

Users & desktop configurations

ontwikkelplatform

Virtualization
Computing
Storage

Adaptive Infrastructure Services

Networking

AIS scope
Shared Processing Power
Shared Storage
System Managent services

PLATFORM STANDARDISATION

Example choices by Centric


Linux

Red Hat Linux 6, HP UX 11iv3, AIX 7, Oracle L6

Microsoft

.Net 4.0/4.5
SQL Server 2008 R2
IIS 7.5
Windows Server 2008 R2
Windows 7/8.1
Terminal Server 2008 R2

Oracle

Database Appliance X4-2


Forms & Reports 11.1.2.1.0
WLS 10.3.6

ON-PREMISES LEGACY TO THE CLOUD

From a Providers perspective!


What characteristics do applications lack when not built for the
cloud?

Applications have to be made accessible on Internet

Applications should support more tenants

Web-based
Security policies

Multi-tenancy is more than just adding metadata

APPLICATIONS FOR WEB

Techopedia.com

Any program that is accessed over a network connection using HTTP,


Web-based applications often run inside a Web browser.
There is a lot of confusion created by the use of terms like

Web-based, Internet-based and cloud-based


Web-based applications actually encompass all the applications that communicate
with the user via HTTP.

Centric has several Oracle Forms applications via Java-applet

Newer applications accessible via http.

Within definition, but not ideal because deployment needs Java runtime

MULTI-TENANCY

Research with Utrecht University

Definition

Faculty of Computer Science


Sharing is caring

Multi-tenancy is a property of a system


where multiple varying customers and
their end-users share the system's
services, applications, databases, or
hardware resources, with the aim of
lowering costs.

Scope

Infrastructure not in scope, IaaS


assumed

Download report: http://www.cs.uu.nl/research/techreps/repo/CS-2013/2013-015.pdf

12 MULTI-TENANT ARCHITECTURES

MTAS

At Centric
Variability (customization)
Less multitenancy

Maintainability of software
More multitenancy

Software Complexity
Less multitenancy

Deployment Time

More multitenancy

Matter of optimizing
No extremes

Less

More

MAKING APPLICATIONS MULTI-TENANT


Applicatie architectuur Key2Jongerenmonitor

What can be shared?


What can be modified to
enable sharing?

Applicatie server

Apps
Koppelvlak App
(WCF)
Jongerenmonitor App

MDU

New non-functional
requirements!

Landelijke voorziening
VSV

VSV Client

GWS koppelvlak

Oracle database

Which interfaces are singletenant?


Some governmental external
services are single-tenant!

Cognos

WebAPI
(Perimeter Service Routing)

Gebruiker
Key2Jongerenmonitor

Applicaties (lokaal of gehost)

GWS4all
UBM StUF client

Oracle WebLogic server

UBM StUF webservice

Key2Datadistributie

Key2Jongerenmonitor schema
Oracle Forms services

UBM GBA-V client


Key2Berichtenmodule schema

Key2GBA-V
GeneriekBevragingsComponent

Due to security constraints

E-Diensten

MWF client schema

GBA-V
MWF client service

Koppelvlak verlof & vrijstelling

Our PaaS should support all


MTAs

Adaptive Application Platform

MWF server schema

Module workflow webservices

Koppelvlak aanvraag vervoer

Statusupdates koppelvlakken

SQL*Net ADO.Net

Conductor

Auteur: Arjan van Bart


Versie 1.1 (21-06-2013)

Afhankelijkheid met ander schema

Dient meerdere keren geinstalleerd worden

SOAP of HTTP

Kan eenmaal geinstalleerd worden

Netwerkshare

Kan hergebruikt worden door x organisaties

Printerverbinding

ADAPTIVE APPLICATION PLATFORM

Share everything
Starting points

Share to cut cost


Guarantee Isolation
Secure access
Integration with onpremises

Calculations showed
cost of specific-per
application-ratio: 10:1

Per application sharing


Tenant specific

Tenant
Isolation

RESEARCH TOPICS

Legacy Application Modelling to find the most suitable MTAs

Legacy Application Modelling for right sizing deployment architecture


To answer the question: why should I go into the Cloud?

Customization vs standardisation: how to combine Operational


Excellence and Customer Intimicy (Treacy & Wiersma)
Cloud computing, especially PAAS (similar to Google App Engine)
In cooperation with Prof. Butincu, Technical University of Iasi

PART 2: CORNERSTONE?

CLOUD AND IOT

INTERNET OF THINGS

Sundmaeker H. Guillemin P. Friess P. and Woelffle S. (2010).


(Vision and Challenges for realising the Internet of Things)

in the nineteenth century, machines learned to do;


in the twentieth century, they learned to think;
and in the twenty-first century, they are learning to perceive they actually
sense and respond.

Some figures
OECD (2012)
50 billion devices connected to mobile networks at the end of this decade

Rafi Haladjian (2009)


number of objects to be connected to the Internet arises to 100,000 billion

Source: http://www.actif-europe.eu/index.php?option=com_docman&task=doc_details&gid=20&Itemid=68

IOT: PARADIGMA SHIFT

IT people are used to centrally managed systems

But what IoT needs is a Network Centric approach

Interoperability in a Network Centric approach

Like how armies were organized before the 2nd Gulf War
Based on doctrines from Network Centric Warfare
Ad hoc coalitions of systems

Temporary, not predetermined

Interoperability of information

Is more than technically connected!

Purposeful

Combination is more than just the sum

NETWORK CENTRIC APPROACH


Traditional

Network

Demand driven

Information driven

One way

Autonomy

Top down

Bottom up

One world fits all

Best of all worlds

Passive

Active

Massive information /
information overload

Just enough / just


in time

EXAMPLES DEVELOPED BY CENTRIC

Network Centric by design (IMHO)


Compact systems

Often small extension to existing systems

Multiple application of systems

Subscriptions on data instead of integration: API world

In control of your process without being the owner of the systems


No big data warehouses: access data directly from the source

One truth

No difference between administration and facts

Real-time information

Decisions on most recent information

System of systems: Ultra large systems

EXAMPLE: SMART CONNECTED CONTAINERS

Actual status of shipload not available real-time


Dangerous goods only, other goods 48 hours

In an emergancy situation information needed instantly


Our concept for container transport
Freight

Containers

Mobile HUBs

Authorities and
Businesses

Permanent HUBs

Decoupling private
data and public data

Decoupling wireless
and wired

Authority

Business

Very short distance


RFID (Optional)

Short distance
wireless

Standards to be developed
Lading information
Security
Extensibility

Medium/long distance
wireless

Standards to be developed
Information
Security / certification
Extensibility

Standards to be developed
Information
Security / certification
Extensibility

Any place
Internet

EXAMPLE: SMART CONNECTED CONTAINERS

Network of
Containers (atomic)
Passive component

Ships
Mobile hub
Active transmitter already available

Radar stations
Permanent hub
Connection to the cloud already in place

Solution
Low cost, decentralized, open
Situational awareness of emergency response within minutes

EXAMPLE: EMERGENCY RESPONSE

Proposal FP7, Security Research Call 6, SEC-2013.5.3-1


Definition of interoperability specifications for information and meta-data exchange
amongst sensors and control systems
Enabling interoperability of sensor and control systems
by context sharing on a technical level
to enable context awareness
for key stakeholders in the civil security field.

Development of an architecture

for sensors and control systems like cameras


for ad hoc participation in an emergency situation
making them accessible for mandated stakeholders like authorities or emergency
responders

Based on the Interoperability Framework of Van Lier (2009)

to achieve a sound basis for context awareness and situational awareness


to all parties involved in civil security.

EXAMPLE: EMERGENCY RESPONSE

CLOUD AND IOT

Cloud is a cornerstone for IoT?


Yes

To be solved

Clearly the size of IoT demands for cloud services


Cloud is everywhere and of low cost, it is flexible for data collection,
opprtunities to create overview and form new combinations
Ultra large systems need large backend systems
Cloud is centralized, IoT is decentralized

Cloud federation (competition is inhibitor), no one-size-fits-all

(Geo)Location of objects is important

But even IPv^6 doesnt include location in addressing scheme

RESEARCH TOPICS

Context and situational awereness

Connecting humans in near distance by using wearables


Ad hoc collaboration without knowing each other

Swarm intelligence (agents, boids, ants etc).


In cooperation with Prof. Butincu, Technical University of Iasi

CLOUD AND BIG DATA

BIG DATA

Looks like a no-brainer!


Cloud = Big so Big Data
needs Cloud?
Internet-of-Things
produces huge volumes
of data* so Big Data is
part of IoT, so Big Data
needs Cloud?
*aka exhaust data

WHAT IS BIG DATA ANYWAY

Data whose size forces us to look beyond the tried-and-true


methods that are prevalent at that time.
Adam Jacobs (ACM, 2009)

The next frontier for innovation, competition, and productivity

McKinsey, 2011

Big data is any data too overwhelming to mine for insight with naive
methods.
Daniel Hardman, Adaptive Computing, March 2014

Big Data is a new term primarily used to describe the data sets that
are so large and complex that they require advanced and unique
storage, management, analysis and visualization technologies
Chen et al., 2012, Georgia Fotaki, 2013

BIG DATA VS TRADITIONAL DATA

Source: Exploring Big Data opportunities for Online Customer Segmentation, Georgia Fotaki

THE VALUE OF BIG DATA FOR BUSINESSES

Creating transparency

Making data available cross-department

Enabling experimentation to discover needs, expose variability, and


improve performance
Root cause analysis of variability

Segmenting populations to customize actions


Real-time microsegmentation

Replacing/supporting human decision making with automated


algorithms
Not necessarily be automated but augmented

Innovating new business models, products, and services


Context awareness, location and time independent

Source: McKinsey 2011 http://www.mckinsey.com/insights/business_technology/big_data_the_next_frontier_for_innovation

BIG DATA MARKET FORECAST

Wikibon

Big Data-related services


revenue made up 40% of the
total market, followed by
hardware at 38% and software
at 22% in 2013
Less software revenue due to
open source frameworks like
Hadoop
Distribution will not change
significantly

Source: http://wikibon.org/wiki/v/Big_Data_Vendor_Revenue_and_Market_Forecast_2013-2017

BIG DATA PRODUCT TYPES

Main

Other

Application of Big Data

Hadoop software
NoSQL database software
Next-generation data warehouses/analytic database software
and related hardware and services
Non-Hadoop Big Data platforms
In-memory databases
Data integration and data quality platforms, tools and services
Advanced analytics and data science platforms, tools and services

Application development platforms, tools and services


Business intelligence and data visualization platforms, tools and services
Analytic and transactional applications and services

Cloud

Cloud-based Big Data services

Other Big Data support, training, and professional services.

WHAT IS HOT IN BIG DATA?

Come back of SQL?

Facebook Presto, Amazon RedShift, Stinger, IBM BigSQL, Clouderas Impala


Hadoop 2.0,

replaces the MapReduce code in Hadoop with YARN (Yet Another Resource Negotiator)
Brings Hadoop to the Enterprises

Learning from visualizations

Well known: Infographics


Startup: Visualisation based on cortex of the human eye
Maximizes pattern recognition

Topological data analysis

Predicting the future

Focuses on the shape of complex data


To identify clusters and any statistical significance that is present
Descriptive analytics: what happened in the past?
Predictive analytics: what is probably going to happen in the future?
Prescriptive analytics: recommendation for key decisions based on future outcomes
what will happen and when it will happen, but also why it will happen

BIG DATA AND CLOUD

Cloud is a cornerstone for Big Data?


Yes

To be solved

Big 3 support for Hadoop on bare-metal or low cost commodity


Support for multi-terabyte volumes of data
Fluctuating volumes need cloud scalability and flexibility
Enterprises struggle with Business Intelligence and Big Data will
not solve this. Comeback of SQL will help.
"A fool with a tool is still a fool"--words of wisdom in IT

Knowledge of new frameworks not wide-spread

RESEARCH TOPICS

Sentiment analysis on social networks (twitter feeds, facebook


feeds etc.)
In cooperation with Prof. Butincu, Technical University of Iasi

CLOUD AND MOBILE

MOBILE

We need to narrow the scope


Mobile applications for the enterprise
Mobile application development for enterprises

Mobile applications
On smartphones, tablets, wearables
Any functionality in relation to business processes
No consumer apps

Mobile application development


Different from normal application development?
Do we need cross platform development?

MOBILE APP DEVELOPMENT PARADIGM

Do not move applications simply to smartphone or tablet


Mobile working is different from behind the desktop work
User interaction is different from mouse/keyboard
Responsive is the buzzword today, new: behavioural design

Requirements for mobile

Context aware, like location aware

Future mobile apps: forget the screen

No typing, moderate typing, just hitting buttons


Start a business process, or complete it, no other tasks

Business apps: Low volumes of users

BYOD may require cross platform development

Do not use app stores unless large volumes or app used by


anonymous users
Which can be expensive

CROSS PLATFORM DEVELOPMENT

Which is better: native, web app or hybrid?


Despite html5 progress most development is
native, although this will change in future
Native

Web app

Full capabilities, best performance


My opinion: best for smartphone, smaller granular
tasks (click-and-go)

Cross platform
Sensor access needs wrapping (hybrid)
allways up-to-date
My opinion: good for tablets, complex tasks

And we have a iOS, Android vs Windows


Phone dilemma

API ECONOMY

Definition is informal

According to Steven Willmott, CEO of 3scale

Entering the API world

The economy where companies expose their (internal) business assets or


services in the form of (Web) APIs to third parties with the goal of unlocking
additional business value through the creation of new asset classes.
Also related to Big Data!
77% of top-50 free and top-50 paid apps connect to backend services of
some kind
23% are completely standalone
Not the Cloud Management APIs like OpenStack
But f.i. Google: Cloud Backend API on App Engine

Source: http://www.cutter.com/content-and-analysis/resource-centers/agile-project-management/sample-our-research/apmu1306/apmu1306.pdf

API TECHNOLOGY

Web API is described as

A set of Hypertext Transfer


Protocol (HTTP) request
messages, along with a definition
of the structure of response
messages, which is usually in an
Extensible Markup Language
(XML) or JavaScript Object
Notation (JSON) format.

Dominant API protocol appears to


be REST, while the most common
data formats are XML and JSON

Source: http://www.3scale.net/wp-content/uploads/2013/10/Winning-in-the-API-Economy-eBook-3scale.pdf

API STRATEGY

Architectural approach

Sounds familiair? Internet-of-Things and Interoperability!


APIs

Providing self-service, one-to-many, reusable interfaces.


Every software system may one day be addressed by multiple
different, as-yet-unknown groups for unknown purposes
Restructure and organize internal systems to support innovative
new projects in a uniform mannerreducing maintenance costs
and increasing agility.
They provide new opportunities to generate new ways to reach
customers, generate revenue and build partnerships.

APIS AND MOBILE

Yesterdays Enterprise

The API Driven Enterprise

Mobile applications for specific platforms


Separate code base and ad-hoc server back end.

A single backend API, addressable from clients on any platform and a


succession of different front-end clients.

Mobile and cloud are pushing increasing amounts of economic


transactions from HTML web over to API-driven interactions

B2B supplier relationships replacing human web-based flows


Multi-channel plays in retail, many interface points feeding a single
transaction process

CLOUD AND MOBILE

Cloud is a cornerstone for Mobile?


Yes
As internal systems open up via APIs served by Cloud providers
Isolation of internal domain from internet accessible data

To be solved
Most enterprises dont have millions of app users
Generic apps too limited for in house developed back end systems
Back end systems need APIs

SUMMARY

SUMMARY

Cloud for enterprises


IaaS is here to stay
PaaS is evolving into standardized development and deployment
SaaS lagging due to legacy applications

Cloud lock-in avoidable


IaaS by follwing open standards and open frameworks
PaaS no way
SaaS data is inhibitor

SUMMARY

Cloud and converged technology


Internet-of-Things
Yes, due to its scale, but inherent distributed

Big Data
Yes, fluctuating data sets suit Cloud scalability and flexibility
But: difficult to find real use cases

Mobile
Yes, following the API Economy
But, real benefits if enterprises provide API access

THANK YOU!
ANY QUESTIONS?
Leen Blom
+31-653562767
leen.blom@centric.eu

You might also like