Professional Documents
Culture Documents
Machines
AI Applications
Kernel Methods
Support Vector Machines
AI projects
Apple Amazon Google IBM Bioinformatics Startups ACM
Apple
https://techcrunch.com/2016/08/05/apple-acquires-turi-a-machine-learning-company/
Apple Amazon Google IBM Bioinformatics Startups ACM
Amazon
http://www.engadget.com/2016/03/23/amazon-secret-conference-of-the-future/
Apple Amazon Google IBM Bioinformatics Startups ACM
http://techcrunch.com/2016/03/23/google-launches-new-machine-learning-platform/
Apple Amazon Google IBM Bioinformatics Startups ACM
http://blog.kubernetes.io/2016/03/
scaling-neural-network-image-classification-using-Kubernetes-with-TensorFlow-Serving.
html
Apple Amazon Google IBM Bioinformatics Startups ACM
IBM
IBM
Bioinformatics
http://futurism.com/
ai-saves-womans-life-by-identifying-her-disease-when-other-methods-humans-failed/
Apple Amazon Google IBM Bioinformatics Startups ACM
Startups
https://www.cbinsights.com/blog/top-acquirers-ai-startups-ma-timeline/
Apple Amazon Google IBM Bioinformatics Startups ACM
Paper Date
Can Artificial Intelligence Predict Earthquakes? Feb17
(Machine learning and pattern recognition)
Japan announces AI supercomputer Feb17
(GPU, DNN, AI apps)
Chinas first deep learning lab ... Feb17
(DNN, national lab)
Chinas Artificial-Intelligence Boom Feb17
(AI labs: Baidu -> Google, Didi -> Uber, Tencent ->WeChat)
AI method allows to diagnose Alzheimers or Parkinsons Feb17
(DNN)
HPC Technique Propels Deep Learning at Scale Feb17
(DNN, HPC: OpenMPI -> SVAIL)
Apple Amazon Google IBM Bioinformatics Startups ACM
Paper Date
Thinking Deeply to Make Better Speech Mar17
(DNN, Speech as humans, WaveNet-DeepMind)
AI systems for air traffic controllers Mar17
(Automatic speech recognition)
AI system lets you control a robot with your mind Mar17
(EEG signals, Machine learning)
How to Upgrade Judges with Machine Learning Mar17
(Machine learning)
Kernel Methods
Kernel Methods
Kernel
K (t 0 , t) = h (t 0 ), (t)i (1)
1
In operating computer systems, the concept of kernel is distinct since it means the core of the system.
2
C. Cortes and V. Vapnik. Support vector networks. Machine Learning, 20:273 297,1995
3
J. Shawe-Taylor and N. Cristianini. Kernel Methods for Pattern Analysis. Cambridge University Press, 2004
Kernel Methods
Kernel trick
Figure from: Cuevas Tello, J.C., Hernndez-Ramrez, Daniel, Garca-Seplveda, Christian A. (2013), Support Vector
Machine algorithms in the search of KIR gene associations with disease, Computers in Biology and Medicine 43
(2013) 20532062
Kernel Methods
Gram matrix
Representer theorem
The larger the training set T , the larger the Gram matrix Kij
Kernels are also considered to be memory-based
methods5
Nevertheless, one is able to create complicated kernels
from simple building blocks.
Moreover, there are other kernel constructions such as
graph kernels, string kernels, P-kernels among many
others6 .
For regression, the kernels are represented as (known as
representer theorem7 )
n
X
f (t) = j K (tj , t) (3)
j=1
5
S. Haykin. Neural Networks: a Comprehensive Foundation. Prentice Hall, 1999.
6
J. Shawe-Taylor and N. Cristianini. Kernel Methods for Pattern Analysis. Cambridge University Press, 2004
7
A. J. Smola and B. Schlkopf. On a kernel-based method for pattern recognition, regression, approximation
and operator inversion. Algorithmica, 22:211231, 1998. Technical Report 1064, GMD FIRST, April 1997.
Kernel Methods
Pn
f (t) = j=1 j K (tj , t)
It is a linear combination of kernels (basis functions) and
j 2 < are the kernel weights.
For example8 , let T = {10, 15, 25, 32, 38, 43}
K (tj , t) = exp( |t tj |2 /! 2 ) be a training set of size six and
a Gaussian kernel with ! = 3, respectively.
The time t goes from 0 to 50 with a resolution of 0.1, and
= [0.5, 1, 0.5, 1, 1.5, 1].
This set of Gaussian kernels K (tj , ) scaled by j .
T has the centres of Gaussian functions and contains
the heights.
8
Cuevas-Tello, J.: Estimating time delays between irregularly sampled time series. Ph.D. thesis, School of
Computer Science, University of Birmingham (2007). http://etheses.bham.ac.uk/88/
Kernel Methods
3.5
2.5 f(t)
K(tj,t) ; f(t)
1.5
0.5
0
0 5 10 15 20 25 30 35 40 45 50
Time
MATLAB Example
% Plot a set of Gaussian kernels
% Aug06 jcct
clear all;
close all;
sigma=3; % width of Gaussians
t_min = 0; % min time
t_max = 50; % max time
inc = 0.1; % resolution of time
ct = [10 15 25 32 38 43]; % centres of Gaussians
alpha = [0.5 1 0.5 1 1.5 1];
n = size(ct,2);
m = size(t_min:inc:t_max,2);
figure;
hold on;
g = zeros(1,m); % Generation of a set of Gaussian functions, and get them together
for j=1:n,
idx =1;
for i = t_min:inc:t_max,
f(idx) = alpha(j)*exp(-abs(i-ct(j))^2/sigma^2); % Gauss function, mean=j, and std=sigma
idx = idx + 1;
end
plot(t_min:inc:t_max,f,k);
g = g + f;
end
plot(t_min:inc:t_max,g+2,k);
box;
xlabel(Time);
ylabel(K(t_j,t) ; f(t));
title([A set of ,num2str(n), Gaussians of width \omega=,num2str(sigma)])
text(2,2.5,f(t));
Kernel Methods
% INPUT:
% x -> the time, according to the real data
% n -> Points per gaussian function, quantity of points in real data
% c -> Center of kernels
% m -> Kernel number
% d -> Distance at each point with their neighbors (left-right), kernelswidth
%
% OUTPUT
% K_c -> matrix (m x n) (kernels x points)
% JCCT Mar17
Gram_matrix = K1(t,n,ct,m,ones(1,n).*sigma);
figure;
plot(Gram_matrix);
xlabel(Samples);
ylabel(Magnitude);
figure;
hold on;
f = zeros(1,n); % sum set of Gaussian with weights
for i=1:m,
f = f + alpha(i) .* Gram_matrix(i,:);
plot(t,alpha(i) .* Gram_matrix(i,:),k);
end
plot(t,f,b);
xlabel(Time);
ylabel(Magnitude);
Kernel Methods
9
Y. Engel and S. Mannor. The Kernel Recursive Least-Squares Algorithm. IEEE Transactions on Signal
Processing, 52(8):2275285, Aug 2004.
10
J. Shawe-Taylor and N. Cristianini. Kernel Methods for Pattern Analysis. Cambridge University Press, 2004
Kernel Methods
11
T. Hastie, R Tibshirani, and J. Friedman. The Elements of Statistical Learning: Data Mining, Inference, and
Prediction. Springer, 2001
12
K. Brown. Diversity in Neural Networks Ensembles. PhD thesis, School of Computer Science, University of
Birmingham, UK, 2004.
13
J. Bengio, o. Delalleau, and N. Le Roux. The curse of dimensionality for local kernel machines. Technical
Report 1258, Dpartement dInformatique et Recherche Oprationnelle, Universit de Montral, May 2005.
Kernel Methods
K = x, (5)
Hence,
= K+ x. (7)
Kernel Methods
=K 1x or = K+ x
We regularise the inversion in (7) through singular value
decomposition (SVD).
K = U W VT , and K+ = V [diag(1/wi )] UT is the pseudo
inverse14 15 (or Moore-Penrose inverse).
SVD has some interesting properties such as W is a
diagonal matrix with positive or zero elements (known as
singular values).
where wi 2 W, U and V are orthogonal so
UT U = VT V = 1, and V is also square and
row-orthonormal; i.e., V V = 1.
14
G.H. Golub and C.F. Van Loan. Matrix Computations. The Johns Hopkins University Press, second edition,
1989
15
W.H. Press, S.A. Teukolsky, W.T. Vetterling, and B.P. Flannery. Numerical Recipes in C++: The Art of
Scientific Computing. Cambridge University Press, 2nd edition, 2002
Kernel Methods
Fitting
DS51G0N0.dat with error bars of 0.106% DS51G51N0.dat with error bars of 0.466%
17.8
A A
B B
17.8
17.7
17.7
17.6
17.6
17.5
17.5
mag
mag
17.4 17.4
17.3
17.3
17.2
17.2
17.1
17.1
17
17 16.9
0 5 10 15 20 25 30 35 40 45 50 0 5 10 15 20 25 30 35 40 45 50
time time
Kernel Methods
clear all;
close all;
d1 = load(ArtificialData/DS-5-1-GAP-0-1-N-0_v2.dat);
figure;
plot(d1(:,1),d1(:,2),.-k);
xlabel(time);
ylabel(mag);
title(DS-5-1-GAP-0-1-N-0);
d2 = load(ArtificialData/DS-5-1-GAP-1-1-N-1_v2.dat);
figure;
plot(d2(:,1),d2(:,2),.-k);
xlabel(time);
ylabel(mag);
title(DS-5-1-GAP-1-1-N-1);
Kernel Methods
a) DS-5-1-G-0-1-N-0 b) DS-5-1-G-1-1-N-1
DS51GAP01N0 DS51GAP11N1
17.9 17.9
17.8 17.8
17.7 17.7
17.6 17.6
17.5 17.5
mag
mag
17.4 17.4
17.3 17.3
17.2 17.2
17.1 17.1
17 17
0 5 10 15 20 25 30 35 40 45 50 0 5 10 15 20 25 30 35 40 45 50
time time
Kernel Methods
% Perform regression on d2
sigma = 3; % width of Gaussians
t = d2(:,1); %time
n = size(t,1); % number of samples/points
Gram_matrix = K1(t,n,ct,m,ones(1,m).*sigma);
figure;
hold on;
plot(d1(:,1),d1(:,2),*-g);
plot(d2(:,1),d2(:,2),.-k);
plot(t,h,.-b);
legend([DS-5-1-GAP-0-1-N-0;DS-5-1-GAP-1-1-N-1;Kernel-based model],0);
xlabel(time);
ylabel(mag);
title([Observed data, kernel-based model MSE = ,num2str(error)]);
box on;
Kernel Methods
Figures regression
Observed data, kernelbased model MSE = 1.6874e005 Observed data, kernelbased model MSE = 1.6874e005
17.9
DS51GAP01N0 17.52 DS51GAP01N0
DS51GAP11N1 DS51GAP11N1
17.8 Kernelbased model 17.5 Kernelbased model
17.48
17.7
17.46
17.6
17.44
17.5
17.42
mag
mag
17.4 17.4
17.38
17.3
17.36
17.2
17.34
17.1
17.32
17
0 5 10 15 20 25 30 35 40 45 50 14 16 18 20 22 24 26 28 30 32
time time
Kernel Methods
Figures regression
17.48
17.46
17.44
17.42
mag
17.4
17.38
17.36
17.34
17.32
14 16 18 20 22 24 26 28 30 32
time
Kernel Methods
miss = [];
figure;
hold on;
plot(d1(:,1),d1(:,2),*-g);
plot(d2(:,1),d2(:,2),.-k);
plot(t1,h1,.-b);
legend([DS-5-1-GAP-0-1-N-0;DS-5-1-GAP-1-1-N-1;Kernel-based model],0);
for i=1:n1,
if sum(t1(i)==t)==0,
plot(t1(i),h1(i),ob);
end
end
%plot(t,h,*-k);
legend([DS-5-1-GAP-0-1-N-0;Kernel-based model]);
xlabel(time);
ylabel(mag);
title([Observed data, kernel-based model MSE = ,num2str(error1)]);
box on;
Kernel Methods
Figures reconstruction
Observed data, kernelbased model MSE = 2.2963e005 Observed data, kernelbased model MSE = 2.2963e005
17.9
DS51GAP01N0 DS51GAP01N0
DS51GAP11N1 DS51GAP11N1
Kernelbased model 17.5
17.8 Kernelbased model
17.48
17.7
17.46
17.6
17.44
17.5 17.42
mag
mag
17.4
17.4
17.38
17.3
17.36
17.2
17.34
17.1 17.32
17.3
17
0 5 10 15 20 25 30 35 40 45 50 14 16 18 20 22 24 26 28 30 32
time time
Kernel Methods
Figures reconstruction
DS51GAP01N0
17.5 DS51GAP11N1
Kernelbased model
17.48
17.46
17.44
17.42
mag
17.4
17.38
17.36
17.34
17.32
17.3
14 16 18 20 22 24 26 28 30 32
time
SVM
Definition
1
C. Cortes and V. Vapnik. Support vector networks. Machine Learning, 20:273 297,1995
2
Cristianini, N. and Shawe-Taylor, J. (2000). Support Vector Machines and other kernel-based learning
methods. Cambridge University Press.
3
T. Hastie, R Tibshirani, and J. Friedman. The Elements of Statistical Learning: Data Mining, Inference, and
Prediction. Springer, 2001
SVM
Linear classifier
Gunn, S. (1998). Support vector machines for classification and regression. Technical report, University of
Southampton. http://www.isis.ecs.soton.ac.uk/resources/svminfo/
SVM
The margin
The margin
The margin
Non-linear classifier
The
Pl constrains unchanged: i 0, i = 1, , l and
j=1 j yj = 0
The classifier implementing the optimal separating
hyperplane in the feature space is given by
X
f (x) = sgn( i yi K (xi , x) + b) (3)
i2SVs
where
sgn : R ! { 1, 0, 1}
(4)
x ! y = sgn(x)
The Support Vectors (SVs) will have non-zero Lagrange
multipliers, i
SVM
SVM classifier
C parameter
Gunn, S. (1998). Support vector machines for classification and regression. Technical report, University of
Southampton. http://www.isis.ecs.soton.ac.uk/resources/svminfo/
SVM
1
x2
0
0 1
x1
XOR problem svcplot(X,Y,ker,alpha,b0,sigma)
Outline Astronomy Computer Vision Speech Recognition Bioinformatics Fault diagnosis
Outline
In collaboration with
Dr. Peter Tino, School of Computer Science, University of
Birmingham, UK
Dr. Ilya Mandel, School of Physics and Astronomy,
University of Birmingham, UK
Sultanah AL Otaibi; Peter Tino; Juan C. Cuevas-Tello; Ilya Mandel; Somak Raychaudhury (2016) Kernel regression
estimates of time delays between gravitationally lensed fluxes, Monthly Notices of the Royal Astronomical Society,
doi: 10.1093/mnras/stw510 ISSN: 0035-8711 http://arxiv.org/abs/1508.03439
Outline Astronomy Computer Vision Speech Recognition Bioinformatics Fault diagnosis
Gonzalez-Grimaldo, R.A., Cuevas-Tello, J.C., (2008) Analysis of Time Series with Neural Networks.
Mexican International Conference on Artificial Intelligence, IEEE Computer Society Proceedings,
pp. 131-137
Outline Astronomy Computer Vision Speech Recognition Bioinformatics Fault diagnosis
Cuevas-Tello, J.: Estimating time delays between irregularly sampled time series. Ph.D. thesis,
School of Computer Science, University of Birmingham (2007).
Outline Astronomy Computer Vision Speech Recognition Bioinformatics Fault diagnosis
A.J. Chipperfield, P.J. Fleming, H. Pohlheim, C.M. Fonseca, Genetic Algorithm Toolbox for use with MATLAB, first
ed., Automatic Control and Systems Engenieering, University of Sheffield, 1996
Cuevas-Tello, J., Tino, P., Raychaudhury, S., Yao, X., Harva, M.: Uncovering delayed patterns in noisy and irregularly
sampled time series: an astronomy application. Pattern Recognition 3(43), 11651179 (2010)
Outline Astronomy Computer Vision Speech Recognition Bioinformatics Fault diagnosis
Computer Vision
In collaboration with
Dr. Cesar A. Puente Montejano, UASLP
Dr. J. Ignacio Nunez Varela, UASLP
Outline Astronomy Computer Vision Speech Recognition Bioinformatics Fault diagnosis
x16
Fig30 0 1
..
. .. ..
. .
x2
Fig1 1 0
x1
Outline Astronomy Computer Vision Speech Recognition Bioinformatics Fault diagnosis
Speech Recognition
In collaboration with
Dr. Manuel Valenzuela, ITESM Mty
Dr. Juan Nolazco, ITESM Mty
Outline Astronomy Computer Vision Speech Recognition Bioinformatics Fault diagnosis
Backpropagation FFNN
Probabilistic Neural Networks (PNN)
Deep Neural Networks (DNN)
Gaussian Mixture Models (GMM)
Outline Astronomy Computer Vision Speech Recognition Bioinformatics Fault diagnosis
Speech Recognition
Speech Processing
Speech
Recogni-
tion
Language
Identifica-
tion
Speech
Recognition
Processing
Speaker
Recogni-
tion
Detection
Verifica- Identi-
Analysis tion fication
Coding or
Synthesis
Outline Astronomy Computer Vision Speech Recognition Bioinformatics Fault diagnosis
Speech Processing
Speaker Recognition
Speaker Verfication
Verify a persons claimed identity from his voice.
The identity claim includes entering a employee number,
smart card and others.
Speaker Identification
Deciding if the speaker is either a specific person.
Either belongs to a group of persons or is unknown.
There is no a priori identity claim.
(Campbell 1997, Cieri et al. 2014)
Outline Astronomy Computer Vision Speech Recognition Bioinformatics Fault diagnosis
Speech Processing
MFCCs
0.4
0.3
0.2
0.1
amplitude
0.1
0.2
0.3
0.4
0.5
Speech Processing
MFCCs
C1
0.4
0.3
0.2
0.1
...
amplitude
0.1
1
0.2
0.3
...
0.4
0.5
Speech Processing
Classification
0 1
P=P1 P2 P5
B [3000 48] [3000 48] [3000 48]C
B C
B
Data B .
. C
+ + . + C
B C
@Tc = class = 1 class = 2 class = 2 A
[3000 5] [3000 5] [3000 5]
(1)
Outline Astronomy Computer Vision Speech Recognition Bioinformatics Fault diagnosis
Bioinformatics
In collaboration with
Dr. Christian A. Garcia Sepulveda, Fac. Medicina, UASLP
MSc. J. Salomon Altamirano, PhD Student, UASLP
MSc. D. Alejandro Glz. Bandala, PhD Student, UASLP
Outline Astronomy Computer Vision Speech Recognition Bioinformatics Fault diagnosis
Bioinformatics: classification
Cuevas Tello, J.C., Hernandez-Ramirez, Daniel, Garcia-Sepulveda, Christian A. (2013) Support Vector Machine
algorithms in the search of KIR gene associations with disease, Computers in Biology and Medicine 43
(2013) 20532062
Outline Astronomy Computer Vision Speech Recognition Bioinformatics Fault diagnosis
J. Gilberto Rodriguez-Escobedo, Christian A. Garcia-Sepulveda, and Juan C. Cuevas-Tello (2015) KIR Genes and
Patterns Given by the A Priori Algorithm: Immunity for Haematological Malignancies, Computational and
Mathematical Methods in Medicine, vol. 2015, Article ID 141363, 11 pages, 2015. doi:10.1155/2015/141363
Outline Astronomy Computer Vision Speech Recognition Bioinformatics Fault diagnosis
Fault diagnosis
In collaboration with
Dr. Ciro A. Nunez Gtz, Electrical Engineering, UASLP
Dr. Nancy Visairo Cruz, Electrical Engineering, UASLP
M.Sc. Eugenio Camargo Trigueros, PhD student, Electrical
Engineering, UASLP
Eng. Juan Jose Acosta E. (2016)
Manuel Alejandro Gomez Vazquez, B.Tech student,
Informatics Engineering
Cristian Garcia Huerta, B.Tech student, Computer
Engineering
Outline Astronomy Computer Vision Speech Recognition Bioinformatics Fault diagnosis
Backpropagation FFNN
General Regression Neural Networks (GRNN)
Outline Astronomy Computer Vision Speech Recognition Bioinformatics Fault diagnosis
Fault diagnosis
Outline Astronomy Computer Vision Speech Recognition Bioinformatics Fault diagnosis
Questions?
Questions?