Professional Documents
Culture Documents
DL 12 July
Pauli Miettinen
Chapter 3
Column
Decompositions
Pauli Miettinen
Motivation
Pauli Miettinen
The CX decomposition
is either F or 2
Pauli Miettinen
Why CX?
Feature selection
Pauli Miettinen
X = C+A
Pauli Miettinen
How to select C?
subsets of columns
m
k
But how?
Pauli Miettinen
of matrix A is
n-by-n orthogonal
R11
= QR = Q
0
Permutation matrix
that satisfies
k (A)
p1 (k, m)
k+1 (A)
DMM, summer 2015
n-by-m
min (R11 )
m x (R22 )
R12
R22
k-by-(mk)
(nk)-by-(mk)
k (A)
p2 (k, m)
Pauli Miettinen
k+1 (A)
CX and RRQR
Pauli Miettinen
10
Computing CX by sampling
pj = k(V Tk )j k22 / k
2
Pauli Miettinen
11
Notes on sampling
Pauli Miettinen
12
13
CX with exact k
1/2
From A = UVT
T
return C = A12
Pauli Miettinen
14
Pauli Miettinen
15
Non-Negative CX
Strauch 2014
Pauli Miettinen
16
Motivation
If data is non-negative, so is C
Pauli Miettinen
17
The non-negative CX
decomposition
Pauli Miettinen
18
Geometry of NNCX
Columns in C
Columns not in C
Convex cone
Projections
1
0.9
0.8
0.7
0.6
0.5
0.4
0.3
0.2
0.1
1.5
0
1.5
1
1
0.5
0.5
0
Pauli Miettinen
19
Otherwise it is internal
Pauli Miettinen
20
Pauli Miettinen
21
The convex_cone
algorithm
Set R A
repeat
Set R R cx
Pauli Miettinen
22
Or truncated-to-zero pseudo-inverse
Pauli Miettinen
23
Application: Neuroimaging
182
time
Human expert
images
Strauch 2014
Algorithm
Pauli Miettinen
24
suitable algorithm for the task. For signal selection from honeybee imaging
movies we can thus compute [T := C, S := X 0+ ] = Convex cone(A, c).
Then, T contains c time series that can be used as input for higher-level
data analysis as demonstrated later in Section 9.3. Matrix S (see Figure 9.1 for
an example) contains information about glomerulus shape, overlap and where
in space the signals from T are located. Successful selection of pure signals into
T leads to sparse images in S (cp. Section 7.4.5) that allow for interpretation
of the time series and images as the signal and shape of glomeruli.
Further, Ac = T S is a rank-c reconstruction of the original movie A that
can be used for visualisation purposes.
Application contd
Pauli Miettinen
25
Column-Row
Decompositions
Pauli Miettinen
26
Often c = r = k
Pauli Miettinen
27
Why CUR?
Pauli Miettinen
28
CUR is two-sided CX
U = C AR
Pauli Miettinen
29
Pauli Miettinen
30
Set U = X*Z AR
Pauli Miettinen
31
Pauli Miettinen
32
Pauli Miettinen
33
Literature
Pauli Miettinen
34