1 views

Uploaded by Adi Suseno

principal

- Verification of Tensegrity Propeties of Kono Structure
- 2010-Advantages and Drawbacks of Applying Periodic Time-Variant Modal Analysis to Spur Gear Dynamics-Rune Pedersen
- Nia Fariba
- Natural Gas Storage Valuation
- System of Linear Equations
- math1231s22015_new.pdf
- https://es.scribd.com/upload-document?archive_doc=249407660&escape=false&metadata=%7B%22context%22%3A%22archive_view_restricted%22%2C%22page%22%3A%22read%22%2C%22action%22%3Afalse%2C%22logged_in%22%3Atrue%2C%22platform%22%3A%22web%22%7D
- Mad
- Portfolio Math CFA Review
- For Encode
- Response Method Theory
- SSRN-id2411493 The Role of Covariance Matrix Forecasting Method in the Performance of Minimum-Variance Portfolios
- Probability Cheatsheet
- Hybrid Fuzzy First Principles Modeling
- Implementing a Principal Component Analysis (PCA)
- Vibrations Part One
- MMT-002_2012
- Energy-Accuracy Trade-Off for Continuous Mobile Device Location Lin-MobiSys10
- ch3solns
- 11.Hotellingt.handout

You are on page 1of 3

The idea: find which features strongly correlate to each other; if the correlation is high, then

(at least) one of those features can be eliminated.

Features with highest variances are the most interesting.

Example: calculating covariance between two features X and Y, where mx = avg(X) and my = avg(Y).

Sample

1

2

3

4

5

6

7

8

9

10

avg=

stdev=

X

2.5

0.5

2.2

1.9

3.1

2.3

2

1

1.5

1.1

Y

2.4

0.7

2.9

2.2 ===>

3

2.7

1.6

1.1

1.6

0.9

X-mx

0.69

-1.31

0.39

0.09

1.29

0.49

0.19

-0.81

-0.31

-0.71

Y-my

0.49

-1.21

0.99

0.29

1.09

0.79

-0.31

-0.81

-0.31

-1.01

1.81

1.91

0.785211 0.846496

Cov(X,Y) = (1/n)*[(x1-mx)(y1-my) + (x2-mx)(y2-my) + + (xn-mx)(yn-my)]

Notice that Cov(X,X) is the same thing as variance.

Cov(X,Y)=

0.5539

Cov(X,X)=

Cov(Y,Y)=

0.5549

0.6449

Cov(Y,X) = Cov(X,Y)

For easy reference, tabulate the result by putting all the covariances into a covariance matrix Cov.

Elements of Cov are Cij, where Cij = Cov(Xi, Xj). In our case, X1 = X and X2 = Y.

Cov =

0.5549

0.5539

0.5539

0.6649

At this point, we could manually program this. However, what if we have many features?

Since the covariance can be represented as a matrix, we could use matrix math.

If we could come up with covariance formulas that use matrices, that would make our life easier because

there is true-and-tested matrix code available.

We would feed matrices into a software package/library function, which would do the matrix math.

So let's work out the matrix math.

In general, the input data is a flat file, i.e. a matrix of n samples (i.e. rows) and f features (i.e. columns):

X=

x11

x21

.

x12

x22

x13

x23

xn1

xn2

xn3

x1f

x2f

xnf

To transpose a matrix means to switch rows and columns; i.e. if the original matrix had elements Xij,

the transposed matrix has elements Xji.

Transpose(X)=

x11

x12

x13

x1f

x21

x22

x23

x2f

xn1

xn2

xn3

xnf

Cov = 1/n* [Transpose(X-mx) * (X-mx)]

Vector mx is the matrix of size n x f that contains averages for all columns:

Each row of matrix mx contains [avg(1st column) avg(2nd column) avg(3rd column) ... avg(last column)]

Covariance is a square matrix of dimension f x f (because we are comparing each feature to all other features).

Cov has elements Cij, i = 1, .f and j = 1, ..f:

------------------------------------------------------------------------------------------------------------------------------------At this point, we can just let the software calculate the covariance. But let's go one more step and calculate Cij.

We can calculate Cij using column vectors.

PS - this way may look "upside down", but is stating the problem correctly - we are comparing

columns (i.e. features).

A vector is a matrix of only 1 column.

For example, if we have a vector V = [1 2] then it's transpose is:

Transpose(V) = 1

i.e. just "flipped over" V.

2

Obviously, trasnposing a transpose gives back the original: Transpose(Transpose(V)) = V.

Let us assume that we label column vectors Xk, where each Xk is a vector and represents the k-th feature.

For example, in the example above:

X1 = Transpose[2.5, 0.5,2.2, 1.9, 3.1, 2.3, 2, 1, 1.5, 1.1]

X2 = Transpose[2.4,0.7, 2.9,2.2,3.0,2.7,1.6,1.1,1.6,0.9]

Cij can be obtained using scalar multiplication of two vectors as:

Cij = [Transpose(Xi - mi) * (Xj - mj)]

where Xki and Xkj are the elements of the input matrix as shown above,

and mi and mj are the averages of ith and jth column, respectively.

-----------------------------------------------------------------------------------------------------------------------------------------------------------So far, we calculated covariance but we are not done yet with PCA: we need to find the eigenvalues of the

covariance matrix Cov.

The eigenvalues are the solutions to the following equation:

Cov * Ei = Li*Ei

Ei are eigenvectors (each Ei has dimension f x 1).

Li are corresponding eigenvalues (i.e. variances for each column), i = 1, .., f.

I is the identity matrix (square matrix with all 1's on the diagonal and 0's otherwise).

We will ask the software to do that for us. Most likely, it will solve the following equation to find Li:

determinant(Cov - L*I) = 0

Once you find eigenvalues, sort them in decreasing order. Hopefully, the first values are the prominently highest,

and can be considered the most important. The smallest values represent features that can be discarded.

To find out how many features to keep, pick m<=f highest eigenvalues and calculate:

R = SUM[i=1,m]Li / SUM[i=1,f]Li

If R > Threshold, taking only those m features and discarding other features would be

a good representation of the f-dimensional data set.

- Verification of Tensegrity Propeties of Kono StructureUploaded bydeectii91
- 2010-Advantages and Drawbacks of Applying Periodic Time-Variant Modal Analysis to Spur Gear Dynamics-Rune PedersenUploaded byPradeep Kumar Mehta
- Nia FaribaUploaded byImran Mani
- Natural Gas Storage ValuationUploaded byOmololu Ogunsesan
- System of Linear EquationsUploaded byAmir Nasaruddin
- math1231s22015_new.pdfUploaded byAnonymous LvO7AhO
- https://es.scribd.com/upload-document?archive_doc=249407660&escape=false&metadata=%7B%22context%22%3A%22archive_view_restricted%22%2C%22page%22%3A%22read%22%2C%22action%22%3Afalse%2C%22logged_in%22%3Atrue%2C%22platform%22%3A%22web%22%7DUploaded byJose
- MadUploaded byRoi Poranne
- Portfolio Math CFA ReviewUploaded byDionizio
- For EncodeUploaded byjj012586
- Response Method TheoryUploaded byerpixaa
- SSRN-id2411493 The Role of Covariance Matrix Forecasting Method in the Performance of Minimum-Variance PortfoliosUploaded byGestaltU
- Probability CheatsheetUploaded byClases Particulares Online Matematicas Fisica Quimica
- Hybrid Fuzzy First Principles ModelingUploaded byCesar Augusto Garech
- Implementing a Principal Component Analysis (PCA)Uploaded byRobert Kowalski
- Vibrations Part OneUploaded byRicardo Colosimo
- MMT-002_2012Uploaded byarocalistus6367
- Energy-Accuracy Trade-Off for Continuous Mobile Device Location Lin-MobiSys10Uploaded byadonnini
- ch3solnsUploaded byPhantum
- 11.Hotellingt.handoutUploaded byWiwit Aye
- ME_solutionsUploaded byMukesh Kumar
- 1407.1165.pdfUploaded byjhansiprs2001
- Code_Aster r5.01.02-1Uploaded byl_icobasi
- CSI Tutorial v1.5Uploaded byVincent
- linear modelsUploaded byApam Benjamin
- willyUploaded byyashar70
- 10.1.1.173.2196Uploaded bystillife
- ODEs and Difference Equations ExcercisesUploaded byFilip Radu
- RotationUploaded byJary Lin
- Identification of Genetically Mediated Cortical Networks - A Multivariate Study of Pediatric Twins and Siblings (Schmitt et al. 2008)Uploaded byapi-3695811

- 188299836 Contoh Draft Perjanjian Funder InvestorUploaded byAdi Suseno
- Compro Pengurus Sansekertan XXXUploaded byAdi Suseno
- Perjanjian Sewa Menyewa Ruko Br1Uploaded byAdi Suseno
- Penawaran Lapkeu PT Radian1Uploaded byAdi Suseno
- Memo Buka Blokir IMB1Uploaded byAdi Suseno
- Memo Buka Blokir & Transfer Dana1Uploaded byAdi Suseno
- Executive Summary CIPUploaded byAdi Suseno
- restoran2Uploaded byAdi Suseno
- Executive BCPUploaded byAdi Suseno
- Memo Buka Blokir Br 20101Uploaded byAdi Suseno
- Executive Summary 1Uploaded byAdi Suseno
- Analisa-Rasio-Keuangan1Uploaded byAdi Suseno
- Contoh Draft Perjanjian Funder InvestorUploaded byBroker Komoditi
- Contoh Draft Perjanjian Funder InvestorUploaded byBroker Komoditi
- Contoh Surat Pengangkatan Karyawan1Uploaded byAdi Suseno
- Memo Asuransi1Uploaded byAdi Suseno
- MEMO duk bank 1Uploaded byAdi Suseno
- Memo Buka Blokir Br1Uploaded byAdi Suseno
- Bbaru xUploaded byAdi Suseno
- MEMO Duk Bank 11Uploaded byAdi Suseno
- memo gap 21Uploaded byAdi Suseno
- Weng.txtUploaded byAdi Suseno
- Data PerusahaanUploaded byAdi Suseno
- BASTU Ck EdtUploaded byAdi Suseno
- MEMO Duk Bank 111Uploaded byAdi Suseno
- Surat Permohonan Proporsional.Uploaded byAdi Suseno
- App Letter Bri Sya xUploaded byAdi Suseno
- Bonus Depo Memo xUploaded byAdi Suseno
- Contoh Perhitungan Modal Kerja Untuk Perdagangan Dalam SetahunUploaded byAdi Suseno
- MEMO Duk BankUploaded byAdi Suseno

- Cerebral veinsUploaded byИлья Буланов
- Mole Mass RelationshipUploaded byJeff Hambre
- CNC Mill Programming Manual 5-12Uploaded byKen Lee
- Nortek Aquadopp Current Meter Sensor Head ConfigurationsUploaded byazhafiz
- coefficient corelation.pdfUploaded byLiza Dwi Wahyuni
- CCRN-PCCN-CMC Review Cardiac Part 2Uploaded byGiovanni Mictil
- QUININAUploaded byBelén Tapia
- colloidsppt-161019033051Uploaded byGOWTHAM GUPTHA
- Plugging Home Drains to Prevent Sewage Back UpsUploaded byabdkha8644
- Om-906-la-pdfUploaded byAlexandergraham
- The Swing Reality GuideUploaded byAlex Brown
- PrismUploaded byKb Ashish
- 20160523_3_DUploaded byPratap
- Heart of Russia.docxUploaded byfrank_2943
- Exploring the Relationship Between Secondary ScienceTeachers’ Subject Matter Knowledge and Knowledge of Student Conceptions While Teaching Evolution by Natural SelectionUploaded byAnthony Petrosino
- Governor Christie's Comprehensive Action Plan to Address the Ecological Decline of Barnegat BayUploaded byGovernor Chris Christie
- John Deere 790 Ficha Tecnica FullUploaded bykiller958
- Power OptionsUploaded byJoey Buck
- Making Biblical Decisions - Lesson 1 - Study GuideUploaded byThird Millennium Ministries
- Police Log June 4, 2016Uploaded byMansfieldMAPolice
- gmos essayUploaded byapi-267820224
- Product catalog_MY.pdfUploaded byazhan114
- Blade Runner DesignUploaded bygarbo14
- Stabilitas Beta SitosterolUploaded bynurhayati novita
- IOTA Observers Manual all pagesUploaded bymuhamad dimas arifin a.k.a Ahmd El Arf
- La Mer Basic TrainingUploaded byKandace Taylor
- Electrical QbUploaded byLakshmiVishwanathan
- Tank Inspection HandbookUploaded byToufik Bensari
- PTRL4012 Course OutlineUploaded byT C
- Which Species of Bacteria Have AB ToxinsUploaded bySwisskelly1