You are on page 1of 15

Dr.

Kartika Fithriasari
EDA adalah suatu pendekatan untuk analisis data yang
menggunakan berbagai teknik (terutama grafis) untuk
memaksimalkan wawasan kumpulan data;
mengungkap struktur yang mendasari;
ekstrak variabel penting;
mendeteksi outlier dan anomali;
uji asumsi yang mendasari;
mengembangkan parsimonious models.
Pendekatan EDA tepatnya adalah suatu pendekatan
bukan seperangkat teknik, tapi sikap/ filosofi tentang
bagaimana sebuah analisis data harus dilakukan
Discover the structure
Find pattern
Indentify relationship
EDA helps to prevent common statistics
prolem
Most Statistical Techniques requires
special assumption before they could
be employed, and EDA can investigate
these assumption
Untuk analisis klasik,urutan adalah:
Masalah => Data => Model => Analisis => Kesimpulan

Untuk EDA, urutan adalah :


Masalah => Data = Analisis> = Model> => Kesimpulan
CDA is Confirmatory Data Analysis
Confirmatory
Formulate model before seeing the data
Analyze the data
Asses significance (inference) based on model

EDA & CDA are important


EDA is tools for exploring and investigating data
CDA is tools for validating hypothesis
Visual data eksploratory (VDE) is the core of EDA
Given 4 datasets and
Analyze the data X1 Y1 X2 Y2 X3 Y3 X4 Y4
10 8.04 10 9.14 10 7.46 8 6.58
8 6.95 8 8.14 8 6.77 8 5.76
13 7.58 13 8.74 13 12.74 8 7.71
9 8.81 9 8.77 9 7.11 8 8.84
11 8.33 11 9.26 11 7.81 8 8.47
14 9.96 14 8.1 14 8.84 8 7.04
6 7.24 6 6.13 6 6.08 8 5.25
4 4.26 4 3.1 4 5.39 19 12.5
12 10.84 12 9.13 12 8.15 8 5.56
7 4.82 7 7.26 7 6.42 8 7.91
5 5.68 5 4.74 5 5.73 8 6.89
X1 Y1 X2 Y2 X3 Y3 X4 Y4
10 8.04 10 9.14 10 7.46 8 6.58
8 6.95 8 8.14 8 6.77 8 5.76
13 7.58 13 8.74 13 12.74 8 7.71
9 8.81 9 8.77 9 7.11 8 8.84
11 8.33 11 9.26 11 7.81 8 8.47
14 9.96 14 8.1 14 8.84 8 7.04
6 7.24 6 6.13 6 6.08 8 5.25
4 4.26 4 3.1 4 5.39 19 12.5
12 10.84 12 9.13 12 8.15 8 5.56
7 4.82 7 7.26 7 6.42 8 7.91
5 5.68 5 4.74 5 5.73 8 6.89
Mean 9 7.50 9 7.50 9 7.5 9 7.50
Std Dev 3.32 2.03 3.32 2.03 3.32 2.03 3.32 2.03
Correlation 0.82 0.82 0.82 0.82
Date Source compound Extraction method Weight observed
29.11.93 NO hot iron 2.30143
5.12.93 NO hot iron 2.29816
6.12.93 NO hot iron 2.30182
8.12.93 NO hot iron 2.29890
12.12.93 Air hot iron 2.31017
14.12.93 Air hot iron 2.30986
19.12.93 Air hot iron 2.31010
22.12.93 Air hot iron 2.31001
26.12.93 N2O hot iron 2.29889
28.12.93 N2O hot iron 2.29940
9.1.94 NH4NO2 hot iron 2.29849
13.1.94 NH4NO2 hot iron 2.29889
27.1.94 Air ferrous hydrate 2.31024
30.1.94 Air ferrous hydrate 2.31030
1.2.94 Air ferrous hydrate 2.31028
Find a graph that
shows clearly that
the data can be
divided into two
different groups

You might also like