You are on page 1of 1

Exercise-1 (MA)

Data Preprocessing and Visualization

1. In the excel sheet “depression.xlsx”, find the following


a. How many elements are there in the data set?
b. How many variables are there in the data set?
c. Which variables are qualitative and which are quantitative?
d. What type of measurement scale is used for each variable?
e. What is the average age of male and female?
f. What is the average overall age?
g. Draw pie/bar chart of Outcome-wise distribution?
h. Draw appropriate bi-variate chart for treat and outcome
i. What are your observations about the data?

2. Summarize the data in the excel sheet “cereal.xlsx”.

a. Draw a pie chart of brand.


b. Find brand-wise average weight
c. Identify the type of variables
3. A survey of age of holywood actors earned is given in the excel sheet “actor.xlsx”. Answer
the following questions?
a. What are lowest and highest age?
b. Draw appropriate chart of age
c. What proportion of age is 40 or less?
d. What % of salaries is more than 50?

4. Draw appropriate bi-variate chart for the data given in the excel sheet “graduation.xlsx”.

5. Pre-process the data in the excel sheet “student_survey.xlsx”. Impute missing value and find
the following.

a. Run summary and comment on the result.

b. Calculate average age gender-wise

c. Calculate average score in verbal course-wise

d. Find the number of students who are left handed and right handed gender-wise.

e. Find average score based on the variable “handed”.

You might also like