You are on page 1of 4

Assignment 1 Part 2

1.) Compute basic descriptive statistics for your variables, such as


the mean, median, and variance.

Descriptive Statistics: Age

Variable N N* Mean SE Mean StDev Minimum Q1 Median Q3


Age 7269 0 42.793 0.0278 2.373 -4.000 41.000 43.000 45.000

Variable Maximum
Age 49.000

Descriptive Statistics: Salary

Variable N N* Mean SE Mean StDev Minimum Q1 Median Q3


Salary 7269 0 35344 487 41546 0.000000000 10000 28000 47000

Variable Maximum
Salary 265933

Descriptive Statistics: Number of Children

Variable N N* Mean SE Mean StDev Minimum Q1


Number of Childr 7269 0 1.9857 0.0171 1.4546 0.000000000 1.0000

Variable Median Q3 Maximum


Number of Childr 2.0000 3.0000 11.0000

2.) Choose one variable to describe in more detail. Investigate


whether the variable is skewed or approximately symmetric, and
check whether it has any apparent outliers.

Salary: It is skewed to the left, because the median is near the minimum
rather than the maximum. Outliers occur to People who earned more than
$105,000 in the previous year.
Boxplot of Salary

0 50000 100000 150000 200000 250000 300000


Salary

3.) Construct a relative frequency distribution, and graph the


relative frequency histogram.
Relative Frequency Histogram

Histogram of Salary
25

20

15
Percent

10

0
0 45000 90000 135000 180000 225000 270000
Salary

30.7333% 0 TO 15000
21.9150% 15000 TO 30000
19.7964% 30000 TO 45000
11.5009% 45000 TO 60000
6.89228% 75000 TO 90000
3.13661% 90000 TO 105000
2.11859% 105000 TO 120000
.962994% 120000 TO 135000
.715367% 135000 TO 150000
.398954% 150000 TO 165000
0 165000 TO 180000
0 180000 TO 195000
0 195000 TO 210000
0 210000 TO 225000
0 225000 TO 240000
0 240000 TO 255000
1.82969% 255000 TO 270000
0 270000 TO 285000
0 285000 TO 300000
Cumulative Frequency Histogram

Histogram of Salary

100

80
Cumulative Percent

60

40

20

0
0 35000 70000 105000 140000 175000 210000 245000
Salary

4.) What do you think the data- generating process is for this
variable?

The United States economy and the census.

5.) Print and document the data set (give your data source(s),
including URL(s), and the definitions of your variables, including
units).

The National Longitudinal Surveys: http://www.bls.gov/nls/

AGE OF RESPONDENT: Years

NUMBER OF CHILDREN EVER BORN: Children

TOTAL INCOME FROM WAGES AND SALARY IN PAST CALENDAR YEAR: Dollars

You might also like