Professional Documents
Culture Documents
Submitted to
Skander Lazrak
Submitted by
6229934
Brock University
Abstract
This paper intends to show descriptive statistics derived by means of SAS Studio using data retrieved from
Wharton Research Data Services (WRDS) using CUSIPs over the period ranging from 2012 to 2016. The
data on the selected annual financials of the North American companies were filtered to generate descriptive
statistics later.
Table of Contents
1. Introduction ........................................................................................................................................... 5
2. Data Extraction and Filtration ............................................................................................................... 5
3. Description of the Sample ..................................................................................................................... 6
4. Analysis of the Descriptive Statistics.................................................................................................... 6
5. Conclusion ............................................................................................................................................ 7
6. Appendix ............................................................................................................................................... 8
4
List of Tables
1. Introduction
Descriptive statistics facilitates in describing and understanding the characteristics of a data set by way of
short summaries about the sample and measures of the data. The recognized types of descriptive statistics
are mean, median, mode, standard deviation, kurtosis and skewness. They are either measures of central
tendency or measures of variability. And we can use graph or tables to help us understand meaning of the
data being analyzed. Measures of central tendency indicates the central position of a distribution for a data
set, whereas measures of variability give an idea as to how spread-out the distribution is.
On WRDS, at first, a list of CUSIPs of all North American index constituents over a period of 2001-2018
was generated. Using these CUSIPs, six annual financials (total assets, total dividend, net income, total
revenue, total shareholders’ equity, total market value), and economic sector codes of the companies were
queried on WRDS. It is worth mentioning that while querying, it was made sure that the currency was USD,
industry format industrial (INDL), population source domestic, and company status active. After receiving
the queried data, for the sake of uniformity and precision of the analysis, the companies which had both
actual period end date and data date as December 31 in any of the year between 2012 and 2016 were taken
into consideration. In the final dataset, all the missing values (blanks in .csv file) were inputted as dots (.)
automatically by SAS. Hence, the missing values did not pollute the descriptive statistics.
6
The query on WRDS generated data of 6210 rows on financials of 1324 companies in North America spread
across 11 economic sectors: basic materials, capital goods, communication services, consumer cyclicals,
consumer staples, energy, financials, health care, technology, transportation and utilities. In addition to that,
the companies are diverse in terms of their financials. Over 2012-2016, the sample consists of total assets
of the companies ranging from USD 0.09 million to USD 3.28 trillion having median USD 4.37 billion.
Their net incomes range from USD -14.68 billion to USD 83.96 billion, and market values from USD 0.96
million to USD 540.65 billion having the medians USD 110.34 million and USD 2.50 billion. Please refer
The descriptive statistics concerning the six financials (variables) of the companies were analyzed over the
period 2012-2016. The means of total assets, net income and total revenue (please refer to table 1) have
steadily decreased in the period, whereas the means of total dividend, total shareholders’ equity and total
market value have increased. The distributions of all the variables over the years have remained positively
skewed, meaning asymmetry owing to their means being greater than their medians. On the other hand, the
kurtosis of the distributions of all those variables is many times greater than 3 indicating that they are all
Conducting normality tests on the variables corroborates the fact that the distributions of the variables are
not normal (please refer to table 3). There have been 3 tests of normality on the variables: Kolmogorov-
Smirnov Test, Cramer-von Mises Test and Anderson- Darling Test. All 3 tests equivocally lead to the same
conclusion for all the variables. Kolmogorov- Smirnov Test statistics have p values <0.010 across all the
7
variables, whereas Cramer-von Mises Test and Anderson- Darling Test statistics have p values <0.005, all
Some interesting facts arise when the variables are analyzed with respect to the economic sectors the
companies belong to (please refer to table 4). Companies in the financial sector has, as expected, the highest
average total assets, and companies in technology, which does not require physical assets much, has the
lowest average. Though companies in energy sector earn the highest average revenue, and companies in
technology sector do the lowest, however, companies in communication services sector are the ones which
have the highest average net income, and basic materials sector has experienced the lowest average net
income. It is the companies in communication services sector which also have given the highest average
dividend, whereas consumer cyclicals sector has given the lowest average dividend. With respect to average
total market value, companies in communication services sector hold the highest average. Companies in
consumer cyclicals have the lowest average total market value. Last but not least, energy sector is backed
by highest average total shareholders’ equity, whereas transportation is backed by the least. Across all the
sectors the skewness of the distributions of the variables is still positive. The kurtosis of all distributions of
the variables across the sectors is leptokurtic except the one of total market value and total revenue in
5. Conclusion
Analyzing the descriptive statistics of the variables of the companies over the period 2012-2016 help reveal
insights on central tendency and variability of the financials. Over the years, though skewness and kurtosis
of the variables have remained similar in nature, a little difference has been observed when the variables
6. Appendix
The table shows year-wise (2012-2016) moments of the company financials (total assets, total dividend, net income, total revenue, total
shareholders’ equity and total market value). All the financials are in USD million when extracted from WRDS.
The table shows the year-wise (2012-2016) quantiles of the financials (total assets, total dividend, net income, total revenue, total
shareholders’ equity and total market value). 100% Maximum = maximum value, 75% Q3 = 3rd quantile, 50% Median = 2nd quantile or
median, 25% Q1 = 1st quantile, and 0% Minimum = minimum value. All the financials are in USD million when extracted from WRDS.
The table shows the output of the normality test on the financials (total assets, total dividend, net income, total revenue, total shareholders’ equity and total
market value). D, W-Sq and A-Sq are statistics from Kolmogorov- Smirnov Test, Cramer-von Mises Test and Anderson- Darling Test.
Total Assets Total Dividend Net Income Total Revenue Total Shareholders’ Equity Total Market Value
Kolmogorov-Smirnov Test
Statistic (D) 0.42 0.41 0.31 0.36 0.34 0.35
P Value (Pr>D) <0.010 <0.010 <0.010 <0.010 <0.010 <0.010
Cramer-von Mises Test
Statistic (W-Sq) 377.46 323.45 252.18 260.83 262.09 206.66
P Value (Pr>W-Sq) <0.005 <0.005 <0.005 <0.005 <0.005 <0.005
Anderson-Darling Test
Statistic (A-Sq) 1797.89 1546.59 1217.77 1276.18 1286.36 1016.67
P Value (Pr>A-Sq) <0.005 <0.005 <0.005 <0.005 <0.005 <0.005
Table 3: Normality test of the financials
11
The table shows moments of the financials total assets, total dividend, net income, total revenue, total shareholders’ equity and total market value) with respect to the
economic sectors the companies belong to. The numbers in the column indicate the code for a specific sector. Here, 600 = transportation, 700 = utilities, 800 = financials,
905 = health care, 925 = capital goods, 935 = energy, 940 = technology, 970 = basic materials, 974 = communication services, 976 = consumer cyclicals, 978 = consumer
staples. All the financials are in USD million when extracted from WRDS.
600 700 800 905 925 935 940 970 974 976 978
Total Assets
No. of observations 191 270 1146 424 498 315 503 349 155 580 311
Sum of observations 2237926.78 8165631.70 173082613 7697389.66 8270749.60 13633308.10 3541588.71 5012804.75 8606529.58 7120342.94 6212832.05
Mean 11716.89 30243.08 151031.94 18154.22 16607.93 43280.34 7040.93 14363.33 55525.99 12276.45 19976.95
Standard Deviation 15107.71 30751.09 450706.17 32868.00 60802.74 81927.59 15948.19 21667.15 80733.48 35228.78 35144.19
Variance 228242994.00 945629768.00 2.03e11 1080305826.00 3696973641.00 6712130746.00 254344983.00 469465781.00 6517895346.00 1241067042.00 1235114326.00
Skewness 1.52 1.76 4.24 2.72 8.80 2.66 4.61 2.68 2.13 5.07 3.13
Kurtosis 1.11 3.42 19.32 7.67 85.69 6.45 26.51 7.49 4.52 26.48 11.92
Total Dividend
No. of observations 191 258 1111 424 498 315 502 349 153 579 311
Sum of observations 35841.92 127997.81 636416.95 213089.70 148529.55 308130.03 98492.44 99256.62 224804.05 105013.43 209658.34
Mean 187.65 496.11 572.83 502.57 298.25 978.19 196.20 284.40 1469.30 181.37 674.14
Standard Deviation 481.40 565.40 3305.25 1427.65 949.03 2420.95 655.42 558.82 2629.72 475.32 1308.99
Variance 231747.90 319677.56 10924741.10 2038186.61 900669.07 5860999.49 429584.57 312288.40 6915477.72 225931.22 1713471.30
Skewness 3.81 2.27 19.22 3.30 6.99 3.45 5.17 3.88 2.41 4.60 2.59
Kurtosis 14.73 8.12 446.01 10.16 57.30 12.33 30.19 18.10 4.97 23.91 6.85
Net Income
No. of observations 191 270 1146 424 498 315 503 349 155 580 311
Sum of observations 127092.42 154677.42 1148570.38 470447.24 275841.13 377028.59 277988.14 57482.90 291687.07 288727.69 416154.78
Mean 665.40 572.87 1002.24 1109.54 553.89 1196.91 552.66 164.70 1881.85 497.80 1338.11
Standard Deviation 1497.02 1243.63 3997.51 2770.87 1732.48 5541.66 1833.24 1480.44 4420.08 1295.83 2344.56
Variance 2241080.48 1546637.94 15980136.70 7677729.75 3001510.66 30710105.70 3360777.64 2191714.20 19537146.30 1679181.04 5496985.93
Skewness 3.37 1.25 10.96 3.84 3.10 3.69 5.52 -2.04 2.71 4.81 2.36
Kurtosis 14.44 28.73 186.27 18.44 26.54 21.25 36.81 21.55 7.41 26.48 6.90
Total Revenue
No. of observations 191 270 1146 424 498 315 503 349 155 580 311
Sum of observations 1785442.61 2387290.47 9292484.57 4890753.30 5035093.19 10671252.60 2372512.46 3055366.01 4047383.73 5633186.69 4065431.86
Mean 9347.86 8841.81 8108.62 11534.79 10110.62 33876.99 4716.72 8754.63 26112.15 9712.39 13072.12
Standard Deviation 13018.73 81680519.50 19401.54 22395.61 21418.48 83975.15 11372.84 13731.98 37570.29 26461.59 21942.81
Variance 169487575.00 9037.72 376419987.00 501563575.00 458751557.00 7051827420.00 129341627.00 188567281.00 1411527132.00 700216217.00 481487119.00
Skewness 2.17 2.19 3.50 3.70 4.75 3.66 5.38 2.96 1.84 4.78 4.10
Kurtosis 4.20 6.69 12.49 17.86 26.84 13.85 36.46 9.68 2.46 22.62 22.02
Total Shareholders’
Equity
No. of observations 191 270 1146 424 498 315 503 349 155 580 311
Sum of observations 551026.15 2450405.59 13611323.10 3075050.15 2413246.21 6664900.04 1644092.41 2217979.40 2717035.41 1799283.70 2104743.53
Mean 2884.95 9075.57 11877.24 7252.47 4845.87 21158.41 3268.57 6355.24 17529.26 3102.21 6767.66
Standard Deviation 4208.48 10148.12 30727.17 15496.89 13091.03 40787.53 6853.46 10282.61 30762.73 7756.99 12655.48
Variance 17711367.70 102984387 944159223.00 240153631.00 171375237.00 1663623346.00 46969979.50 105732181.00 946345986.00 60170967.90 160161263.00
Skewness 2.33 2.42 4.90 3.29 7.28 2.79 3.85 2.96 2.75 5.00 2.77
Kurtosis 6.88 7.05 27.26 10.78 63.77 7.21 20.83 9.23 7.16 28.37 8.25
Total Market Value
No. of observations 176 230 754 373 465 270 442 270 65 533 261
Sum of observations 2033677.36 3148894.63 8494364.60 6780717.48 6061971.04 6134670.61 3579673.74 2207275.94 2335832.05 3672781.63 5755282.81
Mean 11554.98 13690.84 11265.73 18178.86 13036.49 22721.00 8098.80 8175.09 35935.87 6890.77 22050.89
Standard Deviation 20253.30 13760.07 31351.92 36101.84 32384.61 59993.26 22687.09 11132.27 67884.60 11627.51 38936.79
Variance 410196299.00 189339734.00 982943347.00 1303343473.00 1048763196.00 3599191648.00 514704063.00 123927589.00 4608319179.00 135199194.00 1516074001.00
Skewness 3.06 1.89 6.38 3.03 5.72 4.82 5.98 2.29 2.05 3.81 2.40
Kurtosis 9.55 4.67 46.38 9.65 40.38 25.30 41.89 6.16 2.82 18.95 5.55
Table 4: Moments of the financials with respect to economic sectors
12
The following code snippets generate the output shown in the tables.