You are on page 1of 6

Excel 2013 with XLSTAT

Introduction to Excel 2013 with XLSTAT


XLSTAT is an Excel Add-in program that provides a set of extended data analysis functions beyond the built-in functions of Excel.
In order to run XLSTAT, Macros must be enabled and the security level must be set to Medium. Once XLSTAT is installed, the
functionality can be found under the XLSTAT tab (A). XLSTAT
functions can be accessed from the Menu Commands, Custom
Toolbars, or the XLSTAT ribbon. Functions are categorized into
eight groups whose icons (B) can be accessed on the XLSTAT

ribbon. Note: All instructions on this study card will require you
to first open an Excel 2013 Worksheet.
A

Descriptive Statistics
3. Click the Options tab and check the Descriptive statistics
box.
4. Click the Outputs tab to select any desired statistics in the
Quantitative data box. Click OK.

1. Select XLSTAT>Describing Data>Descriptive Statistics.

2. Under the General tab, check the Quantitative data box. Place
your cursor inside the Quantitative data box. Then, select the
column(s) of quantitative data to be analyzed.

Scatterplot
1. Select XLSTAT>Visualizing Data>Scatter plots.
2. Under the General tab, click inside the X and Y boxes to select
two (or more) columns of quantitative data. Check the box
next to Variable labels, if appropriate.
3. Click the Options tab, and select the box for Matrix of plots
and the radial option Histograms. Click OK.

ISBN 13: 978-0-321-92579-4


ISBN 10: 0-321-92579-3
90000
EAN

9 780321 925794

Correlation, Simple Regression and Residual Plots


FROM AN OPEN EXCEL 2010 WORKSHEET.

5. Click the Outputs tab and check the Analysis of variance box.

1. Select Modeling data>Linear Regression.


2. Under Y / Dependent variables, click in the box and select a
single column of quantitative data.
3. Under X / Explanatory variables, check the Quantitative box
and select a single column of quantitative data for a simple
linear regression.
4. To exclude an intercept parameter in a regression model, click
the Options tab and select the Fixed Intercept box. Enter 0
into the Fixed Intercept box.
6. To generate residuals plots, check the Correlations and
Predictions and residuals boxes under the Outputs tab.
7. To generate residual plots (ex. standardized residuals against the
predicted values) and the tted regression line with condence
bands (for a simple linear regression), click the Charts tab
and select Regression Charts, Predictions and residuals and
Condence intervals.
8. Click OK. The output will include a correlation matrix,
Goodness of Fit statistics, model parameters and the equation of
the model, a scatterplot and residual plots.

Displaying Categorical Data Using Frequency Counts, Bar Charts, and Pie Charts
Use the following procedure to generate a frequency table, a bar
chart, and a pie chart for a categorical variable.

2. Place your cursor inside the Qualitative data: box. Click on


the rst row of the variable to be analyzed and highlight the
entire column of data. The Options and Outputs tabs will
default to the required statistics and plots. Select the Charts
(2) tab and check that you want to generate both the Bar
Chart and the Pie Chart. Select OK and Continue.

1. Select XLSTAT>Describing data>Descriptive statistics.

Histogram
3. Typical output is provided:

1. Select XLSTAT>Visualizing Data>Histogram. The following


screen will appear:

2. Place your cursor inside the Data: box. Click on the rst row
of the variable to be analyzed and highlight the entire column
of data. The Options and Missing data, etc. tabs will default
to the required statistics and plots. Select OK and Continue.

Boxplot
4. To create side-by-side boxplots of a quantitative variable by
different values of a qualitative value (such as gender), after
selecting the quantitative
variable in step 2, check the
Subsamples: box. Place your
cursor in the larger box and
select the qualitative variable
to be included. Select the
Outputs tab and check the
Group plots box. Select OK
and Continue.
5. Typical output is
provided:

1. Select XLSTAT>Visualizing Data>Univariate plots.


2. Place your cursor inside
the Quantitative data: box.
Click on the rst row of the
variable to be analyzed and
highlight the entire column
of data. The Options and
Outputs tabs will default
to the required statistics
and plots. Select OK and
Continue.
3. Typical output is provided:

Normal Probability Plots and Normality Tests


1. Select XLSTAT>Describing data>Normality tests.

5. To generate a Normal probability plot, click on Charts tab and


check the Normal P-P plot and/or the Normal Q-Q plot box.
If the data follow a Normal distribution, they will fall on an
approximately straight line.

2. Under the General tab, place your cursor inside the Data box
to select a single or multiple columns of quantitative data.
3. In the Signicance level (%) box, enter a level of signicance
for the selected test(s).
4. Select one or more of the normality tests available. Each test
result includes a p-value corresponding to its test statistic. A
sufciently small p-value (less than a level of signicance)
indicates that the distribution of the data is not normal.

Sampling
sample, the third optionRandom without replacement
may be most appropriate. Review all options prior to making a
selection.
3. Enter the required number of observations into the sample in
the Sample size: box. Select OK and Continue. The resulting
output will be a random subset of the original dataset.

1. Select XLSTAT>Preparing data>Data sampling. Place your


cursor inside the Data: box. Highlight the entire range of the
dataall variables and all observations.
2. The Sampling: box includes several options. If the data have
been sorted in any way, the rst two options N rst rows and
N last rows may not be appropriate. For a simple random

Hypothesis Tests and Condence Intervals for a Proportion


1. Select XLSTAT>Parametric tests>Tests for one proportion.

4. Under the Options tab:


4.1 Select a method under Variance (condence interval)
to calculate the standard error of the distribution of the
sample proportions. A two-sided condence interval
will be calculated based on the standard error calculated
using one of these methods:
(a) Select the Sample option to calculate the standard
error based on either the Frequency or (sample)
Proportion specied under the General tab.
(b) Select the Test proportion option to calculate the
standard error based on the population proportion
specied in the Test proportion box under the
General tab.
4.2 Select the Wald option under Condence interval.
5. Click OK. An example of a z-test result and its interpretation
with a two-sided condence interval for a proportion is shown.

2. Under the General tab, follow these steps:


2.1 Enter a whole number into the Sample size box.
2.2 Enter a value between 0 and 1 for the population
proportion into the Test proportion box.
2.3 Check the z test box.
2.4 Follow one of the following procedures:
(a) After selecting the Frequency option under Data
format, click inside the Frequency box and enter
the number of Success outcomes for the sample.
(b) After selecting the Proportion option under Data
format, the Frequency box will be changed to the
Proportion box. Click inside the Proportion box
and enter the value (between 0 and 1) of the sample
proportion.
3. Under the Options tab, follow these steps:
3.1 Enter a value between 0 and 1 into the Hypothesized
difference box.
3.2 Enter a desired signicance level into the Signicance
level (%) box.
3.3 Select one of the Alternative hypothesis options from
the drop-down menu.

Hypothesis Test and Condence Interval for the Difference between Proportions
1. Select XLSTAT>Parametric tests>Tests for two proportions.
2. Select either the Frequencies or Proportions radial button
under Data Format, and enter the appropriate data into the
boxes on the top left (either whole numbers for Frequency 1
and Frequency 2 or proportions between 0 and 1).
3. Follow step 3 of Hypothesis Tests and Condence Intervals
for a Proportion.
4. Under the Options tab, select one of the options for Variance.
The two variance options represent an unpooled (unequal
variance) approach and a pooled (equal variance) approach,
respectively. Review the differences and select the option most
appropriate for your data. If you are unsure, select the more

conservative unpooled approach, which is the default. Select


OK and Continue.
5. Typical output is provided:

Hypothesis Test and Condence Interval for One Sample Mean


6. Typical output is provided:

1. Select XLSTAT>Parametric tests>One sample t-test and


z-test.
2. Place your cursor inside the Data: box. Click inside the rst
row of the variable to be analyzed and highlight the full range
of the variable. Check the box next to Students t test.
3. Select the Options tab.
4. Follow step 3 of Hypothesis Tests and Condence Intervals
for a Proportion.
5. The Missing data tab provides options for what to do with
missing data. If there are any missing values, select the second
optionRemove the observations. The Outputs tab will
default to the required statistics. Select OK and Continue.

Hypothesis Tests and Condence Intervals for Difference between Two Means
(Two Independent Samples and Paired Samples)
3.2 Select z test if the population variances are known or
estimated by large-sample variances. Then, click the
Options tab. Under Variances for the z-test, follow one
of two options:
(a) Select Estimated using samples. This option will
allow XLSTAT to automatically calculate the large
sample variances that are used as the estimates for
the population variances.
(b) Select User dened and enter the population
variance for the rst and second samples into the
Variance 1 and Variance 2 boxes.
4. Click the Options tab.
4.1 Enter the value from your null hypothesis into the
Hypothesized difference box.
4.2 Enter a desired signicance level into the Signicance
level (%) box.
4.3 Select one of the Alternative hypothesis options from
the drop-down menu.
5. To include the Summary statistics table output, click the
Outputs tab and check the Descriptive statistics box.
Click OK. An example of a t-test result and its interpretation
together with a two-sided condence interval is shown.

1. Select XLSTAT>Parametric tests>Two sample t-test and


z-test.
2. Under the General tab, follow one of three procedures to
select the appropriate data format:
2.1 Under Data format, select One column per variable.
(a) Click inside the Data box and select a single column
of quantitative data.
(b) Click inside the Sample identiers box and select
a single column of qualitative data that contains the
labels for the quantitative data selected in (a).
2.2 Under Data format, select the One column per sample
option. This will change the Data and Sample identiers
boxes to the Sample 1 and the Sample 2 boxes. Click
inside each box and select two single columns of
quantitative data.
2.3 Under the Data format tab, select the Paired samples
option. Follow step 2.2 to select data from paired
samples.

3. Follow one of two procedures:


3.1 Select Students t-test if the population variances are
unknown. Then, click the Options tab. Under Sample
variances for the t-test, check the Assume equality box
if the population variances are assumed to be equal.

Frequency Tables, Contingency Tables, Chi-square Test of Independence and Homogeneity


1. Select Preparing data>Create a contingency table.
2. Under the General tab, click inside the Row variable(s)
and Column variable(s) boxes and select a single column
of qualitative data for each box.

3.3 Theoretical frequencies: Generates a table with an


expected count in each cell under the null hypothesis.
3.4 Proportions/Total: Generates a table that contains a total
percentage in each cell, row, and column.
4. Click the Options tab and check the Chi-square test box.
Click inside the Signicance level (%) box and enter a desired
value (0 to 100) for the level of signicance.
5. Click OK. An example of the Chi-square test of independence
table output and the Test interpretation is given.

3. Click the Outputs tab and select one or more option(s):


3.1 Contingency table: Generates a basic contingency table
that contains only the cell count for a combination of
values of the two variables selected in step 2.
3.2 Observed frequencies: Generates the basic contingency
table that also includes a count total in each row and
count total in columns.

Note: Since the Chi-square test of independence is mechanically identical to


a test of homogeneity, both tests are executed the same way in XLSTAT. The
null hypothesis is, however, set up differently. For a test of homogeneity, the
same small p-value (relative to an alpha value) provided in the Chi-Square
test of independence table output indicates evidence that distributions of the
categorical variable are different across the independent populations.

Generating Random Numbers


4. You may have noticed that the rst value in row 2 changed.
RAND is a volatile function in Excel, meaning that the result
will change whenever a change is made in the spreadsheet. To
resolve this, simply highlight the entire RANDOM column,
select Copy, and then under the Paste options, select Paste
Values.

1. Create a new column on the right of your dataset titled


RANDOM.
2. Inside the rst open cellrow 2type the following
function: =RAND().
3. After you click Enter, a random number, following a uniform
distribution between 0 and 1, will be generated. Once your
random number has been generated, from the Home tab click
Copy. Highlight the remainder of the column to the end of
the data and click Paste.

You might also like