Practical assignment evaluation; hoe je iets wilt studeren, welk gat er in de
kennis is, hoeveel geld je nodig hebt. Presenteren in 10 minuten. Wat je wilt doen en wat de beste technieken zijn. Hoe gebruik je de technieken. Basic statistics Most critical point is to have a good design. You need to see the differences and afterwards statistics. There is always noise/variability. It is important to find the signal, pattern in this data. Probability calculations: See the sheets. He tested the positive relationship. With one experiment, use randomization, you can virtually repeat all the experiments a lot of times. So is the result signal or noise. Random sampling; In order to address research questions with data, we must first consider how data are to be gathered Gathering data: good identification of the population (e.g. all subjects - animals - type of neurons - brain areas) We typically are unable to observe the entire population. Thus, we gathering data from a subset of the population, a sample of size n From the sample, we make inferences about the complete population Definitions Estimation is the process of inferring an unknown quantity of a population using sample data A parameter is a quantity describing the population, whereas an estimate is a related quantity calculated from a sample TED TALK! Shape of distributions; normal distribution is very common. Central limit theorem. Distrubution average, you have some sort of shape, random sampeling, average every sampeling normal distribution to explain the population. You have a probability distribution mean is mainly the same as in the beginning. Sampeling distribution of Y. Because Y is normally distributed you can do an inference about means. Samepling distribution about the standard error of the mean. One sample T-test. One of the easiest ones. You can see the mean as the signal, central tendency you can see the probability signal to noise ratio. How much signal you have on youre noise. You want to see that in a normal distribution. That happens with a t-test but also with ANOVA. After the break. Try to correct the 0-hypothesis, type II error. Bonferroni correction; member of family-wise error rate FWER. Assumes that test are independent. At some point (fmri study) you have a lot of independent examples. You will solve the multiple comparison, you have the type 2 errors than. .. Computer Intensive Methods; no assumptions but still a lot of power. Simulation, randomazitaion and boorstrapping.
Randomization are at the core of statisitical inference processing. Is really
powerfull. Example sage crickets; randomization because data was not easy. Bootstrapping en randomization you can obtain confidence intervals, boorstrap one sample each conditions, calculate differences, histrogram powerfull method, estimation coherence. MEG/EEG data they have spatiotemporal structure. High density you have a lot of channels spatial. Than you have the temporal condition; every sample in time. Two conditions than you have a problem. Mutiple comparison problem. EXAMPLE: magnetic N400 visual evoked potential. Semantic processing of sentences can be studied by manipulating semantic congruity. He tooked 600 is the number of samples. Bonferoni works with one sensor. Muliplte than the value will be very small, than you have a restricted version. Than you do a nonparametrical statistical test. Steps to perform the nonparametrical statistical test 1. Collect the trials of the two experimental conditions in a single set. 2. Randomly draw as many trials from this combined data set as there were trials in condition 1 and place those trials into subset 1. Place the remaining trials in subset 2. The result of this procedure is called a random partition. 3. Calculate the test statistic on this random partition. 4. Repeat steps 2 and 3 a large number of times and construct a histogram of the test statistics. 5. From the test statistic that was actually observed and the histogram in step 4, calculate the proportion of random partitions that resulted in a larger test statistic than the observed one. This proportion is called the pvalue. 6. If the p-value is smaller than the critical alpha-level (typically, 0.05), then conclude that the data in the two experimental conditions are significantly different. The permutation distribution is based on an infinite numbers of permutations. However: - If you divide 30 observations in groups of 15, there are 155 million ways of do it - In 3 groups of 10: 5.5 trillion ways!!!! A solution is to use Monte Carlo sampling from the permutation estimation to obtain exact p-values if M test statistics ti, i = 1,.....,M are randomly sampled from the permutation distribution, a one-sided Monte Carlo p value for a test that rejects for large values of t is: p = 1 + PM i=1 I(ti ! t ) M + 1 The estimated p-value corresponds to the number of tests (ti) that are bigger than the observed t (t*), divided by the total number of tests Cluster Mass Test 1. For every sample, compare the MEG- signal on the two types of trials by means of a t-value (or some other number that quantifies the effect at this sample). 2. Select all samples whose t-value is larger than some threshold. 3. Cluster the selected samples in connected sets on the basis of temporal or spatial adjacency.
4. Calculate cluster-level statistics by taking the sum of the t-values within a
cluster. 5. Take the largest of the cluster-level statistics Cluster Mass Test is a way to perform multiple comparisons avoiding a high False Alarm rate Clusters (plaatje multi-sensor analyse) onderste rij komt door de test van spatio/temporal. Using only bonferii nothing will past the test. Cluster makes significant when Lines are close togheter. Bonferoni is to conservative. Read paper; eric Nijmegen.