Documenti di Didattica
Documenti di Professioni
Documenti di Cultura
2.2 What is an appropriate set of hypotheses for this task? What kind of statistical
test do you expect to perform? Justify your choices.
Null hypothesis : Population mean of Congruent (C ) and Incongruent (I ) cases are
equal.
H0 : C I = 0
Alternate hypothesis : Population mean of Congruent (C ) and Incongruent (I ) cases
are different.
HA : C I 6= 0
1
Since the sample size n < 30, one sample two tailed t-test (for paired samples) with =
.05 is proposed. This will determine whether there is a significant difference in the two
samples namely Congruent and Incongruent cases. We dont know the population standard
deviation, hence the Bessel corrected standard deviation of the sample should be used.
Assumptions made:
We assume the distributions of dependent samples and their difference are normaly
distributed (Gaussian).
We assume the samples are randomly selected.
2
2.2.1 Report some descriptive statistics regarding this dataset. Include at least one measure of
central tendency and at least one measure of variability.
Mean and Stand deviation for both cases are given.
For congruent case (n = 24) :
xC = 14.051 D = 3.559
In [4]: fig=plt.figure(figsize=(7,5.5))
plt.subplot(221)
plt.hist(data["Congruent"], color="#D86E3F")
plt.xlabel('Time Scores for Congruent', fontsize=10)
plt.ylabel('Frequency', fontsize=10)
plt.subplot(222)
plt.hist(data["Incongruent"], color="#2088B2")
plt.xlabel('Time Scores for Incongruent', fontsize=10)
plt.ylabel('Frequency', fontsize=10)
plt.subplot(223)
plt.hist(data["Congruent"], color="#D86E3F",alpha=0.75,
label="Congruent")
plt.hist(data["Incongruent"], color="#2088B2", alpha=0.75,
label="Incongruent")
plt.xlabel('Time Scores', fontsize=10)
plt.ylabel('Frequency', fontsize=10)
fig.tight_layout()
plt.legend(loc=1,prop={'size':9})
plt.subplot(224)
data[["Congruent", "Incongruent"]].boxplot( return_type='dict', grid=False)
plt.ylabel('Time Scores', fontsize=10)
plt.xlabel('Type', fontsize=10)
plt.show()
3
2.3 Provide one or two visualizations that show the distribution of the sample data.
Write one or two sentences noting what you observe about the plot or plots.
The distribution of data for Congruent and Incongruent is shown above.
Observations rom the frequency distribution:
Most of the time scores for Congruent case is lesser than the Incongruent case with
some overlapping data.
Both distribution have the highest frequency at 6 around the middle of each distribu-
tion,i.e. Mode of Congruent < Mode of Incongruent
Boxplot shows the median of congruent case lesser than the incongruent case with some
outliers in the congruent case. i.e. Median of Congruent < Median of Incongruent
2.4 Now, perform the statistical test and report your results. What is your confidence
level and your critical statistic value? Do you reject the null hypothesis or fail to
reject it? Come to a conclusion in terms of the experiment task? Did the results
match up with your expectations?
Measuring the sample differences as xDi = xC i xIi , we can report
4
mean xD = 7.965
standard deviation D = 4.865
degrees of freedom df = 23
Standard Error of Mean SEM = 0.993
tstatistic = 8.021
For a two-tailed test @ = 0.05, the critical t-value tcritical = 2.0687
Correlation factor r2 = .737
p value < 0.0001
Confidence Interval CI = (10.019, 5.910)
Since the tstatistic fall outside critical value tcritical for = 0.05, the difference between two
samples (congruent and incongruent) are significant i.e. not likely due to random chance.
Alternatively, the probability of both samples the being same is less than 0.01%. Hence the
null hypothesis is rejected.
We can say with a 95% confidence interval that the subject requires around 6 to 10 time-units
less to identify congruent words than incongruent words.
Around 73.7% of data account for the difference in the two samples.
Since this is an experimental data, we can conclude that the time taken by subjects to identify
the ink color of a word was significantly influenced by the match/mismatch with words
represented them.
2.5 Optional: What do you think is responsible for the effects observed? Can you
think of an alternative or similar task that would result in a similar effect? Some
research about the problem will be helpful for thinking about these two ques-
tions!
The verbal and visual centers of cognition in the brain seems to be linked. When there is a con-
tradiction between them, the brain seems to take longer time to process information. It would be
intersting to see if there is a difference in cognition time to identify words with swaped letters.
In [ ]: