Documenti di Didattica
Documenti di Professioni
Documenti di Cultura
1. Introduction
Each language has its own distinct grammar. "English grammar" is the
set of rules within the English language itself. "An English grammar" is a
specific study or analysis of these rules. The English Grammar may be
separated into two common broad categories: descriptive and prescriptive. A
descriptive grammar tries to look at the grammar of any spoken language or
dialect as it actually exists, judging whether a sentence is grammatical or not
based on the rules of the speech group in which it is spoken, rather than an
arbitrary set of rules. A prescriptive grammar looks at the norms of speech as
given by authoritative sources, such as an upper-class or academic subculture,
and creates strict rules by which all speech within that language must abide to
be considered grammatical. Few linguists take a prescriptive approach to
Language Testing Page 1
grammar in the modern age, preferring to describe language as it exists in a
given speech community.
There are twenty students taking this test and all of them are seated as the
final test does where each student occupies one table. The students were
instructed to answer the test on a separate answer sheet provided and to choose
one correct answer by crossing or circling letter A, B, C, D, or E on the answer
sheet. Before doing the test, the students are instructed to empty the tables and
close their books. Furthermore, no one is allowed to cheat or peep other
students’ work. The students have to answer the test fairly and individually
based on their knowledge.
3.1. Validity
The test is considered to have good face validity if the test item looks
right to other testers, teachers, moderators, and testees (Heaton, 1989). He
argues that a test which has good face validity can maintain and heighten the
students’ motivation and vice versa. It is necessary to show the test to other
teachers or friends to have their point of view regarding the test whether or not
it is valid. It is important in order to avoid problems once the test is examined
by other people. In this multiple choice grammar test, the test items will be
overviewed by the lecturer to have high face validity.
Each item in this test is typed using Times New Roman font style with 14
in size and the line spacing applied is 1.5 lines which help the students read the
test easily. The test items are clearly and briefly written on A4 plain paper with
2.54 cm for all margins. By applying those criteria above, this test is intended to
have high face validity and the students will get motivated.
Face validity in most test designers’ point of view is regarded as the most
important of all types of test validity. It provides not only a quick and
reasonable guide but also a balance to too great a concern with statistical
Content Validity needs a careful analysis of the language being tested and
of the particular course objectives. The test is considered to have content
validity if the test items represent all the materials that have been taught or
discussed by the teacher. In this multiple choice grammar test, all the items have
been made through critical and careful selection of the materials discussed
previously. The content of the test is entirely taken from there and presented in
various percentage of each grammatical area. As nestled in the table of test
specification above, phrasal verbs occupy 10 per cent, conditionals and
subjunctives 10 per cent, adjectives and adverbs 10 per cent, prepositions 17 per
cent, comparisons 10, relative pronouns 10 per cent, gerunds and infinitives 13
per cent, quantifiers 10 per cent, and causatives 10 per cent. In this way, this
grammar test is intended to have good content validity on each item
representing each of the grammatical areas which have been discussed earlier.
3.2. Reliability
In order to find the reliability coefficient of test, we must first find the
mean of all testees’ score using this formula by referring to data of table 1
above:
M= = 21,85
B. Standard Deviation
After we get the mean score of all testees, then we have to find out the
standard deviation of the test. Look at the table 2 below for the spread of
testees’ scores.
Based on the data from table 2 above, we will use the formula below to find the
standard deviation of the test.
s.d =
= 3,93
C. K-R21
All the data we need to get the reliability coefficient of the test has been
obtained. Using the K-R21 formula below, we are going to calculate our data to
find the reliability coefficient of the grammar test.
K-R21 = (1 - )
= 1,03 (1 - )
= 1,03 x 0,62
= 0,64
The result of the objective grammar test has been obtained, but it is not
finished yet; otherwise, it still needs further analysis since the result of the test
can be used to provide valuable information concerning the performance of the
student as a group, of individual students, and of each of the items comprising
the test. The information concerning the students‘performance as a whole and of
individual students is very important for teaching purposes, especially as much
result can show not only the types of errors most frequently made but also the
actual reasons for the errors being made. The performance of the test items is
fairly importance in compiling future test. All items which have been tested
should be examined from the view of their difficulty level (facility value), and
their discrimination level (discrimination value).
The facility value of an item shows how easy or difficult the particular
item is. It is generally expressed the percentage of the correct answer per test
item. The formula used in calculating the facility value is:
The multiple choice grammar test items which have been tested to the students
consist of 30 items and the facility value of each item is shown in table 3 below.
The discrimination value of an item indicates the extent to which the item
discriminates between the testees separating the high achiever students from the
low achiever students. There are various methods off obtaining the
discrimination value of each item of the text, one of them is by comparing the
top 27 per cent with the bottom 27 per cent as shown by the formula below. In
order to that, we will first assign the top group and the bottom group to get a
number of students occupying each group based on the data in table 4 below.
In assigning the upper and lower groups, we will apply this formula:
A. Upper Group
27% x 20 = 5.4
B. Lower Group
27% x 20 = 5.4
There will be 5 students in lower group. At table 4 below, the members of lower
group are labelled in orange. The rest of those two groups are called middle
group.
Table
4
21,85
Teste Score (x-m)
NO d²
es (x) =d
1 S 29 7,15 51,12
2 P 27 5,15 26,52
3 O 26 4,15 17,22
4 K 25 3,15 9,92
5 T 25 3,15 9,92
6 B 24 2,15 4,62
7 G 24 2,15 4,62
8 J 24 2,15 4,62
9 H 23 1,15 1,32
10 R 23 1,15 1,32
11 I 22 0,15 0,02
12 M 22 0,15 0,02
13 A 21 -0,85 0,72
14 Q 21 -0,85 0,72
15 E 20 -1,85 3,42
16 C 18 -3,85 14,82
17 N 18 -3,85 14,82
18 F 16 -5,85 34,22
19 L 15 -6,85 46,92
20 D 14 -7,85 61,62
∑d² 308,
N=20 = 55
Language Testing Page 18
From the data above, now we will find the discrimination value per item using
this formula:
Look at the table 5 below to see the discrimination value of each test item
between Upper and Lower groups:
As the data shown in the table of discrimination value that some items
cannot discriminate between high achiever students and low achiever students;
thus, those items need revising. The analysis has revealed that there 2 items out
of 30 are not acceptable for the next use of the text due to lack in discrimination
index. Those test items are item no 18 and 28. Those items need revising
whether it be the stem or the options of those items. After it has been revised, it
has to be retest again to the students and reanalysed in order to know whether or
not the revision makes the quality of the items better.
4.1. Conclusion
The critical analysis of teacher - made multiple choice grammar tests has
been scrutinised; therefore, there are some points to be noted as follows:
Firstly, this grammar test items, which are entirely taken from teacher’s
guide book, consist of 30 items. 28 items have been revealed to have good
discriminating power since it has been able to discriminate between high
achieving students and low achieving students. Those 28 items can therefore be
applied for the upcoming use. Otherwise, the other 2 items which do not have
good discrimination powers should be revised. After revising, the test should be
retried out to measure its validity and reliability.
4.2. Suggestions
APPENDICES