Documenti di Didattica
Documenti di Professioni
Documenti di Cultura
INTRODUCTION
The paper draws on those foundations and tools to begin the process of
designing test or revising ting tests. To start that process we need some critical
questions: what is the purpose of the test, what are the objectives of the test, how will
the test specifications reflect both the purpose and the objectives, how will the test tasks
be selected and the separate items arranged, and what kind of scoring, grading, and
feedback.
Establishing appropriate objectives involves a number of issues, ranging from
relatively simple ones about forms and functions covered in a course unit to much more
complex ones about constructs to be operationalized in the test. Included here are
decisions about what language abilities are to be assessed.
To evaluate or design a test, we must make sure that the objectives are
incorporated into a structure that appropriately weights the various competencies being
assessed. The tasks that the test-takers must perform need to be practical. They should
also achieve content validity by presenting tasks that mirror those of the course or
segment there of being assessed. Further, they should be able to be evaluated reliably
by the teacher or scorer. The tasks themselves should strive for authenticity, and the
progression of tasks ought to be biased for best performance.
Test vary in the form and function of feedback, depending on their purpose. Or
every test, the way results are reported is an important consideration. Under some
circumstances may require that a teacher offer substantive wash back to the learners.
In this written, you will draw on the foundation and tools to begin the process
of designing test or revising existing test. To start that process, you need to ask some
critical questions:
1. What is the purposes of the test? Why am I creating this rest or why was it
created by someone else? For an evaluation of overall proficiency? To places
1
2
CHAPTER II
DISCUSSION
A. Test Types
There are many kinds of tests; each test has specific purpose and a particular
criterion to be measured. This written will explain about kinds of tests based on specific
purpose, response, orientation and the way to test, and score interpretation.
b. Diagnostic Test
Diagnostic test is used to identify students' strengths and weaknesses.
Another purpose is to diagnose specific aspects of a language. A test in
pronunciation, for example, might diagnose the phonological features of
English that are difficult for learners and should therefore become part of
curriculum. Usually, such tests offer a checklist of features for the administrator
(often the teacher) to use in pinpointing difficulties. For example, a writing
diagnostic test would first elicit a writing sample of the students. Then the
teacher would identify the organization, content, spelling, grammar, or
vocabulary of their writing. Based on that identification, teacher would know
the needs of students that should have special focus.
c. Placement Test
The purpose of placement test is to place a student into a particular level
or section of a language curriculum or school. It usually includes a sampling of
the material to be covered in the various courses in a curriculum. A student's
performance on the test should indicate the point at which the student will find
material neither too easy nor too difficult. Placement tests come in many
varieties: assessing comprehension and production, responding through written
and oral performance, multiple choice, and gap filling formats. One of the
5
This test can also be used to diagnose the grammar of the Beginner level
that your students need clarification on.
Grammar
1. …. is your name?
A. How
B. Who
C. What
D. Where
2. ….? I’m from Italy.
A. Where are you from?
B. Where you are from?
C. Where from you are?
D. From where you are?
d. Achievement Test
The purpose of achievement tests is to determine whether course
objectives have been met with skills acquired by the end of a period of
instruction. Achievement tests should be limited to particular material
addressed in a curriculum within a particular time frame. Achievement tests
belong to summative because they are administered at the end on a unit/term of
6
study. It analyzes the extent to which students have acquired language that have
already been taught.
Achievement tests range from five or ten-minute quizzes to three hour final
examinations, with an almost in finite variety of item types and formats.
Here is the outline for a mid-term examination offered at the high
intermediate level of an intensive English program in the US.
Section A Vocabulary
Part 1 (5 items) : Match words and definitions
Part 2 (5 items) : use the words in a sentence
Section B Grammar
(10 sentences) : error detection (Underline or circle the error)
Section C Reading Comprehension
(2 one paragraph passage) : Four short - answer items for each
Section D Writing
Respond to a two-paragraph article on Native American culture
b. Phonetic Script
c. Spelling clues
d. Words in sentences
e. Paired associates
2. Based on response
There are two kinds of tests based on response. They are objective and
subjective tests.
a. Objective Test
Objective test is a test in which learners ability of performance are measure using
specific set of answer, means there are only two possible answer, right and
wrong. In other word, the score is according to right answer. Type of objective
test includes multiple choice tests, true or false test, matching and problem based
questions.
b. Subjective Test
Subjective test is a test in which the learners ability or performance are judged
by examiners’ opinion and judgment. The example of subjective test is essay
and short answer.
order cognitive skills. Can be scored skills, but most items assess only
quickly. knowledge. Some correct answer
can be guesses.
Matching Items can be written quickly. A broad Higher order cognitive skills
range of content can be assessed. difficult to assess.
Scoring can be done efficiently.
competences uses basic skill, such as listening, speaking, reading or writing while
indirect language testing is a test that the process to elicit students competences
does not use basic skills.
From the explanation above, language testing can be divided into four types based
on orientation and the way to test. They are direct competences test, indirect
competences test, direct performance test, and indirect performance test
1.1 Direct competences tests
The direct competences test is a test that focuses on measuring students’
knowledge about language component, like grammar or vocabulary, which the
elicitation uses one of the basic skills, listening, speaking, reading or writing.
For the example, a teacher wants to know about students’ grammar knowledge.
The teacher asks the students to write a letter to elicit students’ knowledge in
grammar.
1.2 Indirect competence test
The indirect competence test is a test that focuses on measuring students’
knowledge about language component, like grammar or vocabulary, which the
elicitation does not use one of the basic skills, listening, speaking, reading, or
writing. The elicitation in this test uses other ways, such as multiple choice. For
example , the teacher wants to know about students’ grammar knowledge. The
teacher gives a multiple choice test to measure students knowledge in grammar.
2.1 Direct performance test
Direct performance test is a test that focuses on measuring students’ skill
in listening, speaking, reading and writing that the elicitation is through
direct communication for example, the teacher wants to know the students’
skill in writing, the teacher asks the students to write a letter a short story.
2.2 Indirect performance test
Indirect performance test is a test that focuses on measuring students’ skill
in listening, speaking, reading, and writing that the elicitation does not use
the basic skill. For example, the teacher wants to measure the students’
10
skill in listening. The teacher gives some pictures and asks the student to
arrange the picture into correct order based on the story that they listen to.
criterian-referenced test when they wish to see how well students have learned
the knowledge and skills which they are expected to have mastered. This
information of learning the desired curriculum and how well the school is
teaching that curriculum. The content of a criterion-referenced test is
determined by how well it matches the learning outcomes deemed most
important. In other words, the content selected for the criterion-standard tests
is selected on the basis of its significance in the curriculum. Criterion-
referenced tests give detailed information about how well a student has
performed on each of the educational goals or outcomes included on that test.
The descriptions of types of tests in the preceding section are intended to help
you understand how to answer the first question posed in this chapter. What is the
purpose of the test? It is unlikely that you would be asked to design an aptitude test or
a proficiency test, but for the purposes of interpreting those tests, it is important that
you understand their nature. However, your opportunities to design placement,
diagnostic, and achievement tests -especially the latter- will be plentiful. In the
remainder of this chapter, we will explore the four remaining questions posed at the
outset, and the focus will be on equipping you with the tools you need to create such
classroom-oriented tests.
You may think that every test you devise must be a wonderfully innovative
instrument that will garner the accolades of your colleagues and the admiration of your
students. Not so. First, new and innovative testing formats take a lot of effort to design
and a long time to refine through trial and error. Second, traditional testing techniques
can, with a little creativity, conform to the spirit of an interactive, communicative
language curriculum. Your best tack as a new teacher is to work within the guidelines
of accepted, known, traditional testing techniques. Slowly, with experience, you can
12
get bolder in your attempts. In that spirit, then, let us consider some practical steps in
constructing classroom tests.
Students will
10. write a one-paragraph story about a simple event in the past.
11. use conjunctions so and because in a statement of opinion.
You may find, in reviewing the objectives of unit or a course, that you
cannot possibly test each one. You will then need to choose a possible subset of the
objectives to test.
2. Drawing up test specifications
In the unit discussed above, your specifications will simply comprise (a) a
broad outline of the test, (b) what skills you will test, and (c) what the items will
look like.
You might argue that the first two sentences of this item give it some
authenticity and accomplish a bit of schema setting. But if you simply want a
student to identify the type of medical professional who deals with eyesight
issues, those sentences are superfluous. Moreover, by lengthening the stem, you
have introduced a potentially confounding lexical item, deteriorate, that could
distract the student unnecessarily.
4.3 Make certain that the intended answer is clearly the only correct one.
In the proposed unit test described earlier, the following item appeared in the
original draft:
items. And very difficult item can provide a challenge to the highest-
ability students.
4.4.2 Item Discrimination (ID) is the extent to which an item
differentiates between high-and low-ability test takers. An item on
which high-ability students (who did well in the test) and low-ability
students (who didn't) score equally well would have poor ID
because it did not discriminate between the two groups. Conversely
the high-ability group and incorrect responses from the most of the
low-ability an item that garners correct responses from most of
group has good discrimination power.
Suppose your class of 30 students has taken a test. Once you have
calculated final scores for all 30 students, divide them roughly into
thirds-that is, create three rank-ordered ability groups including the
top 10 scores, the middle 10, and the lowest 10. To find out which
of your 50 or so test items were most "powerful" in discriminating
between high and low ability, eliminate the middle group, leaving
two groups with results that might look something like this on a
particular item:
Using the ID formula (7- 2 = 5 : 10 = 50), you would find that this
item has an ID of .50, or a moderate level.
The formula for calculating ID is
20
The result of this example item tells you that the item has a moderate
level of ID. High discriminating power would approach a perfect
1.0, and no discriminating power at all would be zero. In most cases,
you would want to discard an item that scored near zero. As with IF,
no absolute rule governs the establishment of acceptable and
unacceptable ID indices.
Choices A B *C D E
High-ability Ss (10) 0 1 7 0 2
Low-ability Ss (10) 3 5 2 0 0
*Note: C is the correct response
low group, and therefore the latter students don't even consider it.
The other two distractors (A and B) seem to be fulfilling their
function of attracting some attention from lower- ability students.
1. Scoring
As you design a classroom test, you must consider how t test will be scored and
graded. Your scoring plans reflect the relative weight that you place on each section
and items in each section.
Here are your decisions about scoring your test:
2. Grading
Your first thought might be that assigning grades to student’s performance on
this test would be easy: just give an “A” for 90-100 percent, a “B” for 80-89 percent,
and the topic. How you assign letter grades to this test is a product of
The country, culture, and context of this English classroom,
Institutional expectations (most of them unwritten),
Explicit and implicit definitions of grades that you have set forth,
The relationship you have established with this class, and
22
Student expectations that have been engendered in previous tests and quizzes in this
class.
3. Giving Feedback
A section on scoring and grading would not be complete without some
consideration of the forms in which you will offer feedback to your students, feedback
that you want to become beneficial wash back. In the example test that we have been
referring to here – which is not unusual in the universe of possible formats for periodic
classroom test- consider the multitude of options. You might choose to return the test
to the student with one of, or a combination of, any of the possibilities below:
1. A letter grade
2. A total score
3. Four sub scores (speaking, listening, reading, writing)
4. For the listening and reading sections
a. An indication of correct/incorrect responses
b. Marginal comments
5. For the oral interview
a. Scores for each element being rated
b. A checklist of areas needing work
c. Oral feedback after the interview
d. A post-interview conference to go over the results
6. On the essay
a. Scores for each element being rated
b. A checklist of areas needing work
c. Marginal and end-of-essay comments, suggestions
d. A post-test conference to go over work
e. A self-assessment
7. On all or selected parts of the test, peer checking of results
8. A whole-class discussion of results of the test
23
CHAPTER III
CONCLUSION
There are five kinds of test types: Language aptitude tests, proficiency tests,
placement tests, diagnostic tests, and achievement tests. Every test must be a
wonderfully innovative instrument that will garner the accolades of the colleagues and
the admiration of the students.
In the test, we have some practical steps to test construction, they are: assessing
clear and unambiguous objectives, drawing up test specifications, devising test tasks,
and designing multiple-choice test items.
Evaluation can fulfill two functions: assessment and feedback. Assessment is a
matter of measuring what the learners already know. Any assessment should also
provide positive feedback to inform teachers and learners about what is still not known,
thus providing important input to the content and methods of future works.