Verification and Validation

Verification and Validation
John Morris
Computer Science/
Electrical and Computer Engineering
28 Dec 2015
A hard days work ensuring that some Japanese colleagues

understand why Auckland is The City of Sails!
1
Terms
Validation
Ensuring that the specification is correct
Determine that the software to be built is actually
what the user wants!
Verification
Ensuring that the software runs correctly
28 Dec 2015
Validation or Verification?
Validation
Building the right software
Make sure its what the user wants
Verification
Building the software right
Make sure it works
Accurate, complete specification

essential!
28 Dec 2015
Specifications
Functional
Define actions and operations of system
eg
Each transaction shall be stored in a database
GST at the current rate is applied to invoice
Can be verified by software tests

Apply an input data set
Compare output state to expected state
Expected state is defined in specifications
28 Dec 2015
Specifications
Functional
Non-functional
Performance
eg
Searches will take <2 seconds
Messages will be compressed by 60%
Usability
eg
An trained monkey shall be able to run this software
Require special tests

28 Dec 2015
Specifications
Functional
Non-functional
Performance
eg
Searches will take <2 seconds
Messages will be compressed by 60%
Usability
eg
An trained monkey shall be able to run this software
Require special tests

28 Dec 2015
Testing
Aim
Locate and repair defects
Axiom
Testing only reveals the presence

of defects,
it never proves their absence!!
No matter how much testing you do, you cant be
sure that there isnt an error waiting to bite you!
28 Dec 2015
Testing
The alternative?
Formal verification
Uses formal logic to prove that software is correct
Currently:
Prohibitively expensive
Little automated support
Mainly manual techniques
Error prone
Only feasible when cost of failure is extreme

Usually when failure leads to loss of life
Air and space craft control
Medical systems
Nuclear plants
28 Dec 2015
Testing - Motivation
Definitely the least glamorous part of software
development
Possibly the most expensive!
If not carried out thoroughly!
Estimates of the economic cost of software failure
produce astronomic numbers
US: $59.5 billion in 2002
http://www.nist.gov/public_affairs/releases/n02-10.htm
~10% of projects are abandoned entirely

Including some very large ones
28 Dec 2015
Famous software failures

July 28, 1962 Mariner I space probe
A bug in the flight software for the Mariner 1 causes the rocket to divert
from its intended path on launch. Mission control destroys the rocket over
the Atlantic Ocean. The investigation into the accident discovers that a
formula written on paper in pencil was improperly transcribed into
computer code, causing the computer to miscalculate the rocket's trajectory.
28 Dec 2015
10

1982 -- Soviet gas pipeline.
Operatives working for the Central Intelligence Agency allegedly plant a

bug in a Canadian computer system purchased to control the trans-Siberian
gas pipeline. The Soviets had obtained the system as part of a wide-ranging
effort to covertly purchase or steal sensitive U.S. technology. The CIA
reportedly found out about the program and decided to make it backfire
with equipment that would pass Soviet inspection and then fail once in
operation. The resulting event is reportedly the largest non-nuclear
explosion in the planet's history.
28 Dec 2015
11

1985-1987 -- Therac-25 medical accelerator
Based upon a previous design, the Therac-25 was an "improved" therapy
system that could deliver two different kinds of radiation: either a low-power
electron beam or X-rays. The Therac-25's X-rays were generated by
smashing high-power electrons into a metal target positioned between the
electron gun and the patient. A second "improvement" was the replacement
of the older Therac-20's electromechanical safety interlocks with software
control, a decision made because software was perceived to be more reliable.
What engineers didn't know was that both the 20 and the 25 were built upon
an operating system that had been kludged together by a programmer with no
formal training. Because of a subtle bug called a "race condition," a quickfingered typist could accidentally configure the Therac-25 so the electron
beam would fire in high-power mode but with the metal X-ray target out of
position. At least five patients die; others are seriously injured.
28 Dec 2015
12

June 4, 1996 -- Ariane 5 Flight 501
Working code for the Ariane 4 rocket is reused in the Ariane 5, but the Ariane 5's
faster engines trigger a bug in an arithmetic routine inside the rocket's flight
computer. The error is in the code that converts a 64-bit floating-point number to a
16-bit signed integer. The faster engines cause the 64-bit numbers to be larger in
the Ariane 5 than in the Ariane 4, triggering an overflow condition that results in
the flight computer crashing.
First Flight 501's backup computer crashes, followed 0.05 seconds later by a crash
of the primary computer. As a result of these crashed computers, the rocket's
primary processor overpowers the rocket's engines and causes the rocket to
disintegrate 40 seconds after launch.
More stories
http://www.wired.com/software/coolapps/news/2005/11/69355
or
Software testing failures in Google!
28 Dec 2015
13
Approach
1.
2.
3.
4.
Codings finished
Run a few tests
System passes
Release
Result: Disaster
Inadequate design or poor coding
produced many timebombs in the system!
28 Dec 2015
14
Approach
1.
2.
3.
4.
Codings finished
Run a few tests
System passes
Release
Heres the problem ..

Errors are inevitable (were human!)
Testing did not reveal them
Passing a few tests was assumed to mean
that the system was error-free
See the first axiom!!
28 Dec 2015
15
Why testing is hard

Lets take a trivial example
Test the addition operation on a 32-bit machine
c = a + b
How many tests needed?
28 Dec 2015
16
Why testing is hard

Trivial example
Test the addition operation on a 32-bit machine
c = a + b
How many tests needed?
Nave strategy
But simple and easily understood!
How many values for a?

2 32
How many values for b?
2 32
Total possible input combinations? 232 x 232 = 264
Assume:
One addition test/10 instructions = 3x108 test/sec
28 Dec 2015
17
Why testing is hard (2)

Total possible input combinations? 232 x 232 = 264
Assume:
3GHz machine
One addition test/~10 cycles = 3x108 test/sec
Time = 264 / 3x108 = 1.6x1019/3x108 = 0.5x1011 sec
= several years!!
Clearly need a smarter technique!!
28 Dec 2015
18
Testing strategies
Exhaustive testing - Try all possible inputs
Nave
Simple (easy to implement)
Easy to justify and
Argue for completeness!
Works for very small input sets only!!

For inputs, ai, i= 0, n-1
If Ai = {ai0,ai1,.,aik-1} is the set of all possible values of a i
and |Ai| = k is the cardinality of Ai
then
Tests required = |Ai|
Clearly only useful when all |Ai| are small!!
28 Dec 2015
19
Exhaustive Testing
Inefficient, naive?
Never forget the KISS principle
An automated test system can do a very large
numbers of tests in a reasonable time
and do them while youre designing the next test!
Analysis needed is trivial
whereas
Analysis for a more efficient test regime may be
quite complex and error-prone
Its easy to convince someone that an
exhaustively tested system is reliable
28 Dec 2015
20
Efficient testing
Many tests are redundant
In the adder example, most tests are equivalent
They dont exercise any new part of the underlying
circuit!
For example, you might argue that
all additions of +ve numbers without overflow are equivalent
Addition of 0 to a +ve number is the same for all +ve numbers
Similarly for 0 + -ve number
etc
This divides the tests into equivalence classes

Only one representative of each class need be tested!
28 Dec 2015
21
Equivalence Classes
Key concept:
Only one representative of each class needs
to be tested!
All other tests of inputs in the same equivalence
class just repeat the first one!
Dramatic reduction in total number of tests
No loss of coverage or satisfaction that tests are
complete
28 Dec 2015
22
Adder example
Test
+ve, no
overflow
20
40
60
+ve, overflow
2^31
2^20
overflow
+ve, 0
34
34
-ve, no overflow
-100
-30
-130
-ve, 0
-30
30
-ve, overflow
-2^31
-2^31
underflow
Result 0
-30
30
?
?
Clearly, weve achieved a dramatic reduction in

number of required tests!
Disclaimer: A more careful analysis would look at

the circuitry needed to implement an adder!
28 Dec 2015
23
Equivalence classes formal definition

A set of equivalence classes is a partition of a
set such that
Each element of the set is a member of exactly
one equivalence class
For a set, S, and a set of equivalence classes, Ci
U Ci = S
Ci Cj = (null set) unless I = j
28 Dec 2015
24
Equivalence classes formal definition

A set of equivalence classes is a partition of a
set such that
The elements of an equivalence class, C, are
classified by an equivalence relation, ~
If a C and b C , then a ~ b
The equivalence relation is

Reflexive
Transitive
Symmetric
a~a
if a ~ b and b ~ c, then a ~ c
if a ~ b then b ~ a
A Representative of each class is an arbitrary

member of the class
Theyre all equivalent so choose any one!
28 Dec 2015
25
Equivalence classes verification

Equivalence relation
In the verification context, the elements of the set
are the sets of input values for a function under
test
eg we are verifying a function
f( int a, int b )
The 2-tuples (1,1), (1,2), (1,3) .. (and many more!)
are the elements of the set of all possible inputs for f
The equivalence relation is

behaves the same way under testing
One common interpretation of this is:
follows the same path through the code
28 Dec 2015
26
Equivalence classes verification

Equivalence relation
Consider this function
int max( int a, int b ){
if( a > b ) return a;
else return b;
}
There are two paths through this code, so the inputs fall into
two classes
Those for which a > b and
the rest
This implies that we have only two tests to make:

(a=5, b=3) and
(a=4, b=6)
28 Dec 2015
27
Black Box and White Box Verification

There are two scenarios for developing
equivalence classes
Black Box
Specification is available but no code
Equivalence classes are derived from rules in the
specification
eg admission price: if age < 6, then free
if age < 16, then 50%
else full price
would lead to 3 equivalence classes:
age < 6; age 6 age < 16; age 16
28 Dec 2015
28
Black Box and White Box Verification

Black Box
Specification is available but no code
White Box
Code is available and can be analyzed
specification and the code
28 Dec 2015
29
White Box Verification

White Box
specification and the code
These are not always the same
eg a database stored on a disc
Specification might say,
if record exists, then return it
Black Box Testing
Two equivalence classes
Record exists and
record does not exist
28 Dec 2015
30
White Box Verification

White Box
However, the code reveals that an m-way tree
(matched to disc block size for efficiency) Is used
Many additional classes
Disc block full
Block split needed
Only one record
Record at start of block
Record in middle of block
Record at end of block
Record in root block
Record in leaf
.
28 Dec 2015
31
Generating the Equivalence Classes

Specification
admission price: if age < 6, then free
if age < 16, then 50%
else full price
would lead to 3 equivalence classes:
age < 6; age 6 age < 16; age 16
Choose representatives 3, 9 and 29
(or many other sets)
3
28 Dec 2015
5 6
15 16
29
32

Formally
Choose representatives 3, 9 and 29 is sufficient
However
An experienced tester knows that a very common
error is writing
< for or
> for
or vice versa
So include class limits too!
3
28 Dec 2015
5 6
15 16
29
33

Other special cases
Nulls
Identity under addition: x + 0 = x
Unity
Identity under multiplication: x 1 = x
Range Maxima and Minima

May have (or need!) special code
Illegal values
Should raise exceptions or return errors
Read the specification to determine behaviour!
-5
-1 0 1
28 Dec 2015
5 6
15 16
29
999
34

Illegal values
Should raise exceptions or return errors

Read the specification to determine behaviour!
Particularly important!
Typical commercial code probably has as much
code handling illegal or unexpected input as
working code!
Treat every possible exception as an output!
-5
-1 0 1
28 Dec 2015
5 6
15 16
29
999
35

Other special cases
This caused the set of representatives to expand
from 3 to 12
Some
are not reallytesters
needed routinely
Experienced
eg code does process 1 in just the same way as 3
include these special cases!
However, this is a small price to pay for robust

software!
The cost of proving that a unity is not needed is more
than the cost of testing it!
-5
-1 0 1
28 Dec 2015
5 6
15 16
29
999
36

Outputs
Find equivalence classes that cover outputs also!

Never
neglect
the inputs
null case!
Same general rules
apply
as for
Its very easy to neglect at
One representative
of each class
specification
stage plus
Required behaviour may be obvious
Boundaries
No need to write it down!
Null output
It will require coding
eg No items in a report
does theprogrammers
user want a confirming
Experienced
know that
report anyway? its a very common source of error!
Just one output

eg Reports often have header and trailer sections - are
these correctly generated for a short (<1 page) report?
28 Dec 2015
37
Coverage in White Box Testing

Black Box testing will not usually cover all
the special cases required to test data
structures
Often, the functional goals of the specification
could be met by one of several data structures
Specification may deliberately not prescribe the
data structure used
Allows developers to choose one meeting performance goals
Permits substitution of an alternative with better
performance (vs non functional specifications)
Coverage defines the degree to which white

box testing covers the code
Measurement of completeness of testing
28 Dec 2015
38

Usually, at least some white box coverage goals
will have been met by executing test cases
designed using black-box strategies
How would you know if this were the case or
not?
In simple modules, which dont use internal data structures,
black box classes may be adequate
This is not the general case though!
Various coverage criteria exist

Every statement at least once
Every branch taken in true and false directions
Every path through the code
28 Dec 2015
39

Coverage criteria
Logic coverage
Statement: each statement executed at least once
Branch: each branch traversed (and every entry point taken)
at least once
Condition: each condition True at least once and False at least
once
Branch/Condition: both Branch and Condition coverage
Compound Condition: all combinations of condition values at
every branch statement covered (and every entry point taken)
Path: all program paths traversed at least once
28 Dec 2015
40
Pseudocode and Control Flow Graphs

input(Y)
if (Y<=0) then
Y = Y
end
while (Y>0) do
input(X)
Y = Y-1
end
28 Dec 2015
nodes
edges
41
Statement Coverage
Statement Coverage requires that
each statement is executed at least once
Simplest form of logic coverage
Also known as Node Coverage
What is the minimum number of test cases
required to achieve statement coverage for
the program segment given next?
28 Dec 2015
42
Pseudocode and Control Flow Graphs

input(Y)
if (Y<=0) then
Y = Y
end
while (Y>0) do
input(X)
Y = Y-1
end
28 Dec 2015
nodes
edges
43
Branch coverage
Branch Coverage requires that each branch
will have been traversed, and that every
program entry point will have been taken, at
least once
Also known as Edge Coverage
28 Dec 2015
44
Branch Coverage Entry points

Why include
and that every
program entry point
will have been taken,
at least once.
Not common
in HLLs (eg Java) now
Common in scripting
languages
Any language that
allows a goto and a
statement label!
28 Dec 2015
45
Procedure Module Verification

Steps
Obtain precise specification
Should include definitions of exception or illegal input
behaviour
For each input of module

Determine equivalence classes (inc special cases)
Choose representatives
Determine expected outputs
Repeat for outputs

Many output equivalence classes are probably covered
by input equivalence classes
Build test table

28 Dec 2015
46
Procedure Module Verification

Steps
Write test program
Tests in tables is usually the best approach
Easily maintained
Test programs need to be retained and run when any
change is made
To make sure that something that worked isnt
broken now!!
Tables are easily augmented
When you discover the case that you didnt test for!
28 Dec 2015
47

Verification and Validation

Caricato da

Informazioni sul documento

Copyright

Formati disponibili

Condividi questo documento

Condividi o incorpora il documento

Opzioni di condivisione

Hai trovato utile questo documento?

Questo contenuto è inappropriato?

Copyright:

Formati disponibili

Verification and Validation

Caricato da

Copyright:

Formati disponibili

Verification and Validation

A hard days work ensuring that some Japanese colleagues

Accurate, complete specification

Can be verified by software tests

Require special tests

Require special tests

Testing only reveals the presence

Only feasible when cost of failure is extreme

~10% of projects are abandoned entirely

Famous software failures

Famous software failures

Operatives working for the Central Intelligence Agency allegedly plant a

Famous software failures

Famous software failures

Heres the problem ..

Why testing is hard

Why testing is hard

How many values for a?

Why testing is hard (2)

Clearly need a smarter technique!!

Works for very small input sets only!!

Clearly only useful when all |Ai| are small!!

This divides the tests into equivalence classes

Clearly, weve achieved a dramatic reduction in

Disclaimer: A more careful analysis would look at

Equivalence classes formal definition

Equivalence classes formal definition

The equivalence relation is

A Representative of each class is an arbitrary

Equivalence classes verification

The equivalence relation is

Equivalence classes verification

This implies that we have only two tests to make:

Black Box and White Box Verification

Black Box and White Box Verification

White Box Verification

White Box Verification

Generating the Equivalence Classes

Generating the Equivalence Classes

Generating the Equivalence Classes

Range Maxima and Minima

Generating the Equivalence Classes

Should raise exceptions or return errors

Generating the Equivalence Classes

include these special cases!

However, this is a small price to pay for robust

Generating the Equivalence Classes

Find equivalence classes that cover outputs also!

Just one output

Coverage in White Box Testing

Coverage defines the degree to which white

Coverage in White Box Testing

Various coverage criteria exist

Coverage in White Box Testing

Pseudocode and Control Flow Graphs

Pseudocode and Control Flow Graphs

Branch Coverage Entry points

Procedure Module Verification

For each input of module

Repeat for outputs

Build test table

Procedure Module Verification

Potrebbero piacerti anche