IA169 System Veriﬁcation and Assurance
Course Intro &
Fundaments of Testing
Jiří Barnat
Section
Course Organization
IA169 System Veriﬁcation and Assurance – 01 str. 2/50
Course Coverage
Topics to be covered ...
Introduction to Testing
Symbolic Execution
Model Checking (4 lectures)
Deductive Veriﬁcation
Bounded Model Checking
Veriﬁcation of Real-Time and Hybrid Systems
Veriﬁcation of Probabilistic Systems
CEGAR and Abstract Interpretation
Assurance, Threat Models, Relevant Security Standards
IA169 System Veriﬁcation and Assurance – 01 str. 3/50
Prerequisites and Follow-Up
Prerequisites
Formally none, but we expect ...
... capability of basic math reasoning and abstractions.
... some experience with coding.
... you can handle Unix as a user.
Mutual Exclusion with
IV113 and IV101
Possible Follow-Up
IA159 Formal Veriﬁcation Methods
IA169 System Veriﬁcation and Assurance – 01 str. 4/50
Course Structure and Marking
Structure
2/2/2 + 2 ECTS credits
Lecture/Tutorials/Homework
Marking
Final exam 70% (written test)
Assignments 30% (ﬁve evaluated tasks)
50% for E or Colloquy or Credit
60% for D
70% for C
80% for B
90% for A
Seminar schedule
https://is.muni.cz/auth/el/1433/jaro2019/IA169/
op/IA169_semester_schedule_2019.pdf
IA169 System Veriﬁcation and Assurance – 01 str. 5/50
Section
Fundaments of Testing
http://www.testingeducation.org/BBST/
IA169 System Veriﬁcation and Assurance – 01 str. 6/50
Testing
Testing is an empirical technical investigation conducted to
provide stakeholders with information about the quality of
the product or service under test.
Empirical Technical
Conduct experimental measurements.
Logic and math.
Modelling.
Employs SW tools.
Investigation
Organised and thorough.
Self-reﬂecting.
Challenging.
IA169 System Veriﬁcation and Assurance – 01 str. 7/50
Testing
Product or Service
Software.
Hardware.
Data.
Documentation and speciﬁcation.
... other parts that are delivered.
Information
Not know before.
Has some value.
Stakeholders
Who has interest in the success of testing eﬀort.
Who has interest in the success of the product.
IA169 System Veriﬁcation and Assurance – 01 str. 8/50
Fundamental Questions of Testing
Mission
Why do we test? What we want to achieve?
Strategy
How to proceed to fulﬁl the mission eﬃciently?
Oracle
How to recognise success of the test.
Incompletness
Do we realise that testing cannot prove absence of error?
Measure
How much of of our testing plan has been completed?
How far we are to complete the mission?
IA169 System Veriﬁcation and Assurance – 01 str. 9/50
Mission of Testing
Most Typical Mission
Bug hunting.
Identiﬁcation of factors that reduce quality.
Other Missions
Collect data to support manager decisions, such as: Is the
product good enough to be released?
How much diﬀerent is the product from product available on
market?
Is the product complete with respect to speciﬁcation?
Are individual components logically and ergonomically
connected.
. . .
IA169 System Veriﬁcation and Assurance – 01 str. 10/50
Other Missions
Other Missions – continued
Support manager decision with empirical results.
Evaluate the cost of support after release.
To check compatibility with other products.
Conﬁrm accordance with the speciﬁcation.
Find safe scenarios of product usage.
To acquire certiﬁcation.
Minimise consequences of low quality.
Evaluate the product for third party.
. . .
IA169 System Veriﬁcation and Assurance – 01 str. 11/50
Section
Strategy
IA169 System Veriﬁcation and Assurance – 01 str. 12/50
Strategy
Strategy is a plan, how to fulﬁl the mission in the given
context.
Example: Consider spreadsheet computation in four diﬀerent
contexts.
a) Computer game.
b) Early stage of development of database product.
c) Late stage of development of database product.
d) Driver for medical X-ray scanning device.
Question:
Will you proceed with the same strategy?
IA169 System Veriﬁcation and Assurance – 01 str. 13/50
Example – continued
What factors inﬂuence strategy selection
What is the mission?
How aggressively we need to detect bugs.
What bugs are less important than others?
How thoroughly testing will be documented?
Discussion
Assume that a program has an enter ﬁeld that is expecting
numerical values. Is is meaningful to test the product for
situation when we enter non-numeric value? (Not mentioned
in speciﬁcation at all.)
IA169 System Veriﬁcation and Assurance – 01 str. 14/50
Section
Oracle
IA169 System Veriﬁcation and Assurance – 01 str. 15/50
Deﬁnition of Oracle
Oracle (in the context of testing) is a detection mechanism
or principle to learn that the product passed or failed a test.
Facts
If tester claims that the program passed a test, it does not
mean the program is correct with respect to the tested
property. It depends on the oracle used.
Basically, any test may fail or pass with a suitable oracle.
Example
Does font sizes work properly in OpenOﬃce, WordPad, and
MS Word text editors?
IA169 System Veriﬁcation and Assurance – 01 str. 16/50
Example – OpenOﬃce 1.0
IA169 System Veriﬁcation and Assurance – 01 str. 17/50
Example – WordPad
IA169 System Veriﬁcation and Assurance – 01 str. 18/50
Example – WordPad versus MS Word
IA169 System Veriﬁcation and Assurance – 01 str. 19/50
Example – WordPad versus MS Word (highlighted)
IA169 System Veriﬁcation and Assurance – 01 str. 20/50
Example – Decisions
Questions
Is the observed diﬀerence in font sizes a bug in WordPad?
Is the observed diﬀerence in font sizes a bug in MS Word?
Is the observed diﬀerence in font sizes a bug at all?
Possible Conclusions
We do not know if sizes are correct, but we have tendency to
believe MS Word rather than to WordPad.
For WordPad it is not necessary to stick precisely to
typographic standards.
For WordPad it is possibly a bug, but deﬁnitely it is not a
problem.
IA169 System Veriﬁcation and Assurance – 01 str. 21/50
Example – Risk-Based Testing
Possible (Pragmatic) Position
It is/isn’t a bug? =⇒ It is/isn’t a problem?
It is necessary to know the context, to guess the metrics that
the ﬁnal consumer will use to judge the issue.
With some risk we can achieve simpliﬁcation of the decision.
Simpliﬁcation in Testing Process
Avoid tests that obviously does not reveal any problems.
Avoid tests that obviously reveal only uninteresting problems.
IA169 System Veriﬁcation and Assurance – 01 str. 22/50
Example – Judge Criteria
How much do we actually know about typography?
Point deﬁnition is unclear.
(http://www.oberonplace.com/dtp/fonts/point.htm)
Absolute sizes are diﬃcult to measure.
(http://www.oberonplace.com/dtp/fonts/fontsize.htm)
From Uncertainty to Heuristics
How precisely must the sizes agree in order to declare that the
sizes are correct?
Obtaining complete information and evaluation of all the facts
is too complicated and costly.
Heuristics are used instead.
IA169 System Veriﬁcation and Assurance – 01 str. 23/50
Oracle Problem – Heuristics
Decision Heuristics
Allows for simpliﬁcation of decision problem.
Advice, recommendation, or procedure to be used within the
given context.
Should not build on any hidden knowledge.
Does not guarantee a good decision.
Various heuristics may lead to contradictory decisions.
Disadvantages
Heuristics might be subjective.
If misused, may cause more harm than good.
IA169 System Veriﬁcation and Assurance – 01 str. 24/50
Consistency as Heuristics
Consistency
Good heuristics for decision making.
Consistency with ...
other functions of the product, similar products, history,
producer image, speciﬁcations, standards, user expectations,
the purpose of the product, etc.
Advantages
Consistency is objective enough.
Easily described in bug report.
IA169 System Veriﬁcation and Assurance – 01 str. 25/50
Imperfection in Decision Making
Unintentional Blindness
Human tester does not consider any test outputs that he/she
does not pay attention to.
Similarly, mechanical tester does not consider test outputs
that it is not told to include into decision.
Uncertainty Principle
The presence of observer may aﬀect what is observed.
Consequence
It is impossible to observe all possible outputs from a single
test.
IA169 System Veriﬁcation and Assurance – 01 str. 26/50
Oracle and Automation Process
Motivation
Automation process eliminates human errors.
Automation leads to repeatable procedures.
Automation allows faster test evaluation.
Problems of Automation
It is necessary to automate the decision making (oracle)
principle.
Can we do it? Only partially.
Standard Way of Oracle Automation
A ﬁle of expected outputs, which is required to match
precisely with the outputs of a test being executed.
Example: MS Word could be used to deﬁne a the ﬁle of
expected outputs for testing WordPad.
IA169 System Veriﬁcation and Assurance – 01 str. 27/50
Problems of Automated Oracle
Amount of Agreement
Assume MS Word to serve as the ﬁle of expected outputs for
testing WordPad.
How exactly is the expected output stored?
Is 99% agreement still agreement?
How is the percentage of agreement deﬁned?
False Alarms
Using outdated expected output.
Consequence of simpliﬁcation of decision making.
Undiscovered Errors
Expected ﬁle exhibits the same error as test output.
Unintentional Blindness.
IA169 System Veriﬁcation and Assurance – 01 str. 28/50
Section
Measure Methods in Testing
IA169 System Veriﬁcation and Assurance – 01 str. 29/50
Coverage as a Measure
Coverage
A set of source code entities that has been checked with at
least one test.
Source-code entities: lines of code, conditions, function calls,
branches, etc.
Used to identify parts that have not been tested yet.
Coverage as a Measure
Possible test plan is to achieve a given percentage of coverage.
The percentage than expresses how much of the ﬁnal product
has been tested.
Numeric expression for managers to see how much of the
product remains to be tested.
IA169 System Veriﬁcation and Assurance – 01 str. 30/50
Coverage as a Measure – Disadvantages
Problems
Could avoid testing of interesting input data.
Does not properly test parts of the product that rely on
external services.
Using Coverage as a Measure
The mission is to test all entities of the product, is that OK?
Complete coverage does not guarantee quality of the product.
Stimulates to prefer quantity rather than quality.
Misleading satisfaction (shouldn’t feel safe).
IA169 System Veriﬁcation and Assurance – 01 str. 31/50
Coverage as a Measure – Disadvantages
Example
Input A // program accepts any
Input B // integer into A and B
Print A/B
Observation
Complete coverage is easy achievable.
For example:
input: 2,1
output: 2
There is of course a hidden bug in the program!
IA169 System Veriﬁcation and Assurance – 01 str. 32/50
Coverage Criteria for Control-Flow Graphs
y:=y+1
x=y and z>w
x:=x−1
true false
There are various criteria for control-ﬂow graph coverage.
IA169 System Veriﬁcation and Assurance – 01 str. 33/50
Coverage Criteria for Control-Flow Graphs
y:=y+1
x=y and z>w
x:=x−1
true false
Statement coverage
Every statement (assignment, input, output, condition) is
executed in at least one test.
Set of tests to achieve full coverage:
(x = 2, y = 1, z = 4, w = 3)
IA169 System Veriﬁcation and Assurance – 01 str. 33/50
Coverage Criteria for Control-Flow Graphs
y:=y+1
x=y and z>w
x:=x−1
true false
Edge coverage
Every edge of CFG is executed in at least one test.
Set of tests to achieve full coverage:
(x = 2, y = 1, z = 4, w = 3), (x = 3, y = 3, z = 5, w = 7)
IA169 System Veriﬁcation and Assurance – 01 str. 33/50
Coverage Criteria for Control-Flow Graphs
y:=y+1
x=y and z>w
x:=x−1
true false
Condition coverage
Every condition is a Boolean combination of elementary
conditions, for example x < y or even(x).
If it is possible, every elementary condition is evaluated in at
least one test to TRUE and in at least one test to FALSE.
IA169 System Veriﬁcation and Assurance – 01 str. 33/50
Coverage Criteria for Control-Flow Graphs
y:=y+1
x=y and z>w
x:=x−1
true false
Condition coverage
Set of tests to achieve full coverage:
(x = 3, y = 2, z = 5, w = 7), (x = 3, y = 3, z = 7, w = 5)
In both cases, only the FALSE branch of IF statement is taken.
IA169 System Veriﬁcation and Assurance – 01 str. 33/50
Coverage Criteria for Control-Flow Graphs
y:=y+1
x=y and z>w
x:=x−1
true false
Edge/Condition coverage
Edge and Condition coverage at the same time.
Set of tests to achieve full coverage:
(x = 2, y = 1, z = 4, w = 3), (x = 3, y = 2, z = 5, w = 7),
(x = 3, y = 3, z = 7, w = 5)
Is the set the smallest possible one?
IA169 System Veriﬁcation and Assurance – 01 str. 33/50
Coverage Criteria for Control-Flow Graphs
y:=y+1
x=y and z>w
x:=x−1
true false
Multiple condition coverage
Every Boolean combination of TRUE/FALSE values that may
appear in some decision condition must occur in at least one
test.
IA169 System Veriﬁcation and Assurance – 01 str. 33/50
Coverage Criteria for Control-Flow Graphs
y:=y+1
x=y and z>w
x:=x−1
true false
Multiple condition coverage
Set of tests to achieve full coverage:
(x = 2, y = 1, z = 4, w = 3), (x = 3, y = 2, z = 5, w = 7),
(x = 3, y = 3, z = 7, w = 5), (x = 3, y = 3, z = 5, w = 6)
Exponential grow in the number of tests.
IA169 System Veriﬁcation and Assurance – 01 str. 33/50
Coverage Criteria for Control-Flow Graphs
y:=y+1
x=y and z>w
x:=x−1
true false
Path coverage
Every executable path is executed in at least one test.
The number of paths is big, even inﬁnite in case there is an
unbounded cycle in the control-ﬂow graph.
IA169 System Veriﬁcation and Assurance – 01 str. 33/50
Hierarchy of Coverage Criteria
Criterion A includes criterion B, denoted with A → B, if after
full coverage of type A we guarantee full coverage of type B.
IA169 System Veriﬁcation and Assurance – 01 str. 34/50
Hierarchy of Coverage Criteria
Criterion A includes criterion B, denoted with A → B, if after
full coverage of type A we guarantee full coverage of type B.
path
coverage

multiple condition
coverage

edge/condition
coverage
uu

edge
coverage

condition
coverage
statement
coverage
IA169 System Veriﬁcation and Assurance – 01 str. 34/50
Cycle Coverage
Coverage and Number of Cycle Iterations
All criteria except the path criterion does not reﬂect the
number of iterations over a cycle body.
In case of nested cycles, systematic testing of all possible
executable paths become complicated.
Ad hoc Strategy for Testing Cycles
Check the case when the cycle is completely skipped.
Check the case when the cycle is executed exactly once.
Check the case when the cycle is executed the expected
number of times.
If a boundary n is known for the number of cycle executions,
try to design tests where the cycle is executed n − 1, n, and
n + 1 times.
IA169 System Veriﬁcation and Assurance – 01 str. 35/50
Coverage for Data-Flow Graphs
Motivation
Detect usage of undeﬁned variables.
On some paths, a variable may be set for a speciﬁc purpose
and later on its value misused for other purpose.
Control Flow criteria do not guarantee inclusion of tests for
above mentioned or likewise situations.
Data Flow Coverage
Cover paths through control ﬂow graph that go through a
location where a variable is used but it is not deﬁned along all
incoming paths through control-ﬂow graph.
IA169 System Veriﬁcation and Assurance – 01 str. 36/50
Support for Code Coverage
C/C++, Linux
Tools gcov and lcov.
Example: lcov
gcc -fproﬁle-arcs -ftest-coverage foo.c -o foo
lcov -d . -z
lcov -c -i -d . -o base.info
./foo
lcov -c -d . -o collect.info
lcov -d . -a base.info -a collect.info -o result.info
genhtml result.info
IA169 System Veriﬁcation and Assurance – 01 str. 37/50
Statistics on Found/Fixed Bugs per Time Unit
Week statistics
The number of newly discovered errors.
The number of ﬁxed errors.
The ratio of found versus ﬁxed errors.
Visualisation
IA169 System Veriﬁcation and Assurance – 01 str. 38/50
Weibull Distribution
Observation
The number of discovered errors per time unit exhibits
Weibull Distribution.
Can be used as a measure for the remaining amount of testing.
Software Engineering Method to set the release date.
Using Weibull Distribution
At the moment the curve reaches the peak, the remaining
part of the curve may be predicted, hence, a moment in time
may be set, when expected number of errors discovered per
week drops below a given threshold.
Parameters of Weibull distribution inﬂuence the “width” and
“height/slope” of the peak.
F(x) = 1 − e−ax−b
for x > 0
IA169 System Veriﬁcation and Assurance – 01 str. 39/50
Weibull Distribution – Imperfections
Vague Precision
Testing does not follow the typical usage of the product.
The probability of error discovery is diﬀerent for diﬀerent
errors.
Fix may cause other new errors.
Bugs are dependent.
The number of errors in the product changes over time.
Error insertion exhibits Weibull distribution itself.
Testing epochs (various testing procedures) are independent.
...
Conclusions
Weibull Distribution is not very reliable.
Can be used only with large projects for very rough estimation.
IA169 System Veriﬁcation and Assurance – 01 str. 40/50
Impact of following Weibull Distribution
Assumption
Software developers are aware of being measured.
Phase one
Reach the peak as quickly as possible.
Double reporting of errors.
Avoid ﬁxing known errors.
...
Phase two
Stick to expected shape of the curve.
Delay reporting of errors.
Reporting outside bug-tracking system.
...
IA169 System Veriﬁcation and Assurance – 01 str. 41/50
Section
Incompleteness of Testing
IA169 System Veriﬁcation and Assurance – 01 str. 42/50
Deﬁnition
Observation
The amount of tests to be run is extremely large.
Resources for testing are always limited.
What Is Not Complete Testing
Complete Coverage
Every line of code.
Every branching point.
...
When testers do not ﬁnd more errors.
Testing plan is ﬁnished.
What Is Complete Testing
There are no hidden or unknown errors in the product.
If new issue is reported, testing could not be complete.
IA169 System Veriﬁcation and Assurance – 01 str. 43/50
Reasons for Incompletness of Testing
The number of tests is too large (inﬁnitely many).
To perform all tests means:
To test all possible input values of every input variable.
To test all combinations of input variables.
To test every possible run of a system.
To test every combination of HW and SW, including future
technology.
To test every way a user may use the product.
IA169 System Veriﬁcation and Assurance – 01 str. 44/50
Impossibility to Test All Possible Inputs
Data Bus-Width
The number of tests grows exponentially with respect to bit
used for data representation.
n-bits requires 2n tests.
Other Reasons
Timing of actions.
Invalid or unexpected inputs (buﬀer overﬂow).
Edited inputs
Easter egg [http://j-walk.com/ss/excel/eastereg.htm]
Common Argumentation
“This is not what the customer would do with our product.”
IA169 System Veriﬁcation and Assurance – 01 str. 45/50
Incapability to Test All Runs
Assume the following system
Questions
How many diﬀerent ways it is possible to reach EXIT ?
How many diﬀerent ways it is possible to reach EXIT , if A
can be visited at most n-times?
IA169 System Veriﬁcation and Assurance – 01 str. 46/50
Incapability to Test All Runs
Example
In [F] is a memory leak, in [B] garbage collector.
System will reach an invalid state, if [B] is avoided long
enough.
Observation
Simpliﬁed testing of paths may not discover the error.
The error manifests in circumstances that cannot be achieved
with a simple test.
IA169 System Veriﬁcation and Assurance – 01 str. 47/50
Summary for Measure and Incompleteness
Incompleteness
Testing cannot prove absence of error.
It is impossible to test all valid inputs.
Existence of testing plan inhibits testing creativity.
Measure
There are methods to measure progress in testing phase.
These are unreliable.
Focusing strongly on a selected measure may inﬂuence the
eﬀectiveness of testing.
IA169 System Veriﬁcation and Assurance – 01 str. 48/50
Section
Homework
IA169 System Veriﬁcation and Assurance – 01 str. 49/50
Homework
Reading on MC/DC:
http://www.faa.gov/aircraft/air_cert/design_approvals/air_
software/cast/cast_papers/media/cast-10.pdf
List, and brieﬂy describe as many black-box testing
approaches as you can ﬁnd or are aware of.
http://www.testingeducation.org/BBST/
Optional: Learn about CMAKE and CTEST systems.
IA169 System Veriﬁcation and Assurance – 01 str. 50/50