Unit 11 - Product quality: verification, metrics and testing Flashcards

Question

What are the limitations of TDD?

Answer 1

TDD is not a panacea and has many limitations and possible pitfalls, including the following: . User interface: TDD does not readily apply to user interface testing, for which it is better to apply techniques such as usability testing. . Testing of applications integrated with databases: TDD alone is not adequate for the comprehensive testing of databases. . Multithreaded systems: TDD is not generally suitable for the testing of multithreaded systems, as results may depend on the vagaries of timing. . Customer acceptance: TDD cannot take the place of customer testing. . Legacy systems and systems reusing large code components: If large amounts of code are being reused and TDD was not applied when they were being coded, then at best unit tests can be added retrospectively. Such a retrofit would not constitute TDD, since the testing would not guide the evolution of the code. In many cases, retrofitting unit tests to large bodies of existing code may be impractical. . Management support: If management chooses to penalise time spent on writing tests during development, TDD is unlikely to flourish. . Code whose requirements are liable to change greatly: Sometimes, for example during scientific research or product development research, the purpose of developing software may be to find out what the requirements should be in some little-understood area or what happens in some poorly understood situation. This is known as exploratory programming. In these and other situations, requirements may change so extensively and frequently that there is repeated wholesale invalidation of existing unit tests. This would render TDD a hindrance rather than a help. . Integration testing: TDD cannot take the place of integration testing

Answer 2

(a) If the test unexpectedly already passes at this point, this demonstrates that it is not a good test of the next increment. (b) If it fails in an unexpected way, this demonstrates a faulty or incomplete understanding of the test that needs to be addressed in order to have a good grip on the code and test.

Answer 3

(a) The precondition verifies that the arguments aBalance and anOverdraftLimit are both greater than or equal to zero. The postcondition verifies that the variables balance and overdraftLimit have been correctly initialised with the values of the corresponding arguments. (b) A value of −50 for the overdraft limit will mean the boolean expression in the precondition assertion evaluates to false and an assertion error will be thrown (c) The precondition is met and so a new Account will be created with a zero balance and an overdraft limit of 200.

Answer 4

. Requirements-based testing draws on previously gathered or formulated testable requirements to check that a system meets the customer’s requirements. The final stage in this form of testing is acceptance testing. . Usability testing refers to testing of the user interface. . Developmental testing is a term that refers to all of the testing carried out by the team developing the software. It is useful to distinguish between developmental testing at three different levels of scope – unit testing, integration or component testing and system testing. . Regression testing is any form of testing during development or system maintenance that systematically checks that fixing one bug has not introduced others.

Answer 5

(a) m × n (b) max(m, n) (c) m + n − 1 m + n − 1 is not much bigger than max(m, n), so the balanced approach might as well be used in preference to the minimal approach. However m × n is generally much bigger than both m + n − 1 and max(m, n). So if the safe approach were used in preference to the balanced approach the testing load could increase dramatically.

Answer 6

. user-command testing (or operator testing) tests all user commands from the point of view of tolerance of syntax errors and data input errors . interface and protocol testing if the system communicates with other systems in the outside world, tests its interaction with the communication system . start-up and initialisation testing tests the system’s ability to be started in a working hardware/software configuration – in the case where there are many combinations of hardware/software, all configurations should be system tested individually . restart testing tests the ability of the system to recover from errors of internal state . performance testing tests that the system meets all specified operating requirements for speed, number of concurrent users permitted, and so on . stress testing tests that the system can operate reliably at the limits of each of its resources – for example to make web server simulate the accesses of hundreds or thousands of users all at the same time to see if it can cope with the load . security testing tests that the system does not offer opportunities to breach security . acceptance testing is performed by the customer and after which, all being well, the system is accepted.

Answer 7

Probably the only situation where this is appropriate is when the project team is small. In small teams, one person might play the part of requirements engineer, designer, implementer, tester and maintenance engineer.

Answer 8

In general, the same tests will be carried out during acceptance testing and system testing. System testing is an in-house activity and a customer need never know how system testing went – any bugs can be dealt with before the customer sees them. Acceptance testing, on the other hand, is conducted with much more at stake – the customer can accept or reject a system based on its performance at acceptance testing.

Answer 9

Acceptance testing is the process of showing that the software meets the customer’s requirements, not that there aren’t bugs in the code. In fact, given that a system is put into use, bugs that require fixing are almost certain to be found after acceptance testing. In addition, the system will be maintained, with functionality added and changed, leading to a requirement for regression testing.

Answer 10

. TDD and DbC are valuable but not comprehensive tools for requirements testing. . TDD has regression testing built into it. . DbC and TDD cannot substitute for thorough usability testing or security testing.

Answer 11

In black-box testing, test cases are designed by looking at the specification (that is, requirements, high-level design and external interfaces) of the system to be tested

Answer 12

In white-box testing, test cases are designed by looking at the detail of the implementation of the system to be tested.

Answer 13

The techniques that can be used in black-box testing are characterised by the fact that they can use only information available from the specification in order to develop test cases. This means that to produce test data for a system or subsystem only the defined relationships between inputs and outputs can be scrutinised A quintessential black-box technique is partitioning (also known as equivalence partitioning) combined with boundary testing (also known as fence testing), which focuses on producing test data at the boundaries between partitions of the input data space (or input domain). The input data space is partitioned into subdomains, where a subdomain is a set of input values that require the same type of processing to be performed. Subdomains are obtained by the technique of case analysis, which determines, for each user-perceived function of the (sub)system, the subdomain that results in that function being performed. Boundary testing is based on the observation that common errors are often caused by a data item being just one out, or, for example, a loop being executed one too many or one too few times – such errors are most visible at the boundaries of the input data space

Answer 14

Combining black box and white box testing together.

Answer 15

Because black-box testing takes its test cases from the specification, it is likely to pick up the following sorts of errors that white-box testing would miss (this is not an exhaustive list): . operations required by the specification but not provided for by the implementation . errors between the interfaces of two classes . errors in the transformations between internal states of a class, if these affect input/output behaviour . performance errors, in which system performance is found to be wanting . system initialisation and termination errors. On the other hand, in looking inside the implementation, white-box testing will pick up the following sorts of errors that black-box testing would miss: . the sequences of method calls in the body of a method that fail to perform the correct function . boolean conditions in if statements, while loops, etc. incorrectly expressed . loops that terminate incorrectly . relationships between methods and attributes of classes that are incorrect.

Answer 16

The input data space is the set of triples of values taken from the sets {0, 1, …, 110}, {0.0, 0.1, …, 150.0} and {800, 801, …, 1200}.

Answer 17

``` Because the cyclomatic-complexity metric is based on decision points, which are present only in methods, it is ‘blind’ to the class-structuring mechanisms that are available in object-oriented system descriptions. As much of the complexity of an object-oriented system is held in the class structure, applying the cyclomatic-complexity metric to a whole system would not therefore be appropriate. ```

Answer 18

Chidamber and Kemerer’s metrics measure complexity, and API classes add to complexity. Hence they should be counted in all metric calculations. For this reason, the Java documentation will be very useful when calculating these metrics.

Answer 19

For individual methods, a cyclomatic complexity of 10 or more should be regarded as a hint that the method is too complex. In the case of classes, a little more thought is needed. (a) For a class with ten methods, a value for the WMPC metric of 40 would typically suggest acceptably low class complexity. However although nearly all of the ten methods might be acceptably simple, one or two might be unacceptably complex. (b) By contrast, a complexity of greater than 10 × 10 = 100 is a fair indication that the behaviour of the class is too complex.