It is well recognised that validation of dynamic building simulation programs is a longterm complex task. There have been many large national and international efforts that have led to a well-established validation methodology comprising analytical, interprogram comparison and empirical validation elements, and a significant number of tests have been developed. As simulation usage increases, driven by such initiatives as the European Energy Performance of Buildings Directive, such tests are starting to be incorporated into national and international standards. Although many program developers have run many of the developed tests, there does not appear to have been a systematic attempt to incorporate such tests into routine operation of the simulation programs. This paper reports work undertaken to address this deficiency. The paper summarizes the tests which have been applied to the simulation program ESP-r. These tests have been developed within International Energy Agency Annexes, within CEN standards, within various large-scale national projects, and by the UK’s Chartered Institution of Building Services Engineers. The structure used to encapsulate the tests allows developers to ensure that recent code modifications have not resulted in unforeseen impacts on program predictions, and allows users to check for themselves against benchmarks.