Bernard Goossens

  • Citations Per Year
Learn More
We introduce and describe PerPI, a software tool analyzing the instruction level parallelism (ILP) of a program. ILP measures the best potential of a program to run in parallel on an ideal machine – a machine with infinite resources. PerPI is a programmer-oriented tool the function of which is to improve the understanding of how the algorithm and the(More)
This paper presents a parallel execution model and a manycore processor design to run C programs in parallel. The model automatically builds parallel sections of machine instructions from the run trace. It parallelizes instructions fetches, renamings, executions and retirements. Predictor based fetch is replaced by a fetch-decode-and-partlyexecute stage(More)
Several accurate algorithms to sum IEEE-754 floating-point numbers have been recently published. The recent contributions by Rump, Ogita and Oishi and the newest ones proposed by Zhu and Hayes are examples of accurate summation algorithms. Some of these even compute the faithful or the correct rounding of the exact sum, i.e. the most accurate value with(More)
Tipi is an implementation proposal of the SMT architecture. This implementation is very close to a multiprocessor architecture, sharing only the second level caches and the functional units. It is designed to achieve a high performance with both a high IPC and a very short cycle (estimated to 10 gates levels). Functional unit sharing and very short cycle(More)
We measure and analyze the instruction level parallelism which conditions the running-time performance of core numerical subroutines. We propose PerPI, a programmer oriented tool to fill the gap between high level algorithm analysis and machine dependent profiling tools and which provides reproducible results.