• Publications
  • Influence
The L-TAGE Branch Predictor
  • 59
  • 15
Practical data value speculation for future high-end processors
Dedicating more silicon area to single thread performance will necessarily be considered as worthwhile in future - potentially heterogeneous - multicores. In particular, Value prediction (VP) wasExpand
  • 51
  • 14
A new case for the TAGE branch predictor
  • André Seznec
  • Computer Science
  • 44th Annual IEEE/ACM International Symposium on…
  • 3 December 2011
The TAGE predictor is often considered as state-of-the-art in conditional branch predictors proposed by academy. In this paper, we first present directions to reduce the hardware implementation costExpand
  • 87
  • 13
Tarantula: a vector extension to the alpha architecture
Tarantula is an aggressive floating point machine targeted at technical, scientific and bioinformatics workloads, originally planned as a follow-on candidate to the EV8 processor [6, 5]. TarantulaExpand
  • 99
  • 12
A case for (partially) TAgged GEometric history length branch prediction
  • 151
  • 12
Zero-content augmented caches
It has been observed that some applications manipulate large amounts of null data. Moreover these zero data often exhibit high spatial locality. On some applications more than 20% of the dataExpand
  • 93
  • 11
Practical and secure PCM systems by online detection of malicious write streams
Phase Change Memory (PCM) may become a viable alternative for the design of main memory systems in the next few years. However PCM suffers from limited write endurance. Therefore future adoption ofExpand
  • 85
  • 11
Choosing representative slices of program execution for microarchitecture simulations: a preliminary
Microarchitecture simulations are aimed at providing results representative of the behavior of a processor running and application. Due to CPU time constraints, only a few execution slices of a largeExpand
  • 122
  • 10
Improving cache behavior of dynamically allocated data structures
Poor data layout in memory may generate weak data locality and poor performance. Code transformations such as loop blocking or interchanging and array padding have addressed this issue for scientificExpand
  • 112
  • 8