Mapping applications for high performance on multithreaded, NUMA systems

@inproceedings{Cong2013MappingAF,
  title={Mapping applications for high performance on multithreaded, NUMA systems},
  author={Guojing Cong and Hui-Fang Wen},
  booktitle={Conf. Computing Frontiers},
  year={2013}
}
The communication latency and available resources for a group of logical processors are determined by their relative position in the hierarchy of chips, cores, and threads on modern shared-memory systems. Multithreaded applications exhibit different performance behavior depending on the mapping of software threads to logical processors. We observe the… CONTINUE READING