• Corpus ID: 14063547

Nsort: a Parallel Sorting Program for NUMA and SMP Machines

@inproceedings{Koester2000NsortAP,
  title={Nsort: a Parallel Sorting Program for NUMA and SMP Machines},
  author={Charles Koester and Jim Gray},
  year={2000}
}
Introduction Ordinal TM Nsort TM is a high-performance sort program for SGI IRIX, Sun Solaris and HP-UX servers. Nsort allows its users to realize the full processing potential of their multi-processor, multi-disk Unix systems for sorting data. Unlike the slow, single-threaded Unix sort utility or other third-party sort programs that achieve only minor processing parallelism, only Nsort can use tens of processors and hundreds of disks to quickly sort and merge data. 
Energy-efficient sorting using solid state disks
TLDR
Using a low-power processor, solid state disks, and efficient algorithms, this work beats the current records in the JouleSort benchmark for 10GB to 1 TB of data by factors of up to 5.1.
The record-breaking terabyte sort on a compaq cluster
TLDR
A 1-terabyte commercial-quality scalable sort is run on a 72-node Windows NT cluster, which Sandia utilizes for production work contracted by the U.S. government and yields supercomputer-class performance.
Asynchronous parallel disk sorting
We develop an algorithm for parallel disk sorting, whose I/O cost approaches the lower bound and that guarantees almost perfect overlap between I/O and computation. Previous algorithms have either
Performance / Price Sort
TLDR
This paper documents this and proposes that the PennySort benchmark be revised to Performance/Price sort: a simple GB/$ sort metric based on a two-pass external sort.
TritonSort: A Balanced Large-Scale Sorting System
We present TritonSort, a highly efficient, scalable sorting system. It is designed to process large datasets, and has been evaluated against as much as 100 TB of input data spread across 832 disks in
TritonSort: A Balanced and Energy-Efficient Large-Scale Sorting System
TLDR
This article describes the hardware and software architecture necessary to operate TritonSort, a highly efficient, scalable sorting system designed to process large datasets, and is able to sort data at approximately 80% of the disks’ aggregate sequential write speed.
Performance / Price Sort and PennySort
TLDR
This paper documents this and proposes that the PennySort benchmark be revised to Performance/Price sort: a simple GB/$ sort metric based on a two-pass external sort.
Energy-efficient sorting using solid state disks
TLDR
Drawing conclusions on how to design scalable energy-efficient systems for processing large data sets, such as combining relatively weak computing power with high bandwidth storage devices, are led.
A Minute with Nsort on a 32P NEC Windows Itanium2 Server
In March 2004, the Nsort program was able to sort 34 GB of data (340,000,000 100-byte records) in 58 seconds on a 32 processor Itanium® 2 NEC® Express5800/1320Xd running Microsoft® Windows® Server
: Standard Template Library for XXL Data Sets
TLDR
Stxxl is an implementation of the C++ standard template library STL for external memory computations that supports parallel disks, overlapping between I/O and computation, and pipelining technique that can save more than half of the I/Os.
...
1
2
...

References

SHOWING 1-6 OF 6 REFERENCES
AlphaSort: a RISC machine sort
TLDR
A new sort algorithm, called AlphaSort, demonstrates that commodity processors and disks can handle commercial batch workloads and proposes two new benchmarks: Minutesort: how much can you sort in a minute, and DollarSort: how to sort for a dollar.
High-performance sorting on networks of workstations
We report the performance of NOW-Sort, a collection of sorting implementations on a Network of Workstations (NOW). We find that parallel sorting on a NOW is competitive to sorting on the large-scale
The Art in Computer Programming
TLDR
Here the authors haven’t even started the project yet, and already they’re forced to answer many questions: what will this thing be named, what directory will it be in, what type of module is it, how should it be compiled, and so on.
SyncSort Announces World Record Set on Digital's AlphaServer System
  • Press Release
  • 1996
The Data Warehouse Toolkit
  • The Data Warehouse Toolkit
  • 1996
Silicon Graphics Sets New Data Sorting Record
  • Press Release
  • 1995