Using many-core hardware to correlate radio astronomy signals

@inproceedings{Nieuwpoort2009UsingMH,
  title={Using many-core hardware to correlate radio astronomy signals},
  author={Rob van Nieuwpoort and John W. Romein},
  booktitle={ICS '09},
  year={2009}
}
A recent development in radio astronomy is to replace traditional dishes with many small antennas. The signals are combined to form one large, virtual telescope. The enormous data streams are cross-correlated to filter out noise. This is especially challenging, since the computational demands grow quadratically with the number of data streams. Moreover, the correlator is not only computationally intensive, but also very I/O intensive. The LOFAR telescope, for instance, will produce over 100… 
Correlating Radio Astronomy Signals with Many-Core Hardware
TLDR
This work analyzes the correlator algorithm on multi-core CPUs and many-core architectures, such as NVIDIA and ATI GPUs, and the Cell/B.E. processor, and identifies several important architectural problems which cause architectures to perform suboptimally.
Building Correlators with Many-Core Hardware
TLDR
This article explains how to implement and optimize signal-processing applications on multicore CPUs and many-core architectures, such as the Intel Core i7, NVIDIA and ATI graphics processor units (CPUs, and the Cell/BE), and discusses several important architectural problems which cause architectures to perform suboptimally.
Software Spectral Correlator for the 44-Element Ooty Radio Telescope
A Spectral Correlator is the main component of the real time signal processing for a Radio Telescope array. The correlation of signals received at each element with every other element of the array
The LOFAR correlator: implementation and performance analysis
TLDR
To meet the real-time requirements, the application is highly optimized, and reaches exceptionally high computational and I/O efficiencies, and the scalability of the system is studied, and it is shown that it scales well beyond the requirements.
Processing Real-Time LOFAR Telescope Data on a Blue Gene/P
LOFAR is the first of a new generation of radio telescopes. Rather than using expensive dishes, it forms a distributed sensor network that combines the signals from many thousands of simple antennas.
Accelerating radio astronomy cross-correlation with graphics processing units
We present a highly parallel implementation of the cross-correlation of time-series data using graphics processing units (GPUs), which is scalable to hundreds of independent inputs and suitable for
Adaptive Real-Time Imaging Synthesis Telescopes
  • M. Wright
  • Computer Science, Physics
    Int. J. High Perform. Comput. Appl.
  • 2012
TLDR
This work proposes to build a heterogeneous computing platform for real-time processing of radio telescope array data that comprises a flexible correlator, beam former, and imager that is based on state-of-the-art digital signal processing closely coupled with a computing cluster.
Analyzing LOFAR station processing on multi-core platforms
The LOw-Frequency ARray (LOFAR) is a phasedarray radio telescope in the Netherlands. Currently, the astronomical community is embarking upon designing a new telescope, the Square Kilometer Array
Implementation of a Correlation Algorithm on the Cyclops-64 Architecture
TLDR
The tools in use by radio astronomers are beginning to change, a shift towards the use of large arrays of much smaller, omnidirectional antennas is now underway, and a great deal of signal processing must be done to pull useful signals from the noise.
An efficient real-time data pipeline for the CHIME Pathfinder radio telescope X-engine
  • A. Recnik, K. Bandura, +5 authors K. Vanderlinde
  • Computer Science, Physics
    2015 IEEE 26th International Conference on Application-specific Systems, Architectures and Processors (ASAP)
  • 2015
TLDR
The CHIME Pathfinder is a new interferometric radio telescope that uses a hybrid FPGA/GPU FX correlator that processes over 819 Gb/s of 4+4-bit complex astronomical data from N=256 inputs across a 400MHz radio band.
...
1
2
3
4
...

References

SHOWING 1-10 OF 29 REFERENCES
Processing Real-Time LOFAR Telescope Data on a Blue Gene/P
LOFAR is the first of a new generation of radio telescopes. Rather than using expensive dishes, it forms a distributed sensor network that combines the signals from many thousands of simple antennas.
Astronomical real-time streaming signal processing on a Blue Gene/L supercomputer
TLDR
This paper describes a novel approach to process realtime, streaming telescope data in software, using a supercomputer, and focuses on the LOFAR CEntral Processing facility (CEP), that combines the signals of all LOFar stations.
A Radio Astronomy Correlator Optimized for the Xilinx Virtex-4 SX FPGA
TLDR
A correlator that is optimized for the Xilinx Virtex-4 SX FPGA, and its application in the SKAMP radio telescope at the Molonglo Radio Observatory is described.
Building high-resolution sky images using the Cell/B.E.
TLDR
The Cell/B.E.E can provide high performance for data-intensive applications at the price of increased programming efforts and with a significant aid from aggressive application-specific optimizations.
The Square Kilometre Array
TLDR
This paper outlines how the scientific challenges are translated into technical challenges, how the application of recent technology offers the potential of affordably meeting these challenges, and how the choices of technology will ultimately be made.
Efficient computation of sum-products on GPUs through software-managed cache
TLDR
A GPU-based MPF solver achieves up to 2700-fold speedup on random data and 270-fold on real-life genetic analysis datasets on GeForce 8800GTX GPU from NVIDIA over the optimized CPU version on an Intel 2.4GHz Core 2 with a 4MB L2 cache.
Brook for GPUs: stream computing on graphics hardware
TLDR
This paper presents Brook for GPUs, a system for general-purpose computation on programmable graphics hardware that abstracts and virtualizes many aspects of graphics hardware, and presents an analysis of the effectiveness of the GPU as a compute engine compared to the CPU.
PERI - auto-tuning memory-intensive kernels for multicore
TLDR
This work presents an auto-tuning approach to optimize application performance on emerging multicore architectures, and develops a code generator for each kernel that allows us to identify a highly optimized version for each platform, while amortizing the human programming effort.
Programming the Intel 80-core network-on-a-chip Terascale Processor
TLDR
The instruction set, the programming environment, the kernels written for the chip, and the experiences programming this microprocessor are described, which implies for future message passing, network-on-a-chip processors.
A Survey of General-Purpose Computation on Graphics Hardware
TLDR
The techniques used in mapping general-purpose computation to graphics hardware will be generally useful for researchers who plan to develop the next generation of GPGPU algorithms and techniques.
...
1
2
3
...