Achieving Efficient Realization of Kalman Filter on CGRA Through Algorithm-Architecture Co-design

@inproceedings{Merchant2018AchievingER,
  title={Achieving Efficient Realization of Kalman Filter on CGRA Through Algorithm-Architecture Co-design},
  author={Farhad Merchant and Tarun Vatwani and Anupam Chattopadhyay and Soumyendu Raha and S. K. Nandy and Ranjani Narayan},
  booktitle={ARC},
  year={2018}
}
  • Farhad Merchant, Tarun Vatwani, +3 authors Ranjani Narayan
  • Published in ARC 2018
  • Computer Science
  • In this paper, we present efficient realization of Kalman Filter (KF) that can achieve up to 65% of the theoretical peak performance of underlying architecture platform. KF is realized using Modified Faddeeva Algorithm (MFA) as a basic building block due to its versatility and REDEFINE Coarse Grained Reconfigurable Architecture (CGRA) is used as a platform for experiments since REDEFINE is capable of supporting realization of a set algorithmic compute structures at run-time on a Reconfigurable… CONTINUE READING

    Create an AI-powered research feed to stay up to date with new papers like this posted to ArXiv

    2
    Twitter Mentions

    Citations

    Publications citing this paper.
    SHOWING 1-3 OF 3 CITATIONS

    Applying Modified Householder Transform to Kalman Filter

    VIEW 6 EXCERPTS
    CITES BACKGROUND & METHODS

    A Systematic Approach for Acceleration of Matrix-Vector Operations in CGRA through Algorithm-Architecture Co-Design

    VIEW 1 EXCERPT
    CITES METHODS

    References

    Publications referenced by this paper.
    SHOWING 1-10 OF 24 REFERENCES

    Micro-architectural Enhancements in Distributed Memory CGRAs for LU and QR Factorizations

    VIEW 5 EXCERPTS

    An FPGA Based High throughput Discrete Kalman Filter Architecture for Real-Time Image Denoising

    VIEW 3 EXCERPTS
    HIGHLY INFLUENTIAL

    Efficient Realization of Table Look-Up Based Double Precision Floating Point Arithmetic

    VIEW 1 EXCERPT

    Co-exploration of NLA kernels and specification of Compute Elements in distributed memory CGRAs

    VIEW 1 EXCERPT

    Efficient QR Decomposition Using Low Complexity Column-wise Givens Rotation (CGR)

    VIEW 1 EXCERPT