Corpus ID: 210177618

Coded Data Rebalancing: Fundamental Limits and Constructions

@article{Krishnan2020CodedDR,
  title={Coded Data Rebalancing: Fundamental Limits and Constructions},
  author={Prasad Krishnan and V. Lalitha and L. Natarajan},
  journal={ArXiv},
  year={2020},
  volume={abs/2001.04939}
}
  • Prasad Krishnan, V. Lalitha, L. Natarajan
  • Published in ArXiv 2020
  • Computer Science, Mathematics
  • Distributed databases often suffer unequal distribution of data among storage nodes, which is known as `data skew'. Data skew arises from a number of causes such as removal of existing storage nodes and addition of new empty nodes to the database. Data skew leads to performance degradations and necessitates `rebalancing' at regular intervals to reduce the amount of skew. We define an $r$-balanced distributed database as a distributed database in which the storage across the nodes has uniform… CONTINUE READING

    Create an AI-powered research feed to stay up to date with new papers like this posted to ArXiv

    Figures and Topics from this paper.

    References

    Publications referenced by this paper.
    SHOWING 1-6 OF 6 REFERENCES

    A Fundamental Tradeoff Between Computation and Communication in Distributed Computing

    VIEW 4 EXCERPTS
    HIGHLY INFLUENTIAL

    The Hadoop Distributed File System

    VIEW 4 EXCERPTS
    HIGHLY INFLUENTIAL

    Coded MapReduce

    VIEW 2 EXCERPTS

    Fundamental Limits of Caching

    VIEW 2 EXCERPTS

    A Piggybacking Design Framework for Read-and Download-Efficient Distributed Storage Codes

    VIEW 1 EXCERPT