Shyamala Doraimani

Learn More
The analysis of data usage in a large set of real traces from a high-energy physics collaboration revealed the existence of an emergent grouping of files that we coined "filecules". This paper presents the benefits of using this file grouping for prestaging data and compares it with previously proposed file grouping techniques along a range of performance(More)
Grid computing has reached the stage where deployments are mature and many collaborations run in production mode. Mature grid deployments offer the opportunity for revisiting and perhaps updating traditional beliefs related to workload models, which in turn leads to the re-evaluation of traditional resource management techniques. This paper analyzes usage(More)
The analysis of data usage in a large set of real traces from a high-energy physics collaboration revealed the existence of an emergent grouping of files that we coined " filecules ". This paper presents the benefits of using this file grouping for prestaging data and compares it with previously proposed file grouping techniques along a range of performance(More)
— This paper revisits a basic question in data management , namely whether locality of reference is an important factor for the performance of caches in grid workloads. We answer this question by experimental evaluations using more than two years of real workloads from a science collaboration. Our results show that: (1) locality of reference is significant(More)
  • 1