• Publications
  • Influence
Logan: A Distributed Online Log Parser
TLDR
A data-driven log parser is trained on the authors' new Apache Spark dataset, the largest application log dataset yet, and a distributed online algorithm is implemented to accommodate for the large volume of data.
Delog: A Privacy Preserving Log Filtering Framework for Online Compute Platforms
TLDR
A privacy preserving framework which can be employed by Platform as a Service (PaaS) providers to utilize the user logs generated on the platform while protecting the potentially sensitive logged data and a distributed log parsing algorithm which leverages Locality Sensitive Hashing.
Singularity: Planet-Scale, Preemptive and Elastic Scheduling of AI Workloads
TLDR
This work presents Singularity, Microsoft’s globally distributed scheduling service for highly-efficient and reliable execution of deep learning training and inference workloads, and shows that the resulting efficiency and reliability gains are achieved with negligible impact on the steady-state performance.
Delog: A High-Performance Privacy Preserving Log Filtering Framework
TLDR
A privacy-preserving framework that can be employed by Platform as a Service (PaaS) providers to utilize the user logs generated on the platform while protecting the potentially sensitive logged data is described.
Learning Digital Circuits: A Journey Through Weight Invariant Self-Pruning Neural Networks
TLDR
This work uses the existing framework of binarized networks to find performant topologies by constraining the weights to be either, zero or one, and shows that such topologies achieve performance similar to standard networks while pruning more than 99% weights.