Automatic Application Identification from Billions of Files

@article{Soska2017AutomaticAI,
  title={Automatic Application Identification from Billions of Files},
  author={Kyle Soska and Christopher S. Gates and Kevin A. Roundy and N. Christin},
  journal={Proceedings of the 23rd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining},
  year={2017}
}
  • Kyle Soska, Christopher S. Gates, +1 author N. Christin
  • Published 2017
  • Computer Science
  • Proceedings of the 23rd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining
  • Understanding how to group a set of binary files into the piece of software they belong to is highly desirable for software profiling, malware detection, or enterprise audits, among many other applications. Unfortunately, it is also extremely challenging: there is absolutely no uniformity in the ways different applications rely on different files, in how binaries are signed, or in the versioning schemes used across different pieces of software. In this paper, we show that, by combining… CONTINUE READING
    HAC-T and Fast Search for Similarity in Security
    Endpoint Detection and Response: Why Use Machine Learning?

    References

    Publications referenced by this paper.
    SHOWING 1-3 OF 3 REFERENCES
    BPTree: An ℓ2 Heavy Hitters Algorithm Using Constant Memory
    • 44
    • Highly Influential
    • PDF
    An improved data stream summary: the count-min sketch and its applications
    • 1,527
    • Highly Influential
    • PDF
    Engineering ecient and e‚ective non-metric space library
    • 2013