Mining Statistically Significant Substrings using the Chi-Square Statistic

@article{Sachan2012MiningSS,
  title={Mining Statistically Significant Substrings using the Chi-Square Statistic},
  author={Mayank Sachan and Arnab Bhattacharya},
  journal={ArXiv},
  year={2012},
  volume={abs/1207.0144}
}
  • Mayank Sachan, Arnab Bhattacharya
  • Published in Proc. VLDB Endow. 2012
  • Computer Science, Mathematics
  • ArXiv
  • The problem of identification of statistically significant patterns in a sequence of data has been applied to many domains such as intrusion detection systems, financial models, web-click records, automated monitoring systems, computational biology, cryptology, and text analysis. An observed pattern of events is deemed to be statistically significant if it is unlikely to have occurred due to randomness or chance alone. We use the chi-square statistic as a quantitative measure of statistical… CONTINUE READING

    References

    Publications referenced by this paper.
    SHOWING 1-10 OF 28 REFERENCES

    Intrusion detection using the chi-square goodness-of-fit test for information assurance

    • R. Goonatilake, A. Herath, S. Herath, J. Herath
    • network, forensics and software security. J. Computing Sciences, 23(1):255–263
    • 2007
    VIEW 16 EXCERPTS
    HIGHLY INFLUENTIAL

    Red Sox vs. Yankees: The Great Rivalry

    VIEW 5 EXCERPTS
    HIGHLY INFLUENTIAL

    On finding the most statistically significant substring using the chi-square measure

    • S. Agarwal
    • Master’s thesis, Indian Institute of Technology, Kanpur
    • 2009
    VIEW 1 EXCERPT

    Comparison of Statistical Significance Criteria

    VIEW 1 EXCERPT

    Detection of significant sets of episodes in event sequences

    VIEW 1 EXCERPT

    LECTURE 23: OR

    VIEW 1 EXCERPT