Corpus ID: 54469647

Bridging the Generalization Gap: Training Robust Models on Confounded Biological Data

@article{Liu2018BridgingTG,
  title={Bridging the Generalization Gap: Training Robust Models on Confounded Biological Data},
  author={Tzu-Yu Liu and Ajay Kannan and Adam Drake and Marvin Bertin and Nathan Wan},
  journal={ArXiv},
  year={2018},
  volume={abs/1812.04778}
}
  • Tzu-Yu Liu, Ajay Kannan, +2 authors Nathan Wan
  • Published 2018
  • Mathematics, Computer Science
  • ArXiv
  • Statistical learning on biological data can be challenging due to confounding variables in sample collection and processing. Confounders can cause models to generalize poorly and result in inaccurate prediction performance metrics if models are not validated thoroughly. In this paper, we propose methods to control for confounding factors and further improve prediction performance. We introduce OrthoNormal basis construction In cOnfounding factor Normalization (ONION) to remove confounding… CONTINUE READING

    Figures, Tables, and Topics from this paper.

    References

    Publications referenced by this paper.
    SHOWING 1-10 OF 22 REFERENCES

    Domain-Adversarial Training of Neural Networks

    VIEW 2 EXCERPTS

    Generative Adversarial Nets

    VIEW 1 EXCERPT

    Bayesian Canonical correlation analysis

    Machine learning applications in genetics and genomics

    VIEW 1 EXCERPT