A Data-Driven Approach to the Fragile Families Challenge: Prediction through Principal-Components Analysis and Random Forests

@article{Compton2019ADA,
  title={A Data-Driven Approach to the Fragile Families Challenge: Prediction through Principal-Components Analysis and Random Forests},
  author={Ryan Compton},
  journal={Socius},
  year={2019},
  volume={5}
}
  • Ryan Compton
  • Published 2019
  • Computer Science
  • Socius
  • Sociological research typically involves exploring theoretical relationships, but the emergence of “big data” enables alternative approaches. This work shows the promise of data-driven machine-learning techniques involving feature engineering and predictive model optimization to address a sociological data challenge. The author’s group develops improved generalizable models to identify at-risk families. Principal-components analysis and decision tree modeling are used to predict six main… CONTINUE READING

    References

    Publications referenced by this paper.
    SHOWING 1-10 OF 20 REFERENCES
    Random Forests
    • 47,129
    • PDF
    Scikit-learn: Machine Learning in Python
    • 25,471
    • PDF
    SMOTE: Synthetic Minority Over-sampling Technique
    • 9,212
    • PDF
    XGBoost: A Scalable Tree Boosting System
    • 5,630
    • PDF
    Best practices for missing data management in counseling psychology.
    • 1,210
    • PDF
    Random Search for Hyper-Parameter Optimization
    • 3,582
    • PDF