Connecting Devices to Cookies via Filtering, Feature Engineering, and Boosting

  title={Connecting Devices to Cookies via Filtering, Feature Engineering, and Boosting},
  author={Michael Sungjun Kim and Jiwei Liu and Xiaozhou Wang and Wei Yang},
  journal={2015 IEEE International Conference on Data Mining Workshop (ICDMW)},
We present a supervised machine learning system capable of matching internet devices to web cookies through filtering, feature engineering, binary classification, and post processing. The system builds a reasonably sized training and testing data set through filtering and feature engineering. We build 415 features in total. Some of these features were engineered to be O(n) time, stand alone classifiers for this problem. Other features use various natural language processing (NLP) techniques… CONTINUE READING
3 Citations
9 References
Similar Papers


Publications citing this paper.


Publications referenced by this paper.
Showing 1-9 of 9 references

A language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria

  • R R Core Team
  • URL
  • 2015
1 Excerpt

xgboost: eXtreme Gradient Boosting.

  • Chen, Tianqi, Tong He
  • 2015

Schapire . ” A decision - theoretic generalization of on - line learning and an application to boosting

  • Yoav Freund, E. Robert
  • Journal of computer and system sciences
  • 2010

Bagging predictors

  • Breiman, Leo
  • Machine Learning
  • 1996

Similar Papers

Loading similar papers…