Distribution-based aggregation for relational learning with identifier attributes

  title={Distribution-based aggregation for relational learning with identifier attributes},
  author={Claudia Perlich and Foster J. Provost},
  journal={Machine Learning},
Identifier attributes—very high-dimensional categorical attributes such as particular product ids or people's names—rarely are incorporated in statistical modeling. However, they can play an important role in relational modeling: it may be informative to have communicated with a particular set of people or to have purchased a particular set of products. A key limitation of existing relational modeling techniques is how they aggregate bags (multisets) of values from related entities. The… CONTINUE READING
Highly Cited
This paper has 120 citations. REVIEW CITATIONS


Publications citing this paper.
Showing 1-10 of 49 extracted citations

Automatic generation of relational attributes: An application to product returns

2016 IEEE International Conference on Big Data (Big Data) • 2016
View 8 Excerpts
Highly Influenced

Learning Classifiers from Distributional Data

2013 IEEE International Congress on Big Data • 2013
View 5 Excerpts
Highly Influenced

Learning classifiers from linked data

View 4 Excerpts
Highly Influenced

Corporate residence fraud detection

View 17 Excerpts
Method Support
Highly Influenced

121 Citations

Citations per Year
Semantic Scholar estimates that this publication has 121 citations based on the available data.

See our FAQ for additional information.


Publications referenced by this paper.
Showing 1-10 of 58 references