Sally: a tool for embedding strings in vector spaces

@article{Rieck2012SallyAT,
  title={Sally: a tool for embedding strings in vector spaces},
  author={Konrad Rieck and Christian Wressnegger and Alexander Bikadorov},
  journal={Journal of Machine Learning Research},
  year={2012},
  volume={13},
  pages={3247-3251}
}
Strings and sequences are ubiquitous in many areas of data analysis. However, only few learning methods can be directly applied to this form of data. We present Sally, a tool for embedding strings in vector spaces that allows for applying a wide range of learning methods to string data. Sally implements a generalized form of the bag-of-words model, where strings are mapped to a vector space that is spanned by a set of string features, such as words or n-grams of words. The implementation of… CONTINUE READING

Citations

Publications citing this paper.
Showing 1-10 of 10 extracted citations

Similar Papers

Loading similar papers…