Khashayar Khosravi

We don’t have enough information about this author to calculate their statistics. If you think this is an error let us know.
Learn More
We provide a unifying view of statistical information measures, multi-class classification problems, multi-way Bayesian hypothesis testing, and loss functions, elaborating equivalence results between all of these objects. In particular, we consider a particular generalization of f-divergences to multiple distributions, and we show that there is a(More)
Recurrent neural networks have been very successful at predicting sequences of words in tasks such as language modeling. However, all such models are based on the conventional classification framework, where model is trained against one-hot targets, and each word is represented both as an input and as an output in isolation. This causes inefficiencies in(More)
  • 1