Improving the performance of Naive Bayes multinomial in e-mail foldering by introducing distribution-based balance of datasets

@article{Bermejo2011ImprovingTP,
  title={Improving the performance of Naive Bayes multinomial in e-mail foldering by introducing distribution-based balance of datasets},
  author={Pablo Bermejo and Jos{\'e} A. G{\'a}mez and Jose Miguel Puerta},
  journal={Expert Syst. Appl.},
  year={2011},
  volume={38},
  pages={2072-2080}
}
E-mail foldering or e-mail classification into user predefined folders can be viewed as a text classification/categorization problem. However, it has some intrinsic properties that make it more difficult to deal with, mainly the large cardinality of the class variable (i.e. the number of folders), the different number of e-mails per class state and the fact that this is a dynamic problem, in the sense that e-mails arrive in our mail-forders following a time-line. Perhaps because of these… CONTINUE READING

Citations

Publications citing this paper.
Showing 1-10 of 11 extracted citations

References

Publications referenced by this paper.
Showing 1-10 of 49 references

I

  • E. Montañés, E. F. Combarro
  • Dı́az, J. Ranilla, Towards automatic and optimal…
  • 2005
Highly Influential
1 Excerpt

Similar Papers

Loading similar papers…