Word Sequences as Features in Text-learning

  title={Word Sequences as Features in Text-learning},
  author={Marko GrobelnikJ and Stefan InstituteJamova and Marko. Grobelnik},
Dunja Mladeni c & Marko Grobelnik J.Stefan Institute Jamova 39, 1000 Ljubljana, Slovenia Dunja.Mladenic@ijs.si, Marko.Grobelnik@ijs.si Abstract This paper proposes an e cient algorithm for the generation of new features that enrich the known bagof-words document representation. New features are generated based on word sequences of di erent length. Learning is performed using Naive Bayesian classi er on feature-vectors, where only highly scored features are used. The performance of enriched… CONTINUE READING
Highly Influential
This paper has highly influenced 11 other papers. REVIEW HIGHLY INFLUENTIAL CITATIONS
Highly Cited
This paper has 149 citations. REVIEW CITATIONS

From This Paper

Topics from this paper.


Publications citing this paper.
Showing 1-10 of 94 extracted citations

Text and Hypertext Categorization

Artificial Intelligence: An International Perspective • 2009
View 5 Excerpts
Highly Influenced

Using phrases as features in email classification

Journal of Systems and Software • 2009
View 4 Excerpts
Highly Influenced

149 Citations

Citations per Year
Semantic Scholar estimates that this publication has 149 citations based on the available data.

See our FAQ for additional information.


Publications referenced by this paper.
Showing 1-9 of 9 references

Fast Discovery of Association Rules

Advances in Knowledge Discovery and Data Mining • 1996
View 10 Excerpts
Highly Influenced

Machine Learning for Better Web Browsing

J. Stefan InstituteJamova, SloveniaDunja. Mladenic
View 1 Excerpt

E cient text cat- egorization

M. Grobelnik, D. Mladeni c
Proceedings of the ECML-98 Text Mining workshop • 1998

Turning Yahoo into an Automatic Web-Page Classi er

D. Mladeni c
Proceedings of the 13th European Conference on Arit cial Intelligence • 1998
View 1 Excerpt

Machine Learning. The McGraw- Hill Companies, Inc

T. M. Mitchell

Similar Papers

Loading similar papers…