Learn More
Problem transformation and algorithm adaptation are the two main approaches in machine learning to solve multilabel classification problem. The purpose of this paper is to investigate both approaches in multilabel classification for Indonesian news articles. Since this classification deals with a large number of features, we also employ some feature(More)
The rapid growth of social media, especially Twitter in Indonesia, has produced a large amount of user generated texts in the form of tweets. Since Twitter only provides the name and location of its users, we develop a classification system that predicts latent attributes of Twitter user based on his tweets. Latent attribute is an attribute that is not(More)
Electrical energy plays an important role in Indonesia's economic development. In Indonesia, many islands other than Java, Bali, Madura, still depend on Diesel Engine as the source of electrical energy. Biodiesel as an alternative of diesel engine also makes diesel engine as one possible solution for producing electrical energy. In order to be able to use(More)
Learning to rank is a technique in machine learning for ranking problem. This paper aims to investigate this technique to classify the responsible agencies of each complaint text of LAPOR, which is our government complaint management system. Since this categorization problem is multilabel one and the latest work using learning to rank for multilabel(More)
Indonesia still used diesel engine to produce electricity in isolated area because of its efficiency. Many attempts were made to keep it efficient, one of them by monitor and fix the problem quickly. Current monitoring system can inform if the engine get any problem, the expert then analyze and give advice to fix the problem. In the other side, lack of(More)
In various real case, imbalanced datasets problems are inevitable, such as in metal detecting security or diagnosis of disease. With the limitations of existing learning algorithms when faced with imbalanced datasets, the prediction error is caused by the dominance of the majority against the minority class. Various techniques have been made to address the(More)
Twitter is one of the very popular micro-blogging platforms for people to share content and information. Information propagates through the interaction between users with many different ways, such as retweet, mention or reply. With those abilities, Twitter has become one of the medium for advertisers to perform the marketing campaign. Sometimes in their(More)
The development of online news media grew in number in Indonesia. One technique of news articles summarization is guided summarization where the summary should contain important aspect information. Guided summarization techniques have been developed in the Text Analysis Conference (TAC) 2011 and one of the best methods is SWING by Jun-ping, et al. The(More)
Diesel Engine is still required in Indonesia, since diesel engine is the most appropriate engine to produce electricity in isolated area. Indonesia comprises of a large number of isolated area, and at the moment it is not the priority of the government to install cable connecting those isolated areas. The diesel engines can be used in the next 15-20 years,(More)
Time constraints often lead a reader of scientific paper to read only the title and abstract of the paper, but reading these parts is often ineffective. This study aims to extract information automatically in order to help the readers get structured information from a scientific paper. The information extraction is done by rhetorical classification of each(More)