Learn More
In recent years, mining with imbalanced data sets receives more and more attentions in both theoretical and practical aspects. This paper introduces the importance of imbalanced data sets and their broad application domains in data mining, and then summarizes the evaluation metrics and the existing methods to evaluate and solve the imbalance problem.(More)
Due to name abbreviations, identical names, name misspellings, and pseudonyms inpublications or bibliographies (citations), an author may have multiple names and multiple authors may share the same name. Such name ambiguity affects the performance of document retrieval, web search, database integration, and may cause improper attribution to authors. This(More)
Automatic metadata generation provides scalability and usability for digital libraries and their collections. Machine learning methods offer robust and adaptable automatic metadata extraction. We describe a Support Vector Machine classification-based method for metadata extraction from header part of research papers and show that it outperforms other(More)
An author may have multiple names and multiple authors may share the same name simply due to name abbreviations, identical names, or name misspellings in publications or bibliographies <sup>1</sup>. This can produce name ambiguity which can affect the performance of document retrieval, web search, and database integration, and may cause improper attribution(More)
Because of name variations, an author may have multiple names and multiple authors may share the same name. Such name ambiguity affects the performance of document retrieval, web search, database integration, and may cause improper attribution to authors. This paper presents a hierarchical naive Bayes mixture model, an unsupervised learning approach, for(More)
Cyclooxygenase-2 (COX-2) is known to mediate the cardioprotective effects of the late phase of ischemic preconditioning (PC); however, the signaling pathways involved in COX-2 induction following ischemic PC are unknown. In addition, although inducible nitric oxide synthase (iNOS) has been identified as a co-mediator of late PC together with COX-2, the(More)
Acknowledgements in research publications, like citations, indicate influential contributions to scientific work; however, large-scale acknowledgement analyses have traditionally been impractical due to the high cost of manual information extraction. In this paper we describe a mixture method for automatically mining acknowledgements from research documents(More)
The retina mainly contains ganglion, bipolar and photoreceptor cells which are distributed in the ganglion cell layer (GCL), inner nuclear layer (INL) and outer nuclear layer (ONL), respectively. Whether there is an age-related loss of these retinal cells remains not well understood. Cell density and the total number of cells were two commonly used measures(More)
BACKGROUND A deficiency of mesolimbic dopamine (DA) is a leading candidate for the etiology of certain symptoms of depression (e.g., anhedonia and loss of motivation). Here we show amounts of dopaminergic proteins in the amygdala, a key brain structure involved in the integration of emotions and stress, in subjects with major depression and in(More)
Intracranial venous structures have received increasing attention due to improved neuroimaging techniques and increased awareness of cerebral venous disease. To date, few studies have attempted to investigate the dural entrance of the cerebral bridging vein (BV). The aim of this study was to use the superior sagittal sinus (SSS) as an example to identify(More)