Learn More
There has been an explosive growth of data-mining models involving latent structure for clustering and classification. While having related objectives these models use different parameter-izations and often very different specifications and constraints. Model choice is thus a major methodological issue and a crucial practical one for applications. In this(More)
PNAS article classification is rooted in long-standing disciplinary divisions that do not necessarily reflect the structure of modern scientific research. We reevaluate that structure using latent pattern models from statistical machine learning, also known as mixed-membership models, that identify semantic structure in co-occurrence of words in the(More)
A review focused in its variants, computation and standardization for different scientific fields. Good practices for a literature survey are not followed by authors while preparing scientific manuscripts. ArXiv e-prints, May 2010. Three-feature model to reproduce the topology of citation networks and the effects from authors' visibility on their h-index.(More)
Internet health forums are a rich textual resource with content generated through free exchanges among patients and, in certain cases, health professionals. We tackle the problem of retrieving clinically relevant information from such forums, with relevant topics being defined from clinical auto-questionnaires. Texts in forums are largely unstructured and(More)
There has been an explosive growth of data-mining models involving latent structure for clustering and classification. While having related objectives these models use different parameter-izations and often very different specifications and constraints. Model choice is thus a major methodological issue and a crucial practical one for applications. In this(More)
  • 1