Shuheng Wu

Learn More
The advent of big science has brought a dramatic increase in the amount of data generated as part of scientific investigation. The ability to capture and prepare such data for reuse has brought about an increased interest in data curation practices within scientific fields and venues such as national laboratories. This study employs semi-structured(More)
To be effective and at the same time sustainable, a community data curation model has to be aligned with the community's current work organization: practices and activities; divisions of labor; data and collaborative relationships; and the community's value structure, norms, and conventions for data, quality assessment, and data sharing. This poster(More)
While there are increased efforts to extend existing controlled vocabularies through harvesting socially created image metadata from content creation communities (e.g., Flickr), questions remain about the quality and reuse value of this metadata. Data from a controlled experiment was used to examine relationships among categories of image tags, tag(More)
Few studies have examined bibliographic records enhancement in library catalogs. The purpose of this study is to identify the types and sources of bibliographic enhancement data used by libraries, online booksellers, and social cataloging sites. Based on a content analysis of 210 bibliographic records collected from six bibliographic systems, this study(More)
With the increasing number of multilingual webpages on the Internet, cross-language information retrieval has become an important research issue. Using Activity Theory as a theoretical framework, this study employs semi-structured interviews with key informants who are frequent users of Chinese-English mixed language queries in web searching. The findings(More)
(2015). Research project tasks, data, and perceptions of data quality in a condensed matter physics community. Abstract To be effective and at the same time sustainable, a community data curation model needs to be aligned with the community's current data practices, including research project activities, data types, and perceptions of data quality. Based on(More)
With the recent interest in socially created metadata as a potentially complementary resource for image description in relation to established tools such as thesauri and other forms of controlled vocabulary, questions remain about the quality and reuse value of these metadata. This study describes and examines a set of tags using quantitative and(More)
Team-based scientific collaborations play a key role in the discovery and distribution of scientific knowledge. In order to determine the social and organizational factors that help support a scientific team's successful transition from short-term experiments to long-term programs of ongoing scientific research, this study used observations of teams(More)
Entity and instance determination, disambiguation, and referencing, referred to as authority control in libraries, are essential for scientific research. This study examines the authority control practices and issues in molecular biology using literature and scenario analyses. The analyses imply that the concept of authority control in molecular biology is(More)
  • 1