Shuheng Wu

Learn More
team diversity and the impact of scientific publications: Evidence from physics research at a national science lab. ABSTRACT In the second half of the twentieth century, scientific research in physics, chemistry, and engineering began to focus on the use of large government funded laboratories. This shift toward so‐called big science also brought about a(More)
There have been ample suggestions in the literature that terms added to documents from Flickr and Wikipedia can complement traditional methods of indexing and controlled vocabularies. At the same time, adding new metadata to existing metadata objects may not always add value to those objects. This research examines the potential added value of using(More)
The advent of big science has brought a dramatic increase in the amount of data generated as part of scientific investigation. The ability to capture and prepare such data for reuse has brought about an increased interest in data curation practices within scientific fields and venues such as national laboratories. This study employs semi-structured(More)
This paper analyzes the authority control practices in molecular biology using literature review and scenario analysis and makes a comparison with bibliographic authority control. The analysis indicates the absence of conceptual authority control model in molecular bioinformatics. In addition to traditional knowledge organization tools, authority control in(More)
While there are increased efforts to extend existing controlled vocabularies through harvesting socially created image metadata from content creation communities (e.g., Flickr), questions remain about the quality and reuse value of this metadata. Data from a controlled experiment was used to examine relationships among categories of image tags, tag(More)
To be effective and at the same time sustainable, a community data curation model has to be aligned with the community's current work organization: practices and activities; divisions of labor; data and collaborative relationships; and the community's value structure, norms, and conventions for data, quality assessment, and data sharing. This poster(More)
With the increasing number of multilingual webpages on the Internet, cross-language information retrieval has become an important research issue. Using Activity Theory as a theoretical framework, this study employs semi-structured interviews with key informants who are frequent users of Chinese-English mixed language queries in web searching. The findings(More)
Few studies have examined bibliographic records enhancement in library catalogs. The purpose of this study is to identify the types and sources of bibliographic enhancement data used by libraries, online booksellers, and social cataloging sites. Based on a content analysis of 210 bibliographic records collected from six bibliographic systems, this study(More)
(2015). Research project tasks, data, and perceptions of data quality in a condensed matter physics community. Abstract To be effective and at the same time sustainable, a community data curation model needs to be aligned with the community's current data practices, including research project activities, data types, and perceptions of data quality. Based on(More)
With the recent interest in socially created metadata as a potentially complementary resource for image description in relation to established tools such as thesauri and other forms of controlled vocabulary, questions remain about the quality and reuse value of these metadata. This study describes and examines a set of tags using quantitative and(More)