Learn More
The Enron email corpus is appealing to researchers because it is a) a large scale email collection from b) a real organization c) over a period of 3.5 years. In this paper we contribute to the initial investigation of the Enron email dataset from a social network analytic perspective. We report on how we enhanced and refined the Enron corpus with respect to(More)
We introduce and evaluate a novel network-based approach for determining individual credit of coauthors in multi-authored papers. In the proposed model, coauthorship is conceptualized as a directed, weighted network, where authors transfer coauthorship credits among one another. We validate the model by fitting it to empirical data about authorship credits(More)
To facilitate the analysis of real and simulated data on groups, organizations and societies, tools and measures are needed that can handle relational or network data that is multi-mode, multi-link and multi-time period in which nodes and edges have attributes with possible data errors and missing data. The integrated CASOS dynamic network analysis toolkit(More)
Texts can be coded and analyzed as networks of concepts often referred to as maps or semantic networks. In such networks, for many texts, there are elements of social structure – the connections among people, organizations, events, and so on. Within organizational and social network theory an approach called the meta-matrix is used to describe social(More)
Herein we demonstrate how to get from sets of texts to network representations of covert networks represented in texts. We report on the application of a computer-supported technique that combines network analysis of texts with classifying social and organizational systems into an ontology called the meta-matrix model. The resulting combinatory method is(More)
Scholars have often relied on name initials to resolve name ambiguities in large-scale coauthorship network research. This approach bears the risk of incorrectly merging or splitting author identities. The use of initial-based disambiguation has been justified by the assumption that such errors would not affect research findings too much. This paper tests(More)
There is a pressing need to automatically collect data on social systems as rich network data, analyze such systems to find hidden relations and groups, prune the datasets to locate regions of interest, locate key actors, characterize the structure, locate points of vulnerability, and simulate change in a system as it evolves naturally or in response to(More)