Jana Diesner

Learn More
Anticoagulation with intravenous heparin has been the standard treatment for the management of gestational thromboembolic complications. Catheter-directed thrombolysis is an encouraging approach for the treatment of thromboembolic disease and has not been previously reported during pregnancy. One gravid woman with pulmonary embolism, critically ill, and(More)
To facilitate the analysis of real and simulated data on groups, organizations and societies, tools and measures are needed that can handle relational or network data that is multi-mode, multi-link and multi-time period in which nodes and edges have attributes with possible data errors and missing data. The integrated CASOS dynamic network analysis toolkit(More)
Texts can be coded and analyzed as networks of concepts often referred to as maps or semantic networks. In such networks, for many texts, there are elements of social structure – the connections among people, organizations, events, and so on. Within organizational and social network theory an approach called the meta-matrix is used to describe social(More)
The Enron email corpus is appealing to researchers because it is a) a large scale email collection from b) a real organization c) over a period of 3.5 years. In this paper we contribute to the initial investigation of the Enron email dataset from a social network analytic perspective. We report on how we enhanced and refined the Enron corpus with respect to(More)
Scholars have often relied on name initials to resolve name ambiguities in large-scale coauthorship network research. This approach bears the risk of incorrectly merging or splitting author identities. The use of initial-based disambiguation has been justified by the assumption that such errors would not affect research findings too much. This paper tests(More)
Previous research has shown that one field with a strong yet unsatisfied need for automated extraction of instances of various entities classes from text data is the analysis of socio-technical systems (Carley, 2002; Diesner & Carley, 2005). Domain-specific entity classes and the relations between them are often specified in ontologies or taxonomies. We(More)
Anaphora resolution (AR) identifies the entities that pronouns refer to. Coreference resolution (CR) associates the various instances of an entity with each other. Given our data, our findings suggest that deduplicating and normalizing text data by using AR and CR impacts the literal mention, frequency, identity, and existence of about 75% of the entities(More)
This paper shows empirically how the choice of certain data pre-processing methods for disambiguating author names affects our understanding of the structure and evolution of co-publication networks. Thirty years of publication records from 125 Information Systems journals were obtained from DBLP. Author names in the data were pre-processed via algorithmic(More)
The Enron email corpus is appealing to researchers because it is a) a large scale email collection from b) a real organization c) over a period of 3.5 years. In this paper we contribute to the initial investigation of the Enron email dataset from a social network analytic perspective. We report on how we enhanced and refined the Enron corpus with respect to(More)