• Corpus ID: 215814484

Knowledge Scientists: Unlocking the data-driven organization

@article{Fletcher2020KnowledgeSU,
  title={Knowledge Scientists: Unlocking the data-driven organization},
  author={G. Fletcher and Paul T. Groth and Juan Sequeda},
  journal={ArXiv},
  year={2020},
  volume={abs/2004.07917}
}
Organizations across all sectors are increasingly undergoing deep transformation and restructuring towards data-driven operations. The central role of data highlights the need for reliable and clean data. Unreliable, erroneous, and incomplete data lead to critical bottlenecks in processing pipelines and, ultimately, service failures, which are disastrous for the competitive performance of the organization. Given its central importance, those organizations which recognize and react to the need… 

Tables from this paper

Making the collective knowledge of chemistry open and machine actionable
Large amounts of data are generated in chemistry labs—nearly all instruments record data in a digital form, yet a considerable proportion is also captured non-digitally and reported in ways
Advancement Of Deep Learning In Big Data And Distributed Systems
TLDR
The main contributions are summarized in a comparison table as detailed in Table 1, like the objectives, challenges, and novelty of each paper are clarified, the architecture or model and applications used, and the recommendations for each.
Designing a Visual Tool for Property Graph Schema Extraction and Refinement: An Expert Study
TLDR
The design space of visual tools that aim to help people create schemas for property graphs is explored and insights are used to establish design requirements and design a UI prototype, which are then relayed back to experts.

References

SHOWING 1-10 OF 19 REFERENCES
Data scientist: the sexiest job of the 21st century.
TLDR
Harvard Business School's Davenport and Greylock's Patil take a deep dive on what organizations need to know about data scientists: where to look for them, how to attract and develop them, and how to spot a great one.
How Data Science Workers Work with Data: Discovery, Capture, Curation, Design, Creation
TLDR
This paper building on the work of other CSCW and HCI researchers in describing the ways that scientists, scholars, engineers, and others work with their data, through analyses of interviews with 21 data science professionals sets five approaches to data along a dimension of interventions.
Data Stewardship addressing disciplinary data management needs
TLDR
The principles behind the Data Stewardship project at TU Delft, the progress so far, identify the key challenges and explain the plans for the future are described.
Big Data, Little Data, No Data: Scholarship in the Networked World
"Big Data" is on the covers of Science, Nature, the Economist, and Wired magazines, on the front pages of the Wall Street Journal and the New York Times. But despite the media hyperbole, as Christine
Data Integration: The Current Status and the Way Forward
TLDR
This work uses multiple real customer examples to highlight the technical difficulties around building a deployable and usable data integration software that tackles the data silos problem and the practical aspects involved in using machine learning to enable automating manual or rule-based processes for data integration tasks.
Human-Centered Study of Data Science Work Practices
TLDR
This workshop invites researchers to share their observations, experiences, hypotheses, and insights, in the hopes of developing a taxonomy of work practices and open issues in the behavioral and social study of data science and data science workers.
Dynamo: amazon's highly available key-value store
TLDR
D Dynamo is presented, a highly available key-value storage system that some of Amazon's core services use to provide an "always-on" experience and makes extensive use of object versioning and application-assisted conflict resolution in a manner that provides a novel interface for developers to use.
Knowledge Engineering: Principles and Methods
Bigtable: A Distributed Storage System for Structured Data
TLDR
The simple data model provided by Bigtable is described, which gives clients dynamic control over data layout and format, and the design and implementation of Bigtable are described.
Motivating Salespeople: What Really Works
TLDR
The key is to treat sales compensation not as an expense to rein in but as a portfolio of investments to manage, and companies that do this will be rewarded with much higher returns.
...
1
2
...