Corpus ID: 59553604

OODIDA: On-board/Off-board Distributed Data Analytics for Connected Vehicles

@article{Ulm2019OODIDAOD,
  title={OODIDA: On-board/Off-board Distributed Data Analytics for Connected Vehicles},
  author={Gregor Ulm and Emil Gustavsson and Mats Jirstrand},
  journal={ArXiv},
  year={2019},
  volume={abs/1902.00319}
}
Connected vehicles may produce gigabytes of data per hour, which makes centralized data processing impractical at the fleet level. [...] Key Method Its message-passing infrastructure has been implemented in Erlang/OTP, while the end points are language-agnostic. OODIDA is highly scalable and able to process a significant volume of data on resource-constrained clients.Expand
Active-Code Replacement in the OODIDA Data Analytics Platform
TLDR
The ability to execute user-defined Python modules on clients as well as the server is added, which facilitates use cases such as iterative A/B testing of machine learning algorithms or modifying experimental algorithms on-the-fly. Expand
Facilitating Rapid Prototyping in the OODIDA Data Analytics Platform via Active-Code Replacement.
TLDR
The ability to execute user-defined Python modules on clients as well as the server is added, which facilitates use cases such as iterative A/B testing of machine learning algorithms or modifying experimental algorithms on-the-fly. Expand
Querying Large Vehicular Networks: How to Balance On-Board Workload and Queries Response Timeƒ
TLDR
These tunable algorithms, which are evaluated on two large sets of real-world vehicular data, outperform baseline solutions and are able to balance the trade-off between the overall on-board workload and the response time needed to receive all answers for a set of queries. Expand
Collaborative Learning on the Edges: A Case Study on Connected Vehicles
TLDR
This work proposes CLONE, a collaborative learning setting on the edges based on the real-world dataset collected from a large electric vehicle (EV) company, built on top of the federated learning algorithm and long shortterm memory networks, and demonstrates the effectiveness of driver personalization, privacy serving, latency reduction (asynchronous execution), and security protection. Expand
Time- and Computation-Efficient Data Localization at Vehicular Networks’ Edge
TLDR
This work is the first to study how to localize, efficiently and distributively, relevant data in a vehicular fleet for analysis applications, and its algorithms are both efficient and configurable. Expand
Functional Federated Learning in Erlang (ffl-erl)
The functional programming language Erlang is well-suited for concurrent and distributed applications. Numerical computing, however, is not seen as one of its strengths. The recent introduction ofExpand
CLONE: Collaborative Learning on the Edges
TLDR
This work proposes a collaborative learning framework on the edges, named CLONE, which is steered by the real-world data sets collected from a large electric vehicle company and a grocery store of a shopping mall, respectively, to support real-time training and inference for connected vehicles and marketing intelligence services. Expand
Federated Learning for Healthcare Informatics
  • Jie Xu, F. Wang
  • Computer Science, Medicine
  • J. Heal. Informatics Res.
  • 2021
TLDR
The goal of this survey is to provide a review for federated learning technologies, particularly within the biomedical space, and summarize the general solutions to the statistical challenges, system challenges, and privacy issues in federation, and point out the implications and potentials in healthcare. Expand
Facilitating rapid prototyping in the distributed data analytics platform OODIDA via active-code replacement
TLDR
The main contribution of this paper is the description of a relatively straightforward approach to active-code replacement that is very user-friendly and enables a data analyst to quickly execute custom code on the cloud server as well as on client devices. Expand

References

SHOWING 1-10 OF 25 REFERENCES
Disco: a computing platform for large-scale data analytics
TLDR
Disco is a distributed computing platform for MapReduce style computations on large-scale data that provides both a fault-tolerant scheduling and execution layer as well as a distributed and replicated storage layer. Expand
Federated Optimization: Distributed Machine Learning for On-Device Intelligence
We introduce a new and increasingly relevant setting for distributed optimization in machine learning, where the data defining the optimization are unevenly distributed over an extremely large numberExpand
Functional Federated Learning in Erlang (ffl-erl)
The functional programming language Erlang is well-suited for concurrent and distributed applications. Numerical computing, however, is not seen as one of its strengths. The recent introduction ofExpand
Distributed Data Processing
TLDR
The factors that relate to organizations and their data processing requirements and to the various possible data processing configurations are considered, including price-performance ratio, organizational needs, and other factors that recommend the flexibility of distributed data processing. Expand
Distributed deep learning on edge-devices: Feasibility via adaptive compression
TLDR
It is shown that distributed deep learning computation on WAN connected devices feasible, in spite of the traffic caused by learning tasks, and that such a setup rises some important challenges, most notably the ingress traffic that the servers hosting the up-to-date model have to sustain. Expand
Spark: Cluster Computing with Working Sets
TLDR
Spark can outperform Hadoop by 10x in iterative machine learning jobs, and can be used to interactively query a 39 GB dataset with sub-second response time. Expand
MapReduce: Simplified Data Processing on Large Clusters
TLDR
This paper presents the implementation of MapReduce, a programming model and an associated implementation for processing and generating large data sets that runs on a large cluster of commodity machines and is highly scalable. Expand
Storm@twitter
TLDR
The architecture of Storm and its methods for distributed scale-out and fault-tolerance are described, how queries are executed in Storm is described, and some operational stories based on running Storm at Twitter are presented. Expand
A survey of stream processing
TLDR
An analysis of the development of each of these specialized topics to determine if a general theory of stream processing has emerged is presented, including a comparison of the semantic models that are used to formalize stream based computation. Expand
Parallel and Concurrent Programming in Haskell
TLDR
This tutorial covers the basic concepts involved in writing parallel and concurrent programs in Haskell, and takes a deliberately practical approach: most of the examples are real Haskell programs that you can compile, run, measure, modify and experiment with. Expand
...
1
2
3
...