Arpan Gujarati

Learn More
Contemporary multiprocessor real-time operating systems, such as VxWorks, LynxOS, QNX, and real-time variants of Linux, allow a process to have an arbitrary processor affinity, that is, a process may be pinned to an arbitrary subset of the processors in the system. Placing such a hard constraint on process migrations can help to improve cache performance of(More)
Contemporary multiprocessor real-time operating systems, such as VxWorks, LynxOS, QNX, and real-time variants of Linux, allow a process to have an arbitrary processor affinity, that is, a process may be pinned to an arbitrary subset of the processors in the system. Placing such a hard constraint on process migrations can help to improve cache performance of(More)
A method to bound the Failures In Time (FIT) rate of a CAN-based real-time system, i.e., the expected number of failures in one billion operating hours, is proposed. The method leverages an analysis, derived in the paper, of the probability of a correct and timely message transmission despite host and network failures due to electromagnetic interference(More)
We derive a sound lower bound on the mean time to failure of periodic systems with (m,k) constraints. We assume that upper bounds on the failure probabilities of each system iteration, e.g., a job or a runtime activation of a periodic task, or a single actuation cycle of a control loop, are known and that they satisfy the IID assumption. Our analysis(More)
Contemporary multiprocessor real-time operating systems, such as VxWorks, LynxOS, QNX, and real-time variants of Linux, allow a process to have an arbitrary processor affinity, that is, a process may be pinned to an arbitrary subset of the processors in the system. Placing such a hard constraint on process migrations can help to improve cache performance of(More)
Developers use Machine Learning (ML) platforms to train ML models and then deploy these ML models as web services for inference (prediction). A key challenge for platform providers is to guarantee response-time Service Level Agreements (SLAs) for inference workloads while maximizing resource e�ciency. Swayam is a fully distributed autoscaling framework that(More)
Networked control systems (NCS) [5]—where sensors, controllers, and actuators belonging to different control loops are connected through a shared network—are highly susceptible to both internal and external sources of electromagnetic interference (EMI), e.g., engine movements, TV towers, etc. [7]. System engineers thus use active replication (or static(More)
In time-sensitive, safety-critical systems that must be fail-operational, active replication is commonly used to mitigate transient faults that arise due to electromagnetic interference. However, designing an effective and well-performing active replication scheme is challenging since replication conflicts with the size, weight, power, and cost constraints(More)
From modern cars to airplanes to industrial plants, many applications that must execute in a timely manner are deployed on distributed systems. In case of safety-critical applications, like the anti-lock braking system of a car, the underlying system must tolerate inadvertent environmentally-induced faults to guarantee user safety. Since such systems often(More)