Share This Author
DaDianNao: A Machine-Learning Supercomputer
- Yunji Chen, Tao Luo, O. Temam
- Computer Science47th Annual IEEE/ACM International Symposium on…
- 13 December 2014
This article introduces a custom multi-chip machine-learning architecture, showing that, on a subset of the largest known neural network layers, it is possible to achieve a speedup of 450.65x over a GPU, and reduce the energy by 150.31x on average for a 64-chip system.
RCFile: A fast and space-efficient data placement structure in MapReduce-based warehouse systems
- Yongqiang He, Rubao Lee, Zhiwei Xu
- Computer ScienceIEEE 27th International Conference on Data…
- 11 April 2011
This paper presents a big data placement structure called RCFile (Record Columnar File) and its implementation in the Hadoop system and shows the effectiveness of RCFile in satisfying the four requirements.
Anycast Routing in Delay Tolerant Networks
Three semantics models of anycast in DTNs are defined, namely CM (Current Membership), TIM (Temporal Interval Membership) and TPM (Tem temporal Point Membership Model), which unambiguously define the intended receivers of a message in the anycast routing.
DaDianNao: A Neural Network Supercomputer
A custom multi-chip machine-learning architecture containing a combination of custom storage and computational units, with electrical and optical inter-chip interconnects separately is introduced, and it is shown that, on a subset of the largest known neural network layers, it is possible to achieve a speedup of 656.63× over a GPU, and reduce the energy by 184.05× on average for a 64-chip system.
IMR: High-Performance Low-Cost Multi-Ring NoCs
- Shaoli Liu, Tianshi Chen, Yunji Chen
- Computer ScienceIEEE Transactions on Parallel and Distributed…
- 1 June 2016
This paper presents a novel type of multi-ring NoC called isolated multi- ring (IMR), which can even support chip multiprocessors (CMPs) with 1,024 cores, and observes from experiments that IMR significantly outperforms its competitors in both saturation throughput and latency across all scenarios considered.
GridIS: an incentive-based grid scheduling
- Lijuan Xiao, Yanmin Zhu, L. Ni, Zhiwei Xu
- Computer Science19th IEEE International Parallel and Distributed…
- 4 April 2005
This work proposes an incentive-based grid scheduling, GridIS, which is composed of a P2P decentralized scheduling framework and incentive- based scheduling algorithms and simulation results show that GridlS guarantees the incentive of every participant to a satisfying extent.
Incentive-Based Scheduling for Market-Like Computational Grids
- Lijuan Xiao, Yanmin Zhu, L. Ni, Zhiwei Xu
- BusinessIEEE Transactions on Parallel and Distributed…
- 1 July 2008
This paper presents an incentive-based scheduling scheme, which utilizes a peer-to-peer decentralized scheduling framework, a set of local heuristic algorithms, and three market instruments of job announcement, price, and competition degree, and shows that this approach outperforms other scheduling schemes in optimizing incentives for both consumers and providers.
ELDA: Towards efficient and lightweight detection of cache pollution attacks in NDN
- Zhiwei Xu, Bo Chen, Ninghan Wang, Yujun Zhang, Zhongcheng Li
- Computer ScienceIEEE 40th Conference on Local Computer Networks…
- 26 October 2015
This work proposes ELDA, an Efficient and Lightweight Detection scheme against cache pollution attacks, in which a Lightweight Flajolet-Martin sketch is designed to monitor the interest traffic and demonstrates that, by consuming a few computation and memory resources, ELDA can effectively and efficiently detect CPA attacks.
Cloud-Sea Computing Systems: Towards Thousand-Fold Improvement in Performance per Watt for the Coming Zettabyte Era
- Zhiwei Xu
- Computer ScienceJournal of Computer Science and Technology
- 23 March 2014
This special section contains 12 papers produced by the Cloud-Sea Computing Systems project team, presenting research results relating to sensing and REST 2.0, the elastic processor, the hyperparallel server, and the cloud-sea storage.
DataMPI: Extending MPI to Hadoop-Like Big Data Computing
- Xiaoyi Lu, Fan Liang, Bing Wang, L. Zha, Zhiwei Xu
- Computer ScienceIEEE 28th International Parallel and Distributed…
- 19 May 2014
This paper abstracts the characteristics of key-value communication patterns into a bipartite communication model, which reveals four distinctions from MPI: Dichotomic, Dynamic, Data-centric, and Diversified features, and proposes the specification of a minimalistic extension to MPI.