First-principles calculations of electron states of a silicon nanowire with 100,000 atoms on the K computer

  title={First-principles calculations of electron states of a silicon nanowire with 100,000 atoms on the K computer},
  author={Yukihiro Hasegawa and Jun-ichi Iwata and Miwako Tsuji and Daisuke Takahashi and Atsushi Oshiyama and Kazuo Minami and Taisuke Boku and Fumiyoshi Shoji and Atsuya Uno and Motoyoshi Kurokawa and Hikaru Inoue and Ikuo Miyoshi and Mitsuo Yokokawa},
  journal={2011 International Conference for High Performance Computing, Networking, Storage and Analysis (SC)},
  • Yukihiro Hasegawa, J. Iwata, +10 authors M. Yokokawa
  • Published 2011
  • Computer Science
  • 2011 International Conference for High Performance Computing, Networking, Storage and Analysis (SC)
Real space DFT (RSDFT) is a simulation technique most suitable for massively-parallel architectures to perform first-principles electronic-structure calculations based on density functional theory. We here report unprecedented simulations on the electron states of silicon nanowires with up to 107,292 atoms carried out during the initial performance evaluation phase of the K computer being developed at RIKEN. The RSDFT code has been parallelized and optimized so as to make effective use of the… Expand
Performance evaluation of ultra-large-scale first-principles electronic structure calculation code on the K computer
Real-space density functional theory code performs first-principles electronic structure calculations and enables analysis of the behavior of a silicon nanowire with a diameter of 10–20 nm in a scale of simulation. Expand
Metascalable Quantum Molecular Dynamics Simulations of Hydrogen-on-Demand
  • K. Nomura, R. Kalia, +6 authors N. A. Romero
  • Computer Science
  • SC14: International Conference for High Performance Computing, Networking, Storage and Analysis
  • 2014
A novel nanostructural design for on-demand hydrogen production from water, advancing renewable energy technologies is revealed and metascalable (or "design once, scale on new architectures") algorithm is used for broader applications within a recently proposed divide-conquer-recombine paradigm. Expand
RSDFT-NEGF Quantum Transport Simulation of Ultra-Small Field-Effect Transistors
We describe our recent progress in developing a non-equilibrium Green's function (NEGF) quantum transport simulator based on the real-space density functional theory (RSDFT). The simulator isExpand
Extremely Scalable Algorithm for 108-atom Quantum Material Simulation on the Full System of the K Computer
  • T. Hoshi, H. Imachi, +4 authors F. Shoji
  • Materials Science, Computer Science
  • 2016 7th Workshop on Latest Advances in Scalable Algorithms for Large-Scale Systems (ScalA)
  • 2016
The present paper shows that an innovative scalable algorithm for a real research can appear by the co-design among application, algorithm and architecture. Expand
O ct 2 01 6 Extremely scalable algorithm for 10 8-atom quantum material simulation on the full system of the K computer
An extremely scalable linear-algebraic algorithm was developed for quantum material simulation (electronicstate calculation) with 10 atoms or 100-nm-scale materials. The mathematical foundation isExpand
Modeling Dilute Solutions Using First-Principles Molecular Dynamics: Computing more than a Million Atoms with over a Million Cores
Using a robust new algorithm, this work has developed an O(N) complexity solver for electronic structure problems with fully controllable numerical error, allowing for very accurate FPMD simulations of more than a million atoms on over a million cores. Expand
Quantum algorithms for predicting the properties of complex materials
Recently, a method was presented, applicable to any large Hermitian eigenproblem, by which the spectrum is partitioned among distinct groups of processors, making it compatible with existing schemes that parallelize at a physical level and at the level of primitive operations, e.g., matrix-vector multiplication. Expand
Pushing the Limit of Molecular Dynamics with Ab Initio Accuracy to 100 Million Atoms with Machine Learning
For 35 years, ab initio molecular dynamics (AIMD) has been the method of choice for modeling complex atomistic phenomena from first principles. However, most AIMD applications are limited byExpand
Electron Dynamics Simulation with Time-Dependent Density Functional Theory on Large Scale Symmetric Mode Xeon Phi Cluster
This study implements a real scientific code named Ab-initio Real-time Electron Dynamics simulator (ARTED), which is an electron dynamics simulator on (Time-Dependent Density Functional Theory (TDDFT) using the Symmetric mode, where all CPU and Intel Xeon Phi resources contribute to the computation. Expand
Magic angle and height quantization in nanofacets on SiC(0001) surfaces
We report on the density-functional calculations that provide microscopic mechanism of the facet formation on the SiC (0001) surface. We first identify atom-scale structures of single-, double-, andExpand


A massively-parallel electronic-structure calculations based on real-space density functional theory
A first-principles density functional program that efficiently performs large-scale calculations on massively-parallel computers and obtains a self-consistent electronic-structure in a few hundred hours is developed. Expand
High-performance computing for exact numerical approaches to quantum many-body problems on the earth simulator
From large-scale calculations, the calculations reveal that a change from Schrodinger's cat to classical like one can be controlled by tuning the interaction and is a marked contrast to the general concept. Expand
Quantum Transport Length Scales in Silicon-based Semiconducting Nanowires: Surface Roughness Effects
We report on a theoretical study of quantum charge transport in atomistic models of silicon nanowires with surface roughness disorder, using an efficient real-space, order N Kubo-Greenwood approachExpand
Self-interaction correction to density-functional approximations for many-electron systems
exchange and correlation, are not. We present two related methods for the self-interaction correction (SIC) of any density functional for the energy; correction of the self-consistent one-electronExpand
PACS-CS: a large-scale bandwidth-aware PC cluster for scientific computation
Under the preliminary performance evaluation, it is confirmed that the computation part in practical Lattice QCD code will be able to achieve 30% of peak performance, and up to 600 Mbyte/sec of bandwidth at single directed neighboring communication will be achieved. Expand
Pseudopotential plane-wave calculations for ZnS.
Ces calculs sont compares a ceux d'onde plane amelioree linearisee et la bande interdite fondamentale dans les deux cas sous-estimee and the bande d de Zn est moins liee que dans l'experience. Expand
Performance Evaluation of OpenMP and MPI Hybrid Programs on a Large Scale Multi-core Multi-socket Cluster, T2K Open Supercomputer
This work explores the performance of the OpenMP/MPI hybrid programming model on a large scale multi-core multi-socket cluster called T2K Open Supercomputer and shows that the multi- core multi- socket cluster can take advantage of the hybrid programmingmodel when it uses MPI across sockets and OpenMP within sockets. Expand
Efficient Parallel Implementation of Classical Gram-Schmidt Orthogonalization Using Matrix Multiplication
Extended Abstract The Gram-Schmidt orthogonalization process is one of the fundamental algorithms in linear algebra that implements the QR decomposition of a matrix into the factorization A = QR.Expand
T2K Open Supercomputer: Inter-university and Inter-disciplinary Collaboration on the New Generation Supercomputer
  • H. Nakashima
  • Computer Science
  • International Conference on Informatics Education and Research for Knowledge-Circulating Society (icks 2008)
  • 2008
The specifications are not only the result of the new concept and procurement principle, but also the conclusion of the discussion between computer scientists/engineers in the center and computational scientists of a wide spectrum of research field in the university. Expand
Sparc64 VIIIfx: A New-Generation Octocore Processor for Petascale Computing
The Sparc64 VIIIfx eight-core processor, developed for use in petascale computing systems, runs at speeds of up to 2 GHz and achieves a peak performance of 128 gigaflops while consuming as little asExpand