Skip to search form
Skip to main content
Skip to account menu
Semantic Scholar
Semantic Scholar's Logo
Search 232,670,874 papers from all fields of science
Search
Sign In
Create Free Account
AVX-512
Known as:
AVX3
, AVX512
, Advanced Vector Extensions 512
AVX-512 are 512-bit extensions to the 256-bit Advanced Vector Extensions SIMD instructions for x86 instruction set architecture (ISA) proposed by…
Expand
Wikipedia
(opens in a new tab)
Create Alert
Alert
Related topics
Related topics
24 relations
256-bit
APL
Addressing mode
Advanced Vector Extensions
Expand
Papers overview
Semantic Scholar uses AI to extract papers important to this topic.
2019
2019
Designing efficient SIMD algorithms for direct Connected Component Labeling
A. Hennequin
,
I. Masliah
,
L. Lacassagne
WPMVP'19
2019
Corpus ID: 59337271
Connected Component Labeling (CCL) is a fundamental algorithm in computer vision, and is often required for real-time…
Expand
2019
2019
Optimization of the N-Body Simulation on Intel's Architectures Based on AVX-512 Instruction Set
Enzo Rucci
,
Ezequiel Moreno
,
Adrián Pousa
,
Franco Chichizola
Argentine Congress of Computer Science
2019
Corpus ID: 219481785
The N-body simulations have become a powerful tool to test the gravitational interaction among particles, ranging from a few…
Expand
2019
2019
An Efficient Convolutional Neural Network Computation using AVX-512 Instructions
Hiroki Kataoka
,
Kohei Yamashita
,
K. Nakano
,
Yasuaki Ito
,
Akihiko Kasagi
,
T. Tabaru
2019
Corpus ID: 198342690
Recently, Convolutional Neural Networks (CNNs) are widely used for image processing. Since the computation cost is high, it is…
Expand
2019
2019
Parallel Fully Vectorized Marsa-LFIB4: Algorithmic and Language-Based Optimization of Recursive Computations
Przemysław Stpiczyński
Parallel Processing and Applied Mathematics
2019
Corpus ID: 204790791
The aim of this paper is to present a new high-performance implementation of Marsa-LFIB4 which is an example of high-quality…
Expand
2017
2017
Practical Implementation of Lattice QCD Simulation on Intel Xeon Phi Knights Landing
I. Kanamori
,
H. Matsufuru
International Symposium on Computing and…
2017
Corpus ID: 13771669
We investigate implementation of lattice Quantum Chromodynamics (QCD) code on the Intel Xeon Phi Knights Landing (KNL). The most…
Expand
2017
2017
The Tersoff many-body potential: Sustainable performance through vectorization
M. Höhnerbach
,
A. Ismail
,
P. Bientinesi
arXiv.org
2017
Corpus ID: 5682994
Molecular dynamics models materials by simulating each individual particle's trajectory. Many-body potentials lead to a more…
Expand
2016
2016
Chapter 13 – Performance libraries
Jim Jeffers
,
J. Reinders
,
Avinash Sodani
2016
Corpus ID: 63781881
2016
2016
A new SIMD iterative connected component labeling algorithm
L. Lacassagne
,
Laurent Cabaret
,
D. Etiemble
,
Farouk Hebache
,
Andrea Petreto
WPMVP '16
2016
Corpus ID: 15140428
This paper presents a new multi-pass iterative algorithm for Connected Component Labeling. The performance of this algorithm is…
Expand
2015
2015
Optimizing Total Energy–Mass Flux (TEMF) Planetary Boundary Layer Scheme for Intel’s Many Integrated Core (MIC) Architecture
Jarno Mielikäinen
,
Bormin Huang
,
Hung-Lung Huang
IEEE Journal of Selected Topics in Applied Earth…
2015
Corpus ID: 9035362
In order to make use of the ever-improving microprocessor performance, the applications must be modified to take advantage of the…
Expand
2015
2015
Optimizing MAKWA on GPU and CPU
T. Pornin
IACR Cryptology ePrint Archive
2015
Corpus ID: 18518468
We present here optimized implementations of the MAKWA password hashing function on an AMD Radeon HD 7990 GPU, and compare its…
Expand
By clicking accept or continuing to use the site, you agree to the terms outlined in our
Privacy Policy
(opens in a new tab)
,
Terms of Service
(opens in a new tab)
, and
Dataset License
(opens in a new tab)
ACCEPT & CONTINUE