My Account Log in

3 options

Parallel computing : technology trends / edited by Ian Foster [and four others].

EBSCOhost Academic eBook Collection (North America) Available online

View online

EBSCOhost eBook Community College Collection Available online

View online

Ebook Central Academic Complete Available online

View online
Format:
Book
Contributor:
Foster, Ian, 1959- editor.
Series:
Advances in parallel computing ; Volume 36.
Advances in parallel computing ; Volume 36
Language:
English
Subjects (All):
Parallel processing (Electronic computers).
Physical Description:
1 online resource (xvii, 785 pages) : illustrations.
Edition:
1st ed.
Place of Publication:
Amsterdam ; Berlin ; Washington, District of Columbia : IOS Press, [2020]
Summary:
The year 2019 marked four decades of cluster computing, a history that began in 1979 when the first cluster systems using Components Off The Shelf (COTS) became operational.This achievement resulted in a rapidly growing interest in affordable parallel computing for solving compute intensive and large scale problems.
Contents:
Intro
Title Page
Preface
Conference Organisation
Contents
Opening
Four Decades of Cluster Computing
Invited Talks
Will We Ever Have a Quantum Computer?
Empowering Parallel Computing with Field Programmable Gate Arrays
Main Track
Deep Learning Applications
First Experiences on Applying Deep Learning Techniques to Prostate Cancer Detection
Deep Generative Model Driven Protein Folding Simulations
Economics
A Scalable Approach to Econometric Inference
Cloud vs On-Premise HPC: A Model for Comprehensive Cost Assessment
GPU Computing Methods
GPU Architecture for Wavelet-Based Video Coding Acceleration
GPGPU Computing for Microscopic Pedestrian Simulation
High Performance Eigenvalue Solver for Hubbard Model: Tuning Strategies for LOBPCG Method on CUDA GPU
Parallel Smoothers in Multigrid Method for Heterogeneous CPU-GPU Environment
Load Balancing Methods
Progressive Load Balancing in Distributed Memory. Mitigating Performance and Progress Variability in Iterative Asynchronous Algorithms
Learning-Based Load Balancing for Massively Parallel Simulations of Hot Fusion Plasmas
Load-Balancing for Large-Scale Soot Particle Agglomeration Simulations
On the Autotuning of Task-Based Numerical Libraries for Heterogeneous Architectures
Parallel Algorithms
Batched 3D-Distributed FFT Kernels Towards Practical DNS Codes
On Superlinear Speedups of a Parallel NFA Induction Algorithm
A Domain Decomposition Reduced Order Model with Data Assimilation (DD-RODA)
Predicting Performance of Classical and Modified BiCGStab Iterative Methods
Parallel Applications
Gadget3 on GPUs with OpenACC
Exploring High Bandwidth Memory for PET Image Reconstruction
Parallel Architecture
The Architecture of Heterogeneous Petascale HPC RIVR.
Design of an FPGA-Based Matrix Multiplier with Task Parallelism
Application Performance of Physical System Simulations
Parallel Methods
A Hybrid MPI+Threads Approach to Particle Group Finding Using Union-Find
Parallel Performance
Improving the Scalability of the ABCD Solver with a Combination of New Load Balancing and Communication Minimization Techniques
Characterization of Power Usage and Performance in Data-Intensive Applications Using MapReduce over MPI
Feedback-Driven Performance and Precision Tuning for Automatic Fixed Point Exploitation
Parallel Programming
A GPU-CUDA Framework for Solving a Two-Dimensional Inverse Anomalous Diffusion Problem
Parallelization Strategies for GPU-Based Ant Colony Optimization Applied to TSP
DBCSR: A Blocked Sparse Tensor Algebra Library
Acceleration of Hydro Poro-Elastic Damage Simulation in a Shared-Memory Environment
BERTHA and PyBERTHA: State of the Art for Full Four-Component Dirac-Kohn-Sham Calculations
Prediction-Based Partitions Evaluation Algorithm for Resource Allocation
Unified Generation of DG-Kernels for Different HPC Frameworks
Invasive Computing for Power Corridor Management
Enforcing Reference Capability in FastFlow with Rust
Performance
AITuning: Machine Learning-Based Tuning Tool for Run-Time Communication Libraries
Towards Benchmarking the Asynchronous Progress of Non-Blocking MPI Operations
Power Management
Acceleration of Interactive Multiple Precision Arithmetic Toolbox MuPAT Using FMA, SIMD, and OpenMP
Dynamic Runtime and Energy Optimization for Power-Capped HPC Applications
Programming Paradigms
Paradigm Shift in Program Structure of Particle-in-Cell Simulations
Backus FP Revisited: A Parallel Perspective on Modern Multicores
Multi-Variant User Functions for Platform-Aware Skeleton Programming.
Scalability Analysis
POETS: Distributed Event-Based Computing - Scaling Behaviour
Towards High-End Scalability on Biologically-Inspired Computational Models
Scientific Visualization
GraphiX: A Fast Human-Computer Interaction Symmetric Multiprocessing Parallel Scientific Visualization Tool
When Parallel Performance Measurement and Analysis Meets In Situ Analytics and Visualization
Stream Processing
Seamless Parallelism Management for Video Stream Processing on Multi-Cores
High-Level Stream Parallelism Abstractions with SPar Targeting GPUs
Mini-Symposia
Energy-Efficient Computing on Parallel Architectures (ECOPAR)
Energy-Efficiency Evaluation of FPGAs for Floating-Point Intensive Workloads
GPU Acceleration of Four-Site Water Models in LAMMPS
Energy Consumption of MD Calculations on Hybrid and CPU-Only Supercomputers with Air and Immersion Cooling
Direct N-Body Application on Low-Power and Energy-Efficient Parallel Architectures
Performance and Energy Efficiency of CUDA and OpenCL for GPU Computing Using Python
Computational Performances and Energy Efficiency Assessment for a Lattice Boltzmann Method on Intel KNL
Performance, Power Consumption and Thermal Behavioral Evaluation of the DGX-2 Platform
On the Performance and Energy Efficiency of Sparse Matrix-Vector Multiplication on FPGAs
Evaluation of DVFS and Uncore Frequency Tuning Under Power Capping on Intel Broadwell Architecture
ELPA - A Parallel Dense Eigensolver for Symmetric Matrices with Applications in Computational Chemistry
ELPA: A Parallel Solver for the Generalized Eigenvalue Problem
ParaFPGA 2019. Parallel Computing with FPGAs
Parallel Totally Induced Edge Sampling on FPGAs
An Implementation of Non-Local Means Algorithm on FPGA.
Accelerating Binarized Convolutional Neural Networks with Dynamic Partial Reconfiguration on Disaggregated FPGAs
Porting a Lattice Boltzmann Simulation to FPGAs Using OmpSs
A Processor Architecture for Executing Global Cellular Automata as Software
Crossbar Implementation with Partial Reconfiguration for Stream Switching Applications on an FPGA
Tools and Infrastructure for Reproducibility in Data-Intensive Applications
Cryptographic Methods with a Pli Cachet?. Towards the Computational Assurance of Integrity
Replicating Machine Learning Experiments in Materials Science
Documenting Computing Environments for Reproducible Experiments
Toward Enabling Reproducibility for Data-Intensive Research Using the Whole Tale Platform
Subject Index
Author Index.
Notes:
Includes index.
Description based on print version record.
Description based on publisher supplied metadata and other sources.
ISBN:
1-64368-071-4
OCLC:
1194445378

The Penn Libraries is committed to describing library materials using current, accurate, and responsible language. If you discover outdated or inaccurate language, please fill out this feedback form to report it and suggest alternative language.

Find

Home Release notes

My Account

Shelf Request an item Bookmarks Fines and fees Settings

Guides

Using the Find catalog Using Articles+ Using your account