3 options
Parallel computing : technology trends / edited by Ian Foster [and four others].
- Format:
- Book
- Series:
- Advances in parallel computing ; Volume 36.
- Advances in parallel computing ; Volume 36
- Language:
- English
- Subjects (All):
- Parallel processing (Electronic computers).
- Physical Description:
- 1 online resource (xvii, 785 pages) : illustrations.
- Edition:
- 1st ed.
- Place of Publication:
- Amsterdam ; Berlin ; Washington, District of Columbia : IOS Press, [2020]
- Summary:
- The year 2019 marked four decades of cluster computing, a history that began in 1979 when the first cluster systems using Components Off The Shelf (COTS) became operational.This achievement resulted in a rapidly growing interest in affordable parallel computing for solving compute intensive and large scale problems.
- Contents:
- Intro
- Title Page
- Preface
- Conference Organisation
- Contents
- Opening
- Four Decades of Cluster Computing
- Invited Talks
- Will We Ever Have a Quantum Computer?
- Empowering Parallel Computing with Field Programmable Gate Arrays
- Main Track
- Deep Learning Applications
- First Experiences on Applying Deep Learning Techniques to Prostate Cancer Detection
- Deep Generative Model Driven Protein Folding Simulations
- Economics
- A Scalable Approach to Econometric Inference
- Cloud vs On-Premise HPC: A Model for Comprehensive Cost Assessment
- GPU Computing Methods
- GPU Architecture for Wavelet-Based Video Coding Acceleration
- GPGPU Computing for Microscopic Pedestrian Simulation
- High Performance Eigenvalue Solver for Hubbard Model: Tuning Strategies for LOBPCG Method on CUDA GPU
- Parallel Smoothers in Multigrid Method for Heterogeneous CPU-GPU Environment
- Load Balancing Methods
- Progressive Load Balancing in Distributed Memory. Mitigating Performance and Progress Variability in Iterative Asynchronous Algorithms
- Learning-Based Load Balancing for Massively Parallel Simulations of Hot Fusion Plasmas
- Load-Balancing for Large-Scale Soot Particle Agglomeration Simulations
- On the Autotuning of Task-Based Numerical Libraries for Heterogeneous Architectures
- Parallel Algorithms
- Batched 3D-Distributed FFT Kernels Towards Practical DNS Codes
- On Superlinear Speedups of a Parallel NFA Induction Algorithm
- A Domain Decomposition Reduced Order Model with Data Assimilation (DD-RODA)
- Predicting Performance of Classical and Modified BiCGStab Iterative Methods
- Parallel Applications
- Gadget3 on GPUs with OpenACC
- Exploring High Bandwidth Memory for PET Image Reconstruction
- Parallel Architecture
- The Architecture of Heterogeneous Petascale HPC RIVR.
- Design of an FPGA-Based Matrix Multiplier with Task Parallelism
- Application Performance of Physical System Simulations
- Parallel Methods
- A Hybrid MPI+Threads Approach to Particle Group Finding Using Union-Find
- Parallel Performance
- Improving the Scalability of the ABCD Solver with a Combination of New Load Balancing and Communication Minimization Techniques
- Characterization of Power Usage and Performance in Data-Intensive Applications Using MapReduce over MPI
- Feedback-Driven Performance and Precision Tuning for Automatic Fixed Point Exploitation
- Parallel Programming
- A GPU-CUDA Framework for Solving a Two-Dimensional Inverse Anomalous Diffusion Problem
- Parallelization Strategies for GPU-Based Ant Colony Optimization Applied to TSP
- DBCSR: A Blocked Sparse Tensor Algebra Library
- Acceleration of Hydro Poro-Elastic Damage Simulation in a Shared-Memory Environment
- BERTHA and PyBERTHA: State of the Art for Full Four-Component Dirac-Kohn-Sham Calculations
- Prediction-Based Partitions Evaluation Algorithm for Resource Allocation
- Unified Generation of DG-Kernels for Different HPC Frameworks
- Invasive Computing for Power Corridor Management
- Enforcing Reference Capability in FastFlow with Rust
- Performance
- AITuning: Machine Learning-Based Tuning Tool for Run-Time Communication Libraries
- Towards Benchmarking the Asynchronous Progress of Non-Blocking MPI Operations
- Power Management
- Acceleration of Interactive Multiple Precision Arithmetic Toolbox MuPAT Using FMA, SIMD, and OpenMP
- Dynamic Runtime and Energy Optimization for Power-Capped HPC Applications
- Programming Paradigms
- Paradigm Shift in Program Structure of Particle-in-Cell Simulations
- Backus FP Revisited: A Parallel Perspective on Modern Multicores
- Multi-Variant User Functions for Platform-Aware Skeleton Programming.
- Scalability Analysis
- POETS: Distributed Event-Based Computing - Scaling Behaviour
- Towards High-End Scalability on Biologically-Inspired Computational Models
- Scientific Visualization
- GraphiX: A Fast Human-Computer Interaction Symmetric Multiprocessing Parallel Scientific Visualization Tool
- When Parallel Performance Measurement and Analysis Meets In Situ Analytics and Visualization
- Stream Processing
- Seamless Parallelism Management for Video Stream Processing on Multi-Cores
- High-Level Stream Parallelism Abstractions with SPar Targeting GPUs
- Mini-Symposia
- Energy-Efficient Computing on Parallel Architectures (ECOPAR)
- Energy-Efficiency Evaluation of FPGAs for Floating-Point Intensive Workloads
- GPU Acceleration of Four-Site Water Models in LAMMPS
- Energy Consumption of MD Calculations on Hybrid and CPU-Only Supercomputers with Air and Immersion Cooling
- Direct N-Body Application on Low-Power and Energy-Efficient Parallel Architectures
- Performance and Energy Efficiency of CUDA and OpenCL for GPU Computing Using Python
- Computational Performances and Energy Efficiency Assessment for a Lattice Boltzmann Method on Intel KNL
- Performance, Power Consumption and Thermal Behavioral Evaluation of the DGX-2 Platform
- On the Performance and Energy Efficiency of Sparse Matrix-Vector Multiplication on FPGAs
- Evaluation of DVFS and Uncore Frequency Tuning Under Power Capping on Intel Broadwell Architecture
- ELPA - A Parallel Dense Eigensolver for Symmetric Matrices with Applications in Computational Chemistry
- ELPA: A Parallel Solver for the Generalized Eigenvalue Problem
- ParaFPGA 2019. Parallel Computing with FPGAs
- Parallel Totally Induced Edge Sampling on FPGAs
- An Implementation of Non-Local Means Algorithm on FPGA.
- Accelerating Binarized Convolutional Neural Networks with Dynamic Partial Reconfiguration on Disaggregated FPGAs
- Porting a Lattice Boltzmann Simulation to FPGAs Using OmpSs
- A Processor Architecture for Executing Global Cellular Automata as Software
- Crossbar Implementation with Partial Reconfiguration for Stream Switching Applications on an FPGA
- Tools and Infrastructure for Reproducibility in Data-Intensive Applications
- Cryptographic Methods with a Pli Cachet?. Towards the Computational Assurance of Integrity
- Replicating Machine Learning Experiments in Materials Science
- Documenting Computing Environments for Reproducible Experiments
- Toward Enabling Reproducibility for Data-Intensive Research Using the Whole Tale Platform
- Subject Index
- Author Index.
- Notes:
- Includes index.
- Description based on print version record.
- Description based on publisher supplied metadata and other sources.
- ISBN:
- 1-64368-071-4
- OCLC:
- 1194445378
The Penn Libraries is committed to describing library materials using current, accurate, and responsible language. If you discover outdated or inaccurate language, please fill out this feedback form to report it and suggest alternative language.