Publications in collaboration with researchers from Norwegian University of Science and Technology (22)

2023

  1. Near-optimal multi-accelerator architectures for predictive maintenance at the edge

    Future Generation Computer Systems, Vol. 140, pp. 331-343

2021

  1. Do Not Predict - Recompute! How Value Recomputation Can Truly Boost the Performance of Invisible Speculation

    Proceedings - 2021 International Symposium on Secure and Private Execution Environment Design, SEED 2021

2020

  1. Clearing the shadows: Recovering lost performance for invisible speculative execution through HW/SW Co-design

    Parallel Architectures and Compilation Techniques - Conference Proceedings, PACT

  2. Scalability analysis of AVX-512 extensions

    Journal of Supercomputing, Vol. 76, Núm. 3, pp. 2082-2097

  3. Understanding Selective Delay as a Method for Efficient Secure Speculative Execution

    IEEE Transactions on Computers, Vol. 69, Núm. 11, pp. 1584-1595

2019

  1. Efficient invisible speculative execution through selective delay and value prediction

    Proceedings - International Symposium on Computer Architecture

  2. Ghost Loads: What is the Cost of Invisible Speculation?

    ACM International Conference on Computing Frontiers 2019, CF 2019 - Proceedings

2018

  1. A vectorized k-means algorithm for compressed datasets: design and experimental analysis

    Journal of Supercomputing, Vol. 74, Núm. 6, pp. 2705-2728

  2. SWOOP: Software-hardware co-design for non-speculative, execute-ahead, in-order cores

    ACM SIGPLAN Notices, Vol. 53, Núm. 4, pp. 328-343

  3. SWOOP: Software-hardware co-design for non-speculative, execute-ahead, in-order cores

    Proceedings of the ACM SIGPLAN Conference on Programming Language Design and Implementation (PLDI)

  4. Static Instruction Scheduling for High Performance on Limited Hardware

    IEEE Transactions on Computers, Vol. 67, Núm. 4, pp. 513-527

2017

  1. Clairvoyance: Look-ahead compile-time scheduling

    CGO 2017 - Proceedings of the 2017 International Symposium on Code Generation and Optimization

  2. Energy efficiency effects of vectorization in data reuse transformations for many-core processors—a case study†

    Journal of Low Power Electronics and Applications, Vol. 7, Núm. 1

  3. Transcending hardware limits with software out-of-order processing

    IEEE Computer Architecture Letters, Vol. 16, Núm. 2, pp. 162-165

2016

  1. Transient Temperature Prediction for Aging Thermal Sensors Using Artificial Neural Network

    Proceedings - 24th Euromicro International Conference on Parallel, Distributed, and Network-Based Processing, PDP 2016

2015

  1. ParVec: vectorizing the PARSEC benchmark suite

    Computing, Vol. 97, Núm. 11, pp. 1077-1100

  2. V-PFORDelta: Data Compression for Energy Efficient Computation of Time Series

    Proceedings - 22nd IEEE International Conference on High Performance Computing, HiPC 2015

2014

  1. Optimized hardware for suboptimal software: The case for SIMD-aware benchmarks

    ISPASS 2014 - IEEE International Symposium on Performance Analysis of Systems and Software

  2. Performance and energy impact of parallelization and vectorization techniques in modern microprocessors

    Computing, Vol. 96, Núm. 12, pp. 1179-1193

2013

  1. Energy-efficient sparse matrix autotuning with CSX-A trade-off study

    Proceedings - IEEE 27th International Parallel and Distributed Processing Symposium Workshops and PhD Forum, IPDPSW 2013