Publicaciones en colaboración con investigadores/as de Norwegian University of Science and Technology (22)

2023

  1. Near-optimal multi-accelerator architectures for predictive maintenance at the edge

    Future Generation Computer Systems, Vol. 140, pp. 331-343

2021

  1. Do Not Predict - Recompute! How Value Recomputation Can Truly Boost the Performance of Invisible Speculation

    Proceedings - 2021 International Symposium on Secure and Private Execution Environment Design, SEED 2021

2020

  1. Clearing the shadows: Recovering lost performance for invisible speculative execution through HW/SW Co-design

    Parallel Architectures and Compilation Techniques - Conference Proceedings, PACT

  2. Scalability analysis of AVX-512 extensions

    Journal of Supercomputing, Vol. 76, Núm. 3, pp. 2082-2097

  3. Understanding Selective Delay as a Method for Efficient Secure Speculative Execution

    IEEE Transactions on Computers, Vol. 69, Núm. 11, pp. 1584-1595

2019

  1. Efficient invisible speculative execution through selective delay and value prediction

    Proceedings - International Symposium on Computer Architecture

  2. Ghost Loads: What is the Cost of Invisible Speculation?

    ACM International Conference on Computing Frontiers 2019, CF 2019 - Proceedings

2018

  1. A vectorized k-means algorithm for compressed datasets: design and experimental analysis

    Journal of Supercomputing, Vol. 74, Núm. 6, pp. 2705-2728

  2. SWOOP: Software-hardware co-design for non-speculative, execute-ahead, in-order cores

    Proceedings of the ACM SIGPLAN Conference on Programming Language Design and Implementation (PLDI)

  3. SWOOP: Software-hardware co-design for non-speculative, execute-ahead, in-order cores

    ACM SIGPLAN Notices, Vol. 53, Núm. 4, pp. 328-343

  4. Static Instruction Scheduling for High Performance on Limited Hardware

    IEEE Transactions on Computers, Vol. 67, Núm. 4, pp. 513-527

2017

  1. Clairvoyance: Look-ahead compile-time scheduling

    CGO 2017 - Proceedings of the 2017 International Symposium on Code Generation and Optimization

  2. Energy efficiency effects of vectorization in data reuse transformations for many-core processors—a case study†

    Journal of Low Power Electronics and Applications, Vol. 7, Núm. 1

  3. Transcending hardware limits with software out-of-order processing

    IEEE Computer Architecture Letters, Vol. 16, Núm. 2, pp. 162-165

2016

  1. Transient Temperature Prediction for Aging Thermal Sensors Using Artificial Neural Network

    Proceedings - 24th Euromicro International Conference on Parallel, Distributed, and Network-Based Processing, PDP 2016

2015

  1. ParVec: vectorizing the PARSEC benchmark suite

    Computing, Vol. 97, Núm. 11, pp. 1077-1100

  2. V-PFORDelta: Data Compression for Energy Efficient Computation of Time Series

    Proceedings - 22nd IEEE International Conference on High Performance Computing, HiPC 2015

2014

  1. Optimized hardware for suboptimal software: The case for SIMD-aware benchmarks

    ISPASS 2014 - IEEE International Symposium on Performance Analysis of Systems and Software

  2. Performance and energy impact of parallelization and vectorization techniques in modern microprocessors

    Computing, Vol. 96, Núm. 12, pp. 1179-1193

2013

  1. Energy-efficient sparse matrix autotuning with CSX-A trade-off study

    Proceedings - IEEE 27th International Parallel and Distributed Processing Symposium Workshops and PhD Forum, IPDPSW 2013