Publications (48) Juan Manuel Cebrian Gonzalez publications

filter_list

2024

  1. Temporarily Unauthorized Stores: Write First, Ask for Permission Later

    Proceedings of the Annual International Symposium on Microarchitecture, MICRO

2023

  1. Near-optimal multi-accelerator architectures for predictive maintenance at the edge

    Future Generation Computer Systems, Vol. 140, pp. 331-343

2022

  1. Compiler-Assisted Compaction/Restoration of SIMD Instructions

    IEEE Transactions on Parallel and Distributed Systems, Vol. 33, Núm. 4, pp. 779-791

  2. Free Atomics: Hardware Atomic Operations without Fences

    Proceedings - International Symposium on Computer Architecture

  3. Splash-4: A Modern Benchmark Suite with Lock-Free Constructs

    Proceedings - 2022 IEEE International Symposium on Workload Characterization, IISWC 2022

2021

  1. Efficient, distributed, and non-speculative multi-address atomic operations

    Proceedings of the Annual International Symposium on Microarchitecture, MICRO

  2. Evaluation of clustering algorithms on hpc platforms

    Mathematics, Vol. 9, Núm. 17

2020

  1. Boosting store buffer efficiency with store-prefetch bursts

    Proceedings of the Annual International Symposium on Microarchitecture, MICRO

  2. Efficiency analysis of modern vector architectures: vector ALU sizes, core counts and clock frequencies

    Journal of Supercomputing, Vol. 76, Núm. 3, pp. 1960-1979

  3. High-throughput fuzzy clustering on heterogeneous architectures

    Future Generation Computer Systems, Vol. 106, pp. 401-411

  4. Improving predication efficiency through compaction/restoration of SIMD instructions

    Proceedings - 2020 IEEE International Symposium on High Performance Computer Architecture, HPCA 2020

  5. Offloading strategies for Stencil kernels on the KNC Xeon Phi architecture: Accuracy versus performance

    International Journal of High Performance Computing Applications, Vol. 34, Núm. 2, pp. 199-207

  6. Scalability analysis of AVX-512 extensions

    Journal of Supercomputing, Vol. 76, Núm. 3, pp. 2082-2097

  7. Semi-automatic validation of cycle-accurate simulation infrastructures: The case for gem5-x86

    Future Generation Computer Systems, Vol. 112, pp. 832-847

  8. Using Arm’s scalable vector extension on stencil codes

    Journal of Supercomputing, Vol. 76, Núm. 3, pp. 2039-2062

2019

  1. Poster: An optimized predication execution for simd extensions

    Parallel Architectures and Compilation Techniques - Conference Proceedings, PACT

2018

  1. A vectorized k-means algorithm for compressed datasets: design and experimental analysis

    Journal of Supercomputing, Vol. 74, Núm. 6, pp. 2705-2728

  2. Performance and energy effects on task-based parallelized applications: User-directed versus manual vectorization

    Journal of Supercomputing, Vol. 74, Núm. 6, pp. 2627-2637

  3. Stencil codes on a vector length agnostic architecture

    Parallel Architectures and Compilation Techniques - Conference Proceedings, PACT

2017

  1. A dedicated private-shared cache design for scalable multiprocessors

    Concurrency and Computation: Practice and Experience