Investigador Ramon y Cajal

Investigador Ramón y Cajal

Foto de Investigador Ramon y Cajal

Foto de Manuel Eugenio

Manuel Eugenio
Acacio Sanchez

Catedraticos de Universidad

Publicaciones en las que colabora con Manuel Eugenio Acacio Sanchez (28)

2023

CELLO: Compiler-Assisted Efficient Load-Load Ordering in Data-Race-Free Regions
Parallel Architectures and Compilation Techniques - Conference Proceedings, PACT
Flexagon: A Multi-dataflow Sparse-Sparse Matrix Multiplication Accelerator for Efficient DNN Processing
International Conference on Architectural Support for Programming Languages and Operating Systems - ASPLOS
STIFT: A Spatio-Temporal Integrated Folding Tree for Efficient Reductions in Flexible DNN Accelerators
ACM Journal on Emerging Technologies in Computing Systems, Vol. 19, Núm. 4

2022

Analysing software prefetching opportunities in hardware transactional memory
Journal of Supercomputing, Vol. 78, Núm. 1, pp. 919-944
Understanding the Design-Space of Sparse/Dense Multiphase GNN dataflows on Spatial Accelerators
Proceedings - 2022 IEEE 36th International Parallel and Distributed Processing Symposium, IPDPS 2022

2021

A novel network fabric for efficient spatio-temporal reduction in flexible DNN accelerators
Proceedings - 2021 15th IEEE/ACM International Symposium on Networks-on-Chip, NOCS 2021
STONNE: Enabling Cycle-Level Microarchitectural Simulation for DNN Inference Accelerators
IEEE Computer Architecture Letters, Vol. 20, Núm. 2, pp. 122-125
STONNE: Enabling Cycle-Level Microarchitectural Simulation for DNN Inference Accelerators
Proceedings - 2021 IEEE International Symposium on Workload Characterization, IISWC 2021

2020

CNN-SIM: A Detailed Arquitectural Simulator of CNN Accelerators
Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

2019

InsideNet: A tool for characterizing convolutional neural networks
Future Generation Computer Systems, Vol. 100, pp. 298-315

2018

Photonic-based express coherence notifications for many-core CMPs
Journal of Parallel and Distributed Computing, Vol. 113, pp. 179-194

2017

A dedicated private-shared cache design for scalable multiprocessors
Concurrency and Computation: Practice and Experience

2015

Efficient hardware-supported synchronization mechanisms for manycores
Handbook on Data Centers (Springer New York), pp. 753-803
Fast and efficient commits for Lazy-Lazy hardware transactional memory
Journal of Supercomputing, Vol. 71, Núm. 12, pp. 4305-4326

2013

Deploying hardware locks to improve performance and energy efficiency of hardware transactional memory
Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Design of an efficient communication infrastructure for highly contended locks in many-core CMPs
Journal of Parallel and Distributed Computing, Vol. 73, Núm. 7, pp. 972-985
ECONO: Express coherence notifications for efficient cache coherency in many-core CMPs
Proceedings - 2013 International Conference on Embedded Computer Systems: Architectures, Modeling and Simulation, IC-SAMOS 2013
Efficient Dir0B cache coherency for many-core CMPs
Procedia Computer Science

2012

Design of a collective communication infrastructure for barrier synchronization in cluster-based nanoscale MPSoCs
Proceedings -Design, Automation and Test in Europe, DATE
Efficient hardware barrier synchronization in many-core CMPs
IEEE Transactions on Parallel and Distributed Systems, Vol. 23, Núm. 8, pp. 1453-1466