Publicaciones (58) Publicaciones de Jose Luis Abellan Miguel

2024

  1. AXI4MLIR: User-Driven Automatic Host Code Generation for Custom AXI-Based Accelerators

    CGO 2024 - Proceedings of the 2024 IEEE/ACM International Symposium on Code Generation and Optimization

  2. NeuraChip: Accelerating GNN Computations with a Hash-based Decoupled Spatial Accelerator

    Proceedings - International Symposium on Computer Architecture

  3. Scalability Limitations of Processing-in-Memory using Real System Evaluations

    Performance Evaluation Review, Vol. 52, Núm. 1, pp. 63-64

  4. Scalability Limitations of Processing-in-Memory using Real System Evaluations

    SIGMETRICS/PERFORMANCE 2024 - Abstracts of the 2024 ACM SIGMETRICS/IFIP PERFORMANCE Joint International Conference on Measurement and Modeling of Computer Systems

  5. Scalability limitations of processing-in-memory using real system evaluations

    Proceedings of the ACM on Measurement and Analysis of Computing Systems, Vol. 8, Núm. 1

2023

  1. Accelerating Finite Field Arithmetic for Homomorphic Encryption on GPUs

    IEEE Micro, Vol. 43, Núm. 5, pp. 55-63

  2. Flexagon: A Multi-dataflow Sparse-Sparse Matrix Multiplication Accelerator for Efficient DNN Processing

    International Conference on Architectural Support for Programming Languages and Operating Systems - ASPLOS

  3. GME: GPU-based Microarchitectural Extensions to Accelerate Homomorphic Encryption

    Proceedings of the 56th Annual IEEE/ACM International Symposium on Microarchitecture, MICRO 2023

  4. STIFT: A Spatio-Temporal Integrated Folding Tree for Efficient Reductions in Flexible DNN Accelerators

    ACM Journal on Emerging Technologies in Computing Systems, Vol. 19, Núm. 4

2022

  1. Accelerating Polynomial Multiplication for Homomorphic Encryption on GPUs

    Proceedings - 2022 IEEE International Symposium on Secure and Private Execution Environment Design, SEED 2022

  2. NaviSim: A Highly Accurate GPU Simulator for AMD RDNA GPUs

    Parallel Architectures and Compilation Techniques - Conference Proceedings, PACT

  3. Puppeteer: A Random Forest Based Manager for Hardware Prefetchers Across the Memory Hierarchy

    ACM Transactions on Architecture and Code Optimization, Vol. 20, Núm. 1

  4. The Challenge of Classification Confidence Estimation in Dynamically-Adaptive Neural Networks

    Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

  5. Understanding the Design-Space of Sparse/Dense Multiphase GNN dataflows on Spatial Accelerators

    Proceedings - 2022 IEEE 36th International Parallel and Distributed Processing Symposium, IPDPS 2022

2021

  1. A novel network fabric for efficient spatio-temporal reduction in flexible DNN accelerators

    Proceedings - 2021 15th IEEE/ACM International Symposium on Networks-on-Chip, NOCS 2021

  2. GNNMark: A Benchmark Suite to Characterize Graph Neural Network Training on GPUS

    Proceedings - 2021 IEEE International Symposium on Performance Analysis of Systems and Software, ISPASS 2021

  3. METADOCK 2: a high-throughput parallel metaheuristic scheme for molecular docking

    Bioinformatics, Vol. 37, Núm. 11, pp. 1515-1520

  4. STONNE: Enabling Cycle-Level Microarchitectural Simulation for DNN Inference Accelerators

    IEEE Computer Architecture Letters, Vol. 20, Núm. 2, pp. 122-125

  5. STONNE: Enabling Cycle-Level Microarchitectural Simulation for DNN Inference Accelerators

    Proceedings - 2021 IEEE International Symposium on Workload Characterization, IISWC 2021

  6. Spartan: A Sparsity-Adaptive Framework to Accelerate Deep Neural Network Training on GPUs

    IEEE Transactions on Parallel and Distributed Systems, Vol. 32, Núm. 10, pp. 2448-2463