Juan Gómez Luna
Senior Researcher and Lecturer

Processing-In-Memory | Heterogeneous computing | Memory Systems | Bioinformatics | Medical imaging

TwitterLinkedInPersonal WebsiteGitHub

I’m a senior researcher and lecturer in the SAFARI Research Group at ETH Zurich. I received BS and MS degrees in Telecommunication Engineering from the University of Sevilla, in 2001, and a PhD degree in Computer Science from the University of Córdoba, in 2012. Between 2005 and 2017, I was a faculty member of the University of Córdoba.  My research interests focus on GPU and heterogeneous computing, processing-in-memory, memory systems, and hardware and software acceleration of medical imaging and bioinformatics. I am the lead author of PrIM, the first publicly-available benchmark suite for a real-world processing-in-memory architecture, and Chai, a benchmark suite for heterogeneous systems with CPU/GPU/FPGA.

I’m currently a leading collaborator in the EU project BioPIM.


Teaching

Courses 2023:
Seminar in Computer Architecture (Co-Lecturer)
Projects & Seminars: Data-Centric Architectures: Fundamentally Improving Performance and Energy (Main Lecturer)
Projects & Seminars: Programming Heterogeneous Computing Systems with GPUs (Main Lecturer)

PUMPS+AI summer school at Barcelona Supercomputing Center:

Courses 2022: 
Seminar in Computer Architecture (Co-Lecturer)
Projects & Seminars: Data-Centric Architectures: Fundamentally Improving Performance and Energy (Main Lecturer)
Projects & Seminars: Programming Heterogeneous Computing Systems with GPUs (Main Lecturer)

Previous Teaching
(see our SAFARI courses page)

Upcoming Tutorials
“Real-world Processing-in-Memory Architectures” at HPCA’23
[HPCA Real-world PIM Tutorial website]

“Real-world Processing-in-Memory Systems for Modern Workloads” at ASPLOS’23
[ASPLOS Real-world PIM Tutorial website]


Publications (recent)

Juan Gómez Luna, Yuxin Guo, Sylvan Brocard, Julien Legriel, Remy Cimadomo, Geraldo F. Oliveira, Gagandeep Singh, and Onur Mutlu, “Evaluating Machine Learning Workloads on Memory-Centric Computing Systems”Proceedings of the 2023 IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS), Raleigh, North Carolina, USA, April 2023. [Extended arXiv version] | [Slides (pptx) (pdf)] | [PIM-ML Source Code] Best paper session.
[Talk premiere on YouTube May 8 2023 3PM CEST]

Maurus Item, Juan Gómez Luna, Yuxin Guo, Geraldo F. Oliveira, Mohammad Sadrosadati, and Onur Mutlu, “TransPimLib: Efficient Transcendental Functions for Processing-in-Memory Systems”Proceedings of the 2023 IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS), Raleigh, North Carolina, USA, April 2023. [arXiv version] | [Slides (pptx) (pdf)] | [TransPimLib Source Code]
[Talk premiere on YouTube May 9 2023 3PM CEST]

Joël Lindegger, Damla Senol Cali, Mohammed Alser, Juan Gómez-Luna, Nika Mansouri Ghiasi, and Onur Mutlu, “Scrooge: A Fast and Memory-Frugal Genomic Sequence Aligner for CPUs, GPUs, and ASICs”Bioinformatics, [published online on] 24 March 2023. [Online link at Bioinformatics Journal] | [arXiv preprint] | [Scrooge Source Code]

Safaa Diab, Amir Nassereldine, Mohammed Alser, Juan Gómez Luna Onur Mutlu, and Izzat El Hajj, “A Framework for High-throughput Sequence Alignment using Real Processing-in-Memory Systems”Bioinformatics, [published online on] 27 March 2023. [Online link at Bioinformatics Journal] | [arXiv preprint] | [AiM Source Code]

Alain Denzler, Rahul Bera, Nastaran Hajinazar, Gagandeep Singh, Geraldo F. Oliveira, Juan Gómez-Luna, and Onur Mutlu, “Casper: Accelerating Stencil Computation using Near-cache Processing”IEEE Access, 3 March 2023. Preprint in arXiv, 28 December 2021. [arXiv preprint] [IEEE Access version]

Lukas Breitwieser, Ahmad Hesam, Fons Rademakers, Juan Gómez Luna, and Onur Mutlu,“High-Performance and Scalable Agent-Based Simulation with BioDynaMo”Proceedings of the 28th ACM SIGPLAN Annual Symposium on Principles and Practice of Parallel Programming (PPoPP), Montreal, QC, Canada, February 2023. [arXiv version] [Slides (pdf) (web)] [BioDynamo Source Code (Officially Artifact Evaluated with All Badges)Officially artifact evaluated as available, reusable and reproducible. Best artifact award at PPoPP 2023.

Ataberk Olgun, Juan Gomez Luna, Konstantinos Kanellopoulos, Behzad Salami, Hasan Hassan, Oguz Ergin, and Onur Mutlu,
“PiDRAM: A Holistic End-to-end FPGA-based Framework for Processing-in-DRAM” ACM Transactions on Architecture and Code Optimization (TACO), March 2023. [arXiv version] Presented at the 18th HiPEAC Conference, Toulouse, France, January 2023. [Slides (pptx) (pdf)] [Longer Lecture Slides (pptx) (pdf)] [Lecture Video (40 minutes)] [PiDRAM Source Code]

João Dinis Ferreira, Gabriel Falcao, Juan Gómez-Luna, Mohammed Alser, Lois Orosa, Mohammad Sadrosadati, Jeremie S. Kim, Geraldo F. Oliveira, Taha Shahroodi, Anant Nori, and Onur Mutlu, “pLUTo: Enabling Massively Parallel Computation in DRAM via Lookup Tables”Proceedings of the 55th International Symposium on Microarchitecture (MICRO), Chicago, IL, USA, October 2022. [Slides (pptx) (pdf)] [Longer Lecture Slides (pptx) (pdf)] [Lecture Video (26 minutes)] [arXiv version] [Source Code (Officially Artifact Evaluated with All Badges)Officially artifact evaluated as available, reusable and reproducible.

Jisung Park, Roknoddin Azizi, Geraldo F. Oliveira, Mohammad Sadrosadati, Rakesh Nadig, David Novo, Juan Gómez-Luna, Myungsuk Kim, and Onur Mutlu, “Flash-Cosmos: In-Flash Bulk Bitwise Operations Using Inherent Computation Capability of NAND Flash Memory”Proceedings of the 55th International Symposium on Microarchitecture (MICRO), Chicago, IL, USA, October 2022. [Slides (pptx) (pdf)] [Longer Lecture Slides (pptx) (pdf)] [Lecture Video (44 minutes)] [arXiv version]

Sina Darabi, Mohammad Sadrosadati, Joël Lindegger, Negar Akbarzadeh, Mohammad Hosseini, Jisung Park, Juan Gómez-Luna, Onur Mutlu, and Hamid Sarbazi-Azad, “Morpheus: Extending the Last Level Cache Capacity in GPU Systems Using Idle GPU Core Resources”Proceedings of the 55th International Symposium on Microarchitecture (MICRO), Chicago, IL, USA, October 2022. [Slides (pptx) (pdf)] [Longer Lecture Slides (pptx) (pdf)] [Talk Video (12 minutes)] [Lecture Video (32 minutes)] [arXiv version]

Gagandeep Singh, Dionysios Diamantopoulos, Juan Gómez-Luna, Sander Stuijk, Henk Corporaal, and Onur Mutlu, “LEAPER: Fast and Accurate FPGA-based System Performance Prediction via Transfer Learning”Proceedings of the 40th IEEE International Conference on Computer Design (ICCD), Lake Tahoe, CA, October 2022. [Slides (pptx) (pdf)] [arXiv preprint]

Mohammed Alser, Joel Lindegger, Can Firtina, Nour Almadhoun Alserr, Haiyu Mao, Gagandeep Singh, Juan Gomez-Luna, and Onur Mutlu, “From Molecules to Genomic Variations: Accelerating Genome Analysis via Intelligent Algorithms and Architectures”Invited Article in Computational and Structural Biotechnology Journal (CSBJ), August 2022. [arXiv version with all Supplementary Materials] [Online version at the Computational and Structural Biotechnology Journal] [Source Code]

Gagandeep Singh, Rakesh Nadig, Jisung Park, Rahul Bera, Nastaran Hajinazar, David Novo, Juan Gomez-Luna, Sander Stuijk, Henk Corporaal, and Onur Mutlu, “Sibyl: Adaptive and Extensible Data Placement in Hybrid Storage Systems Using Online Reinforcement Learning”Proceedings of the 49th International Symposium on Computer Architecture (ISCA), New York, June 2022. [Slides (pptx) (pdf)] [arXiv version] [Sibyl Source Code] [Talk Video (16 minutes)]

Damla Senol Cali, Konstantinos Kanellopoulos, Joel Lindegger, Zulal Bingol, Gurpreet S. Kalsi, Ziyi Zuo, Can Firtina, Meryem Banu Cavlak, Jeremie Kim, Nika MansouriGhiasi, Gagandeep Singh, Juan Gomez-Luna, Nour Almadhoun Alserr, Mohammed Alser, Sreenivas Subramoney, Can Alkan, Saugata Ghose, and Onur Mutlu, “SeGraM: A Universal Hardware Accelerator for Genomic Sequence-to-Graph and Sequence-to-Sequence Mapping”Proceedings of the 49th International Symposium on Computer Architecture (ISCA), New York, June 2022. [Slides (pptx) (pdf)] [arXiv version]

Christina Giannoula, Ivan Fernandez, Juan Gomez-Luna, Nectarios Koziris, Georgios Goumas, and Onur Mutlu, “SparseP: Towards Efficient Sparse Matrix Vector Multiplication on Real Processing-In-Memory Architectures”Proceedings of the ACM International Conference on Measurement and Modeling of Computer Systems (SIGMETRICS), Mumbai, India, June 2022.  [Extended arXiv Version] [Slides (pptx) (pdf)] [Long Talk Slides (pptx)(pdf)] [SparseP Source Code]
[Talk Video (16 minutes)] [Long Talk Video (55 minutes)]

Juan Gomez-Luna, Izzat El Hajj, Ivan Fernandez, Christina Giannoula, Geraldo F. Oliveira, and Onur Mutlu, “Benchmarking a New Paradigm: Experimental Analysis and Characterization of a Real Processing-in-Memory System”IEEE Access, 10 May 2022. [arXiv version] [PrIM Benchmarks Source Code] [Slides (pptx) (pdf)] [Long Talk Slides (pptx) (pdf)] [Short Talk Slides (pptx) (pdf)] [SAFARI Live Seminar Slides (pptx) (pdf)] [SAFARI Live Seminar Video (2 hrs 57 mins)] [Lightning Talk Video (3 minutes)] [Short Talk Video (21 minutes)] [1-hour Talk Video (58 minutes)]

Also see my Google Scholar page
For talks not listed here, see my other invited talk and lectures here: SAFARI talks page


Open Source Code

PrIM, the first publicly-available benchmark suite for a real-world processing-in-memory architecture
Chai, a benchmark suite for heterogeneous systems with CPU/GPU/FPGA