Table of Contents
Data-Centric Architectures: Fundamentally Improving Performance and Energy (227-0085-37L)
Course Description
Data movement between the memory units and the compute units of current computing systems is a major performance and energy bottleneck. From large-scale servers to mobile devices, data movement costs dominate computation costs in terms of both performance and energy consumption. For example, data movement between the main memory and the processing cores accounts for 62% of the total system energy in consumer applications. As a result, the data movement bottleneck is a huge burden that greatly limits the energy efficiency and performance of modern computing systems. This phenomenon is an undesired effect of the dichotomy between memory and the processor, which leads to the data movement bottleneck.
Many modern and important workloads such as machine learning, computational biology, graph processing, databases, video analytics, and real-time data analytics suffer greatly from the data movement bottleneck. These workloads are exemplified by irregular memory accesses, relatively low data reuse, low cache line utilization, low arithmetic intensity (i.e., ratio of operations per accessed byte), and large datasets that greatly exceed the main memory size. The computation in these workloads cannot usually compensate for the data movement costs. In order to alleviate this data movement bottleneck, we need a paradigm shift from the traditional processor-centric design, where all computation takes place in the compute units, to a more data-centric design where processing elements are placed closer to or inside where the data resides. This paradigm of computing is known as Processing-in-Memory (PIM).
This is your perfect P&S if you want to become familiar with the main PIM technologies, which represent “the next big thing” in Computer Architecture. You will work hands-on with the first real-world PIM architecture, will explore different PIM architecture designs for important workloads, and will develop tools to enable research of future PIM systems. Projects in this course span software and hardware as well as the software/hardware interface. You can potentially work on developing and optimizing new workloads for the first real-world PIM hardware or explore new PIM designs in simulators, or do something else that can forward our understanding of the PIM paradigm.
Prerequisites of the course:
- Digital Design and Computer Architecture (or equivalent course).
- Familiarity with C/C++ programming.
- Interest in future computer architectures and computing paradigms.
- Interest in discovering why things do or do not work and solving problems
- Interest in making systems efficient and usable
The course is conducted in English.
The course has two main parts:
1. Weekly lectures on processing-in-memory.
2. Hands-on project: Each student develops his/her own project.
Mentors
Name | Office | ||
---|---|---|---|
Lead Supervisor | Juan Gómez Luna | juan.gomez@safari.ethz.ch | ETZ H 61.1 |
Supervisor | Geraldo Francisco De Oliveira Junior | geraldod@inf.ethz.ch | ETZ H 64 |
Supervisor | Konstantinos Kanellopoulos | konstantinos.kanellopoulos@inf.ethz.ch | ETZ H 64 |
Supervisor | Nika Mansouri Ghiasi | mnika@student.ethz.ch | ETZ H 64 |
Lecture Video Playlist on YouTube
Spring 2023 Meetings/Schedule
Week | Date | Livestream | Meeting | Learning Materials | Assignments |
---|---|---|---|---|---|
W1 | 09.03 Thu. | Livestream | M1: P&S PIM Course Presentation (PDF) (PPT) | Required Materials Recommended Materials | HW 0 Out |
W2 | 16.03 Thu. | Premiere | M2: How to Evaluate Data Movement Bottlenecks (PDF) (PPT) | ||
Hands-on Project Proposals | |||||
W3 | 23.03 Thu. | Premiere | M3: Real-world PIM: UPMEM PIM (PDF) (PPT) | ||
W4 | 30.03 Thu. | Premiere | M4: Real-world PIM: Microbenchmarking of UPMEM PIM (PDF) (PPT) | ||
W5 | 06.04 Thu. | Premiere | M5: Real-world PIM: Samsung HBM-PIM (PDF) (PPT) | ||
W6 | 13.04 Thu. | Premiere | M6: Real-world PIM: SK Hynix AiM (PDF) (PPT) | ||
W7 | 20.04 Thu. | Premiere | M7: Real-world PIM: Samsung AxDIMM (PDF) (PPT) | ||
W8 | 27.04 Thu. | Premiere | M8: Real-world PIM: Alibaba HB-PNM (PDF) (PPT) | ||
W9 | 04.05 Thu. | Premiere | M9: Programming PIM Architectures (PDF) (PPT) | ||
W10 | 11.05 Thu. | Premiere | M10: Benchmarking and Workload Suitability on PIM (PDF) (PPT) | ||
W11 | 18.05 Thu. | Premiere | M11: SpMV on a Real PIM Architecture (PDF) (PPT) | ||
W12 | 25.05 Thu. | Premiere | M12: ML Training on a Real PIM Architecture (PDF) (PPT) | ||
W13 | 01.06 Thu. | Premiere | M13: Efficient Transcendental Functions on PIM (PDF) (PPT) | ||
W14 | 08.06 Thu. | Premiere | M14: Genome Sequence Alignment on PIM (PDF) (PPT) | ||
W15 | 15.06 Thu. | Premiere | M15: How to Enable the Adoption of PIM? (PDF) (PPT) |
Past Lecture Video Playlists on YouTube
Learning Materials
Meeting 1: Required Materials
- Processing Data Where It Makes Sense: Enabling In-Memory Computation (summary paper about recent research in PIM):
- Mutlu O., Memory-Centric Computing (Keynote Talk at the Thoughtworks Engineering for Research Symposium (E4R), February 2022):
Meeting 1: Recommended Materials
- Mutlu, O., Ghose, S., Gómez-Luna, J., and Ausavarungnirun, R. A Modern Primer on Processing in Memory. In Emerging Computing: From Devices to Systems, 2023.
- Processing-in-memory: A workload-driven perspective (summary paper about recent research in PIM):
- Gómez-Luna, J., El Hajj, I., Fernandez, I., Giannoula, C., Oliveira, G. F., and Mutlu, O. (2022). Benchmarking a New Paradigm: Experimental Analysis and Characterization of a Real Processing-in-Memory System. IEEE Access, 2022.
- Giannoula, C., Fernandez, I., Gómez-Luna, J., Koziris, N., Goumas, G., and Mutlu, O. SparseP: Towards Efficient Sparse Matrix Vector Multiplication on Real Processing-In-Memory Architectures. SIGMETRICS 2022.
- Olgun, A., Gómez-Luna, J., Kanellopoulos, K., Salami, B., Hassan, H., Ergin, O., and Mutlu, O. PiDRAM: A Holistic End-to-end FPGA-based Framework for Processing-in-DRAM. ACM TACO, 2022.