readings
Differences
This shows you the differences between two versions of the page.
Both sides previous revisionPrevious revisionNext revision | Previous revision | ||
readings [2020/12/31 15:59] – [Lecture 23 (28.12 Mon.)] firtinac | readings [2021/01/04 08:45] (current) – [Lecture 26 (31.12 Thu.)] firtinac | ||
---|---|---|---|
Line 709: | Line 709: | ||
* {{0211027.pdf| L. G. Valiant, "A Scheme for Fast Parallel Communication", | * {{0211027.pdf| L. G. Valiant, "A Scheme for Fast Parallel Communication", | ||
* {{https:// | * {{https:// | ||
- | * {{05749724.pdf|C. Fallin, C. Craik, and O. Mutlu, " | + | * {{chipper_hpca11.pdf|C. Fallin, C. Craik, and O. Mutlu, " |
- | * {{bufferless_springer14.pdf|C. Fallin, G. Nazario, X. Yu, K. Chang, R. Ausavarungnirun, | + | * {{bufferless-and-minimally-buffered-deflection-routing_springer14.pdf|C. Fallin, G. Nazario, X. Yu, K. Chang, R. Ausavarungnirun, |
- | * {{06209256.pdf|C. Fallin, G. Nazario, X. Yu, K. Chang, R. Ausavarungnirun, | + | * {{minimally-buffered-deflection-router_nocs12.pdf|C. Fallin, G. Nazario, X. Yu, K. Chang, R. Ausavarungnirun, |
=== Suggested (lecture 22): === | === Suggested (lecture 22): === | ||
* {{p168-patel.pdf| J. Patel, " | * {{p168-patel.pdf| J. Patel, " | ||
Line 722: | Line 722: | ||
* {{https:// | * {{https:// | ||
* {{P2626.pdf| P. Baran, "On Distributed Communication Networks", | * {{P2626.pdf| P. Baran, "On Distributed Communication Networks", | ||
- | * {{p106-das.pdf|R. Das, O. Mutlu, T. Moscibroda, and C.R. Das, " | + | * {{app-aware-noc_micro09.pdf|R. Das, O. Mutlu, T. Moscibroda, and C.R. Das, " |
===== Lecture 23 (28.12 Mon.) ===== | ===== Lecture 23 (28.12 Mon.) ===== | ||
=== Described in detail during lecture 23 === | === Described in detail during lecture 23 === | ||
- | | + | * {{chipper_hpca11.pdf|C. Fallin, C. Craik, and O. Mutlu, " |
- | * {{bufferless_springer14.pdf|C. Fallin, G. Nazario, X. Yu, K. Chang, R. Ausavarungnirun, | + | |
- | * {{06209256.pdf|C. Fallin, G. Nazario, X. Yu, K. Chang, R. Ausavarungnirun, | + | |
=== Suggested (lecture 23): === | === Suggested (lecture 23): === | ||
| | ||
Line 742: | Line 742: | ||
* {{30470407.pdf| W.W.L. Fung, I. Sham, G. Yuan, and T.M. Aamodt, " | * {{30470407.pdf| W.W.L. Fung, I. Sham, G. Yuan, and T.M. Aamodt, " | ||
===== Lecture 25 (30.12 Wed.) ===== | ===== Lecture 25 (30.12 Wed.) ===== | ||
- | === Recommended | + | === Suggested |
* {{cuda_c_programming_guide.pdf|NVIDIA, | * {{cuda_c_programming_guide.pdf|NVIDIA, | ||
* {{2013_programming_massively_parallel_processors_a_hands-on_approach_2nd.pdf| Hwu and Kirk , “Programming Massively Parallel Processors ” 2017}} | * {{2013_programming_massively_parallel_processors_a_hands-on_approach_2nd.pdf| Hwu and Kirk , “Programming Massively Parallel Processors ” 2017}} | ||
- | |||
- | === Suggested (lecture 25): === | ||
- | |||
* {{p140-fisher.pdf|Fisher , “Very Long Instruction Word Architectures and the ELI-512,” ISCA 1983}} | * {{p140-fisher.pdf|Fisher , “Very Long Instruction Word Architectures and the ELI-512,” ISCA 1983}} | ||
* {{sung_2012.pdf|I. Sung, G. D. Liu, and W. W. Hwu , “DL: A data layout transformation system for heterogeneous computing ,” INPAR 2012}} | * {{sung_2012.pdf|I. Sung, G. D. Liu, and W. W. Hwu , “DL: A data layout transformation system for heterogeneous computing ,” INPAR 2012}} | ||
Line 757: | Line 753: | ||
* {{ransac-publication.pdf|M.A. Fisher, and R.C. Bolles ”Random Sample Consensus: A Paradigm for Model Fitting with Applications to Image Analysis and Automated Cartography“, | * {{ransac-publication.pdf|M.A. Fisher, and R.C. Bolles ”Random Sample Consensus: A Paradigm for Model Fitting with Applications to Image Analysis and Automated Cartography“, | ||
+ | ===== Lecture 26 (31.12 Thu.) ===== | ||
+ | === Suggested (lecture 26): === | ||
+ | * {{https:// | ||
+ | * {{https:// | ||
+ | * {{https:// | ||
+ | * {{https:// | ||
+ | * {{https:// | ||
+ | * {{https:// | ||
+ | * {{https:// | ||
+ | * {{https:// | ||
+ | * {{https:// | ||
+ | * {{https:// | ||
+ | * {{https:// | ||
+ | * {{https:// | ||
+ | * {{https:// | ||
+ | * {{https:// | ||
+ | * {{https:// | ||
+ | * {{https:// | ||
+ | * {{https:// | ||
+ | * {{https:// | ||
+ | * {{https:// | ||
+ | * {{https:// | ||
+ | |||
+ | ===== Lecture 27 (4.01 Mon.) ===== | ||
+ | === Suggested (lecture 27): === | ||
+ | * {{1982-kung-why-systolic-architecture.pdf | H.T. Kung, “Why Systolic Architectures?, | ||
+ | * {{p1-Jouppi.pdf | N. Jouppi, C. Young, N. Patil, D. Patterson, G. Agrawal, R. Bajwa, S. Bates, S. Bhatia, N. Boden, A. Borchers and R. Boyle, “In-datacenter Performance Analysis of a Tensor Processing Unit,” ISCA 2017}} | ||
+ | * {{4824-imagenet-classification-with-deep-convolutional-neural-networks.pdf | A. Krizhevsky, I. Sutskever, G.E. Hinton, " | ||
+ | * {{GoogLeNet.pdf | C. Szegedy, W. Liu, Y. Jia, P. Sermanet, S. Reed, D. Anguelov, D. Erhan, V. Vanhoucke, A. Rabinovich, "Going Deeper with Convolutions," | ||
+ | * {{resnet.pdf | K. He, X. Zhang, S. Ren, J. Sun, “Deep Residual Learning for Image Recognition, | ||
+ | * {{p346-annaratone.pdf | M. Annaratone, E. Arnould, T. Gross, H.T. Kung, and M.S. Lam, “Warp Architecture and Implementation, | ||
+ | * {{ADA184329.pdf | M. Annaratone, E. Arnould, T. Gross, H.T. Kung, M. Lam, O. Menzilcioglu, | ||
+ | * {{Smith-1982-Decoupled-Access-Execute-Computer-Architectures.pdf | J.E. Smith, “Decoupled Access/ | ||
+ | * {{p199-smith.pdf | J.E. Smith, G. E. Dermer, B. D. Vanderwarn, S. D. Klinger, and C. M. Rozewski, "The ZS-1 Central Processor, | ||
+ | * {{DynamicScheduling.pdf | J.E. Smith, “Dynamic Instruction Scheduling and the Astronautics ZS-1,” IEEE Computer, 1989}} | ||
+ | * {{microarchitecture_pentium4_2001.pdf | G. Hinton, D. Sager, M. Upton, and D. Boggs, "The Microarchitecture of the Pentium® 4 Processor," | ||
+ | * {{mutlu_hpca_2003.pdf | O. Mutlu, J. Stark, C. Wilkerson, and Y.N. Patt, " | ||
+ | |||
+ | ===== Lecture 28 (4.01 Mon.) ===== | ||
+ | === Suggested (lecture 28): === | ||
+ | |||
+ | * {{parallel1964thornton.pdf | J. Thornton, “Parallel Operation in the Control Data 6600,” AFIPS 1964.}} | ||
+ | * {{pipelined1978smith.pdf | B.J. Smith, “A Pipelined, Shared Resource MIMD Computer, | ||
+ | * {{kongetira05_niagara.pdf | P. Kongetira, A. Kathirgamar, | ||
+ | * {{hep_burton.pdf | B.J. Smith, " | ||
+ | * {{tera_alverson.pdf | R. Alverson, D. Callahan, D. Cummings, B. Koblenz, A. Porterfield, |
readings.1609430378.txt.gz · Last modified: 2020/12/31 15:59 by firtinac