- Seth Goldstein (Tuesday, 2/19)
Lazy Threads: Implementing a Fast Parallel Call
Seth Copen Goldstein, Klaus Erik Schauser, and David E. Culler. Journal of Parallel and Distributed Computing. 1996.
- David Koes (Tuesday, 2/19)
Near-Optimal Instruction Selection on DAGs
David Ryan Koes and Seth Copen Goldstein.
CGO 2008.
- GPUs and memory hierarchy [Tim Murray] (2/21)
-
Optimizing Compiler for the Cell Processor PACT 2005.
(A. Eichenberger, K. O'Brien, K. O'Brien,
P. Wu, T. Chen, P. Oden, D. Prener, J. Shepherd, B. So,
Z. Sura, A. Wang, T. Zhang, P. Zhao, and M. Gschwind.)
- Brook for GPUs: stream computing on graphics hardware.
(Buck, I., Foley, T., Horn, D., Sugerman, J., Fatahalian, K., Houston, M., and Hanrahan, P.)
In ACM SIGGRAPH 2004 Papers
-
nVidia Corporation. nVidia
CUDA programming guide, version 1.1 (Read sections
2 and 3, skim sections 4.2 and 6, and look at the other sections if
you have time.)
- Compiling for Multicore [Saagar Patel] (3/6)
- Adaptive Compilation
[Stephanie Rosenthal] (2/21)
- John Cavazos, Grigori Fursin, Felix Agakov, Edwin Bonilla, Michael
F.P. O’Boyle, and Olivier Temam. Rapidly
Selecting Good Compiler Optimizations using Performance
Counters, In CGO 2007.
- Prasad A. Kulkarni, David B. Whalley, Gary S. Tyson, Jack
W. Davidson. Evaluating Heuristic Optimization
Phase Order Search Algorithms, In CGO 2007.
- Cooper, K. D., Grosul, A., Harvey, T. J., Reeves, S., Subramanian,
D., Torczon, L., and Waterman, T. ACME: adaptive compilation made efficient. In Proceedings of the 2005 ACM SIGPLAN/SIGBED Conference on Languages, Compilers, and Tools For Embedded Systems (Chicago, Illinois, USA, June 15 - 17, 2005). LCTES '05. ACM Press, New York, NY, 69-77.
- Code Verification [Pongsin Poosankam] (2/28)
- Transactional Memory [Panchalam S. Ramanujan] (3/6)
- Adl-Tabatabai, A., Lewis, B. T., Menon, V., Murphy, B. R., Saha,
B., and Shpeisman, T. Compiler and runtime support for efficient software transactional memory. SIGPLAN Not. 41, 6 (Jun. 2006), 26-37.
- Tim Harris, Mark Plesko, Avraham Shinnar, and David Tarditi. Optimizing Memory Transactions. In PLDI 2006.
- Pointer Analysis [Michael Sanphy] (2/26)
-
Ghiya, Lavery, and Sehr, "On the importance of
points-to analysis and other memory
disambiguation methods for C programs" PLDI 2001
- Hardekopf, B. and Lin, C. The ant and the grasshopper: fast and accurate pointer analysis for millions of lines of code. In PLDI 2007.
-
Nevin Heintze and Olivier Tardieu, "Ultra-fast aliasing
analysis using CLA: a million lines of C code in a second" PLDI 2001
-
Rakesh Ghiya and Laurie J. Hendren. ``Is it a Tree, a DAG, or a
Cyclic Graph? A Shape Analysis for Heap-Directed Pointers in C,''
in Proceedings of the 23rd ACM SIGPLAN-SIGACT Symposium on
Principles of
Programming Languages, St. Petersburg, Florida, pages 1-15, January
1996. p1-ghiya.pdf
-
Lattner and Adve.
Automatic pool allocation: improving performance by controlling data structure layout in the heap.
PLDI'05.
- Learning a Better Compiler [James Cipar] (2/28)
- Mark Stephenson and Saman Amarasinghe. Predicting Unroll Factors Using Supervised Classification. In CGO 2005.
- AbouGhazaleh, N., Ferreira, A., Rusu, C., Xu, R., Liberato, F., Childers, B., Mosse, D., and Melhem, R.
Integrated CPU and l2 cache voltage scaling using machine learning. In LCTES 2007.
- Calder, B., Grunwald, D., Jones, M., Lindsay, D., Martin, J., Mozer, M., and Zorn, B. Evidence-based static branch prediction using machine learning. ACM Trans. Program. Lang. Syst. 19, 1 (Jan. 1997)