15-745: Project Web Pages, Fall 2024
Compiler Optimizations for Machine Learning Applications:
Hardware-aware Compilation of Tensor Programs for Low-Precision Computing
, Kathryn Chen and Zhibo Chen
Implementation and Analysis of Automatic Generation of Efficient Sparse Tensor Format Conversion Routines
, Roland Liu and Bear Xiong
Mode DNN Program Compilation with MagPy in TVM
, Ruihang Lai and Lauren Sands
Compiler Optimizations for Memory Hierarchy Performance:
Autocaching for GPU Shared Memory Using LLVM
, Swamynathan Siva and Eliot Solomon
Register Tiling to Obtain Fast BLAS Kernels with Polly
, Jessica Li
Optimizing Code Size through Function Merging
, Byeongjee Kang and Tianqin Li
Miscellaneous Compiler Optimizations:
Automated Compiler Generation of Log-Based Lock-Free Locks
, Humza Ikram and Deepanjali Mishra
SLP Vectorization
, Enrico Green and Jonathan Rogers
Alpaca: Task-based intermittent computing
, Devin Qu and Shubham Bhargava
Logic Program Compilation
, Yichen Ni and Chase Norman
Partial specialization for subtype polymorphism in Virgil
, Yonah Goldberg and Jason Yao
P4CKET: Efficient Online Profiling for P4 Programs
, Darshil Kaneria and Sai Jaydeep Kudumula
Implementing Predicate Transfer in DuckDB
, Christos Laspias and Ajax Shung