In general, instruction overhead is not likely to be a significant problem. To prevent it from becoming a problem, the compiler should be careful to avoid register spilling problems. For codes that benefit from spatial locality, block prefetches may help reduce instruction overhead. Almost all of the instruction overhead of dense-matrix prefetches could be eliminated through programmable streams, although the corresponding hardware support is much more significant.