pub fn prefetch<T>(p: *const T)
Prefetches as many cache lines as is occupied by the type T. We assume 64B cache lines.