How to cook with SIMD?

Out of a (web)search for haskell ghc simd poor performance, this:

seems to be the most recent of the first six “vaguely-relevant” results - does anything there help?