Note that it's still around 2x slower than my best sequential solution, but because uni, I use the slow sequential solution and the fast parallel solution.