The SSE3 trunctaion is absolutely perfect, so there aren't any differences between the results. The SSE3 optimised code is about 15-20% faster.
Akos, do you refer to the current code or an older one (the one we started S5R1 with or even the 4.37 from S4)? I thougt that in the current code FISTTP is not of much use, and definitely not speeds up the overall computation by 20 or even 15%.
BM

Optimized Einstein@Home App
)
We're quite busy, too, getting the new hierarchical search code running which we intend to use for the next run. I doubt that I will have any time left for any more work on the current code.
BM
BM