Hi Ee Lim,
Altenbachs suggestions is still more efficient!
Simple boolean commands and logical shifts are handled in a processor much faster than "complicated" math like division (as Q&R is a division operation). When you have the FPGA toolkit available you can easily check that by comparing the resources needed for Altenbachs solution and your Q&R approach…
I admit that using a single Q&R function on a standard PC will not hurt performance, but even there Altenbach's approach may be faster when it comes to analyzing big data chunks…