Linearize and scale the mcu blocks into the destination buffer in a single pass
Benchmarking shows that this improves performance for the invitation document from https://github.com/mozilla/pdf.js/issues/3809 by 35%