Can we please introduce parallel build jobs, in order to speed up long sequences of multi-target builds? For example, a default of 2-4 concurrent Docker runs would dramatically accelerate xgo runs, without breaking the average dev host.