Loading Considerations when Paralleling Transformers