Session 3: ZeRO: Memory Optimizations Toward Training Trillion Parameter Models In session 3 we covered the paper "ZeRO: Memory Optimizations Toward Training Trillion Parameter Models." Paper Slides Recording