Skip to main content

On Sale: GamesAssetsToolsTabletopComics
Indie game storeFree gamesFun gamesHorror games
Game developmentAssetsComics
SalesBundles
Jobs
TagsGame Engines

Solving OOM issues with a modified pytorch that has Large Model Support

A topic by mjc619 created Nov 17, 2020 Views: 268 Replies: 1
Viewing posts 1 to 2
(+1)

IBM and Microsoft have released libraries and conda packages that modify pytorch in order to escape out of memory issues when scaling up, some of which hopefully will be implemented in future versions of pytorch (https://github.com/pytorch/pytorch/issues/35633)

have you explored using any of these alternatives?  
https://github.com/IBM/pytorch-large-model-support
https://github.com/microsoft/DeepSpeed

Developer(+1)

That is a pretty interessing read, it seen it require some manual compilation and a few changes, it would be great if pytorch could implement it officially, but I will take a look as well, thanks for that.