Slackbot
10/10/2022, 4:58 PMJiang
10/11/2022, 3:39 AMtorch.cuda.empty_cache()
will not release the memory occupied by tensors. And model weights are tensors, too.Yakir Saadia
10/11/2022, 3:52 AMJiang
10/11/2022, 3:54 AMJiang
10/11/2022, 3:55 AMYakir Saadia
10/11/2022, 3:57 AMJiang
10/11/2022, 3:58 AMempty_cache
manually. Just rely on pytorch itself to do thatJiang
10/11/2022, 3:59 AMYakir Saadia
10/11/2022, 4:04 AMJiang
10/11/2022, 4:25 AMYakir Saadia
10/11/2022, 4:28 AMChaoyu
10/11/2022, 5:03 PMYakir Saadia
10/11/2022, 5:23 PM