Slackbot
12/19/2022, 4:54 PMJim Rohrer
12/19/2022, 5:33 PMrunners:
pytorch_mnist:
resources:
<http://nvidia.com/gpu|nvidia.com/gpu>: [0, 1]
Jim Rohrer
12/19/2022, 5:33 PMThomas Jacquemin
12/19/2022, 5:38 PMserve
failed: GPU device index in [0, 1] is greater than the system available: [0]`
But I would like to specify multi-instance gpu "partition"Jim Rohrer
12/19/2022, 5:56 PMThomas Jacquemin
12/19/2022, 5:59 PMThomas Jacquemin
12/19/2022, 6:00 PMJim Rohrer
12/19/2022, 6:07 PMChaoyu
12/19/2022, 8:07 PMJiang
12/20/2022, 2:14 AMstep 3
make sure nvidia device plugin is properly set up with MIG enabled.
2. use something like <http://nvidia.com/mig-1g.5gb|nvidia.com/mig-1g.5gb>: 1
instead of <http://nvidia.com/gpu|nvidia.com/gpu>
in the resources limit.Aaron Pham
12/21/2022, 1:53 AMJiang
12/22/2022, 6:04 AM