"W external/org_tensorflow/tensorflow/core/common_runtime/bfc_allocator.cc:457] Allocator (GPU_0_bfc) ran out of memory trying to allocate 24.27GiB (rounded to 26061494272)requested by op"
Is it possible to use a full GPU ? And is it possible to access other GPUs and not only the 3rd one ?
Hi,
I repartitioned the GPUs on node 03 to have one full card per partition (40gb)
It should be possible to use the two GPUs on the machine but I doubt Alphafold can spread on several nodes. For the time being only node 03 is dedicated to Alphafold.
What to set in the --gres parameter to use one full card ? I tried --gres=3g:40gb:1 but got a node configuration error. And what to set in case one would like to use two GPUs ?
My problem now is that some jobs crash because of the walltime at 24h. Is it possible to extend to 2 or 3 days ?
Next to it, a new release of AlphaFold is available now (2.2.0) that allows to use new model parameters and GPU for the relaxation step (last step for each prediction), which saves computation time. Would it be possible to install this new version (2.2.0, with params to update as well), available on the AlphaFold's github ? This is a major update for the multimer predictions.
OK, thanks. I hadn't seen it was extended from one day to three. I am going to try.
Three days should be enough for monomer. For multimer, it is not sure for big predictions but it is something to try with the last version of AlphaFold, when the AlphaFold team finds time to install it.