Accéder au contenu.
Menu Sympa

starpu-devel - Re: [Starpu-devel] StarPU code hangs when using a lot of GPU memory

Objet : Developers list for StarPU

Archives de la liste

Re: [Starpu-devel] StarPU code hangs when using a lot of GPU memory


Chronologique Discussions 
  • From: Mirko Myllykoski <mirkom@cs.umu.se>
  • To: Mirko Myllykoski <mirkom@cs.umu.se>
  • Cc: Starpu Devel <starpu-devel@lists.gforge.inria.fr>
  • Subject: Re: [Starpu-devel] StarPU code hangs when using a lot of GPU memory
  • Date: Sat, 02 Mar 2019 23:13:31 +0100
  • Authentication-results: mail2-smtp-roc.national.inria.fr; spf=None smtp.pra=mirkom@cs.umu.se; spf=Pass smtp.mailfrom=mirkom@cs.umu.se; spf=None smtp.helo=postmaster@mail.cs.umu.se
  • Ironport-phdr: 9a23:gtExmhIEzjqokm+NRNmcpTZWNBhigK39O0sv0rFitYgeKP3xwZ3uMQTl6Ol3ixeRBMOHs6IC07KempujcFRI2YyGvnEGfc4EfD4+ouJSoTYdBtWYA1bwNv/gYn9yNs1DUFh44yPzahANS47xaFLIv3K98yMZFAnhOgppPOT1HZPZg9iq2+yo9JDffwZFiCChbb9uMR67sRjfus4KjIV4N60/0AHJonxGe+RXwWNnO1eelAvi68mz4ZBu7T1et+ou+MBcX6r6eb84TaFDAzQ9L281/szrugLdQgaJ+3ART38ZkhtMAwjC8RH6QpL8uTb0u+ZhxCWXO9D9QKsqUjq+8ahkVB7oiD8GNzEn9mHXltdwh79frB64uhBz35LYbISTOfV5Yq7Qc88WSXdYUspNSiBKH4ewY5YPAuYEO+tTsovzqEYUrRamCgajGOzhxDFIiHHowKM10eohHwLJ3QMuBN8OrHbZrNfpOKsOS+250q/FxijDYfNM3jf97ZDFfBcgofGWXrJwdtfax04vFgPBilWRqY/lPzSO1uQOsmiQ8u1tVeeui249qAFxpT2vy9wwhYnSnI4V11XE9ThjzIYuO9K1UUh2asOnHptIryyWKoV7Tt84T212tis3zqcKtJGncCQQ1pgqxwbTZ+Kbf4SU/x7uUeecLixkiH9gZr2yghm//VSvx+HgU8S51VdHoylDn9LRrH4CzQbT5dKCSvZl/keuxzKP1wfL5+FBO080lK7bJ4Q9zb4rjJYTrEHDHjLslEXtkqCabkQk+u625OT7erjqu5GRO5Nuhg3gPKkjntazDOskPgQUQWSW+fyw1Lj58k34RLVKgOc2kq7csJ3CIMQbp7S5AwxS0oYm8BuwEyym3M4FnXkCMVJJYgmHgJbxN1HUPP/4Feu/g0irkDpz3PDGIqfuAo/VLnjeibvuYKhy61BCxwo31t1f45NUCqodIPLoQEPxu9LYDhgjMwy73enrEtR91oUEWWKOGKCVKq3SsUXbrt4odsyNfowS8BPsL/w05Pn1jn5xzVocZ6qu2LMcczalG+kgOEjPMlT2hdJUNGYQvQ12af3ujEyBViVQZj7mWqMm5TY+IIm9S5rGW8a2jerSj2+AApRKazUeWRi3GnDyetDBC6pVOXCiZ/R5mzlBboCPDooo1BWgrgj/kuM1JfGS5ygF84nuhoEsu7/j0Coq/DkxNPyzlnmXRjgtzGgTASIzweZkrB4lkwrR4e1Dm/VdUOdrybZJXwM9bMCOyuV7D5b5QUTce8rPU1v0Gtg=
  • List-archive: <http://lists.gforge.inria.fr/pipermail/starpu-devel/>
  • List-id: "Developers list. For discussion of new features, code changes, etc." <starpu-devel.lists.gforge.inria.fr>

Hi,

On 2019-03-02 16:39, Samuel Thibault wrote:

[...] I guess you are not using a lot of streams
on the same CUDA device with STARPU_NWORKER_PER_CUDA, and not using a
very large CUDA pipeline with STARPU_CUDA_PIPELINE? I also guess you are
not using starpu_data_acquire on the CUDA GPU?

I have not touched the STARPU_NWORKER_PER_CUDA nor STARPU_CUDA_PIPELINE enviromental variables during these test runs. The code does not contain any starpu_data_acquire() function calls either. I tried setting STARPU_CUDA_PIPELINE to 0 at one point but it did not have any effect.


Could you re-run with the latest gdbinit version where I added more
details?

Here:

(gdb) starpu-memusage


Node 0:
Total used: 28, 190MiB
WT: 0, 0MiB
home: 0, 0MiB
OOC: 28, 190MiB
diduse: 28, 190MiB
redux: 0, 0MiB
relax: 0, 0MiB
noref: 0, 0MiB
normal: 28, 190MiB
owner: 26, 175MiB
shared: 2, 14MiB
invalid: 0, 0MiB
nosubdataref: 14, 94MiB
nodataref: 14, 94MiB
reading: 14, 95MiB
writing: 0, 0MiB
overwriting: 0, 0MiB

cached: 0, 0MiB


Node 1:
Total used: 1209, 958MiB
WT: 0, 0MiB
home: 0, 0MiB
OOC: 1209, 958MiB
diduse: 1203, 941MiB
redux: 0, 0MiB
relax: 2, 29MiB
noref: 0, 0MiB
normal: 1207, 929MiB
owner: 1205, 914MiB
shared: 2, 14MiB
invalid: 0, 0MiB
nosubdataref: 0, 0MiB
nodataref: 0, 0MiB
reading: 0, 0MiB
writing: 0, 0MiB
overwriting: 0, 0MiB

cached: 0, 0MiB

- Mirko




Archives gérées par MHonArc 2.6.19+.

Haut de le page