Accéder au contenu.
Menu Sympa

starpu-devel - [Starpu-devel] StarPU changeable sizes and StarPU purge

Objet : Developers list for StarPU

Archives de la liste

[Starpu-devel] StarPU changeable sizes and StarPU purge


Chronologique Discussions 
  • From: Rabab Al-omairy <rabab.omairy@kaust.edu.sa>
  • To: starpu-devel@lists.gforge.inria.fr
  • Cc: "hatem.ltaief" <Hatem.Ltaief@kaust.edu.sa>
  • Subject: [Starpu-devel] StarPU changeable sizes and StarPU purge
  • Date: Sun, 1 Dec 2019 22:03:43 +0300
  • Authentication-results: mail3-smtp-sop.national.inria.fr; spf=None smtp.pra=rabab.omairy@kaust.edu.sa; spf=Pass smtp.mailfrom=rabab.omairy@kaust.edu.sa; spf=None smtp.helo=postmaster@mail-pg1-f170.google.com
  • Ironport-phdr: 9a23:jj814xJh0cGqGpYiuNmcpTZWNBhigK39O0sv0rFitYgeK/vxwZ3uMQTl6Ol3ixeRBMOHsqkC0rWN+Pm6ACQp2tWoiDg6aptCVhsI2409vjcLJ4q7M3D9N+PgdCcgHc5PBxdP9nC/NlVJSo6lPwWB6nK94iQPFRrhKAF7Ovr6GpLIj8Swyuu+54Dfbx9HiTagb75+Ngi6oAffu8UZnIduNLs9wQbVr3VVfOhb2XlmLk+JkRbm4cew8p9j8yBOtP8k6sVNT6b0cbkmQLJBFDgpPHw768PttRnYUAuA/WAcXXkMkhpJGAfK8hf3VYrsvyTgt+p93C6aPdDqTb0xRD+v4btnRAPuhSwaMTMy7WPZhdFqjK9Drx2vpxJxzY3Jbo+LKPVzZbnScc8ASGdbQspdSy5MD4WhZIUPFeoBOuNYopHhqVQUthSxHxKjD/7zxz9Mn3P22qg63P4gEQHc3A0rAtUDsG7Oo9ruLqgdSvq5zK7NzTrZbvNWwzf85ZPLch87vPGMRq5wfdDPxkYyCgPJlE6Qppb8Mj6Ty+8DvW+b7+96WuKujW4qswBxoj61xsctkIbFnIwVykrC+C5kw4g1PcW1RFBnbdOgCpddtCGXO5FoTs8/Q2xkojs2x78atZO9YSME0o4oxwTFZPyCa4WI4gzsVOKWITpggXJqYrO/hxKr/Ui4xe3wS9C40FhXoidHjtXArH8N1xvU6siITvty4F2t1iqI1wDW8u1EIEY0mrTHK5M53LI8ip4evV7AEyL2gkn6krKae0U+9uS16enrfq3qppqGOI91jgH+PL4umsu6AekgKAgBRXOU9v6n27H5+k35Ra9FjvwskqjZq53aJt4UprOiAw9JyIos9QuwDyq83NQDh3YLNEhFdw6fj4j1J1HOJ+j1Deyjg1u2ijdr3+3GMqT8DZrTMHjMjrPhcK15605d0woz0ctf64hbCrEPJ/LzQEDxu8LCAh84NQy02efnB89n2oMQQ2LcSpOeZbjOuEWQ+6cjLveBYKcRuS3hML4q6fn0gnJ/mFkHfKDv04FERmq/G6FcKk+QazLFhpIGFG4LpBt2GPbjglSIFzlWbneoToo/7ywgFMSrAZqFXYn70+/J5zuyApADPjMOMVuLC3q9L9zYCcdJUzqbJ4paqhJBTaKoEtNz1hizrxO8xrZ6aPHdqHVB6MDTkeNt7uiWrikcsDx5C8PHjTOIRmBw23sHHno4hf8n50N6zViH3O5zhPkKTYUCtcMMaR8zMNvn98I/DtnzXgzbedLQFASlS8i9HXc8Qs932NZcOks=
  • List-archive: <http://lists.gforge.inria.fr/pipermail/starpu-devel/>
  • List-id: "Developers list. For discussion of new features, code changes, etc." <starpu-devel.lists.gforge.inria.fr>

Dear StarPU developer,

I am Rabab Alomairy, PhD student working with Hatem Ltaief at KAUST. I would like to have answers to the following two problems:

First issue:
I am working on hierarchical matrices and I am running onto some problem related to changing  sizes after matrix compression. More precisely, hierarchical matrices are divided up onto tiles. I am using starpu_matrix_interface for defining each tile of the matrix. Initially, all tiles contain the same number of rows and columns. However, after matrix compression each tile ends up having different sizes. In order to achieve this on shared memory, I am using the same recommendation that you proposed later on: https://lists.gforge.inria.fr/pipermail/starpu-devel/2019-June/002646.html. Therefore, I realloc interface pointer and change number of column. Shared memory results having very promising outcomes. However, on distributed memory on which tiles are 2D cyclically distributed among mpi nodes  I have  the following error:

[1][starpu_mpi][_starpu_mpi_test_detached_requests][assert failure] MPI_Test returning MPI_ERR_TRUNCATE: message truncated
time_zgetrf_tile: starpu_mpi.c:1153: _starpu_mpi_test_detached_requests: Assertion `req->ret == 0' failed.


How could I resolve this issue?


Second issue:
I am running LU algorithm on low rank matrix on distributed memory architecture (CPU only). For some matrix size which I can say matrix with medium size, I have the following message which cause application to stuck and not continue execution.

[starpu][starpu_memchunk_tidy] Low memory left on node RAM 0 (6405MiB over 128808MiB). Your application data set seems too huge to fit on the device, StarPU will cope by trying to purge 6474 MiB out. This message will not be printed again for further purges. The thresholds can be tuned using the STARPU_MINIMUM_AVAILABLE_MEM and STARPU_TARGET_AVAILABLE_MEM environment variables.
[starpu][_starpu_memory_reclaim_generic] Not enough memory left on node RAM 0. Your application data set seems too huge to fit on the device, StarPU will cope by trying to purge 32202 MiB out. This message will not be printed again for further purges



What could cause this issue?, What can I do to resolve this issue? and How could I avoid this issue?


Please let me know if you need any more details about those questions.
Thank you,
Rabab

--

Best Regards,

Rabab Alomairy


PhD Student

King Abdullah University of Science and Technology

Kingdom of Saudi Arabia
Extreme Computing Research Center
Al-Khawarizmi (Bldg.1)
Floor 0, Office 0203-WS06 


Mobile KSA: +966 (0) 548884792

E-mail: rabab.omairy@kaust.edu.sa






This message and its contents, including attachments are intended solely for the original recipient. If you are not the intended recipient or have received this message in error, please notify me immediately and delete this message from your computer system. Any unauthorized use or distribution is prohibited. Please consider the environment before printing this email.

  • [Starpu-devel] StarPU changeable sizes and StarPU purge, Rabab Al-omairy, 01/12/2019

Archives gérées par MHonArc 2.6.19+.

Haut de le page