Accéder au contenu.
Menu Sympa

starpu-devel - Re: [Starpu-devel] Performance with SOCL on multiple devices

Objet : Developers list for StarPU

Archives de la liste

Re: [Starpu-devel] Performance with SOCL on multiple devices


Chronologique Discussions 
  • From: Samuel Thibault <samuel.thibault@inria.fr>
  • To: Malcolm Roberts <malcolm.i.w.roberts@gmail.com>, starpu-devel@lists.gforge.inria.fr, "helluy@math.unistra.fr" <helluy@math.unistra.fr>, Bruno Weber <bruno.weber@axessim.fr>
  • Subject: Re: [Starpu-devel] Performance with SOCL on multiple devices
  • Date: Wed, 6 Jan 2016 17:16:46 +0100
  • List-archive: <http://lists.gforge.inria.fr/pipermail/starpu-devel/>
  • List-id: "Developers list. For discussion of new features, code changes, etc." <starpu-devel.lists.gforge.inria.fr>

Samuel Thibault, on Wed 06 Jan 2016 17:03:42 +0100, wrote:
> With these changes, I do see kernels running on various GPUs. It does
> not seem faster, but that is probably due to data transfers.

And also serialized kernels

> You will probably want to use FxT to dump traces and read them with
> Vite.

It notably seems that the DGMacroCellInterface kernels are completely
serialized, thus awfully bad performance due to no parallelism and
data transfers :) DGFlux does get parallelized, on the other hand. The
duration is however quite tiny (60µs), so it's really not efficient
compared with the runtime overhead.

Samuel




Archives gérées par MHonArc 2.6.19+.

Haut de le page