Accéder au contenu.
Menu Sympa

starpu-devel - Re: [Starpu-devel] Performance with SOCL on multiple devices

Objet : Developers list for StarPU

Archives de la liste

Re: [Starpu-devel] Performance with SOCL on multiple devices


Chronologique Discussions 
  • From: Malcolm Roberts <malcolm.i.w.roberts@gmail.com>
  • To: Samuel Thibault <samuel.thibault@inria.fr>, starpu-devel@lists.gforge.inria.fr, "helluy@math.unistra.fr" <helluy@math.unistra.fr>, Bruno Weber <bruno.weber@axessim.fr>
  • Subject: Re: [Starpu-devel] Performance with SOCL on multiple devices
  • Date: Wed, 6 Jan 2016 15:10:34 +0100
  • Authentication-results: mail3-smtp-sop.national.inria.fr; spf=None smtp.pra=malcolm.i.w.roberts@gmail.com; spf=Pass smtp.mailfrom=malcolm.i.w.roberts@gmail.com; spf=None smtp.helo=postmaster@mail-wm0-f44.google.com
  • Ironport-phdr: 9a23:XdqsaxGXsSQNgrICWuG3yp1GYnF86YWxBRYc798ds5kLTJ7/psiwAkXT6L1XgUPTWs2DsrQf27SQ6/iocFdDyKjCmUhKSIZLWR4BhJdetC0bK+nBN3fGKuX3ZTcxBsVIWQwt1Xi6NU9IBJS2PAWK8TWM5DIfUi/yKRBybrysXNWC0YLvj6ibwN76XUZhvHKFe7R8LRG7/036l/I9ps9cEJs30QbDuXBSeu5blitCLFOXmAvgtI/rpMYwuwwZgf8q9tZBXKPmZOx4COUAVHV1e1wysfbisATOSUOz5noWW3gSjlIcGAHA8Rz+GIvxsyHzq+5h8CicJ8z/C74uD3Dq9LtiUgf1zSsKKTM92GXWkdBryq1Vpwiuqlp+xZTVacebLr42NKPQedcHVEJEV9pRETFHA5igdYJJAfBLdbJYoo35ukomqBKiBU+xAuL01iVBwHHsi+ly+f48FwaO4wEqEshG5H7OsdPycrgfW+2v5KDYwC7YbvYQ1y2ru6bSdRV0hPiIVrV0dILrzkMiDR/exgGVoIjuODeQkOVLrC6B6OBtT/60o2EiogB15DOow5F/2cHymosJxwWcpm1Cy4EvKIj9ERYjbA==
  • List-archive: <http://lists.gforge.inria.fr/pipermail/starpu-devel/>
  • List-id: "Developers list. For discussion of new features, code changes, etc." <starpu-devel.lists.gforge.inria.fr>

Hello,

Thanks for taking a look at this. The relevant branch is socl2 (or devel); the master branch doesn't include these optimisations. I also added the option in socl2 to include all of the devices on a platform, which is currently (sorry; it's a hack) activated by passing -DSOCL=ON to cmake.

Best,

~Malcolm

On 06/01/2016 14:44, Samuel Thibault wrote:
Hello,

It seems you do not use the CL_QUEUE_OUT_OF_ORDER_EXEC_MODE_ENABLE
property. IIRC, this makes SOCL have to sequentialize all tasks queued
on the same command queue (required by the OpenCL standard). To get
parallelism, you would need to set
CL_QUEUE_OUT_OF_ORDER_EXEC_MODE_ENABLE, and use events to synchronize
the kernels that need to depend on each other.

Samuel

--
http://malcolmiwroberts.com





Archives gérées par MHonArc 2.6.19+.

Haut de le page