Objet : Developers list for StarPU
Archives de la liste
- From: Malcolm Roberts <malcolm.i.w.roberts@gmail.com>
- To: Samuel Thibault <samuel.thibault@inria.fr>, starpu-devel@lists.gforge.inria.fr, "helluy@math.unistra.fr" <helluy@math.unistra.fr>, Bruno Weber <bruno.weber@axessim.fr>
- Subject: Re: [Starpu-devel] Performance with SOCL on multiple devices
- Date: Wed, 6 Jan 2016 15:10:34 +0100
- Authentication-results: mail3-smtp-sop.national.inria.fr; spf=None smtp.pra=malcolm.i.w.roberts@gmail.com; spf=Pass smtp.mailfrom=malcolm.i.w.roberts@gmail.com; spf=None smtp.helo=postmaster@mail-wm0-f44.google.com
- Ironport-phdr: 9a23:XdqsaxGXsSQNgrICWuG3yp1GYnF86YWxBRYc798ds5kLTJ7/psiwAkXT6L1XgUPTWs2DsrQf27SQ6/iocFdDyKjCmUhKSIZLWR4BhJdetC0bK+nBN3fGKuX3ZTcxBsVIWQwt1Xi6NU9IBJS2PAWK8TWM5DIfUi/yKRBybrysXNWC0YLvj6ibwN76XUZhvHKFe7R8LRG7/036l/I9ps9cEJs30QbDuXBSeu5blitCLFOXmAvgtI/rpMYwuwwZgf8q9tZBXKPmZOx4COUAVHV1e1wysfbisATOSUOz5noWW3gSjlIcGAHA8Rz+GIvxsyHzq+5h8CicJ8z/C74uD3Dq9LtiUgf1zSsKKTM92GXWkdBryq1Vpwiuqlp+xZTVacebLr42NKPQedcHVEJEV9pRETFHA5igdYJJAfBLdbJYoo35ukomqBKiBU+xAuL01iVBwHHsi+ly+f48FwaO4wEqEshG5H7OsdPycrgfW+2v5KDYwC7YbvYQ1y2ru6bSdRV0hPiIVrV0dILrzkMiDR/exgGVoIjuODeQkOVLrC6B6OBtT/60o2EiogB15DOow5F/2cHymosJxwWcpm1Cy4EvKIj9ERYjbA==
- List-archive: <http://lists.gforge.inria.fr/pipermail/starpu-devel/>
- List-id: "Developers list. For discussion of new features, code changes, etc." <starpu-devel.lists.gforge.inria.fr>
Hello,
Thanks for taking a look at this. The relevant branch is socl2 (or devel); the master branch doesn't include these optimisations. I also added the option in socl2 to include all of the devices on a platform, which is currently (sorry; it's a hack) activated by passing -DSOCL=ON to cmake.
Best,
~Malcolm
On 06/01/2016 14:44, Samuel Thibault wrote:
Hello,
It seems you do not use the CL_QUEUE_OUT_OF_ORDER_EXEC_MODE_ENABLE
property. IIRC, this makes SOCL have to sequentialize all tasks queued
on the same command queue (required by the OpenCL standard). To get
parallelism, you would need to set
CL_QUEUE_OUT_OF_ORDER_EXEC_MODE_ENABLE, and use events to synchronize
the kernels that need to depend on each other.
Samuel
--
http://malcolmiwroberts.com
- [Starpu-devel] Performance with SOCL on multiple devices, Malcolm Roberts, 06/01/2016
- Re: [Starpu-devel] Performance with SOCL on multiple devices, Samuel Pitoiset, 06/01/2016
- Re: [Starpu-devel] Performance with SOCL on multiple devices, Samuel Thibault, 06/01/2016
- Re: [Starpu-devel] Performance with SOCL on multiple devices, Malcolm Roberts, 06/01/2016
- Re: [Starpu-devel] Performance with SOCL on multiple devices, Samuel Thibault, 06/01/2016
- Re: [Starpu-devel] Performance with SOCL on multiple devices, Samuel Thibault, 06/01/2016
- Re: [Starpu-devel] Performance with SOCL on multiple devices, Malcolm Roberts, 08/01/2016
- Re: [Starpu-devel] Performance with SOCL on multiple devices*, Samuel Thibault, 08/01/2016
- Re: [Starpu-devel] Performance with SOCL on multiple devices, Malcolm Roberts, 13/01/2016
- Re: [Starpu-devel] Performance with SOCL on multiple devices, Malcolm Roberts, 08/01/2016
- Re: [Starpu-devel] Performance with SOCL on multiple devices, Samuel Thibault, 06/01/2016
- Re: [Starpu-devel] Performance with SOCL on multiple devices, Samuel Thibault, 06/01/2016
- Re: [Starpu-devel] Performance with SOCL on multiple devices, Malcolm Roberts, 06/01/2016
Archives gérées par MHonArc 2.6.19+.