Objet : Developers list for StarPU
Archives de la liste
- From: XAVIER LACOSTE <xavier.lacoste@eviden.com>
- To: Samuel Thibault <samuel.thibault@inria.fr>
- Cc: "starpu-devel@inria.fr" <starpu-devel@inria.fr>, MATTHIEU KUHN <matthieu.kuhn@eviden.com>
- Subject: RE: [starpu-devel] Late data transfers
- Date: Wed, 28 Aug 2024 09:49:59 +0000
- Accept-language: fr-FR, en-US
- Arc-authentication-results: i=1; mx.microsoft.com 1; spf=pass smtp.mailfrom=eviden.com; dmarc=pass action=none header.from=eviden.com; dkim=pass header.d=eviden.com; arc=none
- Arc-message-signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=microsoft.com; s=arcselector10001; h=From:Date:Subject:Message-ID:Content-Type:MIME-Version:X-MS-Exchange-AntiSpam-MessageData-ChunkCount:X-MS-Exchange-AntiSpam-MessageData-0:X-MS-Exchange-AntiSpam-MessageData-1; bh=pTCVT7Y2f4Ta7yyds6zVrn+kbvMdpPvjOD/ndC7vTBo=; b=qOIfUi+v6E/eYDsaWK7ROMmcmbZye0tf59oguxJK7IshqJ8DrAgm32weg3yk2V2Rwu0GLaqkpEfb9WZa7IZN4BNY8N9ILWZAks/OFK3pgjC+ptvFAbGV88ebV3HUiuyEH4vE9VByAKWi99Lcx+EopZHhD6eIf+8U0nM0FvNA9Du2U5Z2jTjrIyOg5BatRyqicGcSCk+eEQQ1ehsTT0KlcTU8FXUGr4fwSH6N7TUFgMHa+vYTHEHTbcJfNgOaDy5CqwhwHoe8XuQ9dujUffzxuQzOkumFnCRXE7tdcpsHnlRoYwuoBy1IQOENUmr4Gph8MTTKjomI/GLBvMmKIBuhOA==
- Arc-seal: i=1; a=rsa-sha256; s=arcselector10001; d=microsoft.com; cv=none; b=DJaSnz85H1AgOB5Yv3NHa/zKVQy+8t63hSVTXadmuyRd8bSOUp1o2QzwJEap6IQydYcs50vx9SJFQnE4gEHQX0aNie9+tjn9B+DPcLlfSpnLQ9QiH/z9Xf3TArNVBBYUiuKFbC67H1f512YDA8IrZUYTfsxTdgrda9QpG8E9qfLlBB75gLfbjLQvUo7+T+HcAN9cM1B1SvxQfYXEkSmB1cHlmExxcBH1Zmr3PsM9XdSiIBNsv7uGQai1oibSr/8pfeYPojd1iW55VRL90gfcaffZeJrVC1rLddN2fauswGYoFPdqjcTug3FQRvoLy1mcJWlNtRP0JjJY/Q1Q4fnT4w==
- Authentication-results: mail3-smtp-sop.national.inria.fr; spf=None smtp.pra=xavier.lacoste@eviden.com; spf=Pass smtp.mailfrom=xavier.lacoste@eviden.com; spf=Pass smtp.helo=postmaster@smarthost3.eviden.com
- Ironport-data: A9a23:ZBJryajt+lBLg84pOEFpa2ThX161mhQKZh0ujC45NGQN5FlHY01je htvXWmHMvuNYGCgLop0aIuyoBgPvcKAyIc2QQM5ri5gEiljpJueD7x1DG+gZnLIdpWroGFPt phFNIGYdKjYaleG+39B55C49SEUOZmgH+a6UqieUsxIbVcMYD87jh5+kPIOjIdtgNyoayuAo tqaT/f3YTdJ4BYqdDtMg06/gEk35qiq6GlI5gVWic1j5TcyqVFFVPrzGonqdxMUcqEMdsamS uDKyq2O/2+x13/B3fv4+lpTWhRiro/6ZWBiuFIOM0SRqkQqShgJ70oOHKF0hXG7JNm+t4sZJ N1l7fRcQOqyV0HGsLx1vxJwS0mSMUDakVNuzLfWXcG7liX7n3XQL/pGPHEZFNJI5tlOMz9Ir 6YSOSE/UQyBvrfjqF67YrEEasULN8ziJp8fvWp8lnfUE/cqTJaFSKLPjTNa9GdowJ8eW6+HI ZpFAdZsREyojxlnKl4JCZ8k2uOhjX7kbzBwpUyUqacxpWPUyWSd1ZC8bIqII4zVFK25mG6V+ HP6zWjyOisoG9e002WPzE+v2d3myHaTtIU6T+HoqKQ03DV/3Fc7FB0XX1KmpeWRmE+7UchWb U0S4Csn66YonGSwUtDnRwD9rHOasxo0QMtVC+R86QeXy6OS7RzxO4QfZiVEYcY6tMgtWWds2 0GIk97vQzdotdV5VE5x6J+knRybZngJElYwOyleFwEU8uvcvIMK20enoslYLIa5idj8GDfVy j+MrTQji7h7sSLt//nglbwgq27wzqUlXjIIChPrsnWNwDkRWWJIT5et4ELH7fFbMN3fQ0eAs HUC3cOZ6Yji7K1hdgTTHI3h/5nzv55p1QEwZ3Y0T/HNEBz2qhaekXh4um0WGauQGp9slcXVS EHSoxhNw5RYIWGna6R6C6roVJ52kvi7SIq9DqGPBjarXnSXXFLflM2JTRPJt10BbGB2wcnTx L/HIZjwZZrkIf48nVJauNvxIZdwmnBgnz2PLXwK5x271qHWamScT7wIeECHdYgEAFCs/W3oH yJkH5LSkX13CbSmCgGOqNJ7BQ5RdxATW8upw/G7g8beeGKK7kl7Va+IqV7gEqQ495loehDgo insBxQFkAag2RUq62yiMxheVV8mZr4nxVpTAMDmFQ/ys5T6SdfwtPUsZNEscKM59edu6/dxQ rNXM4+DG/lDAHCPsTgUcZC3/sQoeQWJlDC+GXOvQAE+WJp8GC3P2NvvJTX0+Ac0UyGYiMoZo p+b7D39f6YtfQpZIfjzVOOO1HK05HgUp/JzVRDHI/5VY0Tdz7JpIC3Q0N4xBd09GTebzxTLx Qq6JAoqldvVkaAL7f3Mo76OlKayMu5EBkEBNXLq3bW3Eij7/2SY3o5LVtiTTw3dTG/Z/KaDZ /1f6uPVbc08g1dBtrRjH4ZRza4R48Xlo5lYxF9GGErnQkuKCLQ6BFW7xuhK67Nww4FGtTuMW k6g/sdQPZOLMpjHFH8TPA8UUfSR58oLmzX97eUHH2ui3XVZpIG4aER1OwWArAd/L7EvaYMs/ roHif4ssge6jkInD8aCgiVq7F+zF30nUZg8l5QkEYTu2xsKyFZDXMTmMRXIwqqzMvdCDkp7B QWvpvvmp69dzU/8YXYMBSDz/e5Ct68v5jFO7nE/fmqspPSUp8UZ/hNr9RYPcj901TRCiuJ6B XhqPRZ6JIKI5DZZu/JAVGGNRSBEOkeQxXb1wVpUlmeDF0iMfU7OJV0bJuyi0h048WVdXz4D5 5Ce6j/vfgjLdfHL/BkZeBBaudn8auIp7SzEusSsP/rdLqkAeTC/35OfPzsZmSXoEeYapRPhp 9Az2M1SdKeiFyobg5NjOrmgzb5KFSy1fj1Tc8pAopEMM3rXIgyp+D60LEu0RMNBCtrK/WK8C O1sPsh/bAu/5gnfsgEkAbMwHJEskM4L/NYifpbZFVwCuZabrRtrt8v03Qr6j2kJXd5vsJgcL qX8SjG8KVGT1EBkwzL1kMp5O2SDcYYlYi/417uL6+knLc8In9xtVkAQ6YGKmUupHjFpxS/Jg zObVZTql7Rj7a9Og7rTFr5yAlToCNHrC8WN3gOBk/VPStLtNcvLswAnlQX6Dl4HIZoQRNVFu rCfu/Hn3E7+negXUkKIv7KjBqV29cGJc+4PCf3OLV5ehjmndMvgxzAh6lKIA8VFv/0F7/b2W jbiTtW7cOAkfut0xVpXWnB4KAkcAaGmVZXQj3qxgNrUAydMzDGdCs2s8ELoSmRpdiUoHZnaI S2sstaM4uFolqh9NCUmNdpHXaAheETCXJE4feLfrTObV2mkom2Ts4vYyCYP12v5NWmmIu3bv 7T+WRnMRDajsvrpzfZYkbBIkD85MXJfudQ0L2Uho4NYqjbjAGA/eLFXdd1MD5xPiSX92a3pf DyHPiNoFSz5Wi8CahnmpsjqWgCEHOEVJ9PlPXoT8liJbzutTpa1aFe7GvyMP18tEtci8A2mF T3a0mb1NwKtwpZyWb1V6+K6jephgPjdwxrkPGjjxtfqDU927aoijRRc8MhlDEQr0P0hUG3GO 2k5SGYCS0a+IaI0Odg1YGZbQXn1oxu2pwjFrk6zLBL3tpuawOBGjvb4PokfF1HFgNsifNYze J88e4dBD616FJDeVWvFdu/FWZNJNM8=
- Ironport-hdrordr: A9a23:elRsIaqf+Hznq/bLWnsN99waV5uaL9V00zEX/kB9WHVpm5Oj9v xGzc506farslkssSkb6K+90KnpewK6yXcH2/huAV7CZniohILMFuoD0WKM+UyaJ8STzJ856U 4kSdkDNDSSNyk1sS+Z2njDLz9I+rDunNHK9IXjJjVWPHxXgspbnmFE43OgYzVLrX59dOME/f Snl656TjybE0j/jP7XOlA1G8z44/HbnpPvZhALQzQ97hOVsD+u4LnmVzCFwxY3SVp0sPcf2F mAtza8yrSosvm9xBOZ/XTU9Y5qlNzozcYGLNCQi/ISNi7nhm+TFcdcsvy5zXIISdOUmRIXee r30lAd1gNImjXsl1SO0F7QMs/boW8TAjHZuAelaDDY0L3ErXoBerp8bMRiA1rkAgMbzaNBOK 4n5RPri7NHSRzHhyjz/N7OSlVjkVe1u2MrlaoJg2VYSpZ2Us4akWUzxjIcLH47JlOw1Kk3VO 11SM3M7vdfdl2XK3jfo2l02dSpGnA+BA2PTEQOstGcl2E+pgE082IIgMgE2nsQ/pM0TJdJo+ zCL6RzjblLCssbd7h0CusNSda+TmbNXRXPOmSPJkmPLtBLB1vd75rspLkl7uCjf5IFiJM0hZ TaSVtd8XU/fkr/YPf+laGjMiq9NllVcQ6dvf22vaIJz4EUbICbRRG+dA==
- Ironport-phdr: A9a23:EZwymBc7C7dguXTO5VhZonm3lGM+ydfLVj580XLHo4xHfqnrxZn+J kuXvawr0ASRG9yAsbke06L/iOPJZy8p2dW7jDg6aptCVhsI2409vjcLJ4q7M3D9N+PgdCcgH c5PBxdP9nC/NlVJSo6lPwWB6nK94iQPFRrhKAF7Ovr6GpLIj8Swyuu+54Dfbx9HiTezfb9+N gi6oAvPusUZn4duNrs6xwfUrHdPZ+lZymRkKE6JkRr7+sm+4oNo/T5Ku/Im+c5AUKH6cLo9Q LdFEjkoMH076dPyuxXbQgSB+nUTUmMNkhpVGAfF9w31Xo3wsiThqOVw3jSRMNDsQrA1XTSi6 LprSAPthSwaOTM17H3bh8pth69AvhmvuwJwzJLVYIGNNfpxYKXdfc8BRWFcWspdTjFNDp+gY 4cKCecKIORWoJTnp1YWrRWwGxehCv7zyjFGhXH206813PguHgzdxgMsA84CvW7IoNnpNKofV /2+wqnSzTXEavNbwTby5ZTHfR4irv+CQa97f8zNxEQxDg3Lk0uep4P4ND6S2eUCqXab7+t+W uK1kWInrxt+rSSoxsc3lonGmJkYxU7a9SV43Yk6O9u4RFRnbt6jFZtdrieXPJZ5Tc0+WW9no jo6yqEYtp6heigH0JcqyhHRZvGIfYWF/hPuWfqeLzpmin9oebCyiwu2/EWgxOPxV9W53UpUo iZZkdTBqHQA2h3d58WFV/dx4Fmt1SuJ2gvO6e9EOVg5mKnHJ5I7wLM9lYAfvVneEiPsm0j7g 7eaelsl9+S05enqZ6jqqoOGOIJ1jwzyL7giltK6DOgmPQUDXW2W9OGh2LDi+0DyXatHjvkrk qnCrJDVO94bqLO4Aw5UzIku8wqzAjG729oCh3YHNkhKeBefgojpJV7OJPf4AO+kg1Spijhr3 ezGPr3mApXWMnjOnq3tcqhj605d1QU809Zf64hIBbEGJfL/Qk7xtNrGAR8lKwG5wPrrBM9/2 44dQ26DH7KVPLnRvFKH/u4vJvODZI4RuDbzMfgl4PvugGc2mFAHY6mp3IUYaGqiEvRiOUqWf X3sgswdEWgUpQcxUvLqiFyfXjFNYXa+Rb4z5jY+CI6+C4fMXZiigKad0yejAp1WemdGB0iSH XvwbYWLR+8MaD6OIs9mijEESbihS4462h60qQ/6z6ZnI/HV+y0ZsJLsz8J16PfJmREz8zx0F cWd3HuXQ2F6hGMEXyU207xnoUxh1leD1rB1g+dEFdNJ/fNJSho6OYfBwONkFdD9RAbBc8ySS FemR9WmGSs+Qsgww98IeUZ9Gs+tggrN3yqwUPcpkOmwDZkp/6aU43jwLc9gz2qOgLIoikQrR o1TNWysj7N76yDSAZTImgOXjfD5W74b2Xv93Vmi4FDGmGhzdzVMfO2RTXkBZ0bM69Dw7UTZU 7KGBKomPgZHj8WFL/0ZOZXSkVxaSaK7a5zlaGWrljL1XE7QrlvtRI/jemFGmT7YFFBBiAcLu 3CPKQk5AC6l5WPYFj1nU1z1MAv36ecrjnS9Qwcvyh2SKVV73u+l8wAUiObaT/oY2akYtQ8st TR4EVD71NXTWJKbvwQ0RKxHepsm5Utfk2fQtghzJJulerBrmF8TaUJztkjpzAl6IohekMQnq jUhyw8hYbmA3gZ5fiiDlYv1JqWRKmT2+0W3bLXK31jFzNuM0rwK4e4kq1L4oFrvG1Qj/3hhl dJS1hNw/73sCwwfGdL0W0czrF1hoq3CJzI6/8XS3GFtNq+9tnnD3cgoDa0r0EToedAXK66CG ALodq9ST8GzNOwnnUSoZRMYLahT8qAzJcava/qB3uaiIu9hmDusiWkP7pp61wqA8C91S+iA2 JhgobnQzgKZVjrmylOoucfshY1saC8TEme5jyPjAc8ZZ6F/e5oKFXb7O9e+lbAcz9bmX39V8 kLmBktTiJfvJkrUNgW7jVEDsCZf6Waqkia50TFuxjQgr67EmTfL3/ynbx0ffGhCWGhli17oZ 4myldETGkayPG1L3FOo41j3w69DqeFxNW7WFA1Ydjb3InskVqK7saGebuZC9pIkvCMRW+O5K wP/KPa1s14B3iXvEnELjiAyaTCnoNP9lhd9k36QBHttrXbUf4d7whKVt7m+DbZBmzEBQid/k zzeAFOxasKo8dujnJDGqumiVmilW/W/aAHTxJib/Gu+7GxuWlikmuyr38bgCU482DP60N9jU WPJqgz9a8/lzfbyPeVid0huTFjyjqgyUpF3jIo9ntcU0HMdnYmY1XcZkGv+N5NQ3qe2YHcWR DENysLY+0C1gAs8fjTQnsSpCj2U2YN5asO/Y38K1y5YjYgCE6qS4LFe3GN0rle+sQPNcK14l zYZx+Ep7S1SiOUItQwxiySFV+BKWxYIe3W10UTUvLXc5O1NaW2icKa9zh97lNGlV/SZpx1EH W3+YtEkFDNx6cN2NBTN1mfy48fqYoq1D5pbuxuKnhPHl+UQJog2k69AlSN5OG/n+3kswOkgk RFG0I+7sIWBbW5q+ejqZ3wQfi2wfM4V9jz32OxOl9iX0prpGpxqHS4XUbPlU/OkHTRUvvPif VXrcnV0ujKQHrzRGhWa4UFtoifUEpylAHqQIWEQ0dRoQBTObFwamg0fWy8224IoDg3/jtK0a 193v3pCgzyw4gsJ0O9jMAPzF3vStBv9IClhU4CRdVJX9l0QvhaOd5HAqLsqQ2cBoMHm91ToS CTTZhwUXzhYAQrZWRa6Z+XovIaetLPBTuumc6meOfPR+aoHDrHYndX0isNn52reZp3JYSkkV qR9ihYEBikceYyRmjMETzEbmnDWd8DdvA275iB8ssG49rLsRR7r4oyMTbBVNJ0q8hSyy89vL saojT1iYXZd35IInjrTzaQHmUUVk2dofiWsFrIJsWjMSrjRk+lZFUxTZyR2PcpOp6Uyu2sFc dbckc/w36Vkg+QdEFJMT0Dhndq1PooHPWCxOVWBD0GOfLiLPjzExcjrbLj0GeUW07US7kT26 WrTGlSGXHzLjzTzUhGzLexAxDqWOhBTosD1cxpgD3TiUMOzahC/N4w/hjk3zLsow3LSYD5Ed 2csNRoU6O3MvkY6yr1lFmdM72RoN7yBkiedtKzDL4oO9OBsGmJynv5b53Izz/1U6jtFTbp7g ni3zJYmrle4n+2I0jciXgBJr2MBlIuVvEh4f6HQ/Z5bRXvs8QwM4WiWTR8NopE2b7+n87AV0 dXJmK/pfX1a9MnI+MIHG8XOANmCOWA9MR3yBGWSARUEQTOtc2rYggYO9ZPavm3QpZ88pJ/2n ZMIQbIOT105GMQRDUF9FcADKpN6DXs01KSWh8kS6T+ivQHcEY9E64vfWKvYUpCNYH6JyKNJb BwSzfbkIJQPY8fljld6ZAAylcyPEk7UF7ilTQVscA44rUML+397HDRbM6fNZx2s7HIVU/Wzm 0xu4uOfSf8o/S3w41AsOgGMqTUxnEQ3397ihGLIGAM=
- Ironport-sdr: 66cef2ca_3J1k79gGwic97A0FrbyQmavKJbQLzEYuhqKqrL5EsY8Wzp9 L9ZBOkdlcrGAT9eSaQvXOGY0H/dUWgSuuF52m4g==
- Msip_labels: MSIP_Label_ecb69475-382c-4c7a-b21d-8ca64eeef1bd_Enabled=True;MSIP_Label_ecb69475-382c-4c7a-b21d-8ca64eeef1bd_SiteId=7d1c7785-2d8a-437d-b842-1ed5d8fbe00a;MSIP_Label_ecb69475-382c-4c7a-b21d-8ca64eeef1bd_SetDate=2024-08-28T09:49:59.302Z;MSIP_Label_ecb69475-382c-4c7a-b21d-8ca64eeef1bd_Name=Eviden For Internal Use - All Employees;MSIP_Label_ecb69475-382c-4c7a-b21d-8ca64eeef1bd_ContentBits=0;MSIP_Label_ecb69475-382c-4c7a-b21d-8ca64eeef1bd_Method=Standard;
Hello,
Thanks for your answer,
I answer in the text bellow.
I'll need to double check that there are no synchronisation during submission but there shouldn't be maybe some error on our side.
Regards,
XL.
De : Samuel Thibault <samuel.thibault@inria.fr>
Envoyé : jeudi 22 août 2024 15:13
À : XAVIER LACOSTE <xavier.lacoste@eviden.com>
Cc : starpu-devel@inria.fr <starpu-devel@inria.fr>
Objet : Re: [starpu-devel] Late data transfers
Envoyé : jeudi 22 août 2024 15:13
À : XAVIER LACOSTE <xavier.lacoste@eviden.com>
Cc : starpu-devel@inria.fr <starpu-devel@inria.fr>
Objet : Re: [starpu-devel] Late data transfers
Hello,
It would be useful to provide the paje trace, they contain much more
information than what is displayed by default.
It would be useful to provide the paje trace, they contain much more
information than what is displayed by default.
I think I overwrote the traces, I could try to reproduce them.
I moved to another approach (allocate larger tiles, and show subpart to starpu while gathering them in macro-tasks)
XAVIER LACOSTE, le mer. 07 août 2024 13:23:57 +0000, a ecrit:
> One the attached trace of a LU factorization,
Which implementation is this? Chameleon?
> One the attached trace of a LU factorization,
Which implementation is this? Chameleon?
It's on another implementation than Chameleon.
> GETRF (black) and TRSMs (blue)
> are executed on CPU except for the first ones in row and column and GEMMs are
> all batched on the GPU (except the first one wich is on the GPU but not batch).
> Here are some remarks I can't explain on the trace:
>
> • The TRSMs on host wait until the block is sent to GPU to start (while they
> could start just after it).
> • data transfers from host to device do not start immediatly after GETRF
I'm not sure what exactly you refer to. Do you refer to the red piece
just after the black GETRF? I don't know what that is but that is
surprising indeed. That being said, I see at the bottom a lot of tasks
submission right there, and the number of submitted tasks only grow
there, so it looks like you have some spurious synchronization in your
submission thread?
I will double check but I though there was no control on the submission flow and all should be submitted at the beginning.
> nor after each TRSM.
I also seems there that you have task submissions showing up at the
bottom, so probably more spurious synchronization?
> • data transfer from device to host, before next GETRF do not occurs
> immediatly after the tiny GEMM on the GPU (just after the TRSMs on the GPU)
Again, this looks to me like spurious synchronization in the submission
loop.
> I use 2 streams on the CUDA device because there are small idle time between
> each kernel....
Is there really, how do you see it? StarPU already pipelines kernel
submission, by submitting kernel number n+1 while kernel n is running.
There are tiny red pieces between each kernel that can be a little bigger, and using 2 streams increases the performance.
Samuel
Samuel
- [starpu-devel] Late data transfers, XAVIER LACOSTE, 07/08/2024
- RE: [starpu-devel] Late data transfers, XAVIER LACOSTE, 07/08/2024
- Re: [starpu-devel] Late data transfers, Samuel Thibault, 22/08/2024
- Re: [starpu-devel] Late data transfers, Samuel Thibault, 22/08/2024
- RE: [starpu-devel] Late data transfers, XAVIER LACOSTE, 28/08/2024
- Re: [starpu-devel] Late data transfers, Samuel Thibault, 28/08/2024
- RE: [starpu-devel] Late data transfers, XAVIER LACOSTE, 28/08/2024
- RE: [starpu-devel] Late data transfers, XAVIER LACOSTE, 07/08/2024
Archives gérées par MHonArc 2.6.19+.