Objet : Developers list for StarPU
Archives de la liste
- From: Maxim Abalenkov <maxim.abalenkov@gmail.com>
- To: Troels Henriksen <athas@sigkill.dk>
- Cc: starpu-devel@inria.fr
- Subject: Re: [starpu-devel] Trouble getting performance from a simple program
- Date: Tue, 10 Sep 2024 11:03:20 +0100
- Authentication-results: mail2-smtp-roc.national.inria.fr; spf=None smtp.pra=maxim.abalenkov@gmail.com; spf=Pass smtp.mailfrom=maxim.abalenkov@gmail.com; spf=None smtp.helo=postmaster@mail-wr1-f46.google.com
- Ironport-data: A9a23:fG0Pr6JTYkPworXdFE+RcpElxSXFcZb7ZxGr2PjKsXjdYENS1mRSz TcdWTqOM6vfZDf1e4gnb9mwoE0OuceGnNBlSwod+CA2RRqmi+KVXIXDdh+Y0wC6d5CYEho/t 63yTvGacajYm1eF/k/F3oDJ9CU6j+fSLlbFILasEjhrQgN5QzsWhxtmmuoo6qZlmtHR7zml4 LsemOWBfgb9s9JIGjhMsf7b+Uo35K2aVA4w5zTSW9gb5DcyqFFOVPrzFYnpR1PkT49dGPKNR uqr5NlVKUuEl/uFIorNfofTKiXmcJaKVeS9oiY+t5yZv/R3jndaPpDXmxYrQRw/Zz2hx7idw TjW3HC6YV9B0qbkwIzxX/TEes3X0GIvFLLveBCCXcKvI0LuNEPQ/ft3J0IKNNMm/d9zAThp9 +42N2VYBvyDr7reLLOTT+BtgoE8NpCuMt9B/H5nyj7dALAtRpWrr6fiv4cJmmdtwJoURLCCO aL1ahI3BPjESxhGKlYSTps0leOlgHr4bRVXrVuUoew85G27IAlZiei3a4eFJozRLSlTthbGl 0v07kL3ORY1CoCDwxSc30yxg9aayEsXX6pJSeTgqa806LGJ/UQIFBQMTUb+rfSnh0qWQMNaM 0VS+yw0rKF0+lbDczXmdxixoXrBoQRFHtQMTas17waCzqeS6AGcboQZctJfQJseiPE4XBcp6 lbXodTQATw2l7+HZ23Io994sgiOESQSKGYDYwoNQg0E/8TvrekPYvTnHocL/Emd3o2dJN3g/ w1muhTSkFn6sCLm/6Cy/FSCnCj145aVEEg64QLYWm/j5QR8DGJEW2BKwQmHhRqjBN/GJrVkg JTis5bEhAzpJc/U/BFhuM1XQNmUCw+taVUwe2JHEZg77CiK8HW+Z41W6zwWDB42aJtVIWG2P RSL6FI5CHpv0J2CPf4fj2WZW5RC8EQcPY20PhwpRoMRMsgvK1XflM2QTRTIgDyxzSDAbp3Ty b/ALJ/0UidEYUiW5DWxQOgZ3PcqwCt4rV4/triqpylLJYG2PSbPIZ9caAXmRrlgsMus/l+Jm /4BbJDi40sEAIXDjtz/q9F7waYidihlW/gbaqV/Koa+H+aRMDpwUKCKnu59I9ANcmY8vr6gw 0xRk3RwkDLX7UAr4y3RApy6QOq3B8gtnmFxJiE2I1ej1l4qZIvlvu9VdII6cfNjvKZvxOJ9B atNMciRIOV9ehKe8RQkbL75sNNDcjavjlmwJCaLWmU0UKNhYA3rweXaWDXT2hMANBfqivtmk YacjlvaZbEhWzVdCN3nbaPz7lGp4lkYtuFAf2rJBdhxfk/T3pBgAHHzhKVvIuUnCxbK9h2F3 Sm4XDYaoujspdcu0d/r3KqrkaagI9FcLGF7QVbJzO+RHjbI20ae2ql8afasURGBcXLr6YOgS P5wzfqhAMYYnV1PjZVwI4xrwY06+dHrgb1QlSZgI1nmcHWpDaFGMFCd/MwSqJBI+KBViTG2V m2L5NNeH7eDY+HhMVwJITsafvax7u4VlhbS/MYKDh3DvgEvx4W+UGJWIxWoow5eJuEsMIoan MEQiPRP4Am70hcXItKKix5PzFu1L1sCbr4Gs68LC4q6myspzVB/OabnMBHU263WSdtwMRgNG AS21Y7inLVXw3TQf0UjTUbt2fVvvrVQmRRo4mJbGXG3tIvkvNEV0idV0wwLdSVO7xAe0+tMK mlhbEJ0AqOV/gZXvstIXkHyOgRNGCym/lfVzn0Xnlb4VGitbHTGd0cmCNaO/WcY0mNSRSda9 7em01TYUS7mUcXy/ykqU2tnlqDHYflu0DbdweaLMt+gHZYoRRbE2IqVensuuR/rJegTlX/3j 7Bm08goYJKqKBNKhbMwDrer8Ig5STeGATdnau5g9qZYJlPsUmi+9hbWIn/gZ/4XAeLB9HK5L MlcJshvcRCa/wTWpxA5AZ88GZNFrMQL1vEjJIyyfXUntoGBpAVHqJjTryjypFE6SuVUzPoSF NnjSCKgIEex211vhG7/nOtVMDGZYP4FRjHG8sKbzeErL68H4cZQKRwc873spHiEEhpVzzTNt iP5WqLm5ehDy4NtoojSLpt+FziEcdPdaMnY8SSYkch/UtfUAMKf6yIXsgbGOipVD5swWvN2t 6i8j9rs+HPJrZMNCmX/t6icJfMY+/foTO5zN+TpJkJ7hgqHYtfnuDEYylC7KLtIsdJT3damT A2Gc/mNdcYZdtNe5X9NYQ1MOkw5J4WuSYm4vgK7jfCHKiZF4Dz9NNn9qEPYNzBKRBEHK7jVK 1HSuc/3wvt6sY4VJhsPJ88+Mq9COFW5BJcXLYzghwK5UFutrEiJ4Ib5tBwa7jrONHmIPeD67 b/BRTn8bB6Cg77J/v4Ir71NugArM1gljdkSZk49//tEuwK+BkMCLsUfNswiIbNQmSrQypr5R W/saE0PNCbDZglHIC7MuInbYgSiB+IwY4azYnRj+k6PcC65Cb+RGLYrpG8q/35yfSCl1+29b 80X/nrrJBWq35V1XqAp6+emhft8jObvrp7SFZsRT+Spa/rfPVkL6JClNA9EVCiCDNuU0UuXe y46QmdLREz9Qkn0eSqll7i5BzlB1A4DDR1xBctM/Dobk4qexexEjvb4PokfF5UdOd8SKudmq WzfHgOwDqP/5pDXkaQsstMtx6RzDJpn2yR8wLDLHWUvok171ojr0w7uU8bCoAHONTOzy2/gq wQ=
- Ironport-hdrordr: A9a23:OeuCIaFeHqmm5sqapLqExceALOsnbusQ8zAXPo5KOGRom62j5q WTdZEgvyMc5wxhPk3I9erwW5VoBEmsjKKdn7NhX4tKLTOW3FdARbsKheCD/9SjIVyFygc078 ZdmsNFebrN5DZB/L3HCcqDf+rIAuPrzEllv4jjJg9WPHxXgmhbgTtENg==
- Ironport-phdr: A9a23:Wp+2tRRQVq0vvU3uOEUPOrZeXtpsoq+WAWYlg6HPa5pwe6iut67vI FbYra00ygOSBcOCtaMP0bOempujcFJDyK7CikxKSIZLWR4BhJdetC0bK+nBN3fGKuX3ZTcxB sVIWQwt1Xi6NU9IBJS2PAWK8TW94jEIBxrwKxd+KPjrFY7OlcS30P2594HObwlSizewb69+I A+roQjQssQajpVuJrsswRbVv3VEfPhbymxvKV+PhRjw4du+8oB++CpKofIh8MBAUaT+f6smS LFTESorPWMo6sD1rBfPVQSA6GcSXWUQiRpIHhPK7ArmUZfrsyv1rfRy1S+HNsDrV780WDCi7 6B2SB/0jSoMKjA0/H3LhsF2kalWpg+qqR5izI7OeIybNPlwcb/Sct0HS2ROXNhcWTJPAo27d IYAFfYNPeNCoon9u1cDrx2zDhSsCuP1zT9Ig2f706og3OUuEAHG2RErEs4Vv3vOqNX6KqESU eOyzKnW0D7OaPZW1i3y6YjJdxAtu+uDUqlqfsrV10YvEQDFjlKOqYH+MDOV0/4Cs2mf7+Z6S e2vjGsnphh3rzOyycgilpPHiZgJylDY6yp52oA1KMWlRUN0f9OpEoVduiCbOoZrX84vTWFmt iY0xLAGtpO3YigExZYpyRPQavGLbouF7BLgWeqPLjp2im9pdrK7ihqs90Wr1+PyVs6x0FlQr ypFlMHBtnEL1xzJ68iIUOFx/km72TqXzQ/c8ftLIUQqmqrdKJ4hxaAwmoAOsUTABiP2l0T2j LOWdkU54eek8fjoYrLjppKaKoR6iRn+P7wwlsCjBek0KAsDUmiB9eiiyrHv4Ff1TbpLg/Euj KXWqozWKtoeq6KkHgBY3YQu5hOiADu4zNgVk3wKI05Zdx6bk4TkPk3BLfTjAvexhlmhkThmy vDYMbDkAJjNIGPMnKvkcLt58EFT1gQ+wNZC7J9KDrEBPenzWlPvu9zCExE5Mg21zPj/Bdlh0 I4VRHiBDbWDMKzItF+F/uIvLPeIZI8SoDv9LuIq5//qjXMgl14dZ7Sl0YIZaH23H/lqOUqZY X3rgtcOFWcFoBAyQ/DtiF2HSTJTZnCyULwg5jwjFo6qEYPOSpqujbGBxiu3AINaan1cBl2DE Hrkb4CEVO0NaCKWLM9hiDsEVb25Ro8kzx6vuw76y799IuXO4SIXr5bj28Zz5+3QkhEy6yd5D 8uY026XTmF0mnkESCMx3KB6uUB9zEyM3rBig/NEDdxT++9JUgAiOJLB1+B6Ecz9WhrdfteVT 1arWs6mAS0rQdIr298Ofl9xFMi8gRDdwSWqB6QYl6eLBZwx6qLTxWL9J8d7y3bc1akulUMqQ sVVNT7uuqkqyxLUCorS22WUkbyrdr8R2maZ/XuDxHGmp0BXUw90VePJXiZMSFHRqIHc71PDS veHA70jPw1CztDKfq5Detzsy1xMTfPuPtfZfUq+nm6xAVCDwbbaP9miQHkUwCiIUBtMqAsU5 3vTbWDWZw+kqmPaVnl1EE73Jlnr+q94oW+6SUk9y0eLaVdg3vy74E1dnuSSHtUU2L9MoyI9s 3NsBl/o2tvGCt/GrgBoeKhaYtQjyFhC3GPd8Qd6O8/oNLhs02YXaB8/pEbyz1NyA4REn9Itq SYvxRpzLOSR2lpFejWb2o7YNbjeK2209xeqOObNwl+L9tGQ9+8U7egg7VXuuAb8Dk0573Buy MVYyVOZ75TOSRUICNf/Dhhx+B99qLXXJCI64us4zFVKNq+5+n/H0tMtXq4+zwq4Os1YO+WCH RPzFMsTA46vLvYrkh6ndEBMOuca76MyM868EpnOkKe2IOZtmi6nhmVb8ch81EyL7S91Vu/P2 d4M3fiZ2gKNUzq0gk2mt4j7noVNZDdaGWTaq2CsAYdBZ6g0dIwCDWqoIMqp7tp7jp/pHXVf8 R/rBl8L3tOoZQvHd0b0jmgynQwcpX2qnzf9ziQhyWl46PrCmnaUmqK+KkliWCYDXmRpgFbyL JLhitkbWBLtdA01jF6/4k28waFHpaN5Jm2VQEFSfiGwIXsxN8n4/reEfcNL74sl9CtNV+HpK 1WTWr/45RIQ2ifuGWhZ2hg0cjirvtPymBkw2wf/ZD5j6WHUf81930KV4dXMRPsX1zAPQCR+i j7JLle5Ntitu96Tkt2Q14L2H3LkXZpVfy7xyIqGvybu/mxmDyq0mPWrk8HmGww3ucPi/+FjT j6A7BP1Y42xkr+/LfoiZE5jQln198t9HIh61Io2npAZn3YA1N2Z+n8OkGG7Nts+u+q2aXMdR DtNytTc5AXj20p9Bn2MzoP9EH6ax4NtasK7bWUfxi8mp5oSWeHEsfoex3Iz/gTwpBm0A7A1h joHzPoy9HMWy/oEvgYg1GTVA7wfG1VZIT25kh2J696kq6AELG2rcLW2yA9/hYX7VODE8lwaA iygPMt8TkoSpo1lPVnB0WP+8NThcdjUN5cIswGM1g3HhK5TIY4wkfwDgWxmP3j8tDsr0b1e7 1Qm0JeksYyAM2go8ri+B0sSMzTlZsZV8zvkiatYncGI94+qF5RlXD4MWdG7KJDgWCJXrvnhO wuURXc5o2eaHPzWFwab5UFlqGznHJWiNnXRL34ch4YHJlHVNAlUhwYaWy8/l5gyG1WxxcDvR 0x+4ygY+l/yrhYfgvItLRT0VX3T4Ru5cjphAobKNwJYt0sRgiWdedzb9O94GDtUu4GsvBDYY HLOfBxGVCkIQhDWXA2lZ+j2o4OcrK7AQbDiZ/rWPefQ9aoEDKzOnMz3lNMhpmfpVI3HP2E+X aNlnBMbBzYhXZyewW1HSjRLxXyTKZTH9VHsomsv6ZrnuPXzBFCwv83WV/0LYI8po1fv0cLhf 6aRnHordmoej8lRgyeOkP9GghYTk308LmH9V+1f6midCvqXwPYfDgZHOXosb40Rvv56hk8Vf peF77G9nr9g0qxvUwYDBQGnw5v5I5RNej71NUubVhzSavLbdXuSkpuxOeTlGPVRlLkG7UTu/ 2zAQgm4ZHLb0GC4MnLnefdFiCXRVPBHkKe6dBslSW3qTda8Lwa+LMcylzo9h7s9mnLNM2cYd zl6aUJE6LOKv2tehb1kFmpN42AAT6HMkjuF7+TeNpcdsOd6SiVymeVA5X0myrxTpChaTf1xk SHWo5ZguVaj2uWIzzNmVlJJpFMpzMqTul5+PKzC6pRacXPN/RZI8n/JThpW+Z1qDdrgv60Ww d/K1eryJDpE79PI7J4cCszTe6fleDIqNRvkHiKRDRNQF2b6czGCwRYHybfOqC7wzNBysJXnl ZsQR6UOUVU0EqlfEUF5BJkYJ5wxWDo4kLmdhcpO5HykrRCXSt8J2/KPHv+UH/jrLy6UyLdeY B5di7bxPYUWcITx0khvbFp9gqzFHkPRWZZGpSgrPWpW6A1dtWNzSGE+wRevcgS2/HoaDuK5h DYzgwp6JPs3rXLivwlxKV3Nqy891kI2nJ+25FLZOC60J6C2U4ZMDiPyvEVkKZL3TTF+agiql FBlPjPJL1q+p7RlfGFvzgTbvMkWcRa9ZahNYRtV2uvOIvt0jxJTrSKowUIB7uzAW8MKfO4Cf pulrnYG0AVmPoZdGA==
- Ironport-sdr: 66e01975_lxQCx2ivpkB0ia7S3UEvkeY519GPKuL/POIo6selOlINskM 9PPTfK391RilqgsDYHE66UsM5QTkMnuFLvVkFOw==
Hello Troels,
How are you? I hope all is well with you. There is a section in StarPU handbook on investigating performance issues:
It may give you a good start.
I’m looking at the Mandelbrot example from StarPU distribution 1.4.7 (./examples/mandelbrot/mandelbrot.c). It seems there is no need to take out the graphics commands. You may run the executable with -no-x11 option. This will omit all graphics-related processing. Otherwise, I believe this should be the efficient Mandelbrot program to study.
I hope this helps you. Thank you and have a great day ahead!
—
Best wishes,
Maxim
Maxim Abalenkov \\ maxim.abalenkov@gmail.com
+44 7 486 486 505 \\ www.maxim.abalenkov.uk
+44 7 486 486 505 \\ www.maxim.abalenkov.uk
On 9 Sep 2024, at 10:55, Troels Henriksen <athas@sigkill.dk> wrote:Hello starpu-devel, I am trying to understand how to write hybrid
applications. Specifically, I have adapted the mandelbrot example from
the STarPU distribution, taken out all the graphics stuff, and added
simplistic instrumentation. I want to start by understanding
partitioning of computation (with only trivial data).
This is my program so far:
https://github.com/diku-dk/hybrid-starpu/blob/main/mandelbrot/mandelbrot.c
However, I find that my program runs very slowly on a machine with
2xA100 GPUs and two 24-core CPUs. Specifically, using nvtop I see that
the program utilises both the CPUs and the GPUs, but that the GPUs
finish the work very quickly (as expected), and a lot of time is then
spent waiting on the CPUs. I have played around with increasing the
number of blocks (for overpartitioning), but it doesn't seem to make a
significant difference. So questions are:
1) Where can I read up on the basics of load balancing in StarPU?
2) Is there already an example of an efficient Mandelbrot program (or similar
compute-bound program) I can study?
- [starpu-devel] Trouble getting performance from a simple program, Troels Henriksen, 09/09/2024
- Re: [starpu-devel] Trouble getting performance from a simple program, Maxim Abalenkov, 10/09/2024
Archives gérées par MHonArc 2.6.19+.