Accéder au contenu.
Menu Sympa

starpu-devel - [starpu-devel] Trouble getting performance from a simple program

Objet : Developers list for StarPU

Archives de la liste

[starpu-devel] Trouble getting performance from a simple program


Chronologique Discussions 
  • From: Troels Henriksen <athas@sigkill.dk>
  • To: starpu-devel@inria.fr
  • Subject: [starpu-devel] Trouble getting performance from a simple program
  • Date: Mon, 09 Sep 2024 11:55:19 +0200
  • Authentication-results: mail2-smtp-roc.national.inria.fr; spf=None smtp.pra=athas@sigkill.dk; spf=Pass smtp.mailfrom=athas@sigkill.dk; spf=Pass smtp.helo=postmaster@sigkill.dk
  • Ironport-data: A9a23:CY/5wKOAjplK/pfvrR3Uk8FynXyQoLVcMsEvi/4bfWQNrUp2hjUFz WUWCj3UOfjYMWCnKY12PY++ph9Tup/Ry9YxHHM5pCpnJ55ogZqcVI7Bdi8cHAvLc5adFBo/h yk6QoOdRCzhZiaE/n9BCpC48T8mk/vgqoPUUIbsIjp2SRJvVBAvgBdin/9RqoNziLBVOSvU0 T/Ji5OZYQfNNwJcaDpOt/rS8EM34JwehRtB1rAATaAT1LPhvyJNZH4vDfnZB2f1RIBSAtm7S 47rpF1u1j6xE78FU7tJo56jGqE4aua60Tum1hK6b5Ofbi1q/UTe5EqU2M00Mi+7gx3R9zx4J U4kWZaYEW/FNYWU8AgRvoUx/4iT8sSq9ZeeSUVTv/B/wGWXXH3z0/g/Jn07frQZ4sx5C09pq dMxfWVlghCr34pawZqgS+Zvh8AnasXoVG8dkig8nHeDVK5gGM6SBfySure03x9o7ixKNf/CZ 8MCQSFuYxfBZxkJNlR/5JcWxbfy3yKvKmIIwL6TjbAH/Xn+ky9S7Lf0L8LQVPnWetR0rFnN8 woq+EygWEtCa433JSC+2mmwj/XXgGb3VZwfEJWj5/tyiRuSwHYSAVsYTzOGTeKRkU+xWNNbL woY+ULCsJTe6mT6Yd/iTxDpikKJ/ScdQ/BeGcMdtC6kn/+8DxmiOkAISTtIadoDvcAwRCA32 lLhoz8OLWU02IB5WU6gGqGoQSSaHwx9EIPvTTQFQggE592lrIJbYvPzojRLS/fdYj7dQG6YL 9W2QM8W3u57sCLz///nlW0rehr1znUJJyZsjuktYkqr7xlieKmubJGy5F7Q4J5oddnDEwnb5 CNYwJLHt4ji6K1hcgTTHo3h+5n3uJ643MH02A4+T/HNChz3qyPyINo4DM9WfRk1aJZslcDVj L/74FkLvMACVJdbRbR2bIS8F4wgi7amfekJpdiLBuejo/FZKme6wc2ZTRTPhTq9zBBzzvpX1 FXyWZ/EMEv2wJ9PlFKeL9rxG5dwm0jSHEuCGMyp/AftyreEen+eRJEMNVbEPKhz77qJrE+Ru 5xTPteDgUcXGuDvQDjlwahKJ3AzLF8/Gc/Xre5TfbW9OQZIIjwqJMLQ5rIDQLZbuZppuN3Gx EzgZX8A+mHD3SXGDS6odkFcbKjeWMcjjHAjYg0pE1Wa+1kiRoeN/q4ge4E+Qqk80OlZ1991U Pg3VMGSCdtfSjn82mo8bLusiKdAZRiUlQa1EC79WwcGfrlkXB3vxt/oWiDN5Rs+JHO7muVmq oLxyz6BZ4QIQjpTKfr/adWt/guXhmcckucjZHn4CIBfV2u0+bc7NhGrqOE8JvwNDhDxxjG68 QKyKjVArMnvp74FyvX4tZqmnayITdQnRlF7GlPF54mYLSPZp2qv4bFRWdayIAzyajnGx7WAV 850kdfHL/w1rHRbudFdEpFq754EyfnBmrt49jlgTVL3NwmFK7U4OXSX/9h9hosUzJ9jhAaGc Eau+N5bBLa3BP3YAGMhfAoIU+DS+swXyx/z7Os0KnrU/CVY3qSKemQMMgiuiB5yFqpUMoQky r0lvsdM7xe1oUcpO8enlQFRzXyHdVYbYpUkt7YbIY7lsRUqwVd8eq7hCjf6zZWMStdUOGwoH 2Okv7XDjLFi2UbySXo/OnzT1+57h55VmhR14HIdBlaOwPzpu+QW2UBPzDEJUQhl9BVL/OZtM GxNNUcuB6Gv/S9ttfdTTVKXBABNKx2IyHPfk2JTujXicHCpcWjRIEkWG+WHphkZ+l0BWAlrx uiTzWK9XAv6eM304DAJZndkjP7eVv10yBzJnZG2PsaCHqRiWwHfvI2VWTMqpSfkUOQLv2+Wg clx/e11V7/3Cj5InY0/FLuh9OoxTDKqGTV8ZM9PrYIzIHHkWTCt2DKxBVi7VeFTKtfrr0KpK cxcCfhedha51S3UqjYSX6oXKZQpn/c4xcEjf4nzLjUsqIqvrTtOsbPR+BPhhWQtfc5crMYlJ q7VdBOACma1h0YIq1TSrcJBBHW0UeMEaCL4wuqx1ucDTLAHj81BbmAw1eGSk0iOEQ464S+Rg hzPV5XWw8Nm145ouYnmSYdHJgesLOLMRPa6yx+yv/tOfOHwH5/37S1NkWbePiNSIbc1cPZ0n +7UsNfIgWX0jIxvWGXdw5S8B61F4PupZ9VuM+X1EWJ7mBWTU8q98joB/GGFcaZyqu1/3fX+Z QWEa5qXT+U3CudtnCgfL2AUFhsGEK34Y5vxvS73/bzGFhEZ1heBN9+9s2PgaWZAbCITJpniE UnOtu2z4sxD5pF5bPPe6yqK37cjSLMiZUcnSzE1nSKZAWqphlfEtriKed8I92TQEnfdeCrly cute/Q8XE3aVGL0IBVxqI19vRwTCDBxgoHcu2oDrsVuhWnS4HEudIwg3FZvNn2Qujfz25L5Y DWLYGhK5eARm9hbWU2U3ekPlTtzygDD1hkV69DpE46ph/+KOb69
  • Ironport-hdrordr: A9a23:JK5rQauxkXWLxrmyQHkYyjSL7skDE9V00zEX/kB9WHVpm62j5q OTdZsguiMc5Ax6ZJhCo7C90cu7L080nKQdieIs1NyZMDUO1lHEEGgv1+rf/wE=
  • Ironport-phdr: A9a23:4m16FRbTWrAP4p7IKrnP2P//LTEf2oqcDmcuAnoPtbtCf+yZ8oj4O wSHvLMx1wePBNuEoKoZw8Pt8IneGkU4oqy9+EgYd5JNUxJXwe43pCcHRPC/NEvgMfTxZDY7F skRHHVs/nW8LFQHUJ2mPw6arXK99yMdFQviPgRpOOv1BpTSj8Oq3Oyu5pHfeQpFiTShbb9oM Rm7rAfcusYKjYZiN6081gbHrnxUdutZwm9lOUidkxHg6Mmu4ZVt6T5Qu/Uv985BVaX1YaE1R qFGATolLm44+tTluQHMQgWT6HQcVH4WkgdTDAje8B76RJbxvTDkued7xSKXINf5TbEwWTSl8 qdrVBrlgzoJOjIl7G3ajNF7gaRGqxyjuhN/2ZbZboGLOvRjcK3Tft0VSmVdUctKSyxNHpmxY 5cTA+cbIepVqZT2qVsUrRu5AAmhHP3hyiVPhnDo2a06yPkqHwPY0wwmBdIOsXfUrM7rO6cUT Ou4y7PHzSnDb/xMwjfw85PIchEuofGPXLJwdc7RxVMyGAPGl1idr5HuMDyJ2OoXqWeb8/ZgW vy1i24hswx8oCSjy8kxhoTXmo8bxV/J+CR3zYg1JtC1Vk92bcOlHZVfqiyWK4V4TMM/T2xqp is3yL0ItIC6cSUFx5oqxADTZvqaeIaG5RLjUfyeITZ+hH99d7K/hgqy8Ui9yuLnTMW7zFFKr i9Dn9LRtX4NzwTe5tWIR/Z55EutxyuD2xzJ5u1ZIE05kbDXJ4M9zrMykJcYrFrOHivrmEXtk K+ZaF8k9PWp6+XmfLrrvoOQOolpgQ/kKKsugNawAeEgPwgOQWeb/eO82aX4/U34T7VKjfw2k q3dsJzAOMsbpba1Aw5T0ok99xayFyqq3dsckHUdMl5JZQ6Lg5L0N1zPJP30F+qzj0iunTtzw vDJJLzhApHDLnjZl7fheK5w5FRAyAoz199f/IlZCqoAIPL8WU/xtdjYDhsjPwOpwObrENN92 ZgGVmKVHqCZKL/SsUOP5u83PuWAfJUVtyrlK/g5+/7uimc0mUQBfaazx5sXcG24Eu17LEqHf 3rjnNMBHHwRvgs+SezqkEeNXSRSZ3a0RaI85ys0BJioDYfZFciRh+mawC6hBoATam1YB1SkE HHydozCVe1fRjiVJ5pdiDsJUaPpa4InzxCtrwL8g+5uMurd4QUDtJ3g1dxxoeHQw0JhvQdoB tiQhjneB1p/mXkFEmdeNMFXpEV8zg3Gyq1km7lCEtcV4fpVUwA8PJqazupgCtm0VBiSNsyRR gOARdOrSSo0Usp328UHNk9hG9y5pgvC0CmrB7tTm7/YTIcs/Pfk1mPqb914126A0aAgi1c8R c4aPHGmj7VX7AncBI/NlwOVlPXibrwSiRbE73zL1m+SpAdYXQp3BL3CRmwab1DKoM7R/ULNT L6qDfIrMWOt0OakLa1HIp3shFRCHrL4PcjGJni2kCG2DAqJwbWFaMzrfX8c1WPTEhpMlQdb5 nuAOQUkY0Xp62vDEDxjE07uaELw4KF/rn28VEo90wCNaQVoybO0/hcfgfHURekU2/oIvyIoq jM8G1jYvZqeAsaArhFJZ6RTYtI55BFL1iORtgBwOIChM7E3nkQXIEx8u0Lj0QkyC50VyJR76 ilzlEwrd/PeiQseJFb6ldjqN7baK3f/5kWqYq/SgBTF1cqOv70I47I+okniuwegEgwj9W9m2 p9bySj5hN2CAQwMXJb2Skty+QJ9ouSQazU97pj8znxjMqiwtnnO2Jh6YYltggblZNpZPK6eQ UX7D80THOC1KeMpmlytKB4HdrMa5Os/OMWocOGD0aigMbN7nT6oum9A5Zh0zkOG8ycUpvfg5 58e2LnY2wKGU225l1K9qoXtnosCYzgOH225wCyiBYhLZ6Q0c5xZQWupJsS2wJ15ifuPEzZR6 F6qHHsd2cCqeB6WKVD+lQFdzkUYp3W7lDDwlmAl1Wtw9uzGgn2Imry9PBMccnZGXmxjkUvhL e3Wx5gBUU6kYhJo3Bqp6EDmxrRK8aF2Lm3dW0BNLGD9K2BvVLf1t6LXOpIerslx6WMODrX6O AjFGduf61MA3ijuHnVT3mU+fjCu4NDimgBizXiaJzB1pWbYfsd5wVHe4sbdTLhfxGljJmEwh D/JC1y7J9Ts88+TksKJu/y+XnmJTZxccijvycWMt2HogA8iSQ36hP21ltD9REIzzS/yzPFxV CrLrRL4JIXonffyIadseU9mA0X54sxxF9Rlk4c+s5oX3GATmpSf+XdU9AW7ecUew6/1a2ABA CIa29OAqhaww1Vtdzjaj5K8THiWxdFtIsW3cn9DkDxo9NhEUe+R/PRN+Ek96lug8VCIPak7x G5Fj6JwuThD275O41Blzz3DUOobRRAKYie8xR2B5tvs9a5ZNjToL+TVtgI2lIK6FLCGvgBGQ 3rwddEuHmdx6M57LV7Bgmbr5MfhdcLLZN0SsBKZlRjBieUTI5U03q0DgS4tUY7klUUs0PVzz Rln3JXg+ZOCN30o5qWyRBhRKjzyYcoXvDDrl6dX2MiMjcijGZBoGzNDW5WNL7rgCDUJqfHuL BqDChUnq3iYGL7bWwOcoEtrtHPAFZm3OmrffSVIi487HF/Eeh0Z2VhLFDwh1oY0DAWr2NDsf CIbrngK61j0pwENguNkOh/jU3vO8QelbjBnAJObLRdQ8kRD/xKMbZ3YtLsrWXgGr9v49FPeT w7TLx5FBmwIREGeUlXqP730oMLF7/DdHO21af3HfbSJr+VaEfaO35OmlIV8rFPufo2COGdvC /oj1w9NR3d8To7QhzUGUAQMli3Obsqf4hu/sH4SzIj34LHwVQTj6JHaQaNVKslq8guqjL2rL O+fhyd9LXBU1dlfoB2AgKha11kUhSZ0cjCrGrlVri/BQpXbnapPBgIaYSd+Z4NYqrgx1Q5XN YvHm8v4g/Rm2+UtBQ4PBjmD0om5INYHKGanOBbbCVaXYf6YcCbTzZi/ZLP0QPVRlLkG7kHh/ 2zCSwm6YGXFzmG2EEv1VIMExCCDYk4EsdnkIElmWTrqRdvtOESwPo0lymNqibRmnWvLMXIdK yNwekwLor71j2sQg/N0H3FN4yh+NefCniuC9ODXK5AZvP1kAyJ50eld5Txtwr9Rpkmsq9Rum STRr9pq5V2lwLDnIt9PSxtBqTdPgMSHvhc6UU043oJBVnXF+BZL4WbCUnw3
  • Ironport-sdr: 66dec608_jsPkKBpCNXRDUId7q5fhc2Vjl0Wsum3fq6isJ6TvT1FNmG5 BS9IZUct/HyxWRGfuo04IaUoFVla++P6vbBXZXw==

Hello starpu-devel, I am trying to understand how to write hybrid
applications. Specifically, I have adapted the mandelbrot example from
the STarPU distribution, taken out all the graphics stuff, and added
simplistic instrumentation. I want to start by understanding
partitioning of computation (with only trivial data).

This is my program so far:
https://github.com/diku-dk/hybrid-starpu/blob/main/mandelbrot/mandelbrot.c

However, I find that my program runs very slowly on a machine with
2xA100 GPUs and two 24-core CPUs. Specifically, using nvtop I see that
the program utilises both the CPUs and the GPUs, but that the GPUs
finish the work very quickly (as expected), and a lot of time is then
spent waiting on the CPUs. I have played around with increasing the
number of blocks (for overpartitioning), but it doesn't seem to make a
significant difference. So questions are:

1) Where can I read up on the basics of load balancing in StarPU?

2) Is there already an example of an efficient Mandelbrot program (or similar
compute-bound program) I can study?



Archives gérées par MHonArc 2.6.19+.

Haut de le page