Skip to Content.
Sympa Menu

cado-nfs - Re: [Cado-nfs-discuss] Log: Aggregate statistics. Total CPU vs Intermediate CPU time data.

Subject: Discussion related to cado-nfs

List archive

Re: [Cado-nfs-discuss] Log: Aggregate statistics. Total CPU vs Intermediate CPU time data.


Chronological Thread 
  • From: Jason Parker-Burlingham <jasonp@panix.com>
  • To: cado-nfs-discuss@lists.gforge.inria.fr
  • Subject: Re: [Cado-nfs-discuss] Log: Aggregate statistics. Total CPU vs Intermediate CPU time data.
  • Date: Wed, 23 Sep 2020 08:34:44 -0400
  • Authentication-results: mail2-smtp-roc.national.inria.fr; spf=None smtp.pra=jason.parker.burlingham@gmail.com; spf=Pass smtp.mailfrom=jason.parker.burlingham@gmail.com; spf=None smtp.helo=postmaster@mail-qk1-f173.google.com
  • Ironport-phdr: 9a23:ly0vKBbSafENswpzun/2cVD/LSx+4OfEezUN459isYplN5qZpsy8bR7h7PlgxGXEQZ/co6odzbaP7Oa5Byddud7B6ClELMUSEUddyI0/pE8JOIa9E0r1LfrnPWQRPf9pcxtbxUy9KlVfA83kZlff8TWY5D8WHQjjZ0IufrymUoHdgN6q2O+s5pbdfxtHhCanYbN1MR66sRjdutMZjId/KKs90AbFr3pHd+lW2W9jOFafkwrh6suq85Nv7iVdt+g9+8JcVKnxYrg1Q6FfADk6KW4++dfltQPETQuB53scVnsZnx9VCAXb7x/0Q4n8vDLiuuVyxCeVM8v2TaspWTu59KdkVAXoiCYcODEn9mzcl9F9g7haoBKloBx/3pLUbYSIP/dwYq/RYdUXTndGUchMSixBGZu8YJUTAOodM+hYqIz9qEEPrRu4GAKgAOzixztNinLwwKY00fkuERve0Qw7EdwAsGraosj2OqccXu+60LTHwjfYYvNKxTvw8pTEfgw9rf2SW797bMrfyVMoFwPAllierZbqPyiS1uQLrmOV9fBvVfisi2E5sQF9uyWixto2ionNh4IVzF/E9SVjz4YwONa1T0x7bsS/EJtXsSGaL5B7Qts4TGFuuSY7xacKtoK8fCgP0ZkqwQPUZPObfYaS+B3jSPqRLilihHJjYL+xhwu//Eq+xuHgVMS51FlHoytYn9fCqn0AyRPe5tSJRPZ9/kmtxDmC2Q/R5+1aJU05lK7WJpw9zrA/lZces0bOEynrk0v4i6+Wc18r+umu6+n/ebXmuoKTN4xxig7jKKgunNKwAfggPggBWGib/uu81Ln98kLjWLVKj+Y2nrPFv5DdIMQWoLOyDQxN0os79RqzEzOr3M4bkHQHNl5JZgyLg5bzN13TIv31A++zjlCvnTtx2fzLMLnsDo/TInXNn7rtZ7hw51BaxQEu195Q/YhUBasEIP/rWk/+qtjYDhghPgyx2ennCdF92poHWWKMH6OVKa3SvUKK6+8hOeWMa4gVuDHyK/gh+fHil2M2mVgYfaWx3JsXbm63Huh+LkmHfXbhhs0NHGQKswYkUuDmlkOOXSROa3uwUa8w/jQ7B5inDYfHSICtmruB3CKjE51UfG9GEVeMEXHye4WcQfgMciGSLdF7kjAeTrehVpUh1RWvtA/8xLpnLvDb+igDuJLs0dh6/ezTlRUo+TNoCMSd1nmBT3tokWMQWz82wKd/rFRhxViZzKd4hOVUFdpd5/NSSws1KJ/cz/dmBNDvQQLBZcyESFKnQtW9HTExVcg9w9AAY0Z6AdWigQ7M0zCrA78TjbyLBYY7/rjS33jrdI5BzCPN3aUmyl8oWY5DNHatm7Vk3wzSHJLS1UqXkLyleOId2jTM/SGN1zmgpkZdBSx5SqLDFUgWfEjbt5HT+1nGTrmhCb0iel9DwNWFLLNictTvjEUAT/DmboeNK1mtknu9UE7bjoiHa5DnLjlMhXiPOA0/iwkWuE2+G00+CyOm+TyMCTVvERf2YBqp/7Ah8zW0SUg7ywzMZEpkheLspkwlwMeEQvZW5Yoq/T87omwtTlm41tPSTdGHolg5JfQOUZYG+F5CkFnhmUl4N52kIbplgwdHIQtytkLqkR5wD9cZnA==
  • List-archive: <http://lists.gforge.inria.fr/pipermail/cado-nfs-discuss/>
  • List-id: A discussion list for Cado-NFS <cado-nfs-discuss.lists.gforge.inria.fr>

On Tue, Sep 22, 2020 at 11:11:46AM +0200, eric.jeancolas@free.fr wrote:

> I made some further analyses on a log.
>
> * when a full run is done without interruptions, the overall WTC
> (elapsed time for entire factorization) is ok. But when I retart after
> an interrupt, this counter starts from the restart. That's why I used
> the Total cpu counter to have an idea of the cpu usage. In the log I
> studied, Total Usage / Total WTC is 3,68. I've got a 4-core computer,
> so it's not too weird. But it is not really exact. It's too much...
>
> * in the log, the following is reported : Info:Lattice Sieving: Total
> time: 666693s. What is this time? From my log, the WTC is 334221,78,
> close to half this value. When I looked at cpu usage with the top
> command, I saw two las tasks with two threads. Well, two las processes
> at 198% CPU.
>
> * when entries in the log are talking about WTC, the reported values
> are ok.
>
> * I have an entry with Info:Linear Algebra: Total cpu/real time for
> bwc: 90816.1/23293.6. It seems to be the sum of Krylov + Lingen +
> Mksol. It is true for WTC. So if there is a sum about total cpu, there
> should be total cpu value for each of Krylov and Mksol, shouldn't it?
> If not, I don't understand how the total is computed.
>
> * when I see Jason's log:
> https://stdkmd.net/nrr/cont/7/77377.htm#N135_C183 , I see a ratio
> about total statistics 2.37178e+08/2.53203e+06 = 93.67. It seems a
> lot, even for a AMD Ryzen Threadripper 2990WX 32-Core. That's why I
> made this log analysis about time values...

[Re-sending this message because it's a good idea to check the size of
logfile before attaching to email. Sorry if this is a near-duplicate,
Eric.]

I've uploaded a copy of my logfile for that factorization to
https://storage.googleapis.com/number-field-sieve/run/77377_135.log .

Total runtime was about 3 months, with numerous interruptions during
sieving, krylov and one failure during sqrt.

Some interruptions were because I was experimenting with thread counts,
others because of power failures, and the sqrt interruption because I
had to drop threads from 8 to 4 to not run out of memory.

In particular one interruption during krylov was checking that
utlization and s/iter did not improve by running 64 threads instead of
32. I also remmeber that total CPU utilization reported by htop during
krylov was fairly low, around 30 to 40%.

Whether this might be of particular help in understanding my CPU and
real time reports, I'm not sure. I do still have all the files
generated by the factorization, so if parameter snapshot files and so
forth are useful, I can also make those available.

jason




Archive powered by MHonArc 2.6.19+.

Top of Page