Objet : Developers list for StarPU
Archives de la liste
- From: Maxim Abalenkov <maxim.abalenkov@gmail.com>
- To: Samuel Thibault <samuel.thibault@inria.fr>
- Cc: starpu-devel@lists.gforge.inria.fr
- Subject: Re: [Starpu-devel] [LU factorisation: gdb debug output]
- Date: Thu, 28 Dec 2017 19:22:16 +0200
- Authentication-results: mail3-smtp-sop.national.inria.fr; spf=None smtp.pra=maxim.abalenkov@gmail.com; spf=Pass smtp.mailfrom=maxim.abalenkov@gmail.com; spf=None smtp.helo=postmaster@mail-lf0-f50.google.com
- Ironport-phdr: 9a23:zP9FABNxjNAibJu3K04l6mtUPXoX/o7sNwtQ0KIMzox0Ivv8rarrMEGX3/hxlliBBdydt6odzbKO+4nbGkU4qa6bt34DdJEeHzQksu4x2zIaPcieFEfgJ+TrZSFpVO5LVVti4m3peRMNQJW2aFLduGC94iAPERvjKwV1Ov71GonPhMiryuy+4ZLebxlViDanfb9+MAi9oBnMuMURnYZsMLs6xAHTontPdeRWxGdoKkyWkh3h+Mq+/4Nt/jpJtf45+MFOTav1f6IjTbxFFzsmKHw65NfqtRbYUwSC4GYXX3gMnRpJBwjF6wz6Xov0vyDnuOdxxDWWMMvrRr0vRz+s87lkRwPpiCcfNj427mfXitBrjKlGpB6tvgFzz5LIbI2QMvd1Y6HTcs4ARWdZXshfVDBODYyhYYUBDeUPI+hYoY7mqlQUsRe+ABOhBOPzxjJKgHL9wK000/4mEQHDxAEuGMgBsXvVrNXwKawfTfu6w7PMzTrdafNW2jD96JbVeR0mpPGDR71xccvUyUU1DQzKlU+fqYj7MDOT1eQNtHab4vFvVeKqkWEnqgVxriKzyccrj4nEn4QYwU3K+yV+xYY6P9y4SEhjbN6rEZtQqyWaOJFtTc84QmFovyA3waAFt56jZCUG1ogryhrFZ/GEc4WE+AzvWPuRLDtihH9pZKqzihSx/ES6yeDxWdO43EhEoyZfk9TBtnYA3AHJ5MedUPty5EKh1C6P1w/N7uFEJlg5la/BJJ4gxr48j54TsUrfEiPvlkX7jLOael8r+uiv7OTnbbHmqYGGO4BojQH+N7wims25AesmLggDR3aX9fi42bH5/kD0QK9GguMrnqTZqpzXJdgXqra8AwBP04Yj7xi/Dy2h0NQdhXQHKUhKeR2Gj4jsIV3BOuv3Au27g1uyljdrxPfGPqP6D5XCK3jMirbhfbJn50FAzwozyMhT54hIBbEZPPLzRkjxucTDDh8lKQO02f7nCMhk2owDR22PHLGWMKfJvF+M5+IvOPWMZJQPtDbyJfgl4OTujXAnllMHfKmp24EXaHGiEfh8LUWZeymkvtBUKm4Powc6BNDqiVeLTDpPLyKpVqch6zV9FIKnB47eQpyFgbqb3S79EIcANU5cDVXZNH74doDMcv0FbC+UIch/2mgBXKigTsko3xiquQn5zKRPIe/d+ylevpXmgosmr9bPnA0/oGQnR/+W1HuAGiQtxjsF
- List-archive: <http://lists.gforge.inria.fr/pipermail/starpu-devel/>
- List-id: "Developers list. For discussion of new features, code changes, etc." <starpu-devel.lists.gforge.inria.fr>
Hello Samuel,
Yes, you are right. I’m not using GPUs at the moment. It is purely aCPUs code. My understanding was that once I register the data two times: (i) for panels and (ii) for tiles I will request double the memory:
// Initialize panel and tile handles
for (int j = 0; j < A->nt; j++) {
int naj = plasma_tile_nview(*A, j);
// Initialise panel handle
starpu_matrix_data_register(&((A->panel_handles)[j]),
STARPU_MAIN_RAM,
(uintptr_t) plasma_tile_addr(*A, 0, j),
A->m, A->m, naj, eltsize);
// Invalidate panel handle
starpu_data_invalidate(A->panel_handles[j]);
// Initialise tile handle
for (int i = 0; i < A->mt; i++) {
int ldai = plasma_tile_mmain(*A, i);
int mai = plasma_tile_mview(*A, i);
starpu_matrix_data_register(&((A->tile_handles)[j*A->mt + i]),
STARPU_MAIN_RAM,
(uintptr_t) plasma_tile_addr(*A, i, j),
ldai, mai, naj, eltsize);
}
}
for (int j = 0; j < A->nt; j++) {
int naj = plasma_tile_nview(*A, j);
// Initialise panel handle
starpu_matrix_data_register(&((A->panel_handles)[j]),
STARPU_MAIN_RAM,
(uintptr_t) plasma_tile_addr(*A, 0, j),
A->m, A->m, naj, eltsize);
// Invalidate panel handle
starpu_data_invalidate(A->panel_handles[j]);
// Initialise tile handle
for (int i = 0; i < A->mt; i++) {
int ldai = plasma_tile_mmain(*A, i);
int mai = plasma_tile_mview(*A, i);
starpu_matrix_data_register(&((A->tile_handles)[j*A->mt + i]),
STARPU_MAIN_RAM,
(uintptr_t) plasma_tile_addr(*A, i, j),
ldai, mai, naj, eltsize);
}
}
Please see the above code. Therefore, now instead of “double” data registration, I would like to stick with tiles only, and provide panels as an abstract “container” entity. They will also help me to get the task dependencies right. I just don’t know how to unpack the buffers for tiles and pivots. (Please see my previous email for an example). Can you provide a simple example please? I couldn’t find anything similar in the examples directory. Thank you.
—
Best wishes,
Maxim
On 28 Dec 2017, at 19:13, Samuel Thibault <samuel.thibault@inria.fr> wrote:Hello,
Maxim Abalenkov, on jeu. 28 déc. 2017 18:05:22 +0200, wrote:I hope all is well with you. I need your help please. I have successfully implemented a panel-wise version of the LU factorisation in StarPU using the "multiple views" approach. However, multiple views create a drawback of using twice as much memory compared to the tiles approach (when the matrix is divided into square or rectangular blocks). The memory for the tiles and the memory for the panel views.
Err, but IIRC you are not using GPUs? Multiple views do not bring more
memory consumption, they only bring different ways for looking at the
same data.
Samuel
- Re: [Starpu-devel] [LU factorisation: gdb debug output], (suite)
- Re: [Starpu-devel] [LU factorisation: gdb debug output], Maxim Abalenkov, 20/12/2017
- Re: [Starpu-devel] [LU factorisation: gdb debug output], Samuel Thibault, 20/12/2017
- Re: [Starpu-devel] [LU factorisation: gdb debug output], Maxim Abalenkov, 20/12/2017
- Re: [Starpu-devel] [LU factorisation: gdb debug output], Samuel Thibault, 20/12/2017
- Re: [Starpu-devel] [LU factorisation: gdb debug output], Maxim Abalenkov, 20/12/2017
- Re: [Starpu-devel] [LU factorisation: gdb debug output], Maxim Abalenkov, 20/12/2017
- Re: [Starpu-devel] [LU factorisation: gdb debug output], Samuel Thibault, 20/12/2017
- Re: [Starpu-devel] [LU factorisation: gdb debug output], Maxim Abalenkov, 21/12/2017
- Re: [Starpu-devel] [LU factorisation: gdb debug output], Maxim Abalenkov, 28/12/2017
- Re: [Starpu-devel] [LU factorisation: gdb debug output], Samuel Thibault, 28/12/2017
- Re: [Starpu-devel] [LU factorisation: gdb debug output], Maxim Abalenkov, 28/12/2017
- Re: [Starpu-devel] [LU factorisation: gdb debug output], Maxim Abalenkov, 28/12/2017
- Re: [Starpu-devel] [LU factorisation: gdb debug output], Samuel Thibault, 28/12/2017
- Re: [Starpu-devel] [LU factorisation: gdb debug output], Maxim Abalenkov, 28/12/2017
- Re: [Starpu-devel] [LU factorisation: gdb debug output], Samuel Thibault, 28/12/2017
- Re: [Starpu-devel] [LU factorisation: gdb debug output], Maxim Abalenkov, 20/12/2017
- Re: [Starpu-devel] [LU factorisation: gdb debug output], Samuel Thibault, 28/12/2017
Archives gérées par MHonArc 2.6.19+.