Navigation
Recherche
|
[$] A parallel path for GPU restore in CRIU
mardi 17 juin 2025, 20:02 , par LWN.net
The fundamental concept of checkpoint/restore is elegant: capture a
process's state and resurrect it later, perhaps elsewhere. Checkpointing meticulously records a process's memory, open files, CPU state, and more into a snapshot. Restoration then reconstructs the process from this state. This established technique faces new challenges with GPU-accelerated applications, where low-latency restoration is crucial for fault tolerance, live migration, and fast startups. Recently, the restore process for AMD GPUs has been redesigned to eliminate substantial bottlenecks.
https://lwn.net/Articles/1024747/
Voir aussi |
56 sources (32 en français)
Date Actuelle
mer. 18 juin - 00:08 CEST
|