<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Kubernetes Contributors – WG Checkpoint Restore</title><link>https://deploy-preview-776--kubernetes-contributor.netlify.app/community/community-groups/wg/checkpoint-restore/</link><description>Recent content in WG Checkpoint Restore on Kubernetes Contributors</description><generator>Hugo -- gohugo.io</generator><language>en</language><atom:link href="https://deploy-preview-776--kubernetes-contributor.netlify.app/community/community-groups/wg/checkpoint-restore/index.xml" rel="self" type="application/rss+xml"/><item><title>Community: WG Checkpoint Restore Charter</title><link>https://deploy-preview-776--kubernetes-contributor.netlify.app/community/community-groups/wg/checkpoint-restore/charter/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>https://deploy-preview-776--kubernetes-contributor.netlify.app/community/community-groups/wg/checkpoint-restore/charter/</guid><description>
&lt;h1 id="wg-checkpoint-restore-charter">WG Checkpoint Restore Charter&lt;/h1>
&lt;p>This charter adheres to the conventions described in the &lt;a href="https://github.com/kubernetes/community/blob/master/committee-steering/governance/README.md"
target="_blank" rel="noopener">Kubernetes Charter README&lt;/a>
and uses
the Roles and Organization Management outlined in &lt;a href="https://github.com/kubernetes/community/blob/master/committee-steering/governance/wg-governance.md"
target="_blank" rel="noopener">wg-governance&lt;/a>
.&lt;/p>
&lt;h2 id="scope">Scope&lt;/h2>
&lt;p>The Checkpoint/Restore Working Group aims to solve the problem of transparently
checkpointing and restoring workloads in Kubernetes, a &lt;a href="https://github.com/kubernetes/enhancements/issues/2008"
target="_blank" rel="noopener">functionality discussed
for over five years&lt;/a>
. The group will deliver the design and
implementation of Checkpoint/Restore functionality in Kubernetes, serving as a
central hub for community information and discussion. This initiative addresses
a wide range of problems, including fault tolerance, improved resource
utilization, and accelerated application startup times.&lt;/p>
&lt;h3 id="in-scope">In scope&lt;/h3>
&lt;ul>
&lt;li>Identify core Kubernetes checkpoint/restore use cases (e.g., live migration,
fault tolerance, debugging, snapshotting) and gather stakeholder requirements.&lt;/li>
&lt;li>Investigate and propose Kubernetes APIs for checkpoint/restore operations.&lt;/li>
&lt;li>Work with SIGs for the best integration of checkpoint/restore functionality
and APIs.&lt;/li>
&lt;li>Provide guidance for developers on checkpoint-friendly app design and
recommendations for operators on feature management.&lt;/li>
&lt;li>Work closely with relevant upstream projects (CRI-O, containerd, CRIU, gVisor)
for alignment and integration.&lt;/li>
&lt;li>Revisit the existing implementations to find and remedy possible inefficiencies.
One example is the existing checkpoint archive format which has already been
identified as being a major source of slowdown.&lt;/li>
&lt;/ul>
&lt;h3 id="out-of-scope">Out of scope&lt;/h3>
&lt;ul>
&lt;li>Not focused on general OS-level checkpointing outside Kubernetes
pods/containers.&lt;/li>
&lt;li>Will not dictate internal application checkpointing logic; focuses on
Kubernetes platform orchestration of *container/pod state.&lt;/li>
&lt;/ul>
&lt;h2 id="stakeholders">Stakeholders&lt;/h2>
&lt;p>Stakeholders in this working group span multiple SIGs that own parts of the
code in core kubernetes components and addons.&lt;/p>
&lt;ul>
&lt;li>SIG Node&lt;/li>
&lt;li>SIG Scheduling&lt;/li>
&lt;li>SIG Auth&lt;/li>
&lt;li>SIG Apps&lt;/li>
&lt;/ul>
&lt;h2 id="deliverables">Deliverables&lt;/h2>
&lt;p>The list of deliverables include the following high level features:&lt;/p>
&lt;ul>
&lt;li>In the early stage, we mainly want to offer a well-defined location for the
community to find information, ask questions, and discuss the next steps of
enabling checkpoint and restore in Kubernetes.&lt;/li>
&lt;/ul>
&lt;p>Later:&lt;/p>
&lt;ul>
&lt;li>Ability to checkpoint and restore a container using kubectl&lt;/li>
&lt;li>Ability to checkpoint and restore a pod using kubectl&lt;/li>
&lt;li>Integration of container/pod checkpointing in scheduling decisions&lt;/li>
&lt;/ul>
&lt;h2 id="roles-and-organization-management">Roles and Organization Management&lt;/h2>
&lt;p>This WG adheres to the Roles and Organization Management outlined in &lt;a href="https://github.com/kubernetes/community/blob/master/committee-steering/governance/wg-governance.md"
target="_blank" rel="noopener">wg-governance&lt;/a>
and opts-in to updates and modifications to &lt;a href="https://github.com/kubernetes/community/blob/master/committee-steering/governance/wg-governance.md"
target="_blank" rel="noopener">wg-governance&lt;/a>
.&lt;/p>
&lt;p>Additionally, the WG commits to:&lt;/p>
&lt;ul>
&lt;li>maintain a solid communication line between the Kubernetes groups and the
wider CNCF community&lt;/li>
&lt;/ul>
&lt;h2 id="timelines-and-disbanding">Timelines and Disbanding&lt;/h2>
&lt;p>As a first mandate, the WG will propose a draft roadmap and identify key tasks in the first quarter of operation.&lt;/p>
&lt;p>After that, the WG will facilitate collaboration among community members to explore possible APIs and draft proposals for their integration into Kubernetes, which will then be presented to the relevant SIGs.&lt;/p>
&lt;p>Achieving the aforementioned deliverables, also mentioned in the &lt;code>In Scope&lt;/code>
section, will allow us to decide when to disband this WG. There is no
expectations that the Working Group will be converted into a SIG long term.&lt;/p></description></item></channel></rss>