Sidecar Containers #753

Joseph-Irving · 2019-01-29T18:33:18Z

Joseph-Irving · 2019-01-29T18:36:13Z

@enisoc @dchen1107 @fejta @thockin @kow3ns @derekwaynecarr, opened this tracking issue so that we can discuss.

kow3ns · 2019-01-31T21:05:24Z

/assign

Joseph-Irving · 2019-02-01T14:31:18Z

@derekwaynecarr I've done some scoping out of the kubelet changes required for next week's sig-node meeting, I believe that changes are only needed in the kuberuntime package, specifically kuberuntime_manager.go in and kuberuntime_container.go.

In kuberuntime_manager.go you could modify computePodActions to implement the shutdown triggering (kill sidecars when all non-sidecars have permanently exited), and starting up the sidecars first.

In kuberuntime_container.go you could modify killContainersWithSyncResult for terminating the sidecars last and sending the preStop hooks (the preStop hooks bit was a bit debatable, it wasn't settled whether that should be done or not. @thockin had a good point about why you might not want to encourage that behaviour, see comment).

Let me know if you want me to investigate any further.

resouer · 2019-02-01T19:49:35Z

@kow3ns The discussion makes more sense to me if maybe we can define a full description of containers sequence in Pod spec (sig-app), and how to handle the sequence in kubelet for start, restart and cascading consideration (sig-node). Let's catch the Feb 5 sig-node meeting to give more inputs.

cc @Joseph-Irving

luksa · 2019-02-07T09:24:14Z

The proposal says that sidecars only run after the init containers run. But what if the use-case requires the sidecar to run while/before the init containers run. For example, if you'd like route the pod's traffic through a proxy running as a sidecar (as in Istio), you probably want that proxy to be in place while the init containers run in case the init container itself does network calls.

Joseph-Irving · 2019-02-14T12:25:11Z

@luksa I think there's the possibility of looking at having sidecars that run in init phase at some point but currently the proposal is not going to cover that use case. There is currently no way to have concurrent containers running in the init phase so that would be potentially a much larger/messier change than what is being suggested here.

Joseph-Irving · 2019-02-14T12:31:07Z

Update on this KEP:
I've spoken to both @derekwaynecarr and @dchen1107 from sig-node about this and they did not express any major concerns about the proposal. I will raise a PR to the KEP adding some initial notes around implementation details and clarifying a few points that came up during the discussion.

We still need to agree on the API, it seems there is consensus that a simple way of marking containers as sidecars is prefered over more in depth ordering flags. Having a bool is somewhat limiting though so perhaps something more along the lines of containerLifecycle: Sidecar would be preferable so that we have the option of expanding in the future.

luksa · 2019-02-14T15:30:31Z

@Joseph-Irving Actually, neither the boolean nor the containerLifecycle: Sidecar are appropriate for proper future extensibility. Instead, containerLifecycle should be an object, just like deployment.spec.strategy, with type: Sidecar. This would allow us to then introduce additional fields. For the "sidecar for the whole lifetime of the pod" solution, it would be expressed along these lines:

containerLifecycle: 
  type: Sidecar
  sidecar:
    scope: CompletePodLifetime

as opposed to

containerLifecycle: 
  type: Sidecar
  sidecar:
    scope: AfterInit

Please forgive my bad naming - I hope the names convey the idea.

But there is one problem with the approach where we introduce containerLifecycle to pod.spec.containers. Namely, it's wrong to have sidecars that run parallel to init containers specified under pod.spec.containers. So if you really want to be able to extend this to init containers eventually, you should find an alternative solution - one that would allow you to mark containers as sidecars at a higher level - i.e. not under pod.spec.containers or pod.spec.initContainers, but something like pod.spec.sidecarContainers, which I believe you already discussed, but dismissed. The init containers problem definitely calls for a solution along these lines.

Joseph-Irving · 2019-02-14T15:56:48Z

@luksa You could also solve the init problem by just allowing an init container to be marked as a sidecar and have that run alongside the init containers. As I understand it, the problem is that init containers sometimes need sidecars, which is different from needing a container that runs for the entire lifetime of the pod.

The problem with pod.spec.sidecarContainers is that it's a far more complex change, tooling would need to updated and the kubelet would require a lot of modifying to support another set of containers. The current proposal is far more modest, it's only building on what's already there.

luksa · 2019-02-14T16:50:26Z

@Joseph-Irving We could work with that yes. It's not ideal for the sidecar to shut down after the init containers run and then have the same sidecar start up again, but it's better than not having that option. The bigger problem is that older Kubelets wouldn't handle init-sidecar containers properly (as is the case with main-sidecar containers).

I'd just like you to keep init-sidecars in mind when finalizing the proposal. In essence, you're introducing the concept of "sidecar" into k8s (previously, we basically only had a set of containers that were all equal). Now you're introducing actual sidecars, so IMHO, you really should think this out thoroughly and not dismiss a very important sidecar use-case.

I'd be happy to help with implementing this. Without it, Istio can't provide its features to init containers (actually, in a properly secured Kubernetes cluster running Istio, init containers completely lose the ability to talk to any service).

Joseph-Irving · 2019-03-07T11:19:13Z

In relation to the implementation discussion in #841, I've opened a WIP PR containing a basic PoC for this proposal kubernetes/kubernetes#75099. It's just a first draft and obviously not perfect but the basic functionality works and gives you an idea of the amount of change required.

cc @enisoc

Joseph-Irving · 2019-03-18T15:55:38Z

I put together a short video just showing how the PoC currently behaves https://youtu.be/4hC8t6_8bTs. Seeing it in action can be better than reading about it.
Disclaimer: I'm not a pro youtuber.

Joseph-Irving · 2019-03-26T14:10:30Z

I've opened two new PRs:

add release checklist, grad criteria, upgrade strategy to sidecar kep #918 is about Graduation critieria, version skew, upgrade/downgrade
Sidecar KEP API implementation #919 is to decide on the API implementation

Any thoughts or suggestions will be much appreciated.

SergeyKanzhelev · 2024-01-26T23:02:44Z

this KEP will not progress in 1.30 and will stay in beta. As per per https://kubernetes.slack.com/archives/CPNHUMN74/p1706017979795519 @matthyx will fork out another KEP for improved Pods termination logic.

@matthyx pls ping me when done so I can tag the forked KEP for 1.30 milestone

pacoxu · 2024-01-29T03:51:20Z

@matthyx pls ping me when done so I can tag the forked KEP for 1.30 milestone

#4324 is updated using a seprate KEP.

And the new KEP issue is #4438.

k8s-triage-robot · 2024-04-28T04:48:46Z

The Kubernetes project currently lacks enough contributors to adequately respond to all issues.

This bot triages un-triaged issues according to the following rules:

After 90d of inactivity, lifecycle/stale is applied
After 30d of inactivity since lifecycle/stale was applied, lifecycle/rotten is applied
After 30d of inactivity since lifecycle/rotten was applied, the issue is closed

You can:

Mark this issue as fresh with /remove-lifecycle stale
Close this issue with /close
Offer to help out with Issue Triage

Please send feedback to sig-contributor-experience at kubernetes/community.

/lifecycle stale

gjkim42 · 2024-04-28T04:59:43Z

/remove-lifecycle stale

k8s-triage-robot · 2024-07-27T05:58:34Z

The Kubernetes project currently lacks enough contributors to adequately respond to all issues.

This bot triages un-triaged issues according to the following rules:

After 90d of inactivity, lifecycle/stale is applied
After 30d of inactivity since lifecycle/stale was applied, lifecycle/rotten is applied
After 30d of inactivity since lifecycle/rotten was applied, the issue is closed

You can:

Mark this issue as fresh with /remove-lifecycle stale
Close this issue with /close
Offer to help out with Issue Triage

Please send feedback to sig-contributor-experience at kubernetes/community.

/lifecycle stale

matthyx · 2024-08-04T21:07:19Z

/remove-lifecycle stale

matthyx · 2024-08-04T21:07:45Z

we're going GA for 1.32

kfox1111 · 2024-09-05T21:36:07Z

We hit an issue with sidecars being a little undefined. Looking for guidence on what it the right behavior.

When the normal containers exit, the sidecars are shot.

In some cases, they then return exit codes != 0.

At least one k8s client, https://k9scli.io/ considers a completed pod with exit codes other then 0 bad and marks it in its ui as RED.

Its a bit unclear what the expectation around this is. Should it be documented that exit codes != 0 for sidecar containers are normal on main container exit and should be ignored? That would mean k9s's code should be adusted.

If exit code 0 is expected, each sidecar that doesnt return 0 should be adjusted, and perhaps alternate shutdown signals should be supported to the sidecar for more "graceful shutdown" signaling?

SergeyKanzhelev · 2024-09-05T22:09:23Z

When the normal containers exit, the sidecars are shot.

In some cases, they then return exit codes != 0.

At least one k8s client, https://k9scli.io/ considers a completed pod with exit codes other then 0 bad and marks it in its ui as RED.

From kubernetes perspective, sidecars will receive the SIGTERM following with SIGKILL. If implemented right, they will exit normally and will not mark pod as "red". @kfox1111 what do you see as a Pod status when it is red?

Should it be documented that exit codes != 0 for sidecar containers are normal on main container exit and should be ignored?

Hiding or recommending to hide the bad exit of sidecar may hide real issues with it. We would really hope that sidecars will be implemented correctly and not get into this situation. So I want to make sure there are only a limited set of situations when sidecar will be marked as failed. Any additional comments on how sidecar was implemented, what graceful period was set, and how long container took to terminate, will help understand your use case better

kfox1111 · 2024-09-05T22:22:23Z

phase: Succeeded
initContainerStatuses:
  - containerID: docker://e8fd1c45cd995ef68ad84f9825faf4a35f53797d3308877adb59383a48ddfb3d
    image: ghcr.io/spiffe/spiffe-helper:nightly
    imageID: docker-pullable://ghcr.io/spiffe/spiffe-helper@sha256:6b09786251ae4e3ccef193b98eceae4b3d2bed091f3a8dd92be33a4c207455f2
    lastState: {}
    name: spiffe-helper
    ready: false
    restartCount: 0
    started: false
    state:
      terminated:
        containerID: docker://e8fd1c45cd995ef68ad84f9825faf4a35f53797d3308877adb59383a48ddfb3d
        exitCode: 2
        finishedAt: "2024-09-05T20:29:29Z"
        reason: Error
        startedAt: "2024-09-05T20:29:19Z"
containerStatuses:
  - containerID: docker://eda62dbdb9196efa6723f025194d71aa92cedd3d4dd8c862cc803353d0901f38
    image: busybox:latest
    imageID: docker-pullable://busybox@sha256:34b191d63fbc93e25e275bfccf1b5365664e5ac28f06d974e8d50090fbb49f41
    lastState: {}
    name: main
    ready: false
    restartCount: 0
    started: false
    state:
      terminated:
        containerID: docker://eda62dbdb9196efa6723f025194d71aa92cedd3d4dd8c862cc803353d0901f38
        exitCode: 0
        finishedAt: "2024-09-05T20:29:29Z"
        reason: Completed
        startedAt: "2024-09-05T20:29:24Z"

We already decided we're going to update the spiffe-helper to exit 0 on sigterm. it was exiting 2 on sigterm. But may be good to clarify this behavior for other users of the sidecar api so everyone knows what to expect.

SergeyKanzhelev · 2024-09-05T22:39:19Z

We already decided we're going to update the spiffe-helper to exit 0 on sigterm. it was exiting 2 on sigterm.

It makes sense. Please let us know if there are other cases when it will become failed unexpectedly.

I wonder if this sidecar was used before the feature was implemented. I would think in the past, sidecar (implemented as a regular container) would also receive SIGTERM on pod delete command. So it was suffering from the same issue

SergeyKanzhelev · 2024-09-05T22:40:50Z

Definitely we can benefit from updating this document mentioning this: https://kubernetes.io/docs/tutorials/configuration/pod-sidecar-containers/

k8s-ci-robot added kind/feature Categorizes issue or PR as related to a new feature. sig/apps Categorizes an issue or PR as relevant to SIG Apps. sig/node Categorizes an issue or PR as relevant to SIG Node. labels Jan 29, 2019

Joseph-Irving mentioned this issue Jan 29, 2019

kep-sidecar-containers kubernetes/community#2148

Merged

thockin assigned thockin and unassigned thockin Jan 29, 2019

majgis mentioned this issue Jan 29, 2019

Amend keps/sig-apps/sidecarcontainers.md to reflect kubernetes/kubernetes#52345 #691

Closed

k8s-ci-robot assigned kow3ns Jan 31, 2019

nmittler mentioned this issue Feb 5, 2019

Support startup dependencies between containers on the same Pod kubernetes/kubernetes#65502

Closed

Joseph-Irving mentioned this issue Feb 19, 2019

add implementation details to sidecar kep #841

Merged

justincormack mentioned this issue Feb 20, 2019

"sidecar" containers linuxkit/linuxkit#3291

Open

Joseph-Irving mentioned this issue Mar 7, 2019

[WIP] - sidecar containers PoC kubernetes/kubernetes#75099

Closed

Joseph-Irving mentioned this issue Mar 19, 2019

Better support for sidecar containers in batch jobs kubernetes/kubernetes#25908

Closed

thomschke mentioned this issue Mar 25, 2019

Cluster bootstrapping with Istio akka/akka-management#209

Closed

This was referenced Mar 26, 2019

add release checklist, grad criteria, upgrade strategy to sidecar kep #918

Merged

Sidecar KEP API implementation #919

Merged

viktorklang mentioned this issue Mar 28, 2019

Handoff between sidecar shutdown and user application needs to be orderly cloudstateio/cloudstate#5

Open

k8s-ci-robot removed this from the v1.29 milestone Jan 16, 2024

pacoxu mentioned this issue Jan 25, 2024

Fix:Static pod status is always Init:0/1 if unable to get init contai… kubernetes/kubernetes#122897

Open

tallclair mentioned this issue Feb 3, 2024

Pass down resources to CRI #4113

Open

gjkim42 mentioned this issue Mar 19, 2024

WIP: Manage container termination in a non-blocking manner kubernetes/kubernetes#123995

Closed

2 tasks

k8s-ci-robot added the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Apr 28, 2024

k8s-ci-robot removed the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Apr 28, 2024

gjkim42 mentioned this issue Jun 25, 2024

WIP: Add RestartContainerDuringTermination feature kubernetes/kubernetes#125710

Open

5 tasks

k8s-ci-robot added the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Jul 27, 2024

k8s-ci-robot removed the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Aug 4, 2024

SergeyKanzhelev mentioned this issue Aug 5, 2024

adding SergeyKanzhelev as SIG Node approver kubernetes/kubernetes#126551

Merged

25 tasks

coderanger mentioned this issue Aug 14, 2024

API Server Erroring coderanger/migrations-operator#21

Open

This was referenced Sep 6, 2024

[SidecarContainers] improve testing of termination Status of sidecar containers kubernetes/kubernetes#127217

Open

[SidecarContainer] Need to provide sidecar implementation best practices kubernetes/website#47823

Open

k8s-ci-robot assigned pegasas Sep 8, 2024

pegasas removed their assignment Sep 8, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Sidecar Containers #753

Sidecar Containers #753

Joseph-Irving commented Jan 29, 2019 •

edited by pacoxu

Loading

Joseph-Irving commented Jan 29, 2019

kow3ns commented Jan 31, 2019

Joseph-Irving commented Feb 1, 2019

resouer commented Feb 1, 2019 •

edited

Loading

luksa commented Feb 7, 2019

Joseph-Irving commented Feb 14, 2019

Joseph-Irving commented Feb 14, 2019

luksa commented Feb 14, 2019 •

edited

Loading

Joseph-Irving commented Feb 14, 2019

luksa commented Feb 14, 2019

Joseph-Irving commented Mar 7, 2019

Joseph-Irving commented Mar 18, 2019

Joseph-Irving commented Mar 26, 2019

SergeyKanzhelev commented Jan 26, 2024

pacoxu commented Jan 29, 2024 •

edited

Loading

k8s-triage-robot commented Apr 28, 2024

gjkim42 commented Apr 28, 2024

k8s-triage-robot commented Jul 27, 2024

matthyx commented Aug 4, 2024

matthyx commented Aug 4, 2024

kfox1111 commented Sep 5, 2024

SergeyKanzhelev commented Sep 5, 2024

kfox1111 commented Sep 5, 2024

SergeyKanzhelev commented Sep 5, 2024

SergeyKanzhelev commented Sep 5, 2024

Sidecar Containers #753

Sidecar Containers #753

Comments

Joseph-Irving commented Jan 29, 2019 • edited by pacoxu Loading

Enhancement Description

Joseph-Irving commented Jan 29, 2019

kow3ns commented Jan 31, 2019

Joseph-Irving commented Feb 1, 2019

resouer commented Feb 1, 2019 • edited Loading

luksa commented Feb 7, 2019

Joseph-Irving commented Feb 14, 2019

Joseph-Irving commented Feb 14, 2019

luksa commented Feb 14, 2019 • edited Loading

Joseph-Irving commented Feb 14, 2019

luksa commented Feb 14, 2019

Joseph-Irving commented Mar 7, 2019

Joseph-Irving commented Mar 18, 2019

Joseph-Irving commented Mar 26, 2019

SergeyKanzhelev commented Jan 26, 2024

pacoxu commented Jan 29, 2024 • edited Loading

k8s-triage-robot commented Apr 28, 2024

gjkim42 commented Apr 28, 2024

k8s-triage-robot commented Jul 27, 2024

matthyx commented Aug 4, 2024

matthyx commented Aug 4, 2024

kfox1111 commented Sep 5, 2024

SergeyKanzhelev commented Sep 5, 2024

kfox1111 commented Sep 5, 2024

SergeyKanzhelev commented Sep 5, 2024

SergeyKanzhelev commented Sep 5, 2024

Joseph-Irving commented Jan 29, 2019 •

edited by pacoxu

Loading

resouer commented Feb 1, 2019 •

edited

Loading

luksa commented Feb 14, 2019 •

edited

Loading

pacoxu commented Jan 29, 2024 •

edited

Loading