ohhhhh just saw this now <https docs serverless stack com co Serverless Stack #help

ohhhhh…. just saw this now… <https://docs.serverle...

Dan Van Brunt

09/20/2021, 7:41 PM

ohhhhh…. just saw this now… https://docs.serverless-stack.com/constructs/NextjsSite Did you guys find a workaround to the SSR lambda@edge that causes stack delete to fail? It’s our main issue with using NextJS for our sites, since we want to be able to fire up and tear down quickly.

Dan Van Brunt

09/20/2021, 7:46 PM

Doesn’t look like it. Would love to see something that could be done to

retain

and then use async cleanup to delete orphan functions 2hrs later. Is this something one could use EventBridge or CloudTrail triggered Functions for?

thdxr

09/20/2021, 8:05 PM

I do believe you can receive cloudformation events through the default EventBridge bus

thdxr

09/20/2021, 8:05 PM

Great use case

Frank

09/20/2021, 8:05 PM

Hey @Dan Van Brunt, currently no. One thought I had would be to mark the Lambdas to

retain

, and then have custom resources periodically retry until they are successfully removed. The downside being the stack could take hours to remove, but will eventually remove successfully. This is similar to what you suggested. What do you think?

Dan Van Brunt

09/20/2021, 8:07 PM

I think there is a 30min max to CFN or CRs so I think its off the table.

Dan Van Brunt

09/20/2021, 8:08 PM

I was thinking of just retaining the lambdas so the stack (minus the replicas + main function) can be deleted normally (fast)…. then just have some kinda orphan function cleanup, completely outside of that stack.

Frank

09/20/2021, 8:08 PM

I think you can have async CRs, ie. have a step function to periodically retry, and hits the CFN api to signal the CR is completed.

Dan Van Brunt

09/20/2021, 8:09 PM

Ya, I think you can do that… but I think a single CFN deploy…. has a 30min max

Dan Van Brunt

09/20/2021, 8:10 PM

I remember this being the case for either CFN or CRs about a year ago. It wasn’t easy to find in the docs either.

thdxr

09/20/2021, 8:11 PM

^ I think I ran into this as well. I had a broken CR and after 30min aws killed it

Frank

09/20/2021, 8:11 PM

I see.. I was reading the CDK docs on the different ways to implement CRs https://docs.aws.amazon.com/cdk/api/latest/docs/core-readme.html#custom-resource-providers

Frank

09/20/2021, 8:12 PM

And they claim the

custom-resource.Provider

(they create a step function internally) has Unlimited timeout.

Frank

09/20/2021, 8:13 PM

Maybe they mean the CR has unlimited timeout, but the entire Stack has 30min timeout?

Dan Van Brunt

09/20/2021, 8:13 PM

no… I think you might be right… I found

AWS::CloudFormation::WaitCondition

has a max time of 12 hours…. so CFN must not have this limitation.

Dan Van Brunt

09/20/2021, 8:14 PM

Not sure where I got that idea then.

thdxr

09/20/2021, 8:14 PM

What happens if the CR returns immediately but kicks off some job, CF wouldn't be aware of that right

thdxr

09/20/2021, 8:14 PM

The 30min comes from the CR function not returning in 30min and CF marks the stack as failed

Dan Van Brunt

09/20/2021, 8:14 PM

ah…. so that isn’t a think now though @thdxr?

Frank

09/20/2021, 8:15 PM

Ah I see. In this case, the CR would return something right away, like a token.

Dan Van Brunt

09/20/2021, 8:15 PM

or are you agreeing that the limit is still 30mins on CRs?

thdxr

09/20/2021, 8:15 PM

I think you can do this but just not sure what would own the stepfunction and how it would clean up itself 😵 . Unless we had another stack along with it

Dan Van Brunt

09/20/2021, 8:15 PM

could you not just trigger something to happen in 2hrs? EventBus, StepFunctions etc?

Dan Van Brunt

09/20/2021, 8:16 PM

I was thinking another stack.

thdxr

09/20/2021, 8:16 PM

It might make sense to have a standard SST stack just like how there's a standard CDK stack

Dan Van Brunt

09/20/2021, 8:16 PM

ya something like that would allow for various cleanup things

Dan Van Brunt

09/20/2021, 8:17 PM

I wasn’t yet thinking about SST…. but just that “another stack” could be used to handle the cleanup.

Dan Van Brunt

09/20/2021, 8:18 PM

Mores the better though if there was an optional SST stack and have construct able to use it optionally. Like NextJS construct would do what it does now, without it… and WITH it would retain and handle the deletion.

Dan Van Brunt

09/20/2021, 8:20 PM

If this is of interest… I wouldn’t mind lending a hand to setup this cleanup function / function caller.

thdxr

09/20/2021, 8:21 PM

yeah will let frank give his pov - I'm just talking without really knowing anything about the underlying problem 😄

Frank

09/20/2021, 8:26 PM

Give me 10min to wrap up something and I will share some thoughts.

Frank

09/20/2021, 10:24 PM

Documented the 2 solutions above in this issue, with some pros/cons on top of my head.

Frank

09/20/2021, 10:25 PM

Personally, I lean towards Solution 1, the custom resource approach. Mainly b/c the removal logic is self-contained in the stack.

Frank

09/20/2021, 10:31 PM

Ideally we want to minimize the amount of “SST resources” deployed into a user’s account. So I think let’s introduce an

SST Toolkit

stack when we absolutely have to 🤔

Frank

09/20/2021, 10:32 PM

Open for thoughts (cc’ing @Jay in case he wants to chime in)

Dan Van Brunt

09/20/2021, 11:03 PM

@Frank isn’t that kinda what those debug stacks are? would it not be another thing that could move to a toolkit stack and reuse a single stack to debug all deployed stacks?

Frank

09/21/2021, 5:11 AM

Yeah, I think we’ll definitely move towards the toolkit stack architecture (we had some thoughts on deploying resources to help with monitoring/debugging). But as for the Next.js construct, I’m curious why you prefer solution 2 over 1? Is it b/c the CR solution will make the removal process take hours?

Dan Van Brunt

09/21/2021, 2:39 PM

@Frank oh sorry, I didn’t see the link to what ticket had your thoughts in it?

Frank

09/21/2021, 6:14 PM

Oh my bad 😮 https://github.com/serverless-stack/serverless-stack/issues/835

Frank

09/21/2021, 6:14 PM

Just noticed ur comment. I will follow up in the issue

Open in Slack

Previous Next