guillaume blaquiere
1 min readNov 27, 2022

Hey Jacob, it's a good question

Imagine you want to reduce cold start with Cloud Run, let's say up to 3 concurrent instances. You set the min instance to 3. Same thing with App Engine (you also have a min instance parameter).

Now, what happens at the forth instance if you have more traffic than expected? A new instance is created and the cold start impacts your users.

With the idle instance, you can say to App Engine to keep a number of warm instances, whatever the min instance parameter. Let's say, you set this param set to 2.

When all your min instances are used (3, as before), there are still 2 instances warm ready to serve traffic. If a 4th instance is used, there is no cold start, and the system starts another instance to keep it warm, in the idle instance pool.

It's like a "sliding min instance", you can scale more seamlessly.

guillaume blaquiere
guillaume blaquiere

Written by guillaume blaquiere

GDE cloud platform, Group Data Architect @Carrefour, speaker, writer and polyglot developer, Google Cloud platform 3x certified, serverless addict and Go fan.

No responses yet

Write a response