调用lambda
by Yan Cui
崔燕
如何使用Lambda调用上下文动态设置超时 (How to set timeouts dynamically using Lambda invocation context)
With API Gateway and Lambda, you’re forced to use short timeouts on the server-side:
使用API Gateway和Lambda,您不得不在服务器端使用较短的超时:
API Gateway has a 29s max timeout on all integration points
API网关在所有集成点上的最大超时为29秒
The Serverless framework uses a default of 6s for AWS Lambda functions
无服务器框架对AWS Lambda函数使用默认值6s
However, you have limited influence over a Lambda function’s cold start time. And you have no control over how much overhead API Gateway adds. So the actual latency you’d experience from a calling function is far less predictable than you might think.
但是,你要通过lambda函数的冷启动时间的影响有限。 而且,您无法控制API网关添加多少开销。 因此,您从调用函数遇到的实际延迟远比您想象的要难预测。
We don’t want a slow HTTP response to cause the calling function to timeout. This has a negative impact on the user experience. Instead, we should stop waiting for a response before the calling function times out.
我们不希望缓慢的HTTP响应导致调用函数超时。 这对用户体验有负面影响。 相反,我们应该在调用函数超时之前停止等待响应。
“The goal of the timeout strategy is to give HTTP requests the best chance to succeed, provided that doing so does not cause the calling function itself to err.”
“超时策略的目标是为HTTP请求提供成功的最佳机会,前提是这样做不会导致调用函数本身出错。”
- Me
- 我
Most of the time, I see folks use fixed timeout values — but it’s often tricky to decide:
在大多数情况下,我看到人们使用固定的超时值-但要决定通常很棘手:
Too short, and you won’t give the request the best chance to succeed. For example, there’s 5s left in the invocation, but the timeout is set to 3s.
太短了,您将不会给请求获得成功的最佳机会。 例如,调用中还剩下5s,但是超时设置为3s。
- Too long, and you run the risk of letting the request timeout the calling function. For example, there’s 5s left in the invocation but the timeout is set to 6s. 太长了,您冒着使请求超时的调用函数的风险。 例如,调用中还剩下5s,但是超时设置为6s。
Things are further complicated by the fact that we often perform more than one HTTP request during a function invocation. For example,
在函数调用期间我们经常执行多个HTTP请求这一事实使事情变得更加复杂。 例如,
read from DynamoDB
从DynamoDB读取
- perform business logic on the data 对数据执行业务逻辑
save the update to DynamoDB
将更新保存到DynamoDB
publish an event to Kinesis
向Kinesis发布事件
Let’s look at two common approaches for picking timeout values, and where they fall short.
让我们看一下两种选择超时值的常见方法,以及它们的不足之处。
Instead of following these approaches, I propose we should set the request timeout based on the amount of invocation time left. We should also reserve some time to perform recovery steps in the event of failures.
我建议不要基于这些方法, 而应根据剩余的调用时间来设置请求超时 。 如果发生故障,我们还应该保留一些时间来执行恢复步骤 。
You can find out how much time is left in the current invocation through the context
object.
您可以通过context
对象找出当前调用中还剩下多少时间。
For example, if a function’s timeout
is 6s, and we’re 1s into the invocation. If we reserve 500ms for recovery, then that leaves us with 4.5s to wait for a HTTP response.
例如,如果一个函数的timeout
为6s,而调用的timeout
为1s。 如果我们保留500毫秒用于恢复,那么剩下4.5秒钟的时间来等待HTTP响应。
With this approach, we get the best of both worlds:
通过这种方法,我们可以两全其美:
- Allow requests the best chance to succeed based on the actual amount of invocation time we have left 允许请求根据我们剩下的实际调用时间获得成功的最佳机会
- Prevent slow responses from timing out the function, which gives us a window of opportunity to perform recovery actions. 防止响应缓慢导致功能超时,这为我们提供了执行恢复操作的机会。
But what are you going to do after you time out these requests? Aren’t you still going to have to respond with a HTTP error, since you couldn’t finish whatever operations you needed to perform?
但是,这些请求超时后您将要做什么? 由于您无法完成所需执行的任何操作,您是否仍将不得不以HTTP错误进行响应?
At the minimum, the recovery actions should include:
至少,恢复操作应包括:
Log the timeout incident with as much context as possible. For example, request target, timeout value, correlation IDs, and the request object.
使用尽可能多的上下文记录超时事件。 例如,请求目标,超时值, 相关性ID和请求对象。
Track custom metrics for
serviceX.timedout
so it can be monitored and the team can be alerted if the situation escalates跟踪
serviceX.timedout
自定义指标,以便在情况升级时可以对其进行监视并向团队发出警报Return an application error code and the original request ID in the response body. The client app can then display a user-friendly message like
“Oops, looks like this feature is currently unavailable, please try again later. If this is urgent, please contact us at xxx@domain.com and quote the request ID f19a7dca. Thank you for your cooperation :-)”
在响应正文中返回应用程序错误代码和原始请求ID。 然后,客户端应用程序可以显示一条用户友好的消息,例如
“Oops, looks like this feature is currently unavailable, please try again later. If this is urgent, please contact us at xxx@domain.com and quote the request ID f19a7dca. Thank you for your cooperation :-)”
“Oops, looks like this feature is currently unavailable, please try again later. If this is urgent, please contact us at xxx@domain.com and quote the request ID f19a7dca. Thank you for your cooperation :-)”
{ "errorCode": 10021, "requestId": "f19a7dca", "message": "service X timed out" }
In some cases, you can also recover even more gracefully using fallbacks.
在某些 情况下,您还可以使用后备功能更优雅地恢复。
Netflix’s Hystrix library supports several flavors of fallbacks via the Command pattern it employs heavily. I recommend reading its wiki page, as there is tons of useful information and ideas there.
Netflix的猬库支持通过它使用大量的Command模式回退的几种口味。 我建议阅读其Wiki页面 ,因为那里有大量有用的信息和想法。
Every Hystrix command lets you specify a fallback action.
每个Hystrix命令都可以让您指定一个后备操作。
You can also chain the fallback together by chaining commands via their respective getFallback
methods.
您还可以通过各自的getFallback
方法将命令链接在一起,从而将后备链接链接在一起。
For example,
例如,
execute a DynamoDB read inside
CommandA
执行
CommandA
内部读取的DynamoDBIn the
getFallback
method, executeCommandB
which would return a previously cached response if available在
getFallback
方法中,执行CommandB
,它将返回以前缓存的响应(如果有)If there is no cached response, then
CommandB
would fail and trigger its owngetFallback
method如果没有缓存的响应,则
CommandB
将失败并触发自己的getFallback
方法Execute
CommandC
, which returns a stubbed response执行
CommandC
,它返回存根响应
You should check out Hystrix if you haven’t already. Most of the patterns that are baked into Hystrix can be easily adopted in our serverless applications to help make them more resilient to failures.
如果还没有,请检查Hystrix 。 大多数被烤成豪猪的模式可以在我们的无服务器应用程序,帮助轻易采取使他们更适应故障。
翻译自: https://www.freecodecamp.org/news/how-to-set-timeouts-dynamically-using-lambda-invocation-context-3e78fa832a5b/
调用lambda