-
Notifications
You must be signed in to change notification settings - Fork 40
Improve error, if ECS task can not be submitted #282
Comments
Hi @brodul I'm experiencing the same issues. We are using the ECS cluster with an EC2 capacity provider but Prefect tries to add an endless amount of flow runs (and therefore ECS tasks) without checking the available capacity first. The team stated this should have been solved in this issue but still experiencing the problem. By looking at the code the issue comes by just firing the |
@coffeeandcloud Hey, I have figured that it's not just hitting the quota that fails with this error. We have increased the quota and we can not hit it currently. Tnx for the retry fix. |
In my case, the retry just adds a layer of traceback but does not help targeting the cause of the issue :
I get this error semi-randomly using Fargate, so it might not be a quota issue. EDIT : it was a quota issue. I was asking for more vCPUs than available. It would be nice to get a more explicit error. I might have a look at that in the future. |
FYI : this error also happens if you set EC2 as launch type, but the EC2 capacity provider does not allow enough ressource for the task to be started. For instance, you need 2 vCPUs but your capacity provider can only launch 1 vCPU instances. This problem is different from a quota issue using Fargate, yet Prefect returns the same traceback. |
Facing a similar issue where my agent logs out the following message:
I can see that the boto's ECS library spits out a |
Improve Error trace, if the ECS worker can not create ECS tasks. One example would be if the Fargate vCpu quota is met.
Traceback / Example
Currently an
IndexError
is returned in the stack. And in the Cloud UI State: Message is:Submission failed. IndexError: list index out of range
Expectation / Proposal
More pass more details to the error. Maybe wrap the error in another error like
SubmissionError
The text was updated successfully, but these errors were encountered: