How much flakiness do you tolerate in end to end tests?

kersplort@programming.dev · edit-2 10 months ago

How much flakiness do you tolerate in end to end tests?

kersplort@programming.dev · 10 months ago

My team has just decided to make working smokes a mandatory part of merging a PR. If the smokes don’t work on your branch, it doesn’t merge to main. I’m somewhat conflicted - on one hand, we had frequent breaks in the smokes that developers didn’t fix, including ones that represented real production issues. On the other, smokes can fail for no reason and are time consuming to run.

We use playwright, running on github actions. The default free tier runner has been awful, and we’re moving to larger runners on the platform. We have a retry policy on any smokes that need to run in a step by step order, and we aggressively prune and remove smokes that frequently fail or don’t test for real issues.

AggressivelyPassive@feddit.de · 10 months ago

Shouldn’t you find out, why the smoke tests keep failing? It’s not a good sign, if you can’t even guarantee stability in a controlled environment.

kersplort@programming.dev · 10 months ago

It’s not a fully controlled environment, that is the point of smokes.

AggressivelyPassive@feddit.de · 10 months ago

How do you not fully control the environment your PRs are tested in?

kersplort@programming.dev · 10 months ago

dependencies.