HomeMesosphereNo notifications. 4 unresolved issues.

Increase default expunge timeout tasks
ClosedAll Users

Authored by timcharper on Dec 8 2016, 10:05 PM.

Details

Summary

The default expunge time of 6 minutes is too aggressive and was leading
to persistent volumes getting deleted regular maintenance tasks such as
reboot a host.

Also, update text describing the expungeAfter to make sure it's clear
that it will result in volume deletion.

addresses #4603

Test Plan

n/a

Diff Detail

Repository
rMARATHON marathon
Lint
Automatic diff as part of commit; lint not applicable.
Unit
Automatic diff as part of commit; unit tests not applicable.
timcharper retitled this revision from to Increase default expunge timeout tasks.Dec 8 2016, 10:05 PM
timcharper updated this object.
timcharper edited the test plan for this revision. (Show Details)

@meichstedt is 15 minutes enough for TimeUntilInactive? Or should we go higher? An hour?

jdef added a comment.Dec 9 2016, 12:00 AM

should we use longer defaults for stateful things vs. shorter defaults for stateless things?

docs/docs/rest-api/public/api/v2/types/unreachableStrategy.raml
12

since we're making changes in here, this description should indicate that inactiveAfterSeconds must be less than expungeAfterSeconds

18

it'd be nice to see a readable translation of this number in the description, like "the default of 604800 seconds translates to 7 days" (or some such)

I think using different defaults for resident tasks would be nice; My understanding is @meichstedt votes against it.

timcharper updated this revision to Diff 1171.Dec 9 2016, 12:25 AM

describe default. Clarify that inactiveAfterSeconds is < expungeAfterSeconds

timcharper marked 2 inline comments as done.Dec 9 2016, 12:27 AM

done

jdef accepted this revision.Dec 9 2016, 1:25 AM
This revision is now accepted and ready to land.Dec 9 2016, 1:25 AM
meichstedt accepted this revision.EditedDec 9 2016, 11:32 AM

Thanks. We need to update the docs as well, see https://github.com/mesosphere/marathon/issues/4816

meichstedt updated this revision to Diff 1178.Dec 9 2016, 12:29 PM

Fixed unit test

Thank you for fixing the test! @meichstedt

This revision was automatically updated to reflect the committed changes.