Loading request...
The current iteration-based runner couples checkpointing with the number of iterations per inner epoch. The user requests the ability to configure checkpointing, logging, and learning rate scheduling frequencies independently.
Currently, when using iteration-based runner, checkpointing always happens after each inner epoch, thus coupling with the number of iterations per inner epoch. Ideally, the frequency to checkpoint/logging/ lr scheduling should be decoupled with the number of iterations. No actual technical obstacles here, just need to get it done.