Better explanation about `static` and `dynamic` problems #144

fedebotu · 2024-03-18T07:41:25Z

fedebotu
Mar 18, 2024
Maintainer

Problem

In most combinatorial settings such as the ones we consider, the initial td (e.g. locs in a Euclidean routing problem) does not really change so we do not need to carry information about all the computational graph. This is why, unlike TorchRL, we modified the step() function of the environment here not to save all previous td (since they would just increase runtime).
However, this is not in general true when we consider dynamic / stochastic settings.

Solution

We should better explain why we do this and allow users to save intermediate states during decoding as an option, perhaps specifying the problem as static or dynamic instead of having the _torchrl_mode in here.

PS: optionally one could save the Tensordicts alongside actions here - i.e., saving each step inside of the DecodingStrategy from @LTluttmann upon request and giving back the full nested td as usually done in TorchRL

CC: @Furffico @cbhua

fedebotu · 2024-04-10T14:16:02Z

fedebotu
Apr 10, 2024
Maintainer Author

@yining043 tagging you here too, since I guess you will need step-wise states (or at least rewards) to save for improvement methods!

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Better explanation about `static` and `dynamic` problems #144

Uh oh!

{{title}}

Uh oh!

Replies: 1 comment

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

Better explanation about static and dynamic problems #144

Uh oh!

fedebotu Mar 18, 2024 Maintainer

Problem

Solution

Replies: 1 comment

Uh oh!

fedebotu Apr 10, 2024 Maintainer Author

Better explanation about `static` and `dynamic` problems #144

fedebotu
Mar 18, 2024
Maintainer

fedebotu
Apr 10, 2024
Maintainer Author