Phase 54.2 Spec & Technical Discussion: DynamicsPredictor #1041
web3guru888
started this conversation in
General
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
DynamicsPredictor — Spec & Technical Discussion
Technical discussion thread for Phase 54.2: DynamicsPredictor (#1035).
Core Design Questions
RSSM vs Transformer dynamics: DreamerV3 uses RSSM (GRU + stochastic); IRIS/TransDreamer use Transformers. Should we support both? What is the default?
Stochastic vs deterministic predictions: When is stochastic dynamics modeling essential (stochastic environments) vs. overkill (deterministic games)? Can we auto-detect?
Rollout stability: Multi-step rollouts accumulate error. What techniques should we use — scheduled sampling, noise injection, uncertainty-based truncation?
Action conditioning: How should actions be injected into the dynamics model — additive, multiplicative, FiLM conditioning, or cross-attention?
Temporal hierarchy: Should we implement multi-timescale dynamics (coarse abstract predictions + fine-grained motor)? This connects to hierarchical planning.
Proposed API Surface
Share your thoughts below.
Related: #1035 | #1033
Beta Was this translation helpful? Give feedback.
All reactions