Q&A: Phase 33.2 ProgressiveNetExpander — Design Questions #699
Unanswered
web3guru888
asked this question in
Q&A
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
Q&A: Phase 33.2 — ProgressiveNetExpander Design Questions
This thread is for technical questions about the ProgressiveNetExpander component.
Open design questions
Scalability ceiling: With O(T²) lateral connections for T tasks, how do we prevent quadratic growth? Should we limit lateral connections to the K most relevant columns?
Column architecture selection: Should new columns match the base architecture, or should we use NAS to find task-specific optimal architectures? What is the search cost budget?
Similarity threshold tuning: When using similarity-based reuse, how do we set the threshold? Too high → unnecessary expansion; too low → negative transfer from reusing dissimilar columns.
Pruning timing: Should columns be pruned immediately after training, or after consolidation once importance is clearer? How does pruning interact with lateral connections?
Hybrid with EWC: Can we combine progressive expansion (for structurally different tasks) with EWC (for similar tasks) in the same pipeline? What triggers the switch?
References
Beta Was this translation helpful? Give feedback.
All reactions