Change the repository type filter
All
Repositories list
3 repositories
DEEP-GRPO
Public- A New DataSet API with Efficient Shuffle Mechanism for PyTorch (SGD/Adam without Full Data Shuffle)
- Segment Policy Optimization: Effective Segment-Level Credit Assignment in RL for Large Language Models
ProTip! When viewing an organization's repositories, you can use the
props. filter to filter by custom property.