-
Notifications
You must be signed in to change notification settings - Fork 5.6k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[RLlib] Fix Issue #25316: unconfigurable dist_dim
for custom multi-action distributions
#25490
base: master
Are you sure you want to change the base?
Conversation
4814ac7
to
132d15a
Compare
|
||
|
||
@DeveloperAPI | ||
class MultiActionDistributionMixIn(ActionDistribution): |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
one thing I am curious is why does this have to be a MixIn, instead of having all this logic as part of MultiActionDistribution class?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
one thing I am curious is why does this have to be a MixIn, instead of having all this logic as part of MultiActionDistribution class?
The point here is to reduce duplicate code. All these methods in MultiActionDistributionMixIn
are shared for MultiActionDistribution
for TensorFlow, PyTorch, and maybe Jax in the future.
251f987
to
6fb61f1
Compare
6fb61f1
to
ad170d0
Compare
ad170d0
to
3171889
Compare
@kouroshHakha may be the best person to review this PR once he is back from break. |
This pull request has been automatically marked as stale because it has not had recent activity. It will be closed in 14 days if no further activity occurs. Thank you for your contributions.
|
3171889
to
015aeae
Compare
Any updates for this PR? |
2945dcb
to
c8c4c76
Compare
c8c4c76
to
26fcf79
Compare
26fcf79
to
147bded
Compare
This pull request has been automatically marked as stale because it has not had recent activity. It will be closed in 14 days if no further activity occurs. Thank you for your contributions.
|
147bded
to
0c151eb
Compare
0c151eb
to
16359a3
Compare
a11e545
to
aa748e1
Compare
Signed-off-by: Xuehai Pan <XuehaiPan@pku.edu.cn>
This pull request has been automatically marked as stale because it has not had recent activity. It will be closed in 14 days if no further activity occurs. Thank you for your contributions.
|
A gentle ping for this. Any feedback is welcome. cc @sven1977 @avnishn @ArturNiederfahrenhorst |
This pull request has been automatically marked as stale because it has not had recent activity. It will be closed in 14 days if no further activity occurs. Thank you for your contributions.
|
I'll ping internally to get a review. Apologies for the delay. |
This pull request has been automatically marked as stale because it has not had recent activity. It will be closed in 14 days if no further activity occurs. Thank you for your contributions.
|
This pull request has been automatically marked as stale because it has not had recent activity. It will be closed in 14 days if no further activity occurs. Thank you for your contributions.
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
is this still relevant? should we close this?
8559e25
to
8a14d0f
Compare
I think this is still relevant. |
Why are these changes needed?
Make
dist_dim
for custom multi-action distributions configurable when the user subclassingMultiActionDistribution
.Changes:
ActionDistribution.required_model_output_shape
fromstaticmethod
toclassmethod
Distribution.required_input_dim
fromstaticmethod
toclassmethod
Related issues:
dist_dim
for custom multi-action distributions #25316Related PR:
cc @sven1977
Related issue number
Fixes #25316
Checks
scripts/format.sh
to lint the changes in this PR.