-
Notifications
You must be signed in to change notification settings - Fork 11
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Previous control is not used? #8
Comments
Hi, it's been a while since I last worked with this, so I'm going from memory without guarantee that what I'm saying is correct. Regarding the mask: if I remember correctly using the mask didn't improve the final performance, so I removed it to reduce overhead. Regarding the control unit: the previous control is never used because the control unit uses different parameters for each step ( |
Thanks for your feedback! I actually looked into the codes more carefully in the past few days. If I may, I guess the reason that masking is not helpful is because you used pytorch-mac-network/code/mac.py Lines 43 to 48 in 15a65d1
Regarding the control unit, I think original paper may have a variation of non-recurrent version, which is like what you did here. But the one published in the paper actually combines previous control with the current control, before feeding into the next step. So it's actually used. Anyway, thanks a lot for sharing this awesome implementations and for answering my questions! Greatly appreciated. |
Hi,
Thanks for sharing this great repo! It's awesome.
I just have a few questions about the control unit:
(1) From
pytorch-mac-network/code/mac.py
Lines 74 to 77 in 15a65d1
these lines seem to be commented out -- but I think these are used for masking padded tokens in the sequence. Is that intentional? Do I miss something here?
(2) It seems in this control function,
pytorch-mac-network/code/mac.py
Line 51 in 15a65d1
Previous control is never passed in as an argument, so the previous control unit is not passed to next in the reasoning step. I noticed that in the docstring an argument of
control
is mentioned but it seems to be removed from the function signature. Also thequestion_lengths
is not used since themask
function call is commented out (previous question). Is that intentional as well?Thanks a lot!
The text was updated successfully, but these errors were encountered: