-
Notifications
You must be signed in to change notification settings - Fork 11
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Low accuracy when trianing your MAC model in clevr dataset #4
Comments
And I just follow your training command: scripts/train/mac_flatqa.sh --data_dir $DATA/sqoop-variety_1-repeats_30000 --checkpoint_path model.pt |
How long have you been training the model? |
As my running command shows, num_iterations is 100000 |
The training procedure lasts about 10 hours |
OK, I will run this experiment later today myself. |
Thank you for your reply, waiting for your results |
I am working on it. I presume I broke the model at some point, or maybe a
PyTorch change is to blame. If I don't find the issue today, this will have
to wait until January though.
…On Sun, 22 Dec 2019 at 23:37, songyy14 ***@***.***> wrote:
OK, I will run this experiment later today myself.
Any updates? I also miss the same problem
—
You are receiving this because you commented.
Reply to this email directly, view it on GitHub
<#4?email_source=notifications&email_token=AAE7YYRZM24EB5NJFLLWMZTQ2A6BPA5CNFSM4J6KSQ62YY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGOEHQFXWQ#issuecomment-568351706>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/AAE7YYTOLBLOOXBTJYMLRXDQ2A6BPANCNFSM4J6KSQ6Q>
.
|
While I am tinkering with my setup, could one of you try to run this experiment multiple (like 5) times, please? |
I could not reproduce your issue. I have just trained 10 models, and they all worked fine. Can you please try running the experiment many times and tell me if the issue occurs all the time, or some of the time? |
|
Still,I obtain the lower performance as I stated in the question. Maybe I need some detailed information about your training. Here my setup is CUDA10.1 and torch 1.3.1 |
Before running your MAC model,I utilize Resnet101 to extract features from Clevr dataset and convert them to . h5 file. Also I preprocess the questions. Is my way correct? |
Hi rizar! It seems that the clevr dataset is different from your sqoop dataset. Can you give me some instructions? |
I have trained the MAC model in clevr dataset for more than 10 times. All the results are similar with what I mentioned in my question. I believe that you changed something in training but I did not! Need help |
I am sorry to hear the code doesn't work for you. For now all I can do is to give an extra info w.r.t the environment. I run the code in a Docker image that is based on "nvidia/cuda:9.1-cudnn7-devel-ubuntu16.04". I build the conda environment in the image. Here is the the output of
I can give you more info on Monday. |
As I mentioned in this issue,An error occured: |
I have looked at both |
As the image shows, the training accuracy is 0.7 and the val accuracy is 0.549. I think both of the two accuracies are much lower than the MAC network in https://github.com/stanfordnlp/mac-network. Any instructions?
The text was updated successfully, but these errors were encountered: