-
Notifications
You must be signed in to change notification settings - Fork 173
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Actions are Empty Strings: Blocksworld+ToT with Llama3.1 #111
Comments
This can be reproduced by running step 4 with the default parameters provided, or running step 2 with increased depth. |
Hi! Did you use the instruct model or base model? |
I used the base model, should I be using the instruct model for the prompts? |
Sorry for late reply.. No, the prompt was written for base models. We will look into the problem. |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
I am trying to run ``examples/ToT/blocksworld/test_tot_v1_dfs.sh'' on step 2, step 4, step 6. However, I noticed that Llama 8B returns empty strings as actions.
example: ['unstack the yellow block from on top of the red block', 'unstack the blue block from on top of the yellow block', '', 'pick up the blue block']
Upon inspection I see that prompt that is being passed is `` the block'' and thats it. I believe `generate' function the input is being trimmed resulting in this issue. I also observed that increasing max_depth is causing more empty strings as actions.
Is there any easy fix to this? Maybe some hyperparameter that can be tuned/code fix?
The text was updated successfully, but these errors were encountered: