-
Notifications
You must be signed in to change notification settings - Fork 6
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Support maze dataset tokenizers update #214
Open
aaron-sandoval
wants to merge
58
commits into
main
Choose a base branch
from
update-maze-dataset-tokenizers-step2
base: main
Could not load branches
Branch not found: {{ refName }}
Loading
Could not load tags
Nothing to show
Loading
Are you sure you want to change the base?
Some commits from the old base branch may be removed from the timeline,
and old review comments may become outdated.
Open
Changes from 15 commits
Commits
Show all changes
58 commits
Select commit
Hold shift + click to select a range
0ff866e
Add check on <UNK> token for `maze-dataset` update
aaron-sandoval ac2c09c
Update dependencies, including `maze-dataset = "^1.0.0"`
aaron-sandoval a52baca
maze-dataset PR #37 moved token_utils.py and util.py to a different d…
aaron-sandoval 404e49f
Update mostly just type hints for `MazeTokenizerModular`. No updates …
aaron-sandoval 8424609
Updated unit tests to incorporate `MazeTokenizerModular`. Not run yet
aaron-sandoval 58b6ef5
made a comment
aaron-sandoval 283bdd0
wip making a make recipe to run tests with a user-provided branch of …
aaron-sandoval eab11f4
wip, bogged down in Windows vs Linux crap
aaron-sandoval f83e169
wip, still stuck
aaron-sandoval c0a48a2
m-d git branch environment specified in maze-dataset_test directory
aaron-sandoval d34e6f5
Environment was broken in subdirectory. Move it to the main environment
aaron-sandoval f4f9303
Merge branch 'update-maze-dataset-tokenizers-step2' into add-maze-dat…
aaron-sandoval 667c6ce
Merge pull request #216 from understanding-search/add-maze-dataset-br…
aaron-sandoval dc30b32
Merge branch 'update-maze-dataset-tokenizers-step2' of https://github…
aaron-sandoval 6711a7e
Small edits to get unit tests to collect
aaron-sandoval fb0b1da
Merge branch 'main' into update-maze-dataset-tokenizers-step2
mivanit 02537b8
bump maze-dataset
mivanit 1b39086
run format
mivanit ec748ef
fix imports, unit tests collect
mivanit 74df787
upstream mmtokenizer summary() fix
mivanit 34e9e74
?????????
mivanit 24ca2c4
run format
mivanit 7138677
legacy mt was loaded as mmt by mistake
mivanit 6c3cce2
re-run nb
mivanit 3c244b0
fix loading maze tokenizers
mivanit ced372d
update dep
mivanit af3f953
`test_tokenization_encoding` passing
aaron-sandoval e2f94ac
`test_tokenizer_inside_hooked_transformer` passing
aaron-sandoval 0ddfa23
`test_cfg_post_init` passing
aaron-sandoval fa825c0
Everything in `test_config_holder.py` passing
aaron-sandoval 2c1d19e
`test_random_baseline` passing. 2 zanj tests are the only ones still …
aaron-sandoval 6e585d3
format
aaron-sandoval be99a06
zanj save load tests with multiple tokenizers
mivanit 4157519
Merge branch 'main' into update-maze-dataset-tokenizers-step2
mivanit 9e7b888
poetry update
mivanit 3a589b1
fix failing model loading tests
mivanit 3956867
integration test where too many vocab elements caused argsort fail
mivanit 0650190
trained new demo model
mivanit abc3dcc
replaced demo model path in tests, chnaged notebook cfg to test
mivanit 8311eb8
move training tests to test_train_model.py
mivanit 0064898
trying to fix pytest hang issue by closing wandb run
mivanit 7cf67a7
return logger in TrainingResult object from train_model
mivanit fc42faa
update maze-dataset, new version should maybe fix wandb issues?
mivanit 104e004
re-run notebook to get new model with fixed keys
mivanit 3e254b2
changed cfg back to test in train nb, re-run
mivanit 9f15e7e
format
mivanit 4df4a24
update maze-dataset dep
mivanit 4b3e8ca
ok this bug is incomprehensible
mivanit 98a2083
fixed bug - passing configs passed by ref and modified
mivanit 85ee878
format
mivanit 1f46532
fix paths in notebooks
mivanit 3f795f7
remove old stuff from makefile
mivanit 096f6df
update dep
mivanit 97fba7b
update dep
mivanit 0d05191
update dep
mivanit 04039a4
update deps??
mivanit 31442ec
update dep
mivanit 16e60e4
update dep to maze-dataset 1.0.0
mivanit File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.
Oops, something went wrong.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
It looks like
freeze
can only act on objects, not types. It raises an exception when it recursively callsfreeze(TRAIN_SAVE_FILES.__dict__)
, since it can't act on amappingproxy
.