Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Multiple changes to VQA data prep #21

Merged
merged 3 commits into from
Oct 18, 2024
Merged

Multiple changes to VQA data prep #21

merged 3 commits into from
Oct 18, 2024

Conversation

finalelement
Copy link
Collaborator

  • Fixed the broken links for PathVQA, SLAKE, RadVQA (Github dataset links for PathVQA and SLAKE were originally there and now have been removed)
  • The original PathVQA dataset could not be found except for a hugging face link, this required adding additional processing steps to the data. New script to process from parquet files have been added
  • Readme was also updated
  • Instruction tuning files have been removed as providing them would be a form of re-distribution of data.

Signed-off-by: Vishwesh Nath <[email protected]>
@mingxin-zheng
Copy link
Collaborator

Thanks for the link fixes. Verified they're working now in #22

Signed-off-by: Vishwesh Nath <[email protected]>
Copy link
Collaborator

@holgerroth holgerroth left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Looks good!

@holgerroth holgerroth merged commit 7795abd into main Oct 18, 2024
2 checks passed
@mingxin-zheng mingxin-zheng deleted the vqa_fixes branch October 25, 2024 03:32
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants