[IMP] Switch to GPU for add_id
by default and make that setting configurable
#473
Labels
enhancement
New feature or request
While trying to add ids to 983 GB of data for a workflow but it took an hour+ of time.
I now realize that we hard code backend="pandas" here and that causes it
NeMo-Curator/nemo_curator/scripts/add_id.py
Lines 37 to 44 in 2d7e857
We should default to GPU and make that configurable
The text was updated successfully, but these errors were encountered: