Batch processing implementation for Roformer models: Seeking feedback on potential pull request #118

ntamotsu · 2024-10-01T09:11:56Z

ntamotsu
Oct 1, 2024

Hello,

I noticed that the batch_size parameter wasn't being utilized for Roformer models. To address this, I've implemented batch processing for these models. I'd like to share my findings and seek your feedback on whether this change would be beneficial to incorporate into the main repository.

Key points:

The original code didn't use the batch_size parameter for Roformer models, resulting in no batch processing.
I've implemented batch processing for Roformer models, allowing the batch_size parameter to be used effectively.
After implementation, I confirmed that setting larger batch_size values increases GPU VRAM usage for Roformer models.
Interestingly, I observed little to no improvement in processing speed, despite the batch processing implementation.
The README mentions that the batch_size parameter "may process slightly faster" for other models, so this behavior might be expected.

I've created a modified version of the demix function in the MDXCSeparator class to implement this change. You can find the updated code here:
https://github.com/ntamotsu/fork_python-audio-separator/blob/e44834b4d05203b82c61fb8c6b06205edde26276/audio_separator/separator/architectures/mdxc_separator.py#L248-L282)

Given these observations, I'm wondering if this change would be valuable to include in the main repository. If the lack of batch processing for Roformer models was intentional, then this modification may not be necessary.

I'd greatly appreciate your thoughts on this matter. Should I proceed with creating a pull request for this change, or do you feel it's not needed at this time?

Thank you for your time and consideration.

beveradb · 2024-10-08T05:51:28Z

beveradb
Oct 8, 2024
Maintainer

Thanks for thinking about this!
To be totally honest if there isn't much tangible benefit, I'm not sure it's worth it?

0 replies

ntamotsu · 2024-10-11T18:36:57Z

ntamotsu
Oct 11, 2024
Author

Thank you for your response. I agree that without a tangible benefit, a major code change might not be necessary.

However, after further consideration, I think it might be helpful to add a comment in the code explaining why the batch_size parameter isn't currently used for Roformer models. This would provide clarity for users who might wonder about its effect.
For example, we could add something like:

def demix(self, mix: np.ndarray) -> dict:
    # ...

    if self.is_roformer:
        # Note: Currently, for Roformer models, `batch_size` is not utilized due to negligible performance improvements.
        # ...

This approach would improve documentation without changing the functionality. What do you think?

0 replies

beveradb · 2024-10-11T18:56:25Z

beveradb
Oct 11, 2024
Maintainer

Sounds good to me!

0 replies

ntamotsu · 2024-10-13T09:39:36Z

ntamotsu
Oct 13, 2024
Author

Thanks! I'll submit a pull request then.

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Batch processing implementation for Roformer models: Seeking feedback on potential pull request #118

{{title}}

Replies: 4 comments

{{title}}

{{title}}

{{title}}

{{title}}

Select a reply

Batch processing implementation for Roformer models: Seeking feedback on potential pull request #118

ntamotsu Oct 1, 2024

Replies: 4 comments

beveradb Oct 8, 2024 Maintainer

ntamotsu Oct 11, 2024 Author

beveradb Oct 11, 2024 Maintainer

ntamotsu Oct 13, 2024 Author

ntamotsu
Oct 1, 2024

beveradb
Oct 8, 2024
Maintainer

ntamotsu
Oct 11, 2024
Author

beveradb
Oct 11, 2024
Maintainer

ntamotsu
Oct 13, 2024
Author