Fix ernie ci auto trainer error #9758

blacksheep-Aristotle · 2025-01-08T15:10:33Z

PR types

Bug fixes

PR changes

Others

Description

修复Ernie ci error bug
允许用户自己控制param_init和使用中层api的时机。

paddle-bot · 2025-01-08T15:10:38Z

Thanks for your contribution!

codecov · 2025-01-08T17:18:55Z

Codecov Report

Attention: Patch coverage is 3.44828% with 28 lines in your changes missing coverage. Please review.

Project coverage is 52.37%. Comparing base (fb60645) to head (47691b9).
Report is 1 commits behind head on develop.

Files with missing lines	Patch %	Lines
paddlenlp/trainer/auto_trainer.py	0.00%	26 Missing ⚠️
paddlenlp/transformers/llama/modeling_auto.py	0.00%	1 Missing ⚠️
paddlenlp/transformers/llama/modeling_network.py	0.00%	1 Missing ⚠️

Additional details and impacted files

@@             Coverage Diff             @@
##           develop    #9758      +/-   ##
===========================================
- Coverage    52.70%   52.37%   -0.33%     
===========================================
  Files          731      727       -4     
  Lines       117313   115146    -2167     
===========================================
- Hits         61827    60306    -1521     
+ Misses       55486    54840     -646

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

jeff41404 · 2025-01-09T07:16:41Z

paddlenlp/trainer/auto_trainer.py

+
+        auto_dist_degree = {
+            "tensor_parallel": training_args.tensor_parallel_degree > 1,
+            "sequence_parallel": sequence_parallel,


already move sequence_parallel in training_args, so use "sequence_parallel": training_args.sequence_parallel, directly, no need line 110 above?

jeff41404 · 2025-01-09T07:23:23Z

paddlenlp/trainer/auto_trainer.py

-                    )
+            # NOTE(zhangwl):in pipeline mode , param my be initialized before while delte init_func ,but param is still not is_initialized
+            if not param._is_initialized() and param._init_func is not None:
+                param.initialize()


if param._init_func is not None, should use param._init_func() or model. _init_weights(Layer) ?

jeff41404 · 2025-01-09T08:33:01Z

paddlenlp/trainer/auto_trainer.py

+        if (
+            kwargs.get("args", None) is not None
+            and kwargs["args"].use_intermediate_api
+            and not parallelize.has_parallelized_model


put judgment not parallelize.has_parallelized_model into this branch, the judgment here is to determine whether to use the basic API or the intermediate API

jeff41404 · 2025-01-09T08:36:00Z

paddlenlp/trainer/auto_trainer.py

+            and kwargs["args"].use_intermediate_api
+            and not parallelize.has_parallelized_model
+        ):
+            if self.auto_dist_config is not None:


We can determine parallelize.has_parallelized_model here, if yes, must have auto_dist_config

blacksheep-Aristotle added 2 commits January 8, 2025 12:47

[AutoParallel]:fix ernine auto_trainer error

5eee41a

[AutoParallel]:fix ernine auto_trainer error

6510c29

blacksheep-Aristotle added 5 commits January 8, 2025 23:11

Merge branch 'develop' into fix_ernie_ci_auto_trainer_error

bd5f0e6

[AutoParallel]:fix ernine auto_trainer error

04e7e73

[AutoParallel]:fix ernine auto_trainer error

667a02e

[AutoParallel]:fix ernine auto_trainer error

ee31ace

[AutoParallel]:fix ernine auto_trainer error

0402250

blacksheep-Aristotle added 3 commits January 9, 2025 11:44

[AutoParallel]:fix ernine auto_trainer error

fac5e1a

[AutoParallel]:fix ernine auto_trainer error

cf77fb1

[AutoParallel]:fix ernine auto_trainer error

f5e7c39

jeff41404 reviewed Jan 9, 2025

View reviewed changes

blacksheep-Aristotle added 2 commits January 9, 2025 16:37

[AutoParallel]:fix ernine auto_trainer error

1f53f0f

[AutoParallel]:fix ernine auto_trainer error

47691b9

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix ernie ci auto trainer error #9758

Fix ernie ci auto trainer error #9758

blacksheep-Aristotle commented Jan 8, 2025 •

edited

Loading

paddle-bot bot commented Jan 8, 2025

codecov bot commented Jan 8, 2025 •

edited

Loading

jeff41404 Jan 9, 2025 •

edited

Loading

jeff41404 Jan 9, 2025 •

edited

Loading

jeff41404 Jan 9, 2025 •

edited

Loading

jeff41404 Jan 9, 2025

Fix ernie ci auto trainer error #9758

Are you sure you want to change the base?

Fix ernie ci auto trainer error #9758

Conversation

blacksheep-Aristotle commented Jan 8, 2025 • edited Loading

PR types

PR changes

Description

paddle-bot bot commented Jan 8, 2025

codecov bot commented Jan 8, 2025 • edited Loading

Codecov Report

jeff41404 Jan 9, 2025 • edited Loading

Choose a reason for hiding this comment

jeff41404 Jan 9, 2025 • edited Loading

Choose a reason for hiding this comment

jeff41404 Jan 9, 2025 • edited Loading

Choose a reason for hiding this comment

jeff41404 Jan 9, 2025

Choose a reason for hiding this comment

blacksheep-Aristotle commented Jan 8, 2025 •

edited

Loading

codecov bot commented Jan 8, 2025 •

edited

Loading

jeff41404 Jan 9, 2025 •

edited

Loading

jeff41404 Jan 9, 2025 •

edited

Loading

jeff41404 Jan 9, 2025 •

edited

Loading