Why it is necessary to use a ValueLearner? #6

xuruiyang · 2024-06-05T22:47:50Z

I just have a question regarding the necessity of ValueLearner. Given that we are training on the same offline dataset, why don't we just pick the Return directly from the dataset when computing the advantage? Why would it be beneficial to train another value model to predict the Return value?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Why it is necessary to use a ValueLearner? #6

Why it is necessary to use a ValueLearner? #6

xuruiyang commented Jun 5, 2024

Why it is necessary to use a ValueLearner? #6

Why it is necessary to use a ValueLearner? #6

Comments

xuruiyang commented Jun 5, 2024