如何设置模型推理时的参数，如top_k? How to set the generation config of the model, like the temperature. #329

royny · 2023-08-29T14:57:20Z

royny
Aug 29, 2023

For fair evaluation, generation configs(https://huggingface.co/docs/transformers/main_classes/text_generation#transformers.GenerationConfig) should be the same. I tried to find the setting but directly set that in model configration would fail. I noticed that there is a huggingface.py file at opencompass/models, in which the generation function is called using some kwargs, and the specific parameters are unknown.
My question is how do we see the specific model parameters and set that by ourselves to make sure we use the needed and same setting while evaluating our models.

Answered by gaotongxiao

Sep 1, 2023

Note that by default HF uses greedy decoding if no extra kwargs are passed to the generation function, and the evaluation should be fair.

In case you do want to customize the generation parameters, you may modify the HuggingFace class and add generation_kwargs to its __init__ function, which saves the kwargs on construction and then uses it in generate(). With such a modification, you can set these parameters in model configuration as how model_kwargs or tokenizer_kwargs is set.

View full answer

gaotongxiao · 2023-09-01T07:02:39Z

gaotongxiao
Sep 1, 2023

Note that by default HF uses greedy decoding if no extra kwargs are passed to the generation function, and the evaluation should be fair.

In case you do want to customize the generation parameters, you may modify the HuggingFace class and add generation_kwargs to its __init__ function, which saves the kwargs on construction and then uses it in generate(). With such a modification, you can set these parameters in model configuration as how model_kwargs or tokenizer_kwargs is set.

0 replies

royny · 2023-09-01T07:40:04Z

royny
Sep 1, 2023
Author

Now I simply add generation configs into the model_kwargs. I don't know whether I am right, if the args in model_kwargs are used in this way(from the _load_model function in HugginFaceCausalLM class):

self.model = AutoModelForCausalLM.from_pretrained(path, **model_kwargs)

Then huggingface would automatically set the generation config, there're not any bugs raised in this way.

0 replies

royny · 2023-09-01T07:42:48Z

royny
Sep 1, 2023
Author

The config setting is the belowing, I wonder if it does set the generation config as I expected(although won't raise any error) :

models = [
    dict(
        type=HuggingFaceCausalLM,
        abbr='baichuan-13b-base',
        path="/data/share_user/cls/models/Baichuan-13B-Base",
        tokenizer_path='/data/share_user/cls/models/Baichuan-13B-Base',
        tokenizer_kwargs=dict(padding_side='left',
                              truncation_side='left',
                              trust_remote_code=True,
                              use_fast=False,),
        max_out_len=512,
        max_seq_len=2048,
        batch_size=1,
        model_kwargs=dict(device_map='auto', trust_remote_code=True, revision='77d74f449c4b2882eac9d061b5a0c4b7c1936898', temperature=0.7,
        top_p=0.85,
        top_k=40,
        num_beams=1,
        repetition_penalty=1.2),
        run_cfg=dict(num_gpus=1),
    )
]

0 replies

oaksharks · 2023-09-14T06:20:25Z

oaksharks
Sep 14, 2023

The models perform poorly by default generate setting, if they can be customized, it will be enlightening on how to better use the models

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

如何设置模型推理时的参数，如top_k? How to set the generation config of the model, like the temperature. #329

{{title}}

Replies: 4 comments

{{title}}

{{title}}

{{title}}

{{title}}

Select a reply

如何设置模型推理时的参数，如top_k? How to set the generation config of the model, like the temperature. #329

royny Aug 29, 2023

Replies: 4 comments

gaotongxiao Sep 1, 2023

royny Sep 1, 2023 Author

royny Sep 1, 2023 Author

oaksharks Sep 14, 2023

royny
Aug 29, 2023

gaotongxiao
Sep 1, 2023

royny
Sep 1, 2023
Author

royny
Sep 1, 2023
Author

oaksharks
Sep 14, 2023