Effect of FIM on StarCoder pre-training #138

gojkoc54 · 2023-09-06T11:49:43Z

Hi!

Curious to know some more details about FIM and its effect on the pre-trained model.
Here's a paragraph from the SantaCoder paper:

FIM for cheap
We observe a minor drop in performance of the FIM model compared to the No-FIM model. Specifically, we see that the pass@100 performance of the FIM model is 2-4% lower on HumanEval and 1% lower on MBPP. While Bavarian et al. (2022) presented evidence for the existence of a FIM-for-free property (i.e., arguing that autoregressive models can be trained with FIM without harming left-to-right capabilities), we do find a small but consistent drop of FIM models on left-to-right text2code benchmarks.

Was a similar analysis carried out on StarCoder?
Was StarCoder pre-trained on a 50-50 split between FIM and next-token data? (as indicated in this Megatron script)

loubnabnl · 2023-11-15T15:23:41Z

Hello, we didn't perform the ablation for StarCoder given the amount of compute it requires for training, but you can check the CodeLLama paper where the authors observed similar behavior at different scales.

Regarding FIM percentage, we used 50%.

yiyepiaoling0715 · 2024-04-03T01:20:24Z

Hello, we didn't perform the ablation for StarCoder given the amount of compute it requires for training, but you can check the CodeLLama paper where the authors observed similar behavior at different scales.

Regarding FIM percentage, we used 50%.

i have a question, as the known ratio, many eval ratios drop because of fim under pretrain stage, why you still use fim with 50% percentage?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Effect of FIM on StarCoder pre-training #138

Effect of FIM on StarCoder pre-training #138

gojkoc54 commented Sep 6, 2023

loubnabnl commented Nov 15, 2023 •

edited

Loading

yiyepiaoling0715 commented Apr 3, 2024

Effect of FIM on StarCoder pre-training #138

Effect of FIM on StarCoder pre-training #138

Comments

gojkoc54 commented Sep 6, 2023

loubnabnl commented Nov 15, 2023 • edited Loading

yiyepiaoling0715 commented Apr 3, 2024

loubnabnl commented Nov 15, 2023 •

edited

Loading