About Llama-2 7B training #155

sunnywyang · 2024-11-05T07:38:35Z

Hello, contributors. When I was training the draft model of Llama-2 7B, I used 10,000 rounds of dialogue and did not use deepspeed for training. The loss at the beginning was more than 1400, and after four rounds, the loss was 1100. Is this normal?

sunnywyang · 2024-11-05T08:09:15Z

But the accuracy rate is still high, at 65 percent

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

About Llama-2 7B training #155

About Llama-2 7B training #155

sunnywyang commented Nov 5, 2024

sunnywyang commented Nov 5, 2024

About Llama-2 7B training #155

About Llama-2 7B training #155

Comments

sunnywyang commented Nov 5, 2024

sunnywyang commented Nov 5, 2024