Index-1.9B-Chat Lora 微调合并模型出现ValueError: Can't find 'adapter_config.json' at './output/Index-1.9B-Chat-lora/checkpoint-600' #175

tangyipeng100 · 2024-06-19T02:18:57Z

采用微调脚本微调出现错误：
ValueError: Can't find 'adapter_config.json' at './output/Index-1.9B-Chat-lora/checkpoint-600'

KMnO4-zx · 2024-06-19T15:50:38Z

是路径错误，还是没有训练到600step啊

lisherlock · 2024-06-23T02:05:36Z

我Lora微调Qwen2，加载时也遇到这个问题，改了transformer和peft的版本也都没有adapter_config.json，头疼

EEE1even · 2024-06-24T08:11:05Z

我也有同样的问题

lisherlock · 2024-06-24T08:15:28Z

我也有同样的问题

我目前解决了，官方例程里的部分改成下面：

加载tokenizer

tokenizer = AutoTokenizer.from_pretrained(model_path, trust_remote_code=True)

加载模型

model = AutoModelForCausalLM.from_pretrained(lora_path, device_map="auto",torch_dtype=torch.bfloat16, trust_remote_code=True).eval()

应该是高版本的transformers和peft在lora时直接合并了，不需要再单独加载lora的checkpoint。

lisherlock · 2024-06-24T08:33:28Z

我也有同样的问题

我目前解决了，官方例程里的部分改成下面：

加载tokenizer

tokenizer = AutoTokenizer.from_pretrained(model_path, trust_remote_code=True)

加载模型

model = AutoModelForCausalLM.from_pretrained(lora_path, device_map="auto",torch_dtype=torch.bfloat16, trust_remote_code=True).eval()
应该是高版本的transformers和peft在lora时直接合并了，不需要再单独加载lora的checkpoint。

我刚刚测试了可以跑通，但是这样合并岂不是污染了base Model

我理解应该是把Lora的部分直接合并，base model参数不变，只是加载方式不同。我这边fine-tune之后，用这种加载方式确实有效能提升说明应该有用。看其他有的github的解决方案类似，可能Qianwen2的训练版本比较高，所以降低版本的方法不大行。

EEE1even · 2024-06-24T08:34:08Z

我也有同样的问题

我目前解决了，官方例程里的部分改成下面：

加载tokenizer

tokenizer = AutoTokenizer.from_pretrained(model_path, trust_remote_code=True)

加载模型

model = AutoModelForCausalLM.from_pretrained(lora_path, device_map="auto",torch_dtype=torch.bfloat16, trust_remote_code=True).eval()
应该是高版本的transformers和peft在lora时直接合并了，不需要再单独加载lora的checkpoint。

我刚刚测试了可以跑通，但是这样合并岂不是污染了base Model

我理解应该是把Lora的部分直接合并，base model参数不变，只是加载方式不同。我这边fine-tune之后，用这种加载方式确实有效能提升说明应该有用。看其他有的github的解决方案类似，可能Qianwen2的训练版本比较高，所以降低版本的方法不大行。

理解了，感谢老哥回复答疑

lisherlock · 2024-06-24T08:34:33Z

我也有同样的问题

我目前解决了，官方例程里的部分改成下面：

加载tokenizer

tokenizer = AutoTokenizer.from_pretrained(model_path, trust_remote_code=True)

加载模型

model = AutoModelForCausalLM.from_pretrained(lora_path, device_map="auto",torch_dtype=torch.bfloat16, trust_remote_code=True).eval()
应该是高版本的transformers和peft在lora时直接合并了，不需要再单独加载lora的checkpoint。

我刚刚测试了可以跑通，但是这样合并岂不是污染了base Model

我理解应该是把Lora的部分直接合并，base model参数不变，只是加载方式不同。我这边fine-tune之后，用这种加载方式确实有效能提升说明应该有用。看其他有的github的解决方案类似，可能Qianwen2的训练版本比较高，所以降低版本的方法不大行。

理解了，感谢老哥回复答疑

客气啦

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Index-1.9B-Chat Lora 微调合并模型出现ValueError: Can't find 'adapter_config.json' at './output/Index-1.9B-Chat-lora/checkpoint-600' #175

Index-1.9B-Chat Lora 微调合并模型出现ValueError: Can't find 'adapter_config.json' at './output/Index-1.9B-Chat-lora/checkpoint-600' #175

tangyipeng100 commented Jun 19, 2024

KMnO4-zx commented Jun 19, 2024

lisherlock commented Jun 23, 2024

EEE1even commented Jun 24, 2024

lisherlock commented Jun 24, 2024

lisherlock commented Jun 24, 2024

加载tokenizer

加载模型

EEE1even commented Jun 24, 2024

加载tokenizer

加载模型

lisherlock commented Jun 24, 2024

加载tokenizer

加载模型

Index-1.9B-Chat Lora 微调合并模型出现ValueError: Can't find 'adapter_config.json' at './output/Index-1.9B-Chat-lora/checkpoint-600' #175

Index-1.9B-Chat Lora 微调合并模型出现ValueError: Can't find 'adapter_config.json' at './output/Index-1.9B-Chat-lora/checkpoint-600' #175

Comments

tangyipeng100 commented Jun 19, 2024

KMnO4-zx commented Jun 19, 2024

lisherlock commented Jun 23, 2024

EEE1even commented Jun 24, 2024

lisherlock commented Jun 24, 2024

加载tokenizer

加载模型

lisherlock commented Jun 24, 2024

加载tokenizer

加载模型

EEE1even commented Jun 24, 2024

加载tokenizer

加载模型

lisherlock commented Jun 24, 2024

加载tokenizer

加载模型