patch sft_trainer to favor max_seq_length over max_length in config #2669

mmathew23 · 2025-06-01T17:19:32Z

newest trl has changed from max_seq_length to max_length. In order to maintain old behavior unsloth will favor max_seq_length if it's in the args or if it's set inside the model for sft trainer only.

tested with qwen3:
trl==0.15.2 takes 26 mins
https://colab.research.google.com/drive/1A9DJ6SYsgYWtDciL2wvw-A-1ja238gs1?usp=sharing

upgrade to trl==0.18.1 changes the run to take 13 mins.
https://colab.research.google.com/drive/1IYGn6psJyfhZz-Zabg0IrrhUOBnRMtpR?usp=sharing

Issue is that max_length is defaulted to 1024 and unsloth prepare sft dataset takes this as preference. I added a check to the trl patches for sft_trainer file, and add explicit handling of max_length and max_seq_length. Now the notebook takes the expected ~20ish minutes.
https://colab.research.google.com/drive/1A6uj-VZsRPvPBLNy0ySPyC1kyV07Xeud?usp=sharing

You can also inspect the compiled files to see that it's only sfttrainer that gets impacted.

patch sft_trainer to favor max_seq_length over max_length in config

cbc31fb

danielhanchen merged commit 0c21999 into unslothai:main Jun 3, 2025

ebinan92 mentioned this pull request Jun 6, 2025

ZeroDivisionError: Unsloth: All labels in your dataset are -100. Training losses will be all 0 (Phi3.5-mini and Phi4-mini) #2364

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

patch sft_trainer to favor max_seq_length over max_length in config #2669

patch sft_trainer to favor max_seq_length over max_length in config #2669

Uh oh!

mmathew23 commented Jun 1, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

patch sft_trainer to favor max_seq_length over max_length in config #2669

patch sft_trainer to favor max_seq_length over max_length in config #2669

Uh oh!

Conversation

mmathew23 commented Jun 1, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants