grad_norm is too large #55

Open

opened

on Dec 11, 2025

When I fine-tune on my own dataset, grad_norm becomes too large, reaching up to 1e6. Have any of you encountered this situation?

Metadata

Assignees

No one assigned

Labels

No labels

No labels

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests