Init on CPU with deferred init #29

jonb377 · 2023-08-10T00:56:48Z

It seems directly initializing onto the XLA device impacts the steady-state HLO and increases memory usage. This change will first initialize on CPU, then move the tensors to the XLA device.

alanwaketan

LGTM.

jonb377 · 2023-08-10T02:22:43Z

Verified with a run of 70B, we see an improvement on both memory utilization and MFU

Init on CPU with deferred init

7e1f6ee

jonb377 requested a review from alanwaketan August 10, 2023 00:56

jonb377 self-assigned this Aug 10, 2023

jonb377 requested a review from JackCaoG August 10, 2023 00:57

alanwaketan approved these changes Aug 10, 2023

View reviewed changes

jonb377 merged commit e169167 into llama2-google-next-training Aug 10, 2023

jonb377 deleted the jonbolin-cpu-init branch August 10, 2023 02:22

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Init on CPU with deferred init #29

Init on CPU with deferred init #29

Uh oh!

jonb377 commented Aug 10, 2023

Uh oh!

alanwaketan left a comment

Uh oh!

jonb377 commented Aug 10, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Init on CPU with deferred init #29

Init on CPU with deferred init #29

Uh oh!

Conversation

jonb377 commented Aug 10, 2023

Uh oh!

alanwaketan left a comment

Choose a reason for hiding this comment

Uh oh!

jonb377 commented Aug 10, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants