2025-02-28 02:08:05 +00:00

8 lines
74 B
Plaintext

torch
transformers
wandb
datasets
accelerate>=0.26.0
deepspeed
flash-attn