2025-03-03 05:40:38 +00:00

9 lines
86 B
Plaintext

torch
transformers
wandb
datasets
accelerate>=0.26.0
deepspeed
flash-attn
s3tokenizer