GPU Training for 14b Models
I’m a researcher and for my research I’m training a 14B-parameter model. However my available compute resources are limited to a single NVIDIA H100 GPU with 95 GB of VRAM provided by my institution via SSH. How do you all manage situations like this when working with large models? Please share your thoughts and experiences.
submitted by /u/StatusArrival3382
[link] [comments]
Like
0
Liked
Liked