Mastodon discussion Discussions Apr 16 5 views

Title: P1: Distributed training [2023-09-15 Fri]- Huggingface/accelerate with DeepSpeed or Megatron-LM- FairScale by Met...

by anoncheg

Title: P1: Distributed training [2023-09-15 Fri]- Huggingface/accelerate with DeepSpeed or Megatron-LM- FairScale by Meta, facebook- Megatron-LM by Nvidia- DeepSpeed by Microsoft- Horovod Uber- Ray- ColossalAI- PyTorch Lightning- FFCV: Fast Forward Computer Vision⛧I have made distributed training of ResNet50 in FSDP, the new PyTorch distribute training approach allow to train modes that not fit to one GPU\n#nn #ai #neural #automl #tensorflow #tf #torch #pytorch #llama #llama2

Read Original

Hugging Face

Metadata

Reblogs Count: 1
Account: anoncheg@mastodon.in.th

Mastodon discussion 22m ago

Generative AI can run completely offline. By downloading a model's neural weights and using local hardware, you skip the...

Generative AI can run completely offline. By downloading a model's neural weights and using local hardware, you skip the cloud entirely for maximum privacy. #AI #TechEducation #Mac...

Mastodon discussion 24m ago

Adventures in #LLM. Like, am i missing something/doing something wrong? Is this really why all the rivers are being boil...

Adventures in #LLM. Like, am i missing something/doing something wrong? Is this really why all the rivers are being boiled? It burns this many tokens for "greetz bro"? (Qwen 3.5_2b...

Mastodon discussion 25m ago

@mrencyclopedia Nevertheless, using an LLM as glorified rubberducky does have some merit if you grill it for sources, ac...

@mrencyclopedia Nevertheless, using an LLM as glorified rubberducky does have some merit if you grill it for sources, actually use/read the sources and have it grill you back accor...

Title: P1: Distributed training [2023-09-15 Fri]- Huggingface/accelerate with DeepSpeed or Megatron-LM- FairScale by Met...

Metadata

Related

Generative AI can run completely offline. By downloading a model's neural weights and using local hardware, you skip the...

Adventures in #LLM. Like, am i missing something/doing something wrong? Is this really why all the rivers are being boil...

@mrencyclopedia Nevertheless, using an LLM as glorified rubberducky does have some merit if you grill it for sources, ac...