Akshay (@akshay_pachaar)대규모 언어모델을 커스터마이즈할 때 알아두면 좋은 파인튜닝 기법 목록이 정리됐다. LoRA, QLoRA, Prefix Tuning, Adapter Tuning, Instruction Tuning, P-Tuning, BitFit, Soft Prompts, RLHF, RLAIF, DPO, GRPO 등이 포함된다.https://x.com/akshay_pachaar/status/2045125478391099858#llm #finetuning #lorA #rlhf #dpo
Related
Amazon Prime members can buy a car online now - and get a $1,500 gift cardAmazon is now partnering with local dealership...
Amazon Prime members can buy a car online now - and get a $1,500 gift cardAmazon is now partnering with local dealerships to help you buy, sell, or lease your car - and Prime membe...
Open AI, the company behind ChatGPT and destruction of our watersheds and breathable air, just sent out an email about t...
Open AI, the company behind ChatGPT and destruction of our watersheds and breathable air, just sent out an email about their privacy policy to users. They have probably about a doz...
RE: https://mastodon.social/@Sheril/116720919490343442Soudain, nous sommes devenus aux #Éloïs de "Machine à explorer le ...
RE: https://mastodon.social/@Sheril/116720919490343442Soudain, nous sommes devenus aux #Éloïs de "Machine à explorer le temps", de H. G. Wells, en ignorant des #Morlocks qui se cac...