#Fine-Tuning | AI Hub

GitHub Trending repo Jun 14

MarcosCarbajoEchalecu/Local-ai-image-pipeline: Entorno local RTX 5090 con ComfyUI y Stable Diffusion para generación de contenido visual fotorrealista · ControlNet · IP-Adapter · LoRAs · Florence-2

Entorno local RTX 5090 con ComfyUI y Stable Diffusion para generación de contenido visual fotorrealista · ControlNet · IP-Adapter · LoRAs · Florence-2

Stability AI Fine-Tuning

35

Dev.to tutorial Jun 12

PEFT Explained: How to Fine-Tune LLMs Without Retraining Billions of Parameters

Hello, I'm Shrijith Venkatramana. I'm building git-lrc, an AI code reviewer that runs on every...

Fine-Tuning

12

GitHub Trending repo Jun 11

Goekdeniz-Guelmez/MLX-LoRA-Studio: A native Mac App for LLM fine-tuning on Apple Silicon — fully on-device, fully open source.

A native Mac App for LLM fine-tuning on Apple Silicon — fully on-device, fully open source.

LLM Fine-Tuning Open Source

55

Papers with Code paper Jun 11

The Hidden Power of Scaling Factor in LoRA Optimization

In Low-Rank Adaptation (LoRA), the scaling factor α is often treated as a mere complement to the learning rate, yet its role in optimization remains poorly understood. In this pape...

Fine-Tuning

21

Mastodon discussion Jun 10

Anthropic confirms Claude Opus 5 embeds invisible safeguards — prompt modification, steering vectors, PEFT — specificall...

Anthropic confirms Claude Opus 5 embeds invisible safeguards — prompt modification, steering vectors, PEFT — specifically to limit its usefulness for training frontier LLMs. A tech...

Anthropic Fine-Tuning

9

GitHub Trending repo Jun 9

nagara214/SSR-Merge: [ICML 2026] SSR-Merge: Subspace Signal Routing for Training-Free LoRA Merging in Diffusion Models

[ICML 2026] SSR-Merge: Subspace Signal Routing for Training-Free LoRA Merging in Diffusion Models

Fine-Tuning

35

Mastodon discussion Jun 5

This adapter adds Android Auto to most GM EVs, but there’s a catchThere isn't a subscription, but there's a high upfront...

This adapter adds Android Auto to most GM EVs, but there’s a catchThere isn't a subscription, but there's a high upfront cost and no guarantee it'll work forever.https://www.androi...

Google Fine-Tuning

9

Mastodon discussion Jun 4

Cooler Master shows off new MWE Gold V4 Power supplies and GPU Shield adapter — per-pin monitoring can dynamically scale...

Cooler Master shows off new MWE Gold V4 Power supplies and GPU Shield adapter — per-pin monitoring can dynamically scale down power to stop cables meltingCooler Master has new powe...

Google Fine-Tuning AI Hardware

9

GitHub Trending repo Jun 4

3xela/ideogram-lora: ideogram lora pipeline trainer.

ideogram lora pipeline trainer.

Fine-Tuning

35

Papers with Code paper Jun 4

DRIFT: A Residual Flow Adapter for Decoding Continuous Outputs in Vision-Language Models

Many modern vision-language models (VLMs) build on autoregressive decoding of discrete tokens. While text-based output interfaces enable scalable pretraining and strong zero-shot g...

Multimodal Fine-Tuning

21

Mastodon discussion Jun 3

Zespół Trajectory i UC Berkeley prezentują architekturę C-LoRA, która dzięki równoległemu przetwarzaniu adapterów pozwal...

Zespół Trajectory i UC Berkeley prezentują architekturę C-LoRA, która dzięki równoległemu przetwarzaniu adapterów pozwala skrócić cykl uczenia modeli AI o blisko 300 procent. #si #...

Fine-Tuning

18

Papers with Code paper Jun 2

Training-Free Multi-Concept LoRA Composition with Prompt-Aware Weighting

Low-Rank Adaptation (LoRA) successfully enables personalization in text-to-image generation by adapting pre-trained diffusion models to specific visual concepts and styles. However...

Fine-Tuning

21

Dev.to tutorial Jun 1

qwen2.5-lora-finetuning-colab

This guide walks through the complete process of fine-tuning Qwen2.5-3B-Instruct — a 3-billion...

Fine-Tuning

12

GitHub Trending repo Jun 1

wallnavigatorhook/fine-tuning-llm-lora-qlora-unsloth: Fine-tuning LLM — lora, qlora, unsloth, fine tune tutorial.

Fine-tuning LLM — lora, qlora, unsloth, fine tune tutorial.

LLM Fine-Tuning

51

Papers with Code paper Jun 1

On the Scaling of PEFT: Towards Million Personal Models of Trillion Parameters

Parameter-efficient fine-tuning (PEFT) is usually treated as a cheaper alternative to full fine-tuning. We study a broader role: small trainable adapters as persistent local state ...

Fine-Tuning

21

Papers with Code paper Jun 1

LayerRoute: Input-Conditioned Adaptive Layer Skipping via LoRA Fine-Tuning for Agentic Language Models

Agentic language model systems alternate between two structurally distinct step types: structured tool calls (short, deterministic, low perplexity) and open-ended planning/reasonin...

Fine-Tuning Agents

21

Mastodon discussion May 31

Trajectory has released a concurrent multi-LoRA training stack for continual learning, built with UC Berkeley Sky Lab an...

Trajectory has released a concurrent multi-LoRA training stack for continual learning, built with UC Berkeley Sky Lab and Anyscale. Each experiment maps to a dedicated LoRA adapter...

Fine-Tuning

9

Mastodon discussion May 28

Platforma Krea uruchamia zaawansowany system trenowania modeli LoRA, pozwalający tworzyć powtarzalne style i postaci z p...

Platforma Krea uruchamia zaawansowany system trenowania modeli LoRA, pozwalający tworzyć powtarzalne style i postaci z precyzją dostępną dotąd tylko w rozwiązaniach open-source. #s...

Fine-Tuning

18

Mastodon discussion May 28

Cerebras (@cerebras)Cerebras Inference에서 Multi-LoRA가 비공개 프리뷰로 제공된다. 하나의 베이스 모델 위에 여러 LoRA 어댑터를 올려 요청마다 전환할 수 있으며, 재로딩이나 ...

Cerebras (@cerebras)Cerebras Inference에서 Multi-LoRA가 비공개 프리뷰로 제공된다. 하나의 베이스 모델 위에 여러 LoRA 어댑터를 올려 요청마다 전환할 수 있으며, 재로딩이나 별도 배포 없이 지연시간 증가도 없다고 한다. 여러 도메인·고객별 어댑터를 운영하는 추론 인프라에 실용적인 ...

Fine-Tuning

18

Papers with Code paper May 28

How LoRA Remembers? A Parametric Memory Law for LLM Finetuning

Large Language Models (LLMs) must continuously learn and update knowledge to remain effective in dynamic real-world environments. While Low-Rank Adaptation (LoRA) is widely used fo...

LLM Fine-Tuning

21

Papers with Code paper May 28

Token-Level Generalization in LoRA Adapter Backdoors: Attack Characterization and Behavioral Detection

We show that LoRA adapters, the dominant distribution format for fine-tuned LLMs, can be reliably backdoored through training data poisoning while preserving baseline task performa...

Fine-Tuning

21

Papers with Code paper May 28

MAAT: Multi-phase Adapter-Aware Targeted Unlearning

Machine unlearning evaluation is structurally skewed: Why-type questions, which probe causal and relational knowledge, comprise less than 0.06% of CounterFact, 0.6% of ZSRE, and le...

Fine-Tuning

21

GitHub Trending repo May 27

QuickCricketCherish/Stable-Diffusion-WebUI-Portable-Full-Pack: Stable Diffusion WebUI Portable with model pack, ControlNet, LoRA library, and extensions—full local AI art studio unlocked.

Stable Diffusion WebUI Portable with model pack, ControlNet, LoRA library, and extensions—full local AI art studio unlocked.

Stability AI Fine-Tuning

53

Mastodon discussion May 27

Ingi Erlingsson (@ingi_erlingsson)ComfyUI로 제작한 멀티툴 워크플로를 공유. LTX Model 2.3, FLW LoRA, Alibaba Wan 2.2 I2V/T2V, Florence-...

Ingi Erlingsson (@ingi_erlingsson)ComfyUI로 제작한 멀티툴 워크플로를 공유. LTX Model 2.3, FLW LoRA, Alibaba Wan 2.2 I2V/T2V, Florence-2, Meta Sapiens2, OpenAI GPT Image 2.0, Suno, Adobe AE 등을 조합...

OpenAI Fine-Tuning

18