#RAG | AI Hub

Dev.to tutorial May 26

Master RAG Systems: Build an End-to-End LangChain Pipeline with Milvus, Reranking & Azure OpenAI 🚀

Beyond Basic RAG: Learn LangChain + RAG End-to-End 🚀 ...

OpenAI RAG

25

Mastodon discussion May 26

A naive RAG pipeline pulls 281 docs to deliver 25. Real backpressure pulls 40. How WorkIt paces the producer to consumer...

A naive RAG pipeline pulls 281 docs to deliver 25. Real backpressure pulls 40. How WorkIt paces the producer to consumer demand in Node.js & TypeScript. https://hackernoon.com/back...

RAG

18

Dev.to tutorial May 26

How I Built an AI-Powered Incident RCA Platform with LangGraph and RAG

It’s 2:13 AM. A payment API suddenly starts failing in production. Customers can’t complete...

RAG

12

GitHub Trending repo May 26

NeXra-AI/awesome-ai-image-prompts: Prompt-as-Code library — 955 curated prompts for GPT-Image-2, Nano Banana, Seedance and more. RAG-ready, multi-language (zh/en/ms).

Prompt-as-Code library — 955 curated prompts for GPT-Image-2, Nano Banana, Seedance and more. RAG-ready, multi-language (zh/en/ms).

RAG

35

Mastodon discussion May 26

🧠 What is RAG (Retrieval-Augmented Generation) and why it matters in 2026:RAG = LLM + Your Own DataInstead of retraining...

🧠 What is RAG (Retrieval-Augmented Generation) and why it matters in 2026:RAG = LLM + Your Own DataInstead of retraining models, you inject relevant context at query time. Result? ...

LLM RAG

18

Dev.to tutorial May 26

I built a local-first movie recommender with Corrective-RAG (cited explanations, hybrid retrieval, runs entirely on Ollama)

Hey — sharing a project I've been building for the last few months. It's a movie recommendation...

RAG

12

Mastodon discussion May 25

От RAG-прототипа к агенту в продакшн: путь по метрикам, а не по модеНа связи Сергей Смирнов, AI-инженер LLMStart.ru. Сег...

От RAG-прототипа к агенту в продакшн: путь по метрикам, а не по модеНа связи Сергей Смирнов, AI-инженер LLMStart.ru. Сегодня расскажу о полноценном кейсе, который мы делали для ком...

RAG

9

Dev.to tutorial May 25

turbovec: Local RAG Without the 60 GB Tax

A 1536-dimensional float32 embedding is 6 KB. A corpus of 10 million documents is roughly 60 GB of...

RAG

12

Dev.to tutorial May 25

Building a Local-Only RAG System with Ollama and TypeScript

Building a Local-Only RAG System with Ollama and TypeScript Most RAG tutorials send your...

RAG

12

Dev.to tutorial May 25

RAG Explained: How Retrieval-Augmented Generation Actually Works

A visual walkthrough of RAG's two pipelines — ingestion and query — covering chunking, embeddings, vector databases, and why it beats sending all your text to an LLM.

RAG

12

Mastodon discussion May 25

RAG в энтерпрайзе: почему демо работает, а прод нетПредставьте себе типичное совещание. Кто-то из руководства возвращает...

RAG в энтерпрайзе: почему демо работает, а прод нетПредставьте себе типичное совещание. Кто-то из руководства возвращается с конференции, садится напротив и говорит: «У них там бот...

OpenAI RAG

9

Mastodon discussion May 25

Building a Go-based CLI tool to export Confluence spaces to local Markdown files for RAG pipelines, offline docs, and Gi...

Building a Go-based CLI tool to export Confluence spaces to local Markdown files for RAG pipelines, offline docs, and Git-based knowledge management. https://hackernoon.com/meet-co...

RAG

18

Mastodon discussion May 24

Cost-efficiency is everything in production AI! I build custom agents with LangGraph + RAG — affordable alternatives to ...

Cost-efficiency is everything in production AI! I build custom agents with LangGraph + RAG — affordable alternatives to expensive platforms, starting $20 💼 #AI #LangChain

RAG

9

Mastodon discussion May 24

Most developers overcomplicate AI agents.My production stack 👇🔀 LangGraph — agent flow control🔍 RAG + Pinecone — searche...

Most developers overcomplicate AI agents.My production stack 👇🔀 LangGraph — agent flow control🔍 RAG + Pinecone — searches your docs 🐍 FastMCP — runs Python code live🧠 PostgreSQL — ...

RAG

24

Dev.to tutorial May 24

Building a RAG Document Q&A System with Hybrid Retrieval (No Embeddings API Needed)

Building a production-quality RAG (Retrieval-Augmented Generation) system taught me one thing: the...

RAG API

12

Mastodon discussion May 24

Most teams reach for fine-tuning when they should be using RAG.The confusion usually comes from one thing people know wh...

Most teams reach for fine-tuning when they should be using RAG.The confusion usually comes from one thing people know what both are, but nobody gives a clear way to decide.Here's t...

RAG

24

Dev.to tutorial May 24

Four production pitfalls that turn RAG demos into broken chatbots

A perfect 50-question demo on Tuesday. By the second week of production, users were filing tickets faster than the team could close them. The model had not changed. The retrieval p...

RAG

12

Dev.to tutorial May 24

Multimodal RAG: when summary-based stops being enough

A founder asked why their AI assistant kept saying 'the chart shows a positive trend' instead of reading the actual numbers. The pipeline was doing exactly what it was designed to ...

Multimodal RAG

12

Mastodon discussion May 24

LMIM OS v2.1 "Tezcat · Sharpened" drops tomorrow.This isn't a patch. It's a different animal:• RAG Lite — drop a PDF, as...

LMIM OS v2.1 "Tezcat · Sharpened" drops tomorrow.This isn't a patch. It's a different animal:• RAG Lite — drop a PDF, ask anything about it• Workspace sandbox — point LMIM at a fol...

RAG

18

Dev.to tutorial May 24

How We Built a Production RAG Chatbot for a Client in 72 Hours (Full Stack Breakdown)

A client messaged us on a Tuesday night. By Friday afternoon, their customer support chatbot was live...

RAG

12

Mastodon discussion May 23

Show HN: I built a RAG and knowledge graph agent that runs locallyhttps://news.ycombinator.com/item?id=48248801#HackerNe...

Show HN: I built a RAG and knowledge graph agent that runs locallyhttps://news.ycombinator.com/item?id=48248801#HackerNews #Tech #AI

RAG

18

Dev.to tutorial May 23

Building a Private RAG System: Lessons from a Local-First AI Journal

Most AI apps quietly send your data to the cloud. DiaryGPT does the opposite — and this is the full...

RAG

12

GitHub Trending repo May 23

study8677/awesome-architecture: 🗺️ Think like a software architect, not just a coder — 21 architecture maps (incl. AI gateway, RAG, agents, inference serving, vector DB) + a language-agnostic system-design tutorial. Every template links to real open-source prototypes. 中英文双语。

🗺️ Think like a software architect, not just a coder — 21 architecture maps (incl. AI gateway, RAG, agents, inference serving, vector DB) + a language-agnostic system-design tutori...

RAG Open Source

62

Dev.to tutorial May 23

From Manual RAG to Real Retrieval — Embedding-Based RAG with NVIDIA NIM

Replace hardcoded context with real retrieval using NVIDIA's nv-embedqa-e5-v5 embedding model. Cosine similarity, the query vs passage input distinction most beginners get wrong, n...

NVIDIA RAG

20