#RAG | AI Hub

GitHub Trending repo Jun 23

Emmimal/context-graph-benchmark: A pure-Python structured memory benchmark for multi-agent LLM systems — context graph vs vector RAG vs raw history dump, five scenarios, 18 graded queries, zero API calls.

A pure-Python structured memory benchmark for multi-agent LLM systems — context graph vs vector RAG vs raw history dump, five scenarios, 18 graded queries, zero API calls.

LLM RAG Benchmark

45

Mastodon discussion Jun 23

🧠 Why most AI apps fail in production:1. Hallucinations not handled → RAG solves this2. No fallback when LLM is down → C...

🧠 Why most AI apps fail in production:1. Hallucinations not handled → RAG solves this2. No fallback when LLM is down → Circuit breakers3. Costs explode at scale → Cache aggressivel...

LLM RAG

9

Mastodon discussion Jun 23

Где заканчивается вызов LLM и начинается backend система: локальный RAG на FastAPI и OllamaХотел разобраться где заканчи...

Где заканчивается вызов LLM и начинается backend система: локальный RAG на FastAPI и OllamaХотел разобраться где заканчивается простой вызов локальной LLM и начинается backend сист...

LLM RAG

18

Dev.to tutorial Jun 22

RAG Systems with Claude: From Documentation to Production

Meta: Build production-grade RAG systems using Claude and vector search. Step-by-step guide to...

Anthropic RAG

12

Dev.to tutorial Jun 22

I Built RAG From Scratch in Python to Understand It. Here's What I Learned.

Every RAG tutorial I read used LangChain or LlamaIndex and hid the interesting parts. So I built a 500-line RAG pipeline with no frameworks — just pypdf, ChromaDB, and Ollama. The ...

RAG

12

Dev.to tutorial Jun 22

Build a Local RAG Chatbot in 30 Minutes with .NET 8, Ollama, and React

I built a PDF question-answering chatbot in .NET 8 + React that runs entirely on my laptop — no API keys, no cloud, no monthly bill. The whole thing is 9 source files, ~400 lines, ...

RAG

12

Dev.to tutorial Jun 22

How to Evolve a Linear LangChain RAG Pipeline into a Stateful, Multi-Agent Consensus Architecture

We’ve all built the classic, straight-line RAG pipeline: chunk a document, toss it into a vector...

RAG

12

Dev.to tutorial Jun 22

Why My RAG App Kept Hallucinating (and How I Fixed It)

A few months ago I was demoing my RAG-powered support bot to a colleague, feeling pretty confident...

RAG

33

Dev.to tutorial Jun 22

Agentic RAG: Designing Self-Correcting Retrieval Loops for Production

Standard RAG retrieves once and hopes for the best. Agentic RAG retrieves, reflects, decides it was...

RAG Agents

20

Dev.to tutorial Jun 22

Manticore Search 27.1.5: Authentication, sharded tables, conversational search and faster vector search

Manticore Search 27.1.5 has been released. This release brings built-in authentication and...

RAG

12

Dev.to tutorial Jun 22

Why Twio Chose Vertex AI Search over pgvector for Production RAG

When we first built RAG at Twio, pgvector was the obvious pick. Our business data was already in...

RAG

12

Papers with Code paper Jun 22

ChartWalker: Benchmarking the Cross-Chart RAG Task

Cross-Chart Retrieval-Augmented Generation (RAG) is critical for complex multi-modal analytical tasks in scientific, business, and political domains. However, existing benchmarks e...

RAG

21

Mastodon discussion Jun 22

RAG не только для вопросов и ответов: почему он естественно подходит для рекомендацийRetrieval-Augmented Generation (RAG...

RAG не только для вопросов и ответов: почему он естественно подходит для рекомендацийRetrieval-Augmented Generation (RAG) чаще всего рассматривается в контексте вопросно-ответных с...

RAG

9

Dev.to tutorial Jun 21

De-mystifying the GenAI Stack: From LLMs to RAG (A Systems Perspective)

As a backend engineer who has spent more than a decade designing distributed systems, asynchronous...

RAG

12

Mastodon discussion Jun 21

Learned #RAG and #Agents for using the #LLM Api with free #llmzoomcamp course.

LLM RAG API

18

Dev.to tutorial Jun 21

Building a RAG System With Chinese AI Models: Complete Tutorial

Building a RAG System With Chinese AI Models Retrieval-Augmented Generation (RAG) is the...

RAG

12

Dev.to tutorial Jun 21

# Vector Search and RAG: A Primer

A short learning path from a weekend project: I indexed my personal markdown notes (~800 chunks),...

RAG

12

Dev.to tutorial Jun 21

why a simple string match beat apple's nlembedding for local rag

Why a simple string match beat Apple's NLEmbedding for local RAG how apple's nlembedding drove me...

RAG

12

Mastodon discussion Jun 21

Build self-hosted AI systems with OpenClaw, Hermes, RAG, and local LLM infrastructure. Learn to orchestrate assistants w...

Build self-hosted AI systems with OpenClaw, Hermes, RAG, and local LLM infrastructure. Learn to orchestrate assistants with memory, retrieval, routing, and observability.#AI #LLM #...

LLM RAG

24

Dev.to tutorial Jun 21

Building RAG that doesn't hallucinate

Every RAG tutorial promises the same thing: hook a vector database up to an LLM, and suddenly your...

RAG

25

Dev.to tutorial Jun 21

The Hidden Layer Behind Every Smart AI App: RAG, MCP, and Agentic Systems

If you've spent any time with ChatGPT, Gemini, or Claude, you already know they're impressive. Ask...

OpenAI Anthropic Google

12

Mastodon discussion Jun 20

The Senior Go Engineer Interview Guide: AI Platform Engineering: Production-Grade Go, LLM Platforms, RAG, Vector Search,...

The Senior Go Engineer Interview Guide: AI Platform Engineering: Production-Grade Go, LLM Platforms, RAG, Vector Search, and Cloud Native Systems by Luca Sepe is a new release on L...

LLM RAG

24

Dev.to tutorial Jun 20

RAG Pipeline: Complete Node.js Implementation Guide

Build production RAG systems in Node.js - Know where it breaks, why it works, and when to use...

RAG

20

Dev.to tutorial Jun 20

RAG Pipeline: The Uncle-Nephew Complete Learning Guide

How to Build Systems That Actually Know Your Data (Not Hallucinate About It) ...

RAG

12