- Популярные видео
- Авто
- Видео-блоги
- ДТП, аварии
- Для маленьких
- Еда, напитки
- Животные
- Закон и право
- Знаменитости
- Игры
- Искусство
- Комедии
- Красота, мода
- Кулинария, рецепты
- Люди
- Мото
- Музыка
- Мультфильмы
- Наука, технологии
- Новости
- Образование
- Политика
- Праздники
- Приколы
- Природа
- Происшествия
- Путешествия
- Развлечения
- Ржач
- Семья
- Сериалы
- Спорт
- Стиль жизни
- ТВ передачи
- Танцы
- Технологии
- Товары
- Ужасы
- Фильмы
- Шоу-бизнес
- Юмор
Everyone Is Using RAG Wrong
RAG became the default AI architecture before people even understood when to use it.
Here’s the problem:
Most AI apps don’t actually need retrieval systems.
If your entire knowledge base fits inside the model’s context window, you can skip vector databases completely and inject the data directly into the prompt.
Example:
A startup building an HR chatbot with 25 pages of company policy usually doesn’t need embeddings, chunking, retrieval pipelines, and reranking. The model can read the whole thing directly.
RAG makes sense when:
- your data changes frequently
- your knowledge base becomes very large
- your product needs citations or external retrieval
Otherwise, you may just be adding cost and latency.
Comment "RAG" and I’ll send you the full video explaining how RAG actually works.
Have a product idea in mind?
We build and launch AI product MVPs in 15 days. 4+ years of experience, 30+ projects shipped — websites, mobile apps, AI agents, and SaaS tools. Our team handles the full build: product thinking, design, architecture, development, and launch.
Contact: https://thesquirrel.tech
#RAG #VectorSearch #LLMArchitecture #PromptEngineering #AIInfrastructure #Embeddings #RetrievalAugmentedGeneration
Видео Everyone Is Using RAG Wrong канала Ganesh Ghatti AI
Here’s the problem:
Most AI apps don’t actually need retrieval systems.
If your entire knowledge base fits inside the model’s context window, you can skip vector databases completely and inject the data directly into the prompt.
Example:
A startup building an HR chatbot with 25 pages of company policy usually doesn’t need embeddings, chunking, retrieval pipelines, and reranking. The model can read the whole thing directly.
RAG makes sense when:
- your data changes frequently
- your knowledge base becomes very large
- your product needs citations or external retrieval
Otherwise, you may just be adding cost and latency.
Comment "RAG" and I’ll send you the full video explaining how RAG actually works.
Have a product idea in mind?
We build and launch AI product MVPs in 15 days. 4+ years of experience, 30+ projects shipped — websites, mobile apps, AI agents, and SaaS tools. Our team handles the full build: product thinking, design, architecture, development, and launch.
Contact: https://thesquirrel.tech
#RAG #VectorSearch #LLMArchitecture #PromptEngineering #AIInfrastructure #Embeddings #RetrievalAugmentedGeneration
Видео Everyone Is Using RAG Wrong канала Ganesh Ghatti AI
Комментарии отсутствуют
Информация о видео
18 мая 2026 г. 21:20:04
00:00:46
Другие видео канала
