Question 1

What is RAG development?

Accepted Answer

RAG development is building the pipeline that retrieves relevant passages from your own data and feeds them to a language model so its answers are grounded in your documents. It is the standard way to give an LLM access to private or up-to-date knowledge without retraining the model.

Question 2

Have you built RAG systems in production?

Accepted Answer

Yes. I engineered a production RAG chatbot using the OpenAI API and the Qdrant vector database for context-aware information retrieval, and built EigenTalk, a RAG-powered research assistant with a Next.js frontend for document ingestion and AI-powered retrieval.

Question 3

How do you stop a RAG chatbot from hallucinating?

Accepted Answer

Most hallucinations are a retrieval problem, not a model problem. The fixes are better chunking, stronger embeddings, reranking the retrieved passages, setting a confidence threshold so the system abstains when it has no good context, and requiring citations so answers stay tied to sources.

Question 4

Which vector database do you use?

Accepted Answer

It depends on the project. Qdrant is a strong default for filtered vector search; pgvector is ideal when the data already lives in PostgreSQL and the corpus is moderate in size. The choice follows the scale, the filtering needs, and the existing infrastructure.

RAG Development

What you get

How I approach it

Frequently asked questions

What is RAG development?

Have you built RAG systems in production?

How do you stop a RAG chatbot from hallucinating?

Which vector database do you use?

RAG vs Fine-Tuning vs Long Context: When to Use Each