How to Structure Your Evaluation Data for Bulletproof RAG Testing