Test. Validate. Ship.

Ship Better AI, Faster

Systematically test and validate prompts, models, and RAG pipelines to find the optimal configuration for your AI agents and services.

Join waitlist

experiments.do

import { Experiment } from 'experiments.do';

const RAGvsFinetune = new Experiment({
  name: 'RAG vs. Finetuned Model',
  description: 'Compare retrieval-augmented generation against a finetuned model for product Q&A.',
  variants: [
    {
      id: 'rag_pipeline',
      agent: 'productExpertAgent',
      config: { useRAG: true, model: 'gpt-4-turbo' }
    },
    {
      id: 'finetuned_model',
      agent: 'productExpertAgent',
      config: { useRAG: false, model: 'ft:gpt-3.5-turbo-product-qa' }
    }
  ],
  metrics: ['accuracy', 'latency', 'hallucination_rate'],
  sampleSize: 1000
});

RAGvsFinetune.run().then(results => {
  console.log(results.winner);
});

Deliver economically valuable work

Frequently Asked Questions

Do Work. With AI.