Page MenuHomePhabricator

[Spike] Research & recommend method for model fine-tuning/training
Open, Needs TriagePublic

Description

Goal: Learn about ways to fine-tune/adapt OOB models vs train our own vs prompt-engineer

Past & in-progress Research team work:

Questions to answer:

  1. Would any of the above be suitable for a quick (~couple of weeks to build & launch) experiment? How much added benefit would they give us over prompt engineering?
  2. What is a sensible way to test these out? Does Research team have a test query set and/or method for scoring desirability of output that they've been using that we could use? (Talk to Xiao about this)