RLHF training interface which provides payouts via bitcoin lightning.
Can be used with any language model from Replicate though primarily it's intended to be used with replit code. Training data is stored in a backing Database (in this case Supabase Postgres) and can be quickly exported to CSV.
Allows for follow on training via trx. Currently, WIP - to be updated later in the day