Heretic statistics

#1
by redaihf - opened

Please publish the initial and residual refusal and KL divergence scores. Please also classify the model according to the degree of heretication. Thank you.

Sign up or log in to comment