Skip to content

Instantly share code, notes, and snippets.

@itsderek23
Created October 9, 2023 23:32
Show Gist options
  • Select an option

  • Save itsderek23/8a1fde245a24889b68b521cdf3e0f09a to your computer and use it in GitHub Desktop.

Select an option

Save itsderek23/8a1fde245a24889b68b521cdf3e0f09a to your computer and use it in GitHub Desktop.
I'm comparing the output of two different systems. I need you to compare the two responses to the same question and see if they are functionally equivalent.
It's ok if one response is more detailed than the other, as long as they are functionally equivalent.
It's ok if when comparing floats (ie 10.5 or 12.4%), if the two numbers are within 30% of each other.
Question: %%QUESTION%%
# Response
%%RESPONSE_TO_EVAL%%
# Reference Response
%%REFERENCE_RESPONSE%%
# Response format
Answer in the following format:
{
"confidence": Return a confidence value btw 0.0-1.0 that the responses are functionally equivalent,
"why": "A single sentence explaining why you chose this confidence value"
}
Begin!
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment