-
-
Save itsderek23/8a1fde245a24889b68b521cdf3e0f09a to your computer and use it in GitHub Desktop.
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
| I'm comparing the output of two different systems. I need you to compare the two responses to the same question and see if they are functionally equivalent. | |
| It's ok if one response is more detailed than the other, as long as they are functionally equivalent. | |
| It's ok if when comparing floats (ie 10.5 or 12.4%), if the two numbers are within 30% of each other. | |
| Question: %%QUESTION%% | |
| # Response | |
| %%RESPONSE_TO_EVAL%% | |
| # Reference Response | |
| %%REFERENCE_RESPONSE%% | |
| # Response format | |
| Answer in the following format: | |
| { | |
| "confidence": Return a confidence value btw 0.0-1.0 that the responses are functionally equivalent, | |
| "why": "A single sentence explaining why you chose this confidence value" | |
| } | |
| Begin! |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment