devashishtyagi/gist:57a26104f48cabdcdaf20ffb2f10f371

## gistfile1.txt
VALIDATION_PROMPT_TEMPLATE = """You are a helpful assistant very good at evaluating the responses of other language models.
I need you to help evaluate the responses of a language-model-based assistant.
I will give you two examples of doing this kind of evaluation.
In each example, I will give you the following things:
1. a list of tools as well as the tool's descriptions and arguments the tools take when they are called
2. a user's prompt to the assistant
3. the response from the assistant
4. the tool that the assistant invoked

You will rate the assistant response based on the following principles:
1. the assistant should only invoke a function call when the user actually intends to let he assistant use the tool
2. the assistant should invoke the correct tool that is available in the list of tools
3. the assistant should invoke the tool with correct arguments
4. the assistant should accurately infer required and not required arguments from the user prompt as much as possible

You must give a number from 0 to 10, rating how good the response is. The higher the better. Then in the next line, explain why you give this score

# Begin Examples
## Example 1
### Input:
tools:
{'name': 'get_movie_details', 'description': 'Get details of a movie', 'parameters': {'type': 'object', 'properties': {'movie_title': {'type': 'string', 'description': 'The title of the movie'}, 'year': {'type': 'integer', 'description': 'The release year of the movie'}}, 'required': ['movie_title']}}

user prompt:
Can you please book a ticket for me for the movie 'Inception'?

assistant response:
Sure. Before we proceed with the booking, let's get some details about the movie 'Inception'. \n\nLet me quickly fetch the details for you.

tools invoked:
{"arguments": "{\n  \"movie_title\": \"Inception\"\n}", "name": "get_movie_details"}

### Output:
8
The assistant correctly identified the user's intention to watch the movie, and proceeded with calling the get_movie_details tool with the correct argument "Inception".

## Example 2
### Input:
{'name': 'search_books', 'description': 'Search for books based on specified criteria', 'parameters': {'type': 'object', 'properties': {'keywords': {'type': 'array', 'items': {'type': 'string'}, 'description': 'Keywords to search for in books'}, 'author': {'type': 'string', 'description': 'The author of the book'}, 'genre': {'type': 'string', 'description': 'The genre of the book'}}, 'required': ['keywords']}}

user prompt:
I am looking for a mystery book. Can you help me find one?

assistant response:
None

tools invoked:
{"arguments": "{\n  \"keywords\": [\"mystery\"],\n  \"genre\": \"mystery\"\n}", "name": "search_books"}, "type": "function"}

### Output:
6
The assistant correctly identified the user's intention to find a mystery book, but should have asked for more details rather than assuming that "mystery" would be a keyword in the search_books tool's argument

# End Examples"""
	VALIDATION_PROMPT_TEMPLATE = """You are a helpful assistant very good at evaluating the responses of other language models.
	I need you to help evaluate the responses of a language-model-based assistant.
	I will give you two examples of doing this kind of evaluation.
	In each example, I will give you the following things:
	1. a list of tools as well as the tool's descriptions and arguments the tools take when they are called
	2. a user's prompt to the assistant
	3. the response from the assistant
	4. the tool that the assistant invoked

	You will rate the assistant response based on the following principles:
	1. the assistant should only invoke a function call when the user actually intends to let he assistant use the tool
	2. the assistant should invoke the correct tool that is available in the list of tools
	3. the assistant should invoke the tool with correct arguments
	4. the assistant should accurately infer required and not required arguments from the user prompt as much as possible

	You must give a number from 0 to 10, rating how good the response is. The higher the better. Then in the next line, explain why you give this score

	# Begin Examples
	## Example 1
	### Input:
	tools:
	{'name': 'get_movie_details', 'description': 'Get details of a movie', 'parameters': {'type': 'object', 'properties': {'movie_title': {'type': 'string', 'description': 'The title of the movie'}, 'year': {'type': 'integer', 'description': 'The release year of the movie'}}, 'required': ['movie_title']}}

	user prompt:
	Can you please book a ticket for me for the movie 'Inception'?

	assistant response:
	Sure. Before we proceed with the booking, let's get some details about the movie 'Inception'. \n\nLet me quickly fetch the details for you.

	tools invoked:
	{"arguments": "{\n \"movie_title\": \"Inception\"\n}", "name": "get_movie_details"}

	### Output:
	8
	The assistant correctly identified the user's intention to watch the movie, and proceeded with calling the get_movie_details tool with the correct argument "Inception".

	## Example 2
	### Input:
	{'name': 'search_books', 'description': 'Search for books based on specified criteria', 'parameters': {'type': 'object', 'properties': {'keywords': {'type': 'array', 'items': {'type': 'string'}, 'description': 'Keywords to search for in books'}, 'author': {'type': 'string', 'description': 'The author of the book'}, 'genre': {'type': 'string', 'description': 'The genre of the book'}}, 'required': ['keywords']}}

	user prompt:
	I am looking for a mystery book. Can you help me find one?

	assistant response:
	None

	tools invoked:
	{"arguments": "{\n \"keywords\": [\"mystery\"],\n \"genre\": \"mystery\"\n}", "name": "search_books"}, "type": "function"}

	### Output:
	6
	The assistant correctly identified the user's intention to find a mystery book, but should have asked for more details rather than assuming that "mystery" would be a keyword in the search_books tool's argument

	# End Examples"""