(based on training)
Frequency penalty parameter makes it less likely to predict words that the model has seen more frequently during its training.
This will encourage the model to generate novel or less common words. It works by scaling down the log probabilities of words that the model has seen frequently during training - making it less likely for the model to generate these common words (and thus common phrases).