Skip to content

Instantly share code, notes, and snippets.

@ymoslem
Last active July 18, 2020 21:23
Show Gist options
  • Star 0 You must be signed in to star a gist
  • Fork 0 You must be signed in to fork a gist
  • Save ymoslem/27747f9e10c057ee13867f3a61b6a144 to your computer and use it in GitHub Desktop.
Save ymoslem/27747f9e10c057ee13867f3a61b6a144 to your computer and use it in GitHub Desktop.
Calculate BLEU score for sentence by sentence and save the result to a file
# BLEU for segment by segment
import sacrebleu
from sacremoses import MosesDetokenizer
md = MosesDetokenizer(lang='en')
# Open the test dataset human translation file and detokenize the references
refs = []
with open("target.test") as test:
for line in test:
line = line.strip().split()
line = md.detokenize(line)
refs.append(line)
print("Reference 1st sentence:", refs[0])
# Open the translation file by the NMT model and detokenize the predictions
preds = []
with open("target.pred") as pred:
for line in pred:
line = line.strip().split()
line = md.detokenize(line)
preds.append(line)
# Calculate BLEU for sentence by sentence and save the result to a file
with open("bleu.txt", "w+") as output:
for line in zip(refs,preds):
test = line[0]
pred = line[1]
print(test, "\t--->\t", pred)
bleu = sacrebleu.sentence_bleu(pred, [test], smooth_method='exp')
print(bleu.score, "\n")
output.write(str(bleu.score) + "\n")
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment