Skip to content

Instantly share code, notes, and snippets.

View codelion's full-sized avatar

Asankhaya Sharma codelion

View GitHub Profile
Model ARC HellaSwag MMLU TruthfulQA Winogrande GSM8K
mera-mix-4x7B 65.7 84.73 Error: File does not exist 51.03 79.48 66.34

ARC

Task Version Metric Value Stderr
arc_challenge 1 acc,none 0.62
acc_stderr,none 0.01
acc_norm,none 0.66
Model ARC HellaSwag MMLU TruthfulQA Winogrande GSM8K Average
mera-mix-4x7B 72.01 88.82 63.67 77.45 84.61 71.65 76.37

ARC

Task Version Metric Value Stderr
arc_challenge 1 acc,none 0.70
acc_stderr,none 0.01
acc_norm,none 0.72
Model ARC HellaSwag MMLU TruthfulQA Winogrande GSM8K Average
mera-mix-4x7B-v1 46.67 69.64 39.65 46.78 68.27 2.73 45.62

ARC

Task Version Metric Value Stderr
arc_challenge 1 acc,none 0.44
acc_stderr,none 0.01
acc_norm,none 0.47