Model | AGIEval | GPT4All | TruthfulQA | Bigbench | Average |
---|---|---|---|---|---|
NeuralBeagle14-7B | 46.06 | 76.77 | 70.32 | 47.86 | 60.25 |
Task | Version | Metric | Value | Stderr | |
---|---|---|---|---|---|
agieval_aqua_rat | 0 | acc | 26.38 | ± | 2.77 |
acc_norm | 25.98 | ± | 2.76 | ||
agieval_logiqa_en | 0 | acc | 38.56 | ± | 1.91 |