Skip to content

Instantly share code, notes, and snippets.

@CultriX-Github
Last active July 22, 2024 14:59
Show Gist options
  • Save CultriX-Github/65896cc90cb7e3c74622801755184427 to your computer and use it in GitHub Desktop.
Save CultriX-Github/65896cc90cb7e3c74622801755184427 to your computer and use it in GitHub Desktop.
Yet Another LLM
Model Average AGIEval GPT4All TruthfulQA Bigbench
CultriX/MonaCeption-7B-SLERP-DPO πŸ“„ 62.93 45.57 76.99 78.98 50.19
CultriX/MonaCeption-7B-DPO πŸ“„ 62.91 45.55 76.97 78.95 50.19
CultriX/MonaCeption-7B-SLERP-SFT πŸ“„ 62.89 45.18 76.94 79.31 50.14
CultriX/MonaCeption-7B-SLERP πŸ“„ 62.88 45.51 77.04 78.88 50.07
CultriX/MonaTrix-7B-DPOv2 πŸ“„ 62.86 45.63 76.98 78.63 50.18
CultriX/MonaTrix-v4-7B-DPO πŸ“„ 62.83 45.59 76.89 78.74 50.11
CultriX/AlphaCeption-7B-v1 πŸ“„ 62.8 45.22 76.94 78.87 50.19
CultriX/MergeCeption-7B-v3 πŸ“„ 62.79 45.16 76.86 79.27 49.86
abideen/AlphaMonarch-daser πŸ“„ 62.77 45.48 76.95 78.46 50.21
CultriX/MonaTrix-v4 πŸ“„ 62.77 45.54 76.97 78.5 50.05
Kukedlc/NeuralMaxime-7B-slerp πŸ“„ 62.77 45.75 76.89 78.28 50.17
CultriX/ACultriX-7B πŸ“„ 62.75 45.23 77.0 78.95 49.82
abideen/AlphaMonarch-dora πŸ“„ 62.75 45.42 76.93 78.48 50.18
abideen/AlphaMonarch-laser πŸ“„ 62.74 45.39 77.0 78.4 50.15
mlabonne/AlphaMonarch-7B πŸ“„ 62.74 45.37 77.01 78.39 50.2
CultriX/NeuralCeptrix-7B-SLERPv3 πŸ“„ 62.73 45.28 77.03 78.84 49.75
CultriX/NeuralCeptrix-7B-SLERPv2 πŸ“„ 62.73 45.28 77.03 78.84 49.75
mlabonne/NeuralMonarch-7B πŸ“„ 62.73 45.31 76.99 78.35 50.28
CultriX/NeMoTrix-v1 πŸ“„ 62.68 45.23 76.87 78.53 50.09
mlabonne/Monarch-7B πŸ“„ 62.68 45.48 77.07 78.04 50.14
CultriX/MonaTrix-v3 πŸ“„ 62.66 45.27 77.13 78.42 49.81
CultriX/NeuralTrixlaser-bf16 πŸ“„ 62.66 44.43 77.0 79.33 49.9
automerger/YamshadowExperiment28-7B πŸ“„ 62.65 44.73 77.28 78.85 49.73
CultriX/NeuralCeptrix-7B-SLERP πŸ“„ 62.65 44.49 76.75 79.77 49.6
CultriX/MoNeuTrix-7B-v1 πŸ“„ 62.64 45.04 77.11 78.07 50.34
CultriX/ShadowTrix πŸ“„ 62.61 44.6 77.05 78.59 50.19
CultriX/MonaTrix-v2 πŸ“„ 62.61 45.13 77.18 78.05 50.06
CultriX/CultMerge-7B-v1 πŸ“„ 62.6 45.2 77.1 78.22 49.87
mlabonne/Monarch-7B-slerp πŸ“„ 62.6 45.13 77.09 78.63 49.56
eren23/ogno-monarch-jaskier-merge-7b-OH-PREF-DPO πŸ“„ 62.59 45.31 77.2 78.0 49.87
eren23/ogno-monarch-jaskier-merge-7b πŸ“„ 62.59 45.18 77.24 78.05 49.88
CultriX/MonaTrix-v5 πŸ“„ 62.58 45.07 77.01 78.42 49.82
mlabonne/Monarch-7B-dare πŸ“„ 62.58 45.16 77.22 77.98 49.95
bardsai/jaskier-7b-dpo-v3.3 πŸ“„ 62.58 44.57 76.53 80.0 49.22
CultriX/NeuralShadow-7B πŸ“„ 62.57 44.58 77.21 78.91 49.6
CultriX/NeMoTrix-v2 πŸ“„ 62.57 45.25 76.61 78.16 50.26
CultriX/NeuralTrix-bf16 πŸ“„ 62.57 44.43 76.43 80.18 49.23
CultriX/MergeCeption-7B-v1 πŸ“„ 62.56 45.27 77.0 77.66 50.3
CultriX/NeuralShadow-7B-v2 πŸ“„ 62.52 44.95 77.17 78.53 49.44
eren23/dpo-binarized-NeuralTrix-7B πŸ“„ 62.5 44.57 76.34 79.81 49.27
CultriX/NeuralTrix-7B-dpo πŸ“„ 62.5 44.61 76.33 79.8 49.24
liminerity/Neurotic-Jomainotrik-7b-slerp πŸ“„ 62.46 45.09 77.1 78.44 49.23
CultriX/Monatrix-v4-dpo πŸ“„ 62.44 45.4 76.33 78.44 49.59
mlabonne/UltraMerge-v2-7B πŸ“„ 62.41 44.16 76.72 79.58 49.2
EryriLabs/TriFusionNexus-7b πŸ“„ 62.41 44.72 76.88 78.56 49.47
mlabonne/NeuBeagle-7B πŸ“„ 62.39 44.43 76.62 79.13 49.38
mlabonne/OmniTruthyBeagle-7B-v0 πŸ“„ 62.39 45.72 77.49 76.16 50.18
mlabonne/Zebrafish-slerp-7B πŸ“„ 62.37 44.83 77.13 78.27 49.25
CultriX/NeuralTrix-v4-bf16 πŸ“„ 62.37 44.22 76.33 79.5 49.43
MTSAIR/multi_verse_model πŸ“„ 62.36 44.9 76.9 78.25 49.39
chihoonlee10/T3Q-Mistral-Orca-Math-DPO πŸ“„ 62.36 44.41 76.83 78.78 49.43
CultriX/NeuralTrix-V2 πŸ“„ 62.36 44.43 77.01 78.2 49.81
liminerity/M7-7b πŸ“„ 62.34 44.84 77.01 78.4 49.1
mlabonne/UltraMerge-7B πŸ“„ 62.33 44.36 77.15 78.47 49.35
mlabonne/NeuralOmniBeagle-7B πŸ“„ 62.3 45.85 77.26 76.06 50.03
mlabonne/Zebrafish-dare-7B πŸ“„ 62.29 44.68 77.0 78.28 49.21
yam-peleg/Experiment26-7B πŸ“„ 62.28 44.49 77.06 78.58 49.0
shadowml/MBeagleX-7B πŸ“„ 62.28 45.02 76.87 78.04 49.18
yam-peleg/Experiment24-7B πŸ“„ 62.27 44.03 77.0 78.99 49.05
yleo/OgnoMonarch-7B πŸ“„ 62.22 44.9 76.99 77.27 49.74
mlabonne/OmniTruthyBeagle-7B πŸ“„ 62.21 45.65 77.22 75.77 50.21
Unknown πŸ“„ 62.2 44.31 77.09 78.06 49.34
CultriX/OmniBeagleTrix-SLERP-7B πŸ“„ 62.2 44.8 77.0 77.09 49.9
mlabonne/NeuralOmniBeagle-7B-v2 πŸ“„ 62.15 45.86 77.31 75.34 50.09
eren23/dpo-binarized-NeutrixOmnibe-7B πŸ“„ 62.15 44.96 77.3 77.24 49.1
Kquant03/NeuralTrix-7B-dpo-laser πŸ“„ 62.14 44.51 76.24 78.56 49.26
CultriX/NeuralTrix-v3-fp16 πŸ“„ 62.14 43.76 76.79 77.92 50.07
CultriX/NeuralTrix-v3-bf16 πŸ“„ 62.13 44.52 76.84 77.78 49.37
shadowml/MBTrix-7B πŸ“„ 62.12 44.92 77.14 77.26 49.18
Kukedlc/NeuTrixOmniBe-7B-model-remix πŸ“„ 62.12 44.74 77.31 77.27 49.18
paulml/DPOB-INMTOB-7B πŸ“„ 62.11 45.09 77.37 76.81 49.18
paulml/OGNO-7B πŸ“„ 62.09 45.07 77.28 76.91 49.12
Kukedlc/NeuTrixOmniBe-DPO πŸ“„ 62.06 44.32 77.33 77.43 49.17
mlabonne/OmniBeagle-7B πŸ“„ 62.05 45.64 77.48 75.03 50.03
mlabonne/OmniBeagle-7B πŸ“„ 62.02 45.66 77.38 75.03 50.03
liminerity/Omningotex-7b-slerp πŸ“„ 61.97 44.9 77.36 76.59 49.04
mlabonne/Beyonder-4x7B-v3 πŸ“„ 61.91 45.85 76.67 74.98 50.12
CultriX/NeuralTrix-7B-v1 πŸ“„ 61.91 44.75 77.61 75.44 49.82
mlabonne/NeuralOmni-7B πŸ“„ 61.9 45.8 77.5 74.51 49.8
mlabonne/FrankenMonarch-11b πŸ“„ 61.85 44.01 76.45 76.7 50.22
mlabonne/FrankenMonarch-11b πŸ“„ 61.85 44.01 76.45 76.7 50.22
mlabonne/FrankenMonarch-11b πŸ“„ 61.85 44.01 76.45 76.7 50.22
shadowml/OmnixBeagle-7B πŸ“„ 61.84 45.3 77.64 75.24 49.2
liminerity/binarized-ingotrix-slerp-7b πŸ“„ 61.78 45.17 76.74 75.64 49.57
mlabonne/Omnarch-7B πŸ“„ 61.75 45.88 77.28 74.07 49.76
mlabonne/Beagle4 πŸ“„ 61.61 45.5 77.38 73.84 49.7
mlabonne/BeagleB-7B πŸ“„ 61.5 45.19 77.75 73.19 49.88
mlabonne/ArchBeagle-7B πŸ“„ 61.4 45.56 77.32 73.36 49.36
shadowml/BeagleSempra-7B πŸ“„ 61.38 45.56 77.44 73.35 49.15
shadowml/BeagSake-7B πŸ“„ 61.35 45.9 77.36 72.82 49.32
CultriX/MergeTrix-v3 πŸ“„ 61.33 45.17 77.56 73.1 49.48
CultriX/Wernicke-7B-dpo πŸ“„ 61.31 45.09 77.15 73.95 49.05
CultriX/NeuralTrix-7B-NO-INST πŸ“„ 61.27 43.61 74.64 78.69 48.16
shadowml/WestBeagle-7B πŸ“„ 61.21 46.19 77.23 72.25 49.15
shadowml/WestBeagle-7B-gen3 πŸ“„ 61.13 45.74 77.28 72.29 49.23
CultriX/Wernicke-7B-v9 πŸ“„ 61.12 45.63 77.59 72.4 48.86
shadowml/BeagleX-7B πŸ“„ 61.11 45.39 77.52 72.91 48.63
FelixChao/WestSeverus-7B-DPO-v2 πŸ“„ 60.98 45.29 77.2 72.72 48.71
shadowml/Beaglake-7B πŸ“„ 60.97 45.03 77.8 72.58 48.48
shadowml/FoxBeagle-7B πŸ“„ 60.97 45.46 77.42 72.08 48.91
CultriX/MergeTrix-v4 πŸ“„ 60.9 45.54 77.35 72.19 48.52
CultriX/MergeTrix-v6 πŸ“„ 60.89 45.54 77.39 72.01 48.61
shadowml/Beagwake-7B πŸ“„ 60.88 45.03 77.54 72.37 48.56
CultriX/Wernicke-7B-v8 πŸ“„ 60.83 45.32 77.67 71.84 48.49
vanillaOVO/supermario_v2 πŸ“„ 60.74 45.2 77.21 71.67 48.87
CultriX/Wernicke-7B-v1 πŸ“„ 60.73 45.59 77.36 71.46 48.49
flemmingmiguel/MBX-7B-v3 πŸ“„ 60.71 45.12 77.48 71.76 48.48
CultriX/Wernicke-7B-v5 πŸ“„ 60.68 45.12 77.44 71.64 48.51
CultriX/Wernicke-7B-v2 πŸ“„ 60.65 45.59 77.4 71.23 48.38
CultriX/Wernicke-7B-v3 πŸ“„ 60.62 45.57 77.39 71.06 48.46
CultriX/Wernicke-MoE πŸ“„ 60.58 45.36 77.62 70.74 48.59
CultriX/CombinaTrix-7B πŸ“„ 60.58 45.52 77.42 71.12 48.24
CultriX/SymbioTrix-v2 πŸ“„ 60.53 44.99 77.46 71.13 48.55
CultriX/NextGen-7B πŸ“„ 60.49 44.96 77.45 71.17 48.38
CultriX/Wernicke-7B-v6 πŸ“„ 60.48 45.11 77.39 71.21 48.22
CultriX/Aphasia-7B πŸ“„ 60.43 45.24 77.25 71.04 48.17
CultriX/Wernicke-7B-v7 πŸ“„ 60.42 44.54 77.18 71.02 48.93
shadowml/TurdusBeagle-7B-gen3 πŸ“„ 60.41 45.08 77.52 70.36 48.69
CultriX/SymbioTrix-v1 πŸ“„ 60.37 44.94 76.99 71.06 48.47
CultriX/OmniTrixAI πŸ“„ 60.35 44.94 77.31 70.62 48.52
mlabonne/FrankenMonarch-7B πŸ“„ 60.32 45.1 75.53 73.86 46.79
flemmingmiguel/MBX-7B-v2 πŸ“„ 60.25 44.23 77.27 71.04 48.47
mlabonne/NeuralBeagle14-7B πŸ“„ 60.25 46.06 76.77 70.32 47.86
mlabonne/FrakenBeagle14-11B πŸ“„ 60.17 45.08 76.08 70.93 48.58
CultriX/MistralTrix-v1 πŸ“„ 60.05 44.98 76.62 71.44 47.17
jsfs11/TurdusTrixBeagle-DARETIES-7B πŸ“„ 59.99 44.46 77.81 69.15 48.54
CultriX/ActualNextGen-7B πŸ“„ 59.92 45.95 76.78 68.91 48.04
shadowml/mibe-7B πŸ“„ 59.91 44.22 76.9 71.25 47.27
CultriX/SevereNeuralBeagleTrix-7B πŸ“„ 59.82 44.37 77.38 69.59 47.95
CultriX/SymbioTrix-v3 πŸ“„ 59.74 45.39 76.73 69.28 47.55
CultriX/MergeTrix-7B-v2 πŸ“„ 59.53 44.7 77.66 67.52 48.23
senseable/WestLake-7B-v2 πŸ“„ 59.42 44.27 77.86 67.46 48.09
mlabonne/Beagle14-7B πŸ“„ 59.4 44.38 76.53 69.44 47.25
mlabonne/NeuralDaredevil-7B πŸ“„ 59.39 45.23 76.2 67.61 48.52
fblgit/UNA-TheBeagle-7b-v1 πŸ“„ 59.17 42.73 77.12 70.82 46.01
argilla/distilabeled-Marcoro14-7B-slerp πŸ“„ 58.93 45.38 76.48 65.68 48.18
CultriX/MergeTrix-7B πŸ“„ 58.88 44.93 76.85 66.56 47.18
SanjiWatsuki/Kunoichi-DPO-v2-7B πŸ“„ 58.29 44.79 75.05 65.68 47.65
mlabonne/Daredevil-7B πŸ“„ 58.22 44.85 76.07 64.89 47.07
fblgit/una-cybertron-7b-v2-bf16 πŸ“„ 57.76 43.29 74.98 65.32 47.45
mlabonne/Marcoro14-7B-slerp πŸ“„ 57.67 44.66 76.24 64.15 45.64
CultriX/DominaTrix-7B-v2 πŸ“„ 56.55 36.81 74.12 72.09 43.17
CultriX/DominaTrix-7B-v1 πŸ“„ 55.19 36.23 73.8 68.71 42.01
mistralai/Mistral-7B-Instruct-v0.2 πŸ“„ 54.81 38.5 71.64 66.82 42.29
CultriX/MergeCeption-7B-v2 πŸ“„ 54.44 42.38 67.83 66.6 40.94
NousResearch/Hermes-2-Pro-Mistral-7B πŸ“„ 54.19 44.54 71.2 59.12 41.9
microsoft/Phi-3-mini-4k-instruct πŸ“„ 54.0 44.44 71.88 57.77 41.9
openchat/openchat-3.5-1210 πŸ“„ 53.14 42.62 72.84 53.21 43.88
HuggingFaceH4/zephyr-7b-alpha πŸ“„ 51.72 38.0 72.24 56.06 40.57
Replete-AI/Phi-3-Goru πŸ“„ 51.7 38.59 70.54 59.44 38.23
openchat/openchat_3.5 πŸ“„ 51.34 42.67 72.92 47.27 42.51
cognitivecomputations/dolphin-2.2.1-mistral-7b πŸ“„ 51.05 38.64 72.24 54.09 39.22
HuggingFaceH4/zephyr-7b-beta πŸ“„ 50.99 37.33 71.83 55.1 39.7
deepnet/Subnet6Model3 πŸ“„ 50.91 38.86 70.83 55.86 38.09
cognitivecomputations/dolphin-2.8-mistral-7b-v02 πŸ“„ 50.9 38.99 72.22 51.96 40.41
beowolx/CodeNinja-1.0-OpenChat-7B πŸ“„ 50.35 39.98 71.77 48.73 40.92
Weyaxi/Einstein-v4-7B πŸ“„ 49.92 37.83 67.52 55.56 38.78
mlabonne/Mistralpaca-7B πŸ“„ 48.53 33.48 70.71 52.89 37.06
mlabonne/phixtral-3x2_8 πŸ“„ 48.23 33.58 72.1 49.59 37.67
meetkai/functionary-small-v2.2 πŸ“„ 47.99 33.15 70.35 51.5 36.97
stabilityai/StableBeluga-7B πŸ“„ 47.65 35.36 69.37 50.09 35.77
lxuechen/phi-2-dpo πŸ“„ 46.93 30.39 71.68 50.75 34.9
microsoft/phi-2 πŸ“„ 44.61 27.96 70.84 44.46 35.17
TheBloke/guanaco-7B-HF πŸ“„ 40.38 23.12 66.85 38.92 32.64
TinyLlama/TinyLlama-1.1B-Chat-v1.0 πŸ“„ 36.32 20.77 54.28 37.84 32.4
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment