Ranking LLMs based on 180k French votes (French government's AI arena) comparia.beta.gouv.fr 1 points by simon-inta 6 hours ago
simon-inta 6 hours ago The data is open : https://huggingface.co/ministere-culture Colab notebook to reproduce the calculations : https://colab.research.google.com/drive/1j5AfStT3h-IK8V6FSJY...Note on Mistral Medium's great performance : there is no style control on compar:IA, and the majority of questions are not focused on code, thus explaining the great performance of the model.
The data is open : https://huggingface.co/ministere-culture Colab notebook to reproduce the calculations : https://colab.research.google.com/drive/1j5AfStT3h-IK8V6FSJY...
Note on Mistral Medium's great performance : there is no style control on compar:IA, and the majority of questions are not focused on code, thus explaining the great performance of the model.