Replies: 1 comment 2 replies
-
We're checking all splits datasets results |
Beta Was this translation helpful? Give feedback.
2 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
The calculation logic of MTEB(Multilingual) seems to involve many multilingual datasets, where the main score for each dataset is the average of the results across various languages (considering only the languages that the model has been evaluated on). I am quite puzzled about how it is ensured that all the models currently on the leaderboard have been evaluated on all the languages. Are these results now comparable?
Beta Was this translation helpful? Give feedback.
All reactions