Optimizing Benchmark Dataset Size #2170
Replies: 2 comments 2 replies
-
So since For both of these cases; however, I suspect that they are machine translated (though the metadata does specify found I think that this is an error?). I think better alternatives are available, e.g. Miracl (which already have a HardNegatives version) |
Beta Was this translation helpful? Give feedback.
-
@mehran-sarmadi Some more details: |
Beta Was this translation helpful? Give feedback.
-
Hi everyone,
Some of the datasets in the Persian benchmark, like MSMARCO-Fa and FEVER-Fa, are pretty large, and users might have trouble running them efficiently. To help with this, we’re considering two options:
Which option do you think is better?
Beta Was this translation helpful? Give feedback.
All reactions