Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

List all potential test benchmarks #63

Open
6 tasks
faneshion opened this issue Feb 28, 2024 · 3 comments
Open
6 tasks

List all potential test benchmarks #63

faneshion opened this issue Feb 28, 2024 · 3 comments

Comments

@faneshion
Copy link
Collaborator

faneshion commented Feb 28, 2024

List all most used datasets in RAG researches, and we will add them to the benchmarks.

@FBzzh
Copy link
Collaborator

FBzzh commented Feb 29, 2024

@FBzzh
Copy link
Collaborator

FBzzh commented Feb 29, 2024

@Wenshansilvia
Copy link
Collaborator

Select and implement typical benchmarks, collect RAG papers that utilized these benchmarks, and try to reproduce evaluation result in the paper.

  1. List benchmark and related papers & metrics.
  2. Produce testset using baseline RAG in the paper. Pack testset as dataset format and upload to HuggingFace.
  3. Reproduce evaluation result in RAGEval.

Eli5 @QianHaosheng , ASQA @bugtig6351 , Fever @henan991201

This was referenced Mar 20, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment