Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[R-310] Test set generation improvements #1577

Open
5 of 10 tasks
shahules786 opened this issue Oct 25, 2024 · 6 comments
Open
5 of 10 tasks

[R-310] Test set generation improvements #1577

shahules786 opened this issue Oct 25, 2024 · 6 comments
Assignees
Labels
enhancement New feature or request module-testsetgen Module testset generation

Comments

@shahules786
Copy link
Member

shahules786 commented Oct 25, 2024

For Ragas 0.2, we released our third iteration of synthetic test generation for RAG. While developing this new approach we have kept some important feedback that we gathered from earlier versions

  1. Ability to generalize over more data formats and domains.
  2. Ability to customize and create scenarios related to one's own domain.
  3. Ability to persist and resample Q&A from same list of documents w/o redoing preprocessing
  4. Lower cost and faster iteration.

We have already noted some feedback from the community and we want to welcome more, hence this discussion thread. Please feel free share your thoughts and queries on the same. This will help us greatly improve the feature in coming weeks.

R-310

Known issues

Feature enhancement

Quality enhancement

Documentation improvements

TODO

  • Improve quality of generated questions
  • Add node filter to avoid query creation from nodes with poor information quality.
  • Heading splitter - add minium and max chunk size
  • Make sure default settings work in small document set (PubMed summaries,etc, <500 tokens per doc)
  • Fix long context issues with extraction (OOC)
  • Adjust transforms/query generation for small number of documents (n=1)
  • Improve sampling algorithm to sample diverse scenarios from available scenarios.
  • Simplify code for writing custom scenarios
  • Add documentation for prompt adaptation in test set generation
  • Add documentation for writing custom scenarios and query types, For example Yes/No questions
@shahules786 shahules786 added the enhancement New feature or request label Oct 25, 2024
@shahules786 shahules786 self-assigned this Oct 25, 2024
@shahules786 shahules786 changed the title Test set generation improvements [R-310] Test set generation improvements Oct 25, 2024
@shahules786 shahules786 pinned this issue Oct 25, 2024
@dosubot dosubot bot added the module-testsetgen Module testset generation label Oct 25, 2024
@shahules786
Copy link
Member Author

#1621
#1618
Both these items addresses quality-related issues raised in issues like #1568

@ahgraber
Copy link
Contributor

ahgraber commented Nov 7, 2024

Personas and improvements to multihop look really good, I'll have to spend some time with the new multihop generation.

@shahules786
Copy link
Member Author

Update on node filter and headline splitter

  • Added custom node filter to remove nodes w/o good content for query creation - such as references in research papers. One could modify and their own filters
  • Tuned headline splitter for better node creation

merged #1645

@shahules786
Copy link
Member Author

shahules786 commented Nov 8, 2024

@ahgraber with new simplified way of query generation in multi hop, I am hoping it would also be easier to create multi turn conversations. Will explore this soon.

@shahules786
Copy link
Member Author

This week's updates

  1. feat: improvements in default test generation #1661 make default settings work with different types of input docs
  2. fixes: handle long context extraction #1680 handle long context extraction

@shahules786
Copy link
Member Author

language adaptation for testgen #1734

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request module-testsetgen Module testset generation
Projects
None yet
Development

No branches or pull requests

2 participants