-
Notifications
You must be signed in to change notification settings - Fork 30
Issues: instructlab/sdg
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
[Epic] Ability to resume/continue an SDG cycle
epic
Larger tracking issue encompassing multiple smaller issues
jira
#267
opened Sep 9, 2024 by
ktam3
Grounded skill samples generated by the simple pipeline are missing context?
question
Further information is requested
#258
opened Aug 28, 2024 by
markmc
Checkpoint files make iterating on a taxonomy awkward
UX
Affects the User Experience
#245
opened Aug 16, 2024 by
bbrowning
Empty dataset error kills the workflow
bug
Something isn't working
#240
opened Aug 9, 2024 by
aakankshaduggal
2 tasks
No feedback from ilab data generate
UX
Affects the User Experience
#259
opened Aug 8, 2024 by
jjasghar
Include precomputed dataset and datamixing recipes
enhancement
New feature or request
#234
opened Aug 5, 2024 by
aakankshaduggal
Support more than 3 qna per context chunk
enhancement
New feature or request
#232
opened Jul 30, 2024 by
markmc
Simplify New feature or request
base_document
column usage with auxiliary instructions in pipeline config
enhancement
#228
opened Jul 29, 2024 by
bbrowning
ilab data generate
does not specify the correct num of samples generated
bug
#227
opened Jul 29, 2024 by
alinaryan
checkpointing: consider allowing users to specify save frequency
enhancement
New feature or request
#225
opened Jul 28, 2024 by
markmc
Make generate_data(batch_size=None) default to a batch size of 8
refactor
Same results, different method
#224
opened Jul 27, 2024 by
markmc
INFO logging seems more like DEBUG
refactor
Same results, different method
#223
opened Jul 27, 2024 by
danmcp
Provide name of leaf node in error message when taxonomy contains older knowledge version
enhancement
New feature or request
#218
opened Jul 26, 2024 by
bbrowning
Add Functionality in LLMBlock to Override Global OpenAI Client Variable
enhancement
New feature or request
#217
opened Jul 25, 2024 by
npalaska
Enhance SDG to Support Multiple OpenAI Endpoints for Improved Performance
enhancement
New feature or request
#216
opened Jul 25, 2024 by
npalaska
Add test to cover schema correctness for blocks not used in upstream pipeline configs
testing
Relates to testing
#207
opened Jul 25, 2024 by
markmc
Pull taxonomy precomputed dataset from hugging face
enhancement
New feature or request
#201
opened Jul 24, 2024 by
aakankshaduggal
Reduce the MMLU evaluation benchmark dataset to the minimum set of features
enhancement
New feature or request
#183
opened Jul 23, 2024 by
markmc
Generated data has single quote or \n in the beggining of the setence
bug
Something isn't working
#181
opened Jul 21, 2024 by
tsailiming
Resolve confusion about "batching" support
question
Further information is requested
#174
opened Jul 19, 2024 by
markmc
Add precomputed dataset to skills data generation
enhancement
New feature or request
#171
opened Jul 19, 2024 by
bbrowning
Set a default New feature or request
seed
value for gen_kwargs
enhancement
#169
opened Jul 18, 2024 by
russellb
Make pipeline validation script more general purpose
testing
Relates to testing
#139
opened Jul 15, 2024 by
russellb
Previous Next
ProTip!
Type g i on any issue or pull request to go back to the issue listing page.