-
Notifications
You must be signed in to change notification settings - Fork 40
Issues: instructlab/sdg
New issue
Have a question about this project? # for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “#”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? # to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
Create XL E2E CI job
CI/CD
Affects CI/CD configuration
enhancement
New feature or request
#457
opened Dec 20, 2024 by
nathan-weinberg
e2e tests occasionally flake with out of disk space
bug
Something isn't working
CI/CD
Affects CI/CD configuration
#442
opened Dec 10, 2024 by
bbrowning
Update
docling
and docling-parse
dependencies to use docling-parse
>= v3
#436
opened Dec 10, 2024 by
courtneypacheco
Change default skills sampling size during datamixing to "1.0" from "30"
jira
#420
opened Nov 29, 2024 by
bbrowning
[EPIC] Refactor Generate Functionality into a Standalone Python API
#412
opened Nov 26, 2024 by
aakankshaduggal
4 tasks
data generate --model parameter used for local file path and where to point to remote teacher model endpoint (need two separate variables)
bug
Something isn't working
#425
opened Nov 21, 2024 by
relyt0925
Remove / replace spellcheck auxiliary instruction in knowledge pipeline
#405
opened Nov 20, 2024 by
bbrowning
Download tokenizer artifacts in CI instead of storing them in Relates to testing
tests/testdata/models
testing
#384
opened Nov 14, 2024 by
khaledsulayman
Documentation Update for Improvements or additions to documentation
docling_model_path
:
documentation
#383
opened Nov 14, 2024 by
aakankshaduggal
[Epic] Fully Utilize Docling V2 Capabilities
enhancement
New feature or request
#374
opened Nov 12, 2024 by
ktam3
6 tasks
Use Docling v2 hierarchical chunking instead of the existing context-aware chunking implementation
jira
#350
opened Nov 8, 2024 by
jwm4
Previous Next
ProTip!
Follow long discussions with comments:>50.