-
Notifications
You must be signed in to change notification settings - Fork 289
Pull requests: NVIDIA-NeMo/Curator
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
docs(fern): add local library autodocs without Fern auth
#2102
opened Jun 22, 2026 by
lbliii
Contributor
Loading…
2 of 3 tasks
docs: fix tutorial doc links and add data curation challenges
#2098
opened Jun 22, 2026 by
lbliii
Contributor
Loading…
2 tasks done
docs: expand tutorials Quick Start with docs and Core Concepts links
#2097
opened Jun 22, 2026 by
lbliii
Contributor
Loading…
2 tasks
docs: clarify fuzzy dedup input blocksize
community-request
waiting-on-customer
Waiting on the original author to respond
#2096
opened Jun 22, 2026 by
nightcityblade
Contributor
Loading…
3 tasks done
feat: Add DataChef recipe generation integration (Issue #1760)
community-request
#2095
opened Jun 22, 2026 by
anushkagupta200615-jpg
Loading…
3 tasks done
Aaftabv/qwen1967 global bucketing benchmark
#2094
opened Jun 22, 2026 by
mohammadaaftabv
Contributor
•
Draft
3 tasks
Aaftabv/qwen1967 local noio control
#2093
opened Jun 22, 2026 by
mohammadaaftabv
Contributor
•
Draft
3 tasks
fix: make quickstart Ray startup Xenna-safe
community-request
waiting-on-maintainers
Waiting on maintainers to respond
#2089
opened Jun 18, 2026 by
nightcityblade
Contributor
Loading…
3 tasks done
Add option to drop deduplication id field
community-request
waiting-on-maintainers
Waiting on maintainers to respond
#2078
opened Jun 16, 2026 by
nightcityblade
Contributor
Loading…
3 tasks done
fix: open remote JSONL files before cuDF reads
community-request
waiting-on-maintainers
Waiting on maintainers to respond
#2076
opened Jun 14, 2026 by
nightcityblade
Contributor
Loading…
2 of 3 tasks
Draft: ASR Open Source Datasets Processing Pipeline
#2067
opened Jun 11, 2026 by
sushmitha-deva-09
Contributor
Loading…
3 tasks
[wip][nightly] RAPIDS 26.08* / Ray 3* / Dynamo 1.3* + bump transformers 5 + data-designer 0.61
#2065
opened Jun 11, 2026 by
praateekmahajan
Contributor
•
Draft
3 tasks
[wip] Ray 2.56 nightly + Dynamo 1.3.0 + vLLM 0.22 (cu129)
#2064
opened Jun 10, 2026 by
praateekmahajan
Contributor
•
Draft
Pipeline resumability via source-level counter checkpointing
#2063
opened Jun 10, 2026 by
abhinavg4
Contributor
Loading…
test: cover remote pairwise file paths
community-request
waiting-on-maintainers
Waiting on maintainers to respond
#2061
opened Jun 10, 2026 by
nightcityblade
Contributor
Loading…
3 tasks done
Add support for Slurm arrays
#2059
opened Jun 9, 2026 by
sarahyurick
Contributor
Loading…
5 of 6 tasks
fix: auto-detect Ray fanout stages
community-request
waiting-on-customer
Waiting on the original author to respond
#2056
opened Jun 8, 2026 by
nightcityblade
Contributor
Loading…
3 tasks done
Add review-curator-audio-pr Cursor skill for reviewing audio Curator PRs
#2051
opened Jun 5, 2026 by
mohammadaaftabv
Contributor
Loading…
4 of 5 tasks
Dynamo Server Fixes + Nemotron Parsing PDF Benchmark changes
#2050
opened Jun 5, 2026 by
praateekmahajan
Contributor
Loading…
3 tasks
fix: default workflow input extensions by filetype
community-request
#2045
opened Jun 3, 2026 by
nightcityblade
Contributor
Loading…
3 tasks done
Previous Next
ProTip!
Follow long discussions with comments:>50.