-
Notifications
You must be signed in to change notification settings - Fork 3.2k
Pull requests: huggingface/datasets
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
fix(arrow_dataset): clear stale local temp dir before re-downloading from remote FS
#8196
opened May 14, 2026 by
xodn348
Loading…
3 tasks done
Fix spurious label column when folder builders see split-named directories
#8195
opened May 12, 2026 by
1fanwang
Loading…
Escape glob chars in
base_path so directory paths with [] work (#7468)
#8192
opened May 10, 2026 by
jbbqqf
Loading…
Preserve
info.features across IterableDataset.map(remove_columns=...) (#7568)
#8191
opened May 10, 2026 by
jbbqqf
Loading…
Reject
num_shards > len(dataset) in Dataset.shard (#7443)
#8190
opened May 10, 2026 by
jbbqqf
Loading…
docs: clarify
num_proc semantics in Dataset.batch / Dataset.filter (#7700)
#8189
opened May 10, 2026 by
jbbqqf
Loading…
Don't infer labels from split-named directories in folder-based builders (#7880)
#8188
opened May 10, 2026 by
jbbqqf
Loading…
Better error for sliced splits in streaming mode (
train[:10%], train[:N]) (#7721)
#8187
opened May 10, 2026 by
jbbqqf
Loading…
docs: fix broken WebDataset documentation link in audio/video/image dataset pages (#7699)
#8186
opened May 10, 2026 by
jbbqqf
Loading…
docs: make
Dataset.map batched example self-contained (#7703)
#8185
opened May 10, 2026 by
jbbqqf
Loading…
Preserve triple-slash in remote URLs (HDFS, file://, ...) in
_as_str (#7934)
#8184
opened May 10, 2026 by
jbbqqf
Loading…
Add dtype="list" to List / Sequence / LargeList features (#8002)
#8183
opened May 10, 2026 by
jbbqqf
Loading…
Fix Dataset.map crash when first examples return None and later return dict (#7990)
#8182
opened May 10, 2026 by
jbbqqf
Loading…
Nightly OSS: Bug loading
sentence-transformers/parallel-sentences
#8177
opened May 7, 2026 by
xodn348
Loading…
feat: add
on_error to Image/Audio/Video features and IterableDatase…
#8174
opened May 6, 2026 by
LIUYellowBlack
Loading…
fix(image): guard exif_transpose against corrupted EXIF rational tags
#8173
opened May 5, 2026 by
xodn348
Loading…
fix(map): fix progress bar exceeding total when load_from_cache_file=False
#8170
opened May 3, 2026 by
Nitin-Rajasekar
Loading…
docs(map): clarify that fn_kwargs yields a stable cache fingerprint vs closures
#8168
opened May 1, 2026 by
xodn348
Loading…
Include all test files in sdist via MANIFEST.in
#8162
opened Apr 29, 2026 by
VladimirGutuev
Loading…
feat: add TsFile (Apache IoTDB) packaged builder with per-device wide format
#8160
opened Apr 29, 2026 by
JackieTien97
Loading…
1 of 3 tasks
Fix revision-aware cache lookup in load_dataset
#8156
opened Apr 26, 2026 by
Jatin-Shihora
Loading…
6 tasks done
fix: tolerate chmod PermissionError on flat-permission filesystems
#8146
opened Apr 22, 2026 by
dschulmeist
Loading…
4 tasks done
Previous Next
ProTip!
Exclude everything labeled
bug with -label:bug.