-
Notifications
You must be signed in to change notification settings - Fork 2.9k
Pull requests: huggingface/datasets
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
fix(iterable): ensure MappedExamplesIterable supports state_dict for resume
#7656
opened Jun 29, 2025 by
ArjunJagdale
fix(load): strip deprecated use_auth_token from config_kwargs
#7654
opened Jun 28, 2025 by
ArjunJagdale
feat(load): fallback to
load_from_disk()
when loading a saved dataset directory
#7653
opened Jun 28, 2025 by
ArjunJagdale
Add columns support to JSON loader for selective key filtering
#7652
opened Jun 27, 2025 by
ArjunJagdale
Introduces automatic subset-level grouping for folder-based dataset builders #7066
#7646
opened Jun 26, 2025 by
ArjunJagdale
Add ignore_decode_errors option to Image feature for robust decoding #7612
#7638
opened Jun 24, 2025 by
ArjunJagdale
Fix: Preserve float columns in JSON loader when values are integer-like (e.g. 0.0, 1.0)
#7635
opened Jun 24, 2025 by
ArjunJagdale
Pass user-agent from DownloadConfig into fsspec storage_options
#7631
opened Jun 21, 2025 by
ArjunJagdale
Add test for
as_iterable_dataset()
method in DatasetBuilder
#7629
opened Jun 19, 2025 by
ArjunJagdale
Add
as_iterable_dataset()
method to DatasetBuilder for streaming from cached Arrow files
#7628
opened Jun 19, 2025 by
ArjunJagdale
feat(map): reuse unchanged columns when input_columns specified to reduce disk usage (#6013)
#7626
opened Jun 19, 2025 by
ArjunJagdale
Guard against duplicate builder_kwargs/config_kwargs in load_dataset_…
#7622
opened Jun 17, 2025 by
Shohail-Ismail
fix: raise error when folder-based datasets are loaded without data_dir or data_files
#7618
opened Jun 16, 2025 by
ArjunJagdale
7 of 9 tasks
Enhance error handling and input validation across multiple modules
#7602
opened Jun 8, 2025 by
mohiuddin-khan-shiam
Previous Next
ProTip!
Follow long discussions with comments:>50.