Skip to content

Add support for long-format DFs#150

Merged
shchur merged 5 commits into
autogluon:mainfrom
shchur:long-format-df-support
Jun 10, 2026
Merged

Add support for long-format DFs#150
shchur merged 5 commits into
autogluon:mainfrom
shchur:long-format-df-support

Conversation

@shchur

@shchur shchur commented Jun 10, 2026

Copy link
Copy Markdown
Contributor

Issue #, if available:

Description of changes:

  • Support using long-format DFs as datasets inside fev.Task
  • Update the documentation about supported dataset formats
  • Speed up convert_long_df_to_hf_dataset by using pyarrow operations instead of pandas groupby
  • Replace Dataset.sort with sorting on the pa.Table to avoid creating _indices that need to be flattened afterwards

By submitting this pull request, I confirm that you can use, modify, copy, and redistribute this contribution, under the terms of your choice.

@review-notebook-app

Copy link
Copy Markdown

Check out this pull request on  ReviewNB

See visual diffs & provide feedback on Jupyter Notebooks.


Powered by ReviewNB

@shchur shchur merged commit ac7f5d6 into autogluon:main Jun 10, 2026
2 checks passed
@shchur shchur deleted the long-format-df-support branch June 10, 2026 12:53
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant