Import Q&A datasets directly from Hugging Face Hub into your FloTorch workspace. This guide explains what you need and how to use the import form.
- A dataset created with HuggingFace as the source (in Dataset Form → Source)
- The dataset repository ID from Hugging Face
- Knowledge of the file structure in the target dataset (file names and column names)
- Open your Workspace → Datasets
- Open a dataset with HuggingFace source
- Go to the Files tab → click Add → Import from Hugging Face
- Or: Dataset detail page → Add → Import from Hugging Face
| Field | Description | Example |
|---|
| Repository | Hugging Face dataset repository ID in username/dataset-name format | allenai/mmlu |
| GT File | Filename of the ground truth / evaluation file in the repo | test.jsonl, validation.jsonl |
| Question Map From | Name of the source column that contains the question/prompt | input, question, prompt |
| Answer Map From | Name of the source column that contains the expected answer | output, answer, target |
| Limit | Number of Q&A pairs to import | 100 |
| Field | Description | Example |
|---|
| Access Token | Hugging Face token for private repositories. Leave empty for public datasets | hf_xxxx... |
| Examples File | Filename for few-shot examples (used for N-Shot Prompts) | train.jsonl |
| Examples Question Map From | Column name for questions in the examples file | input |
| Examples Answer Map From | Column name for answers in the examples file | output |
- Browse datasets: huggingface.co/datasets
- From the dataset page URL:
https://huggingface.co/datasets/allenai/mmlu → use allenai/mmlu
- Open the dataset page on Hugging Face
- Go to the Files and versions tab to see available files (e.g.
train.jsonl, test.jsonl)
- Use the Preview or Explore feature to inspect the structure and column names
| Field | Value |
|---|
| Repository | allenai/mmlu |
| GT File | test.jsonl (or the appropriate split) |
| Question Map From | input |
| Answer Map From | target |
| Limit | 100 |
- An import job runs in the background
- Progress is shown in the dataset Files tab
- When complete, the imported Q&A pairs appear as ground truth (and optionally examples) data