Uploading Datasets¶
This guide walks you through uploading a BIDS dataset to NEMAR.
Prerequisites¶
Before uploading:
- [ ] Dataset is in valid BIDS format
- [ ] Logged in with
nemar auth login - [ ] DataLad and git-annex installed
Step 1: Validate Your Dataset¶
Always validate before uploading:
Validation Must Pass
Datasets with validation errors cannot be uploaded. Fix all errors before proceeding.
Common Validation Issues¶
| Issue | Solution |
|---|---|
| Missing dataset_description.json | Create the required BIDS metadata file |
| Invalid JSON | Check for syntax errors in JSON files |
| Missing required fields | Add Name and BIDSVersion to dataset_description.json |
| Invalid modality data | Ensure data files match BIDS naming conventions |
Step 2: Upload¶
Options¶
| Option | Description |
|---|---|
--name, -n |
Dataset name (defaults to folder name) |
--description |
Brief description |
--skip-validation |
Skip BIDS validation (not recommended) |
--dry-run |
Show what would be uploaded without doing it |
Step 3: What Happens¶
The upload process:
- Validation - Runs BIDS validator
- Repository Creation - Creates private GitHub repo under nemarDatasets
- Metadata Push - Pushes small files to GitHub
- Data Upload - Uploads large files to S3 via git-annex
- Workflow Setup - Configures GitHub Actions for PR workflow
Step 4: Making Updates¶
After initial upload, make changes via pull requests:
cd nm000104 # Your dataset directory
# Create a branch
git checkout -b add-new-subjects
# Make changes...
# Commit and push
git add .
git commit -m "Add subjects 101-110"
git push -u origin add-new-subjects
# Create PR (opens in browser)
gh pr create
Changes are merged after admin review.
Troubleshooting¶
Upload Fails with Authentication Error¶
git-annex Errors¶
Large File Upload Timeout¶
For very large datasets, files are uploaded in batches. If a timeout occurs: