
Install DagsHub:
pip install dagshub
To stream this data directly on DagsHub
from dagshub.streaming import DagsHubFilesystem
fs = DagsHubFilesystem(".", repo_url="https://test.dagshub.com/DagsHub-Datasets/gatk-test-data-dataset")
fs.listdir("s3://gatk-test-data")
Description
The GATK test data resource bundle is a collection of files for resequencing human genomic data with the Broad Institute’s Genome Analysis Toolkit (GATK).
Additional information
Update frequency
Every 3 months
Managed by
Broad Institute
License
CC0 1.0 Universal (CC0 1.0) Public Domain Dedication