
Install DagsHub:
pip install dagshub
To stream this data directly on DagsHub
from dagshub.streaming import DagsHubFilesystem
fs = DagsHubFilesystem(".", repo_url="https://test.dagshub.com/DagsHub-Datasets/ccle-dataset")
fs.listdir("s3://gdc-ccle-2-open")
Description
The Cancer Cell Line Encyclopedia (CCLE) project is an effort to conduct a detailed genetic characterization of a large panel of human cancer cell lines. The CCLE provides public access to genomic data, visualization and analysis for over 1100 cancer cell lines. This dataset contains RNA-Seq Aligned Reads, WXS Aligned Reads, and WGS Aligned Reads data.
Additional information
Documentation
Update frequency
Genomic Data Commons (GDC) is source of truth for this dataset; GDC offers monthly data releases,
although this dataset may not be updated at every release.
Managed by
License
NIH Genomic Data Sharing Policy: https://gdc.cancer.gov/access-data/data-access-policies