Projects are a basic unit of data organization in the HCA Data Portal. Project contributors contribute raw sequencing and associated files along with rich metadata describing:
This Metadata is included in the project's Metadata Manifest (TSV file). When the HCA Data Portal processes the contributor's raw data with uniform pipelines, this processing information is also added to the Metadata Manifest.
The Data Portal Explore page lists all projects by title along with key project metadata. The project list is filterable by metadata values.
Selecting a project title on the project list takes you to the project's information page.
The project information page contains:
For each project, the HCA Data Portal maintains a project-specific TSV file containing the full project metadata. The TSV contains a row for each file in the project and columns for each metadata property. Meanings of the metadata properties are listed in the Metadata Dictionary.
The metadata TSV file gives a representation of the project's metadata graph that can be sorted and filtered using a standard spreadsheet or data manipulation tools.
The "Project Metadata" tab left of the Project Information page contains a link to download the project's metadata file.
Metadata file sizes vary across projects but will generally be between 1 and 100 megabytes.
The TSV file is named after the project and includes the date and time the file was created. For example:
ProstateCellAtlas 2023-11-09 08.10.tsv
A partial example of a TSV file is shown below:
Each project processed with HCA Data Portal pipelines has HCA Data Portal-generated matrices. To download Project matrices, navigate to the Project Information page and select the "Project Matrices" tab to the left.
Scroll to identify the relevant matrix and then select the download icon.
Contributor-generated matrices are optionally provided by the project contributors. These matrices vary in file format and content. For questions about a specific contributor-generated matrix, reach out to the Project Contacts listed on the Project Information page.
To download the contributor-generated matrix, select the "Project Matrices" tab to the left of the Project page.
Scroll to the Contributor-Generated Matrices section and select the download icon.