Welcome to the Galaxy Human Cell Atlas project

Human Cell Atlas

The Human Cell Atlas Galaxy setup comprises of analysis tools and workflows for the analysis of Single Cell RNA-Seq data. It includes a module that connects to the Matrix Service API of the HCA’s Data Coordination Platform that enables retrieval of gene expression matrices from any data sets in the Human Cell Atlas. The instance is based on the Galaxy framework, which guarantees simple access, easy extension, flexible adaption to personal and security needs, and sophisticated analyses independent of command-line knowledge.

This setup aims to give users access to as much granularity as possible in terms of the downstream analysis steps provided by the major software for single cell data analysis: Scanpy, SC3, Scater and Seurat. For each of these tools, this Galaxy instance has decomposed modules for each the main functionalities: ingestion from 10x/loom, filtering (by cells or genes), scaling, normalisation, clustering, marker genes, and dimensionality reduction, among others. In the short term we expect to have interoperability between these tools through the Loom exchange format. Additionally, we provide specialised viewers for single cell clustering data: UCSC CellBrowser and cellxgene interactive tool (contributed by the great Galaxy community).

Tools available under HCA-Single Cell section were mainly brought to Galaxy by the Gene Expression Team at EMBL-EBI and the Teichmann Team at the Wellcome Sanger Institute.

Content

  1. Get started
  2. Available Workflows
  3. Available tools
    1. Single Cell Galaxy Tools developed for the Human Cell Atlas
  4. Contributors

Get started

Are you new to Galaxy, or returning after a long time, and looking for help to get started? Take a guided tour through Galaxy’s user interface.

Available Workflows

Workflow Description
Human Cell Atlas - Scanpy - CellBrowser Retrieve data from the Human Cell Atlas matrix service, analysis with Scanpy and visualisation with UCSC CellBrowser
EBI Single Cell Expression Atlas - Scanpy - CellBrowser Retrieve expression matrices from Single Cell Expression Atlas, analysis with Scanpy and visualisation with UCSC CellBrowser
EBI Single Cell Expression Atlas Scanpy Prod 1.3 Workflow used for clustering data in the release 6 to 9 of Single Cell Expression Atlas
EBI Single Cell Expression Atlas Tertiary Analysis Rel 10 Workflow used for clustering data in the release 10 of Single Cell Expression Atlas
EBI Single Cell Expression Atlas Tertiary Analysis Rel 11 Workflow used for clustering data in the releases 11 and 12 of Single Cell Expression Atlas
EBI Single Cell Expression Atlas Tertiary Analysis Rel 13 Workflow used for downstream analysis in the releases 13 to 15 of Single Cell Expression Atlas.

Available tools

In this section we list all tools that have been integrated in the RNA workbench. The list is likely to grow as soon as further tools and workflows are contributed. To ease readability, we divided them into categories.

Single Cell Galaxy Tools developed for the Human Cell Atlas

Data retrieval from Single Cell data Repositories

Tool Description Reference
hca_matrix_downloader Human Cell Atlas Matrix Downloader retrieves expression matrices and metadata from the Human Cell Atlas. Regev et al. 2018
retrieve_scxa EBI SCXA Data Retrieval downloads expression matrices and metadata from the EBI Single Cell Expression Atlas (SCXA) Papatheodorou et al. 2018

10x files produced by these tools can be consumed by 10x reader modules in the tools below.

Visualisation

Tool Description Reference
ucsc_cell_browser UCSC Cell Browser displays single-cell clusterized data in an interactive web application. cells.ucsc.edu

Scanpy

Granular tools for accessing the main Scanpy functionalities.

Tool Description Reference
scanpy_read_10x Scanpy Read10x into hdf5 object handled by scanpy Wolf et al. 2018
scanpy_filter_genes Scanpy FilterGenes based on counts and numbers of cells expressed Wolf et al. 2018
scanpy_filter_cells Scanpy FilterCells based on counts and numbers of genes expressed Wolf et al. 2018
scanpy_normalise_data Scanpy NormaliseData to make all cells having the same total expression Wolf et al. 2018
scanpy_find_variable_genes Scanpy FindVariableGenes based on normalised dispersion of expression Wolf et al. 2018
scanpy_scale_data Scanpy ScaleData to make expression variance the same for all genes Wolf et al. 2018
scanpy_run_pca Scanpy RunPCA for dimensionality reduction Wolf et al. 2018
scanpy_compute_graph Scanpy ComputeGraph to derive kNN graph Wolf et al. 2018
scanpy_find_cluster Scanpy FindCluster based on community detection on KNN graph Wolf et al. 2018
scanpy_find_markers Scanpy FindMarkers to find differentially expressed genes between groups Wolf et al. 2018
scanpy_run_tsne Scanpy RunTSNE visualise cell clusters using tSNE Wolf et al. 2018
scanpy_run_umap Scanpy RunUMAP visualise cell clusters using UMAP Wolf et al. 2018

Seurat

Granular tools for accessing the main Seurat functionalities. These tools received contributions from Maria Doyle @mblue9.

Tool Description Reference
seurat_read10x Seurat Read10x Loads 10x data into a serialized seurat R object Satija et al. 2015
seurat_create_seurat_object Seurat CreateSeuratObject create a Seurat object Satija et al. 2015
seurat_filter_cells Seurat FilterCells filter cells in a Seurat object Satija et al. 2015
seurat_normalise_data Seurat NormaliseData normalise data Satija et al. 2015
seurat_find_variable_genes Seurat FindVariableGenes identify variable genes Satija et al. 2015
seurat_scale_data Seurat ScaleData scale and center genes Satija et al. 2015
seurat_run_pca Seurat RunPCA run a PCA dimensionality reduction Satija et al. 2015
seurat_find_clusters Seurat FindClusters find clusters of cells Satija et al. 2015
seurat_find_markers Seurat FindMarkers find markers (differentially expressed genes) Satija et al. 2015
seurat_dim_plot Seurat Plot dimension reduction graphs the output of a dimensional reduction technique (PCA by default). Cells are colored by their identity class. Satija et al. 2015
seurat_run_tsne Seurat RunTSNE run t-SNE dimensionality reduction Satija et al. 2015
seurat_export_cellbrowser Seurat Export2CellBrowser produces files for UCSC CellBrowser import. Satija et al. 2015

Scater

Granular tools for accessing the main Scater functionalities. Normally used in combination with SC3.

Tool Description Reference
Unknown Tool Scater read 10x data Loads 10x data into a serialized scater R object McCarthy et al. 2017
Unknown Tool Scater CalculateQcMetrics based on expression values and experiment information McCarthy et al. 2017
scater_filter Scater Filter cells and genes based on pre-calculated stats and QC metrics McCarthy et al. 2017
Unknown Tool Scater DetectOutlier cells based on expression metrics McCarthy et al. 2017
Unknown Tool Scater CalculateCPM from raw counts McCarthy et al. 2017
scater_normalize Scater Normalise expression values by library size in log2 scale McCarthy et al. 2017

SC3

Granular tools for accessing the main SC3 functionalities. Normally used in combination with Scater.

Tool Description Reference
sc3_prepare SC3 Prepare a sc3 SingleCellExperiment object Kisilev et al. 2017
sc3_calc_consens SC3 Calculate Consensus from multiple runs of k-means clustering Kisilev et al. 2017
sc3_calc_transfs SC3 Calculate Transformations of distances using PCA and graph Laplacian Kisilev et al. 2017
sc3_calc_biology SC3 DiffExp calculates DE genes, marker genes and cell outliers Kisilev et al. 2017
sc3_estimate_k SC3 Estimate the number of clusters for k-means clustering Kisilev et al. 2017
sc3_calc_dists SC3 Calculate Distances between cells Kisilev et al. 2017
sc3_kmeans SC3 K-Means perform k-means clustering Kisilev et al. 2017

…plus all the great tools normally available at the usegalaxy.eu.

Contributors

Our Data Policy

Registered UsersUnregistered UsersFTP DataGDPR Compliance
User data on UseGalaxy.eu (i.e. datasets, histories) will be available as long as they are not deleted by the user. Once marked as deleted the datasets will be permanently removed within 14 days. If the user "purges" the dataset in the Galaxy, it will be removed immediately, permanently. An extended quota can be requested for a limited time period in special cases. Processed data will only be accessible during one browser session, using a cookie to identify your data. This cookie is not used for any other purposes (e.g. tracking or analytics). If UseGalaxy.eu service is not accessed for 90 days, those datasets will be permanently deleted. Any user data uploaded to our FTP server should be imported into Galaxy as soon as possible. Data left in FTP folders for more than 3 months, will be deleted. The Galaxy service complies with the EU General Data Protection Regulation (GDPR). You can read more about this on our Terms and Conditions.