Nuwa

Stars GitHub last commit CI tests docs

A bioinformatics web tool built with scanpy for genomics data processing and analysis.

Warning

Currently in beta version, not recommended for use in research or commercial use.

What it does?

Nuwa is an open-source, graphical web tool for single cell RNA seq analysis. It primarily focuses on functions available within the Scanpy library, but also incorporates other software packages and deep learning models. Nuwa runs on a local web server which users access in their browser.

Goals of the project

Nuwa is a free resource that aims to speed up single cell analysis with no coding required. However users can also export scripts in python or R for reproducible analysis results. When the project matures, institutions such as universities may wish to host it as a service to students/researches in their internal organisation (made easier with our docker compose files). We hope that in the future Nuwa will be an addition to other open-source software for lowering the cost of scRNA-seq analysis.

Quick start

Clone the repo:

#clone repo
git clone https://github.com/nuwa-genomics/Nuwa.git && cd Nuwa
Note

If using a GPU

If you are planning to use a GPU for faster training:

Then bring up containers:

for GPU:

docker-compose -f cuda.docker-compose.yml up -d --build

Or for CPU only:

docker-compose -f cpu.docker-compose.yml up -d --build

Usage

Navigate to http://localhost to use the app. Files are mounted to the streamlit-volume directory in the installation path.

Features

  • Upload scRNA expression matrices in multiple formats (h5ad, mtx, loom supported) including built-in datasets and online datasets using their EBI accession key
  • Preprocess data (includes mito, ribo and haem annotation, cell & gene filtering, highly variable genes, data scaling and normalization, cell-cycle scoring, batch effect correction, doublet detection, downsampling, subsampling and automated preprocessing recipes included in scanpy)
  • Integrate different dataset types using BBKNN, scanorama and Ingest
  • Visualise differential gene expression using multiple statistical tests across clusters or individual genes
  • A variety of deep learning models such as doublet detection with Solo and cluster analysis using Cite-seq autoencoder. Adjustable hyperparameters and hardware devices
  • Trajectory inference on data and easily adding PAGA embeddings to data
  • Spatial transcriptomics with metrics such as Ripley and Centrality scoring
  • 3D interactive chart to view generated clusters
  • Terminal to interact with the docker container running a bioconda environment

License

Nuwa is under MIT license, a short and simple permissive license with conditions only requiring preservation of copyright and license notices. Licensed works, modifications, and larger works may be distributed under different terms and without source code. See licence