Next, in the upload page select the covid dataset. You may wish to learn more about the dataset at ncbi.
In the file info in the sidebar we can see there are 9000 obs and 33538 vars.
Obs (observations) include things like cell annotations (e.g. tumour or normal cell) or other information relating to the dataset.
In our case we have 3 obs columns:
Var (variables) refers to annotation of gene/feature metadata. This is a dataframe indexed by unique gene names (or other gene identifier such as ensembl ID).
In our case we have 3 columns:
To see more about the structure of Scanpy's anndata oject see here
Gene format: the format of gene symbols (the way the .var dataframe is indexed) whether they are ensembl IDs or gene symbols. It is recommended not to change these unless necessary.
Obs/var make names unique: Make obs/var names unique
Generally you will be uploading your own datasets. Currently H5AD, H5, loom and mtx are supported. You can also import a dataset from the EBI expression atlas using an accession key.