STELLA

Pre-training a Large Single-Cell RNA Model from Scratch for Educational Purposes.

🎉 0. Introduction

Data:
- Input: gene symbol sequence and gene expression sequence. (added)
- Preprocess: normalize_total (1e4) and log1p, then globally divided into 100 bins using tdigest algorithm.
Model Architecture:
- Transformer with DeepSeekMoE.
Pretraining Task:
- Masked Language Modeling (MLM). (Just like scBERT)

⚙️ 1. Environment Configuration

conda create -y -n stella python=3.10
conda activate stella

pip install notebook ipywidgets

# CUDA 12.4
pip install torch==2.4.0 torchvision==0.19.0 torchaudio==2.4.0 --index-url https://download.pytorch.org/whl/cu124

# transformers<=4.49.0
pip install transformers==4.49.0 datasets evaluate accelerate peft tensorboard
pip install deepspeed

pip install scanpy igraph leidenalg gseapy joblib tdigest

# Perturbation Prediction Tutorial Needed!
pip install torch_geometric

🧑🏻‍💻 2. Scripts

Pretrain: src/wanglab_workflow

# bash
cd src/wanglab_workflow
sh pretrain.sh

# slurm
cd src/wanglab_workflow
sbatch submit.sh

Downstream Tasks: tutorials
Cell Type Annotation
Gene Regulatory Network Inference (GRN)
Genetic Perturbation Prediction

📚 3. Results Overview

Cell Type Annotation & GRN

Genetic Perturbation Prediction

💕 4. Acknowledgement

Huggingface
scBERT
Geneformer

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
assets		assets
pretrained_models		pretrained_models
src		src
tutorials		tutorials
.gitattributes		.gitattributes
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

STELLA

🎉 0. Introduction

⚙️ 1. Environment Configuration

🧑🏻‍💻 2. Scripts

📚 3. Results Overview

💕 4. Acknowledgement

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

STELLA

🎉 0. Introduction

⚙️ 1. Environment Configuration

🧑🏻‍💻 2. Scripts

📚 3. Results Overview

💕 4. Acknowledgement

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages