-
Notifications
You must be signed in to change notification settings - Fork 31
MachineLearning
Investigating feature identification approaches for AA sequences, K-mers, in-silico prediction of epitopes, and study of secondary structures.
For the time being, there is a #machinelearning channel on the Slack group (check out the virtual-biohackathon@googlegroups.com group for the invitation link). During the BioHackathon, we'll update this section.
We've setup a dedicated GitHub organization here. For a detailed list of all tasks, code and resources, please go there. This page will be updated for the main points only.
- 1st e-meeting Sunday, April 5th @ 17:00 CEST, using zoom.
- 2nd e-meeting: tbd
- Phillip Davis (co-coordinator)
- Fotis Psomopoulos (co-coordinator)
- David Yuan
- Elizabeth Pereira
- Yachee Gupta
- Lukas Heumos
- Simon Verheyen
- Gonzalo Colmenarejo
- Maciej Bak
- Ceci Valenzuela
- Aneesh Panoli
- Alexandros Dimopoulos
- Tunca Dogan
- Thanasis Vergoulis
- Kostis Zagganas
- Marco Pietrosanto PhD in Biology (Bioinformatics)
- Andrea Guarracino PhD student in Biology (Bioinformatics)
- Francesco Ballesio PhD student in Biology (Bioinformatics)
- Nikolaos Pechlivanis
- Festus Nyasimi
- Tasos Papadopoulos
- Odysseas Platias
- Anastasis Togkousidis
- Hesham Haikal
Please check out the Datasets and Tools page.
Any new resources you might have in mind, please add them there directly.
Left here for reference / legacy - refer to the covid19-bh-machine-learning GitHub repo for details.
Investigating Multi-Layer Perceptrons, Convolutional Neural Networks, Regression Models, and Ensembl methods for prediction of disease progression, impact of geographical distribution, etc.
(Side Note: There seems to be some overlap between this Task and the BioStatistics Task. It may be worth considering merging these two.)
-
Machine learning requires much computing resources, in many cases GPUs. Kubeflow, as a highly portable and cloud native platform for workflows, is highly optimised for machine learning. Containerised workloads can easily be ported onto it.
-
Apply Markovian Clustering (MCL) on the currently available SARS-CoV-2 sequences GenBank sequences in order to identify potential groupings beyond the traditional phylogenetic ones. Apply both at the NT and the AA level, based on a number of distance metrics (aka e-value, string distance, etc).
-
Diagnose COVID-19 based on image data from CT scans and X-rays, using neural net models for image classification.
- Similar projects: SMART-CT-SCAN_BASED-COVID19_VIRUS_DETECTOR and COVID-Net