CV
Basics
| Name | Ilyes Baali |
| Label | PhD candidate in Computational Bilogy |
| ilb4001@med.cornell.edu | |
| Phone | (646) 236-4417 |
| Url | https://ilyes495.github.io/ |
Work
-
2021.07 - Present DOCTORAL RESEARCHER
Morris Lab (Prof. Quaid Morris) MSKCC and WCM, New York, NY
• Develop a machine learning-based recalibration model that can accurately predict the cellular binding sites of RNA-Binding Proteins (RBPs) given only their intrinsic binding preferences.
• Reanalyze all available CLIP data to identify biases and experimental artifacts in the dataset. We found evidence that as much as a third of the eCLIP data associated with ENCODE RBPs does not measure direct binding.
• Investigate various machine learning techniques such as AlphFold, for representing proteins to predict the binding preferences of RNA-binding Proteins (RBPs) from their amino-acid sequences.
• Analyze large genomic data to understand the role of MSI2-RBP in AML, this include running various bioinformatics pipelines to detect binding sites from HyperTRIBE assay and measure translation efficiency from RiboSTAMP assay.
• Collaborate with wet-lab teams to address their bioinformatics needs, prepared presentations to communicate the findings and results.
• Supervise undergraduate interns and graduate rotation students in CLIP-seq data analysis, applying machine learning techniques, and utilizing HPC clusters.
• Organize regular lab-meeting for a whole year, including scheduling presentations, developing automated scripts to collect and distribute presentation feedback.- RNA-Binding Proteins
- Machine Learning
- CLIP-seq
- AML
-
2019.06 - 2019.09 VISITING RESEARCHER
Morris Lab at University of Toronto (Prof. Quaid Morris), Toronto, Canada
• Designed and implemented a multitask deep learning model for predicting in vivo binding sites for RNA-binding proteins (RBPs) using in vitro binding preferences learned from paired data.
• This model utilizes advanced machine learning techniques to accurately predict binding site locations within the genome, which can provide valuable insights into the regulatory mechanisms of RBPs and their role in various biological processes. -
2018.10 - 2020.07 RESEARCH ASSISTANT
Computational Biology Lab (Assoc. Prof. Hilal Kazan & Prof. Cesim Erten), Antalya, Turkey
• Developed a novel algorithm for identifying overlapping cancer driver modules using multi-omics data from The Cancer Genome Atlas (TCGA).
• Performed the classification and survival analysis for a novel approach named MEXCOWalk for identifying cancer driver modules.
• Co-supervised a final year undergraduate student’s independent study project, providing support and guidance on various aspects including experimental design, data analysis, and report writing.- Cancer Driver Genes
-
2016.10 - 2017.08 SENIOR PROJECT
Machine Learning Lab (Assoc. Prof. Hilal Kazan), Antalya, Turkey
• Participated in the Neuroblastoma data Integration challenge organized by the CAMDA consortium, with the goal of improving the prediction of clinical outcome, survival time, or disease mechanisms through the integration of multiple data types, as compared to the original expression only study conducted by the FDA SEQC.
• Integrated various types of data such as gene expression and aCGH data to improve the prediction of survival time. -
2016.06 - 2016.09 SUMMER INTERNSHIP
Digital Media and Data Reconstruction Lab (Prof. Lizhuang Ma), Shanghai, China
• Learned the principles of machine and deep learning methods, with a particular focus on convolutional neural networks.
• Built a face recognition system using the GoogleNet neural network, which achieved an accuracy of 94% on the LFW (Labelled Faces in the Wild) dataset.
Education
-
2020.09 - Present New York, NY
PhD
Weill Cornell Medicine and Memorial Sloan Kettering Cancer Center
Computational Biology
- Genomics
- Bioinformatics
- Biostatistic
- Molecular Biology and Cell Biology
-
2018.10 - 2020.08 Antalya, Turkey
-
2014.09 - 2017.06 Antalya, Turkey
-
2013.09 - 2017.06 Antalya, Turkey
Awards
- 2020.09.01
PhD Fellowship
Followship for the whole duration of the PhD
The PhD Fellowship is a prestigious award that aims to support the most talented students in the field of computational biology.
Publications
-
2022 Machine learning for wearable IoT-based applications: A survey
Transactions on Emerging Telecommunications Technologies
A comprehensive survey on the application of machine learning techniques in wearable IoT-based applications.
-
2020 MEXCOwalk: mutual exclusion and coverage based random walk to identify cancer modules
Oxford University Press
MEXCOwalk is a method for identifying cancer modules based on mutual exclusion and coverage using random walk algorithms.
-
2020 DriveWays: a method for identifying possibly overlapping driver pathways in cancer
Nature Publishing Group UK London
DriveWays is a seed-and-extend based heuristic that identifies overlapping cancer driver modules from the graph built from the IntAct PPI network.
-
2018 Predicting clinical outcomes in neuroblastoma with genomic data integration
BioMed Central
This study integrates genomic data to predict clinical outcomes in neuroblastoma.
Skills
| Computational Biology | |
| Perform various differential analyses, Analyze CLIP and RNA-editing based assays, Apply machine learning techniques to functional genomic data |
| Programming | |
| Python | |
| R | |
| Bash | |
| JavaScript | |
| SQL | |
| LaTeX |
Languages
| Arabic | |
| Native speaker |
| French | |
| Good |
| English | |
| Fluent |
| Turkish | |
| Basic |
Interests
| Artificial Intelligence |
| Computational Biology |