Mohammed Shaban
Data Scientist • AI/ML Expert
Available for New Opportunities

Mohammed Shaban

Data Scientist • AI/ML • Healthcare & Drug Discovery

Data Scientist with 3+ years delivering AI/ML-based project delivery in healthcare, oncology, and drug discovery. I drive data contextualization and model deployment pipelines to maximize efficiency and translate complex models into measurable business outcomes aligned with long-range strategic objectives.

MS
AI Impact Metrics
Delivering measurable outcomes
6mo
Accelerated
$2M
Investment
0.82
AUC Score
Mestastop Solutions
Turing Minds.Ai
Case Western Reserve University
AWS SageMaker
XGBoost
Random Forest
SHAP Explainability
Drug Discovery
Oncology AI
Multi-Omics
Mestastop Solutions
Turing Minds.Ai
Case Western Reserve University
AWS SageMaker
XGBoost
Random Forest
SHAP Explainability
Drug Discovery
Oncology AI
Multi-Omics
Career Journey

Professional Experience

A track record of delivering measurable AI/ML outcomes across healthcare, oncology, and drug discovery.

Jr. Data Scientist
Mestastop Solutions
June 2024 – Present
  • Lead AI/ML-based project delivery, articulating complex genomic and clinical insights to senior leadership and C-suite
  • Developed and optimized ML workflows reducing delivery cycles and maximizing efficiency across business operations
  • Accelerated 2 drug programs by 6 months through prioritized experiments and effective resource allocation
  • Influenced $2M budget allocation for AI-driven drug discovery by presenting ROI-focused roadmaps to C-suite and board
  • Applied SHAP-based explainability (80–85% interpretability) for actionable insights via management dashboards
Leadership Strategic Planning SHAP MLOps
Data Scientist
Turing Minds.Ai
July 2022 – March 2023
  • Built ensemble models (XGBoost, Random Forest) on 1,500+ structured datasets
  • Drove model performance from 70% to 80% through iterative preprocessing, feature selection, and cross-validation
  • Delivered analytical insights via dashboards and presentations to management and stakeholders
XGBoost Random Forest Feature Engineering Dashboards
Data Scientist Trainee
Turing Minds.Ai
April 2022 – June 2022
  • Conducted EDA to identify patterns and anomalies using Pandas, Matplotlib, and Seaborn
  • Documented technical workflows and provided insights to internal teams
  • Supported junior talent development and onboarding processes
EDA Pandas Matplotlib Seaborn
Portfolio

Key Projects

Real-world AI/ML projects delivering quantifiable outcomes in healthcare, oncology, and drug discovery.

Companion Diagnostics (Oncology)
Team: 2 | Python, XGBoost, Random Forest, SHAP

Integrated patient clinical and molecular data for 100+ patients using ensemble ML, achieving 0.82 AUC. Designed predictive algorithms validated across 3 oncology biomarkers (92% sensitivity, 89% specificity) to drive targeted interventions.

Refined 500+ patient sample data via Python and Bioconductor, reducing experimental time by 3 weeks and improving throughput.

Oncology Biomarkers 0.82 AUC 92% Sensitivity
Drug Discovery & Drug Repurposing
Team: 3 | Python, Scikit-learn, SHAP, AutoDock Vina, ChEMBL

Collaborated with multi-disciplinary teams to analyze 100+ compounds, achieving 65% improvement in binding affinity and reducing experimental testing time by 35%.

Engineered feature extraction pipeline processing 500GB+ of multi-omics data (RNA-seq, proteomics), reducing dimensionality by 95% while retaining biological signals.

Conducted retrospective clinical analysis (N=73) uncovering patient subgroups with 2.3x higher treatment efficacy.

Drug Discovery Multi-Omics 65% Improvement 2.3x Efficacy
Pima Indian Diabetes Prediction
SKDNN Classifier, KNN Optimization

Implemented predictive modeling on the Pima Indian Diabetes Dataset achieving 85% accuracy with a 90% training and 10% testing split.

Proposed a novel distance calculation formula reducing computation time by 25% and increasing nearest-neighbor accuracy by 15% in KNN.

Diabetes Prediction SKDNN 85% Accuracy KNN Optimization
Technical Arsenal

Skills & Expertise

Comprehensive expertise spanning machine learning, cloud infrastructure, and domain-specific healthcare AI.

Programming & ML Frameworks

Python, SQL, Bash Expert
Scikit-learn, XGBoost Expert
TensorFlow, PyTorch Advanced
SHAP, LIME Advanced

Data Science & MLOps

EDA, Statistical Modeling Expert
AWS (S3, EC2, SageMaker) Advanced
Docker, GitHub Advanced
Genomics, Bioinformatics Advanced
Academic Background

Education & Certifications

Strong academic foundation complemented by specialized training in data science and healthcare AI.

PG Program in Data Science
Case Western Reserve University, upGrad & INSOFE
2023 | Bengaluru
B.E. Information Science & Engineering
T John Institute of Technology
2022 | Bengaluru | CGPA: 7.0/10
Professional Certifications
  • Biology of Cancer - Coursera, 2024
  • Computational Data Science - Case Western Reserve University & INSOFE, 2023
Get In Touch

Let's Work Together

Ready to discuss AI/ML opportunities in healthcare, oncology, or drug discovery? I'm open to new challenges and collaborations.

📧
Email
mohammedshaban854@gmail.com
📱
Phone
+91-9886549746
📍
Location
Bengaluru, India
💼
LinkedIn
mohammed-shaban-6a8915229