Muhammad Umar Salman

Abu Dhabi, UAE · (+971)-561-732912 · umar.salman1997@gmail.com · umar.salman@mbzuai.ac.ae

Currently I am pursuing my Master's degree at Mohamed Bin Zayed University of Artificial Intelligence (MBZUAI) in Natural Language Processing. In my last working experience, I was a Data Scientist Intern at G42 Healthcare, UAE's leading company in Artifial Intelligence and Cloud Computing. Before joining MBZUAI's Master's program I worked as a Data Scientist/Python Developer in New York-based startup, Decklaration. My major interests lie in Data Engineering and Data Science and would love to pursue a career in either of these fields after my graduation in 2023 with priority given to the former. I am very passionate to learn new things as I constantly strive to become a better version of myself. Through my experiences I have learnt that data-driven based strategic decisions are what primarily differntiates a growing business from a stagnant one and thats what drives me to work in the data field.

I am also looking for Data Engineering or Data Science internship oppurtunities.

If you want to contact me or get to know me better, send me an email at either of the email adresses above, follow me at LinkedIn/GitHub, or download my resume.

...

Education

Mohamed Bin Zayed University of Artificial Intelligence (MBZUAI), ABU DHABI, UAE

MSc. Natural Language Processing
Relevant Courses: Advanced Natural Language Processing, Artificial Intelligence, Big Data Processing, Speech Processing

GPA: 3.95

August 2021 - June 2023

Lahore University of Management Sciences (LUMS), Lahore, Pakistan

B.S Computer Science
Relevant Courses: Data Structures, Algorithms, Databases, Data Science, Deep Learning, Machine Learning, Software Engineering, Computer Vision.

GPA: 3.50

August 2016 - June 2020

Experience

Data Science Intern

G42 Healthcare - Abu Dhabi, UAE

Designed and worked on the development of a natural language interface that can parse a user question or statement, transform it into a structured criteria representation and produce an executable clinical data query represented as an SQL query conforming to an EHR (Electronic Health Records) and OMOP (Observational Medical Outcomes Partnership) Common Data Model.

Built a NLP pipeline which parses and preprocesses a user's statement and then passes it sequentially through to a Named Entity Recognition model, Entity Linking model and lastly a SQL Generation model which generates an executable query.

For the project I primarily used Python's Deep Learning frameworks and libraries such as PyTorch, Hugging Face and Scikit-Learn.

May 2022 - Aug 2022

Graduate Teaching Assistant

MBZUAI - Abu Dhabi, UAE
Aug 2022 - Dec 2022
Course: Machine Learning

Designed and created the lab session material and taught the assigned curriculum in lab sessions.

Evaluating student projects, labs, reports and other graded components.

Maintaining records on student progress/grades.

Jan 2022 - May 2022
Course: Data Mining

Designed and created the lab session material and taught the assigned curriculum in lab sessions.

Evaluated student projects, labs, reports and other graded components.

Maintained records on student progress/grades.

Python Developer | Data Scientist

Decklaration - Lahore, Pakistan

Developed the backend of a Crypto + Finance News Aggregator in Django (Python) and Postgres SQL in a team using its rest framework and deployed it on AWS

Designed and implemented a Trending Score Model and Recommendation System based on Collaborative Filtering and Natural Language Processing for a Crypto + Finance News Aggregator in Python.

Dec 2020 - Aug 2021

Research Assistant

United Arab Emirates University - (Remote) Lahore, Pakistan

Developed simulation codes of a selection algorithm for On‑Demand Mass Transport in Python that models the dynamic routing of public services to minimize the waiting time along with increasing bus capacity utilization.

Oct 2020 - Jan 2021

Data Analyst

Afiniti - Lahore, Pakistan

Designed models to optimize pairing between call-center agents and customer call groups to maximize the profit of the company in the Artificial Intelligence department where I also analyzed the effects of the model on the data using SQL and Python and R Programming Language.

Jun 2020 - Sep 2020

Research Assistant

LUMS - Lahore, Pakistan

Worked under the supervision of two professors to construct different machine learning models which predicts possible diabetes after looking at prediabetics vitals and their food consumption behavior of individuals.

Compared results of different using Scikit‑Learn’s machine learning models such as Naive Bayes, Support Vector Machine, Logistic Regression, Decision Trees, Random Forrest and lastly Multi‑Linear Perceptron (MLP) to see which models performed better.

Jan 2020 - May 2020

Undergraduate Teaching Assistant

LUMS - Lahore, Pakistan
Courses: Computer Problem Solving, Artificial Intelligence, Data Mining

Helped develop and analyze the course structure and various graded instruments for the aforementioned courses.

Facilitated the students with learning and building holistic understanding of various concepts in the areas mentioned above.

Implemented the automated testing from scratch to reduce checking times.

Aug 2018 - Jan 2020

Projects

Digital Email Assistant


Developed a model which takes a speech source input from a user and converts it to a structured email and then sends it to the recipient.

For the project I primarily used GCP's Speech-to-Text model, PyTorch, Hugging Face and Scikit-Learn.

OLX Business Analytics (Capstone Project)


Provided insights on the behavioral pattern of the customers as to which actions they take on the OLX website platform through clickstream and cluster analysis.

Constructed a conversion metric to identify serious buyers from casual window shoppers, using various Machine Learning and Association Mining algorithms.

Maedaan Application


Developed the backend for an application for booking playing fields in Django (Python framework) using it's rest framework to provide endpoint API's.

Deployed the backend application on AWS' EC2 instance and connected it to AWS S3 bucket and AWS RDS as a storage and database solution.

Forecasting Prices for Cryptocurrencies


Built a time-series model which forecasted a 3-month period for future closing prices for cryptocurrencies: Bitcoin and Ethereum, using an Auto-Regressive model (ARIMA) in Python with an accuracy of more than 90%.

Airline Database Management System


Designed and built a fully functional backend-system for an Airline to book tickets by administration and customers, using SQL, completely digitalizing the whole experience


Skills

Programming Languages
  • Python

  • Bash Scripting

  • SQL

  • Java

  • JavaScript

  • Golang

  • C/C++

  • R

  • VBA in Excel

Scraping and Automation
  • Scrapy

  • Selenium

Databases
  • Apache Cassandra

  • MySQL

  • PostgreSQL

  • MongoDB

Data Modelling and Manipulation
  • Pandas

  • Numpy

  • NLTK

  • Regex

  • Scikit-Learn

  • TensorFlow

  • PyTorch

Story Telling and Dashboarding
  • Tableau

  • Plotly and Dash

  • Chart.js

  • Powerpoint

  • Matplotlib

  • Seaborn

Big Data Tools
  • Apache Spark

  • Map Reduce

  • Hadoop

  • Apache Kafka

  • Apache Airflow

  • AWS EMR

Backend Development
  • Django

  • Flask

  • Node.js

Data Warehousing and Data Lakes
  • AWS Redshift

  • Snowflake

Tools
  • Git

  • Docker

  • Postman

  • Google Colab

  • Jupyter Notebooks

  • Slack

  • VS Code


Interests

  1. In my free time these days I am trying to learn Arabic.
  2. Enjoy watching and playing Tennis and Cricket.
  3. Play football, badminton and recently started learning to play squash.
  4. Love to swim.
  5. Just recently started following Formula 1.
  6. Occasionally play Call of Duty and Fifa.

Awards & Honors

  • Received UAE Golden Visa under the category of talented individuals, UAE
  • 3rd Place at the G42 and Coders HQ #HackForSpace event in 2021, Dubai Link
  • Graduated with high merit from Lahore University of Management Sciences 2020, Pakistan
  • Paced on Dean's Honors List for 3 semesters at Lahore University of Management Sciences, Pakistan
  • Earned an IBM Data Science Professional Certificate after completing a 9-course specialization on Coursera Link