Olusegun Stephen Omotunde

Timeline


  • April 2023 - Present

    Dream Chase Technology

    Position: Data Scientist

  • August 2022 - Feb 2023

    WeCloudData
    Toronto, Canada

    Program: Diploma in Data Science & Machine Learning

  • August 2021 - December 2022

    Dept. of Finance, College of Business, Bowling Green State University
    Bowling Green, Ohio

    Position: Graduate Teaching & Research Assistant

  • January 2021 - December 2022

    Bowling Green State University
    Bowling Green, OH

    Program: Master of Applied Statistics & Operations Research with a Business Analytics Specialization

  • January 2020 - December 2020

    Nigerian Airspace Management Agency
    Lagos, Nigeria

    Position: Financial Data Analyst

  • Jan 2018 - October 2018

    Caverton Helicopters
    Nigeria

    Position: Business Analyst Intern

  • September 2015 - October 2019

    University of Ilorin
    Kwara State, Nigeria

    Program: Bachelor Science in Finance

Projects


Queuing Analysis

Dunkin' Donuts Drive-Thru Queueing Analysis

Performed queueing analysis on customer arrival, service, and waiting times at a Dunkin' Donuts drive-thru. Collected and analyzed data for 54 customers during peak morning period. Explored distribution fits.

Skills:
Probability models, microsoft excel , R (R Studio & R Markdown), Queuing Theories, Binomial and Exponential Distribution

GitHub Repository
Mario Kart Experimental Design

Mario Kart Experimental Design

Designed and analyzed an ANOVA experiment evaluating the effect of game sound on Mario Kart race times. Checked model assumptions, performed ANOVA, and generated plots.

Skills:
Statistical Analysis, R (R Studio & R Markdown), Experiment Design

GitHub Repository
Baseball EDA Project

Baseball Exploratory Data Analysis

Performed exploratory analysis on career statistics of major league baseball players. Used visualizations and statistical methods to understand relationships between performance metrics like AtBat, Hits and Salary. Applied data cleaning, transformations, resistant models, binning and rootograms. Uncovered insights into distributions, correlations and deviations from normality

Skills:
Exploratory Data Analysis, statistical Analysis, R (R Studio & R Markdown)

GitHub Repository

Dashboard of King County House Sales

King County House Sales Tableau Dashboard

Interactive Tableau dashboard visualizing King County home sales data to deliver insightful market overview. Advanced calculations and filters enable drilling into property details. Visualizes sales trends, pricing, and feature impacts on value. Showcases data storytelling skills through effective charts, maps, parameters, and tooltips

Skills:
Tableau, Data Visualization


GitHub Repository
Dashboard of 365 Careers Customer Behavior

365 Careers Customer Behavior Analysis Tableau Dashboard

Interactive Tableau dashboard visualizing 365 Careers Customer Behavior to track Cohort Engagement and Customers Metrics.

Skills:
Tableau, Data Visualization


GitHub Repository
University Library Database

University Library Database

Designed and implemented a relational database using SQL in MariaDB Server and HeidiSQL for a university library system to track books, authors, publishers, students, and employees. Included tables for main entities, constraints, relationships, and sample data

Skills:
SQL, Relational Database, MariaDB Server, HeidiSQL

GitHub Repository
UFO Sightings

Tableau Citi Bike Analytics

Analysis of New York Citi's Bike Program which is responsible for overseeing the largest bike sharing program for 200,000+ data points in the United States through visualization in Tableau

Skills:
Tableau-Desktop


GitHub Repository

311 Calls San Diego

Bike Sales Analysis

This project analyzes bike sales data to understand purchasing trends and identify opportunities to boost sales. The analysis was performed in Excel using pivot tables, dashboards, and visualizations.

Skills:
Advanced Excel (Pivot table, custom formulas, dashboard & Visualization)

GitHub Repository
HR Analytics

HR Analytics

This project focuses on leveraging a substantial dataset to extract insights that can aid HR departments in understanding the market's recruitment landscape. The report delves into various insights obtained through detailed Exploratory Data Analysis (EDA) and data cleaning processes. Explore HR Analytics using SQL and Python for dataset creation, cleaning, and EDA. Visualize insights with Python (matplotlib, seaborn) and Tableau. Enhance HR decision-making with comprehensive analysis.

Skills:
Python, Panda, Matplotlib, MySQL, DataGrip, Tableau Desktop


GitHub Repository
NBA Project

NBA THEN AND NOW: A RE- CATEGORIZATION

NBA Player Clustering- Performed unsupervised clustering analysis on player stats to categorize NBA players based on style of play and on-court contributions. Analyzed box score data from the 2020-2021 season using K-means, hierarchical, and model-based clustering algorithms. Identified an optimal 3-cluster solution that divides players into groups called "In-the-Paints", "Generals", and "Versatiles" based on statistical profiles

Skills:
Unsupervised learning, machine learning, web-scraping, Clustering (K-means, hierarchical, and model-based) Algorithms, R (R Studio & R Markdown)

GitHub Repository
Big Data Sentiment Project

Big Data Sentiment Analysis on Twitter

This project focuses on sentiment analysis of tweets related to Black Friday shopping events. The dataset is obtained from the Twitter API, stored in CSV format, and loaded into an Amazon S3 bucket via Amazon Kinesis Data Firehose. A machine learning pipeline is established to train a Logistic Regression model for supervised sentiment analysis. The model's accuracy is then calculated, and the prediction data is written to a personal S3 bucket for further analysis.

Skills:
Supervised Machine Learning (Logistics Regression), AWS (s3, Athena, Kinesis Firehose, Quicksight), PySpark

GitHub Repository
Baseball Salary Prediction

Baseball Regression Project

Developed a regression model using R to predict MLB player salaries based on performance statistics. Selected predictors from a dataset of 263 player-seasons using methods like variable selection, transformations, and influence diagnostics. The final 10 variable model attained an adjusted R-squared of 0.75 and accurately predicted salaries on test data

Skills:
Linear Regression, Machine Learning, Supervised Learning, R (R Studio & R Markdown)

GitHub Repository
CSV Data Viz

CSV Data Viz Analysis Tool

CSV Data Viz Analysis Tool is a user-friendly application designed for visualizing structured data. This tool allows users to effortlessly create plots with accompanying summary explanations and easily embed them in a downloadable PowerPoint file. Powered by advanced language models, the application is tailored for seamless integration with Streamlit

Skills:
Python, Regex, Pandas, Matplotib, llm, gpt-3.5-turbo, gpt-4-vision-preview, OpenAI API, Streamlit

GitHub Repository

AWS

Implementation of an E-Commerce System on AWS in an automated way using Terraform and Ansible

In this project, I worked as a Cloud Engineer using DevOps, where I created and implemented an e-Commerce MVP (Minimum Viable Product) on AWS in less than 2 hours and in an automated way using Terraform and Ansible (Infrastructure as Code — IaC)

Skills:
AWS, Terraform, EC2, Ansible

GitHub Repository
AWS

Migration of a Workload running in a Corporate Data Center to AWS

In this project, I as the Cloud Specialist was responsible for migrating a workload running in a Corporate DataCenter to AWS using the Amazon EC2 and RDS service

Skills:
Amazon Ec2, Amazon VPC, MySQL, Amazon RDS, Internet Gateway

GitHub Repository
AWS

Implementation and deployment of a Scalable Web Application

In this project, I was responsible for implementing an application that needs to support the high demand of a large number of users accessing it simultaneously

Skills:
Amazon Route 53, Amazon Cloudfront, Amazon Elastic Beanstalk,Amazon Cloudwatch, Amazon Dynamo DB

GitHub Repository

Certificates


Amazon Web Service

Microsoft Azure

Google Cloud Platform

Oracle Cloud Infrastructure

Python

Tableau

LangChain

R Programming

CONTACT ME

I will be more than happy to talk with you.