Hi there!

I'm

About

Hello there! My name is Pratheep Raju and I am currently pursuing a Master's degree in Business Analytics from the esteemed University of Texas at Dallas. Prior to this, I completed my undergraduate degree in Industrial Engineering from the National Institute of Technology in Jalandhar.

With over four years of professional experience in the field of analytics, I have had the opportunity to work across multiple industries including Fintech, Supply chain, Consulting, and Manufacturing domains. As a data professional, I possess a strong understanding of data analytics tools, techniques, and methodologies, which I have applied to solve complex business problems and drive data-driven decision making.

  • Website: https://pratheepraju.github.io/
  • Phone: (469) 350-0114
  • City: Dallas, TX
  • Age: 27
  • Degree: Master
  • Email: pratheepraju03@gmail.com

In my previous roles, I have worked closely with cross-functional teams to design and develop solutions that have helped organizations to optimize their operations, improve their customer experience, and enhance their bottom line. I am passionate about leveraging data to derive insights that drive business value and am always eager to learn and stay up-to-date with the latest trends and innovations in the field.

Resume

Hey there! Are you ready for some serious resume entertainment? Head on over to my website and check out my resume. It's so impressive, it might just make you want to hire me on the spot! Okay, maybe not on the spot, but I promise it's worth a look. Thanks a bunch!

Sumary

Pratheep Raju

A data analyst with 4 years of analytics experience and knowledge in the Fintech, Supply chain, Consulting, and Manufacturing domains. Proficient in collecting, analyzing, and presenting decisive insights from large datasets using SQL, Python, Excel, PowerBI, and Tableau.

  • Dallas, Texas
  • (469) 350-0114
  • pratheepraju03@gmail.com

Education

Master of Science in Business Analytics

2021 - 2023

University of Texas at Dallas, Richardson, TX

Dean’s Excellence Scholarship Recipient

Bachelor of Technology in Industrial & Production Engineering

2013 - 2017

Dr.B.R Ambedkar National Institute of Technology, Jalandhar, India

Academic projects

Payments Default Prediction – Credit Risk Analytics (Python)

  • Analyzed 30k+ credit card clients to create a logistic regression model and report the strongest predictors of payment default
  • Examined 23 financial metrics and constructed an SVM model to predict the risk of payment defaults with an F1 score of 0.85

Medicare spend by physicians in the US – Healthcare data analytics (Tableau)

  • Analyzed 10 million CMS claims data to segregate Physician’s practicing Oncology based on their number of beneficiaries and claims
  • Created Crosstab visualizations based on these segmentations and inferred insights for better beneficiary targeting

E-Commerce Goods Shipment – Supply Chain and Logistics Analytics (PowerBI, Python)

  • Performed data processing and exploratory analysis on 0.2 M e-commerce orders in PowerBI to understand relationships and discover patterns
  • Built a Multi-Output Decision Tree Regression model to determine the maximum range of e-commerce shipping time with an MSE of 0.05

Wildlife Data Engineering (MySQL, MongoDB, Relational Database)

  • Structured and systematized a database by normalization of 1.8M records of national fire program survey data to maintain data integrity
  • Performed data extraction, utilized data manipulation techniques to streamline data and tuned queries reducing execution time by 20%

Professional Experience

Data Scientist Intern - Blue Ridge Solutions

August 2022 - December 2022

Atlanta, GA

  • Built a classification model using ensemble algorithms (XGBoost, LGBM) in python with an accuracy of 90% to reduce model training time by 12%
  • Collaborated with the product team to understand reporting needs of the customers and established performance tracking by building reports in PowerBI extracting JSON data from AWS S3
  • Performed hyperparameter tuning increasing the prediction model accuracy by 9% to forecast demand and optimize inventory for retail clients
  • Supported in design, development and deployment of seasonal forecasting models using SARIMA and Prophet algorithms with 87% accuracy

Process Excellence Analyst - CRED

January 2020 – August 2021

Bengaluru, India

  • Extracted financial data (payments and reconciliation) from various data sources (AWS redshift, DynamoDB), conducted analysis using SQL and proposed process improvement suggestions reducing payment failures by 5% and cutting transaction costs by 3%
  • Performed in-depth ad hoc analysis on contact center data (tickets, impressions, surveys) using SQL and Python to create data models and recommended strategies to reduce customer support contacts by 20%
  • Designed 15+ interactive dashboards and advanced real-time data visualizations in Tableau for the senior management to monitor business metrics (KPIs), display statistical analysis and provide actionable insights to identify and solve critical business problems
  • Collaborated with the data science team to build classification and forecasting machine learning models to improve customer satisfaction by 17%
  • Setup data workflows and maintained data pipelines using ETL with the data engineering team to extract tickets data at 15-minute intervals
  • Co-ordinated with cross-functional stakeholders to gather requirements, define, build, and deliver solutions for customer facing problems through Agile Scrum methodology to manage operational aspects of the product and drive business excellence

Lean Engineer - Gemba Concepts

January 2018 – December 2019

Bengaluru, India

  • Improved revenue by 15% through financial analysis and product costing model using Excel VBA
  • Decreased inventory costs by 20% by developing databases in SQL Server, drafting queries, and generating data visualizations in Tableau
  • Achieved $90,000 cost reduction through analytical and Lean Six Sigma methodologies for process improvement
  • Provide regular project evaluations and recommendations to solve client's business problems.

Portfolio

Take a glance at my projects listed below to gain insight into the type of work I have undertaken, the manner in which I employ my expertise, and my methodology in carrying out tasks from conception to completion.

To gain a better understanding of both the projects showcased here and my broader portfolio, I invite you to peruse my Github and Tableau profile.

Payment default Prediction – Credit Risk analytics(Python)
  • Developed a classification model to predict payment default of Taiwanese credit clients
  • Utilized 6 months of financial records with over 25 credit card metrics
  • EDA and feature selection to prepare the data
  • Built supervised Machine Learning models using Ensemble Algorithms.


Data Visualisation(Tableau and PowerBI)
  • Medicare spend by physicians in the US – Healthcare Analytics (Tableau)
  • Customer Churn Analysis – Marketing (Tableau)
  • Retail Inventory Analytics – Supply chain (PowerBI)
  • Competitor Sales Analysis (PowerBI)



Wildlife Data Engineering (MySQL, MongoDB, Relational Database)
  • Structured and systematized a database by normalization of 1.8M records of national fire program survey data in MySQL
  • Generated results through queries and validated the same on MongoDB to reduce execution time


Vehicle Insurance – Customer Response Prediction(Python)
  • This Project aims at building a classification model for an insurance company to predict whether the policyholders (customers) from past year will be interested in buying the Vehicle Insurance provided by the company.
  • I have used 10 features providng information about demographics, Vehicles details and Policy information to predict customer response.
  • Building ML Pipelines of 6 supervised ML Algorithms
Patient admission duration prediction – Healthcare Analytics (Python)
  • This project aims to accurately predict the Length of Stay for each patient on case-by-case basis so that the Hospitals can use this information for optimal resource allocation and better functioning. The length of stay is divided into 11 different classes ranging from 0-10 days to more than 100 days.
  • Designed and trained multiple classification models to predict length of patient stay.
E-Commerce Goods Shipment - Supply chain analytics (Python)
  • E-commerce goods Late delivery risk identification by predicting fastest and normal Shipping Durations.
  • Built a Multi-Output Decision Tree Regressor to determine the maximum range of shipping time, by predicting the Fastest and Normal duration for shipping of goods for both Inland and International customers.
  • Built a Binary Classifier to classify orders with high probability of late delivery (Late Delivery Risk analyser).

Contact

Need to reach me urgently? Send a carrier pigeon, smoke signals, or just use the good ol' contact information below!

Location:

Dallas, Texas

Call:

(469) 350-0114