Education
- Georgia Institute of Technology, May 2020
- GPA: 3.50 / 4.00
- Track: Business Analytics
- University of Georgia, B.A. Economics, Athens, GA, May 2017
- Minor: Statistics, Certificate: Public Polity
- Magma Cum Laude, GPA: 3.85 / 4.00
Skills
Technical
- Tensorflow, Pyspark, Keras, Python, D3, HTML, CSS, JS, Django, Spark, Hadoop, JQuery, SQL, Tableau, R, Stata, Rats, JMP, Simio, Dash, Plotly
Data Analysis Techniques
- Linear Regression, Random Forest, KNN, K-Nearest Neighbor, SVM, Neural Networks, Logistic Regression, Lasso, Ridge, Elastic Net, Outlier detection/correction, Time Series, Change Detection, and Principle Component Analysis
Big Data/Cloud
- AWS: EMR, EC2, S3
- I am familiar using SSH to connect to a cluster directly or using Zeppelin to run Pyspark
- Microsoft: Azure
- SSH into a VM to run Pig or Scala scripts
- Databricks: Community version to run Pyspark
- Google Colab: Used mainly for Tensorflow projects that require GPU speed
Georgia Tech Projects
STATISTICS: REGRESSION TO ANALYZE USED CAR LIST PRICES
- Wrote a python script to scrape relevant data from Carfax.com
- Created a model using a stepwise approach that maximized validation R-squared
- Final model was able to correctly explain about 90% of the variability in test data
MARKETING RESEARCH: CLUSTERING BASED ON SURVEY DATA
- Cleaned and scaled data to identify distinct groups within population
- Employed robust clustering methods such as AHC and K-Means
- Clearly identified trends among music preferences and lifestyle choices
VISUALIZATION: D3 TO INTERACT WITH DATA
- Created site where user could input health and update a live line chart
- Utilized US geographical data to generate a heatmap of the US where color corresponded to number of earthquakes with interactive mouse hover events
- Created a heatmap that filtered data based on a dynamic dropdown menu with a fixed color scale to visualize US earthquake data by state
- Created a multiline bar graph that generated detailed bar plot based on a dynamic filtering triggered by mouse hover
D3, PYTHON, AND SQL TO STUDY POLICY EFFECTS ON EDUCATION LEVEL
- Utilized SQL to join, filter and clean a large dataset with the goal of finding factors at the state level that affect educational attainment
- Generated a variety of algorithms and used statistical methods for verification and validation
- Compiled data using interactive D3 visualizations
Experience
ANALYTICS INTERN (ECONOMIST/STATISTICIAN), CLARKE COUNTY SCHOOL DISTRICT, ATHENS, GA (August - December 2016)
- Explored a survey administered every year to high school students and spearheaded effort to find pattern behind binge drinking. Objective was to reduce drinking among teens
- Designed four predictive models to identify students at risk of binge drinking
- Navigated complex ethical and technical issues inherent to the nature of CCSD’s objective
- Combined results in a report designed for the layman and provided data-driven insights for superiors to act upon
PHARMACY TECHNICIAN, KROGER, CEDARTOWN, GA (August 2017 - June 2018)
- Assisted pharmacist with administrative duties
- Managed sensitive patient information and offered other pharmacy services such as vaccines or over the counter products
- Maintained an awareness of all laws and policies to ensure compliance with regulatory bodies