Education

  • Georgia Institute of Technology, May 2020
    • GPA: 3.50 / 4.00
    • Track: Business Analytics
  • University of Georgia, B.A. Economics, Athens, GA, May 2017
    • Minor: Statistics, Certificate: Public Polity
    • Magma Cum Laude, GPA: 3.85 / 4.00

Skills


Technical

  • Tensorflow, Pyspark, Keras, Python, D3, HTML, CSS, JS, Django, Spark, Hadoop, JQuery, SQL, Tableau, R, Stata, Rats, JMP, Simio, Dash, Plotly

Data Analysis Techniques

  • Linear Regression, Random Forest, KNN, K-Nearest Neighbor, SVM, Neural Networks, Logistic Regression, Lasso, Ridge, Elastic Net, Outlier detection/correction, Time Series, Change Detection, and Principle Component Analysis

Big Data/Cloud

  • AWS: EMR, EC2, S3
    • I am familiar using SSH to connect to a cluster directly or using Zeppelin to run Pyspark
  • Microsoft: Azure
    • SSH into a VM to run Pig or Scala scripts
  • Databricks: Community version to run Pyspark
  • Google Colab: Used mainly for Tensorflow projects that require GPU speed


Georgia Tech Projects

STATISTICS: REGRESSION TO ANALYZE USED CAR LIST PRICES

  • Wrote a python script to scrape relevant data from Carfax.com
  • Created a model using a stepwise approach that maximized validation R-squared
  • Final model was able to correctly explain about 90% of the variability in test data

MARKETING RESEARCH: CLUSTERING BASED ON SURVEY DATA

  • Cleaned and scaled data to identify distinct groups within population
  • Employed robust clustering methods such as AHC and K-Means
  • Clearly identified trends among music preferences and lifestyle choices

VISUALIZATION: D3 TO INTERACT WITH DATA

  • Created site where user could input health and update a live line chart
  • Utilized US geographical data to generate a heatmap of the US where color corresponded to number of earthquakes with interactive mouse hover events
  • Created a heatmap that filtered data based on a dynamic dropdown menu with a fixed color scale to visualize US earthquake data by state
  • Created a multiline bar graph that generated detailed bar plot based on a dynamic filtering triggered by mouse hover

D3, PYTHON, AND SQL TO STUDY POLICY EFFECTS ON EDUCATION LEVEL

  • Utilized SQL to join, filter and clean a large dataset with the goal of finding factors at the state level that affect educational attainment
  • Generated a variety of algorithms and used statistical methods for verification and validation
  • Compiled data using interactive D3 visualizations

Experience

ANALYTICS INTERN (ECONOMIST/STATISTICIAN), CLARKE COUNTY SCHOOL DISTRICT, ATHENS, GA (August - December 2016)

  • Explored a survey administered every year to high school students and spearheaded effort to find pattern behind binge drinking. Objective was to reduce drinking among teens
  • Designed four predictive models to identify students at risk of binge drinking
  • Navigated complex ethical and technical issues inherent to the nature of CCSD’s objective
  • Combined results in a report designed for the layman and provided data-driven insights for superiors to act upon

PHARMACY TECHNICIAN, KROGER, CEDARTOWN, GA (August 2017 - June 2018)

  • Assisted pharmacist with administrative duties
  • Managed sensitive patient information and offered other pharmacy services such as vaccines or over the counter products
  • Maintained an awareness of all laws and policies to ensure compliance with regulatory bodies