Curriculum Vitae

Email Me Email

Education

Ph.D. in Statistics, University of South Carolina (USC), Columbia, SC, 08/2014—Present
  • GPA: 4.0/4.0, Research Interest: Spatial-Temporal Data Modeling, Simulation and Computation, Machine Learning Algorithms.
Master of Science in Statistics, The George Washington University (GWU), Washington, DC 08/2012—05/2014 Bachelor of Science in Actuarial Science, Central University of Finance and Economics, Beijing, China 09/2008—06/2012
  • GPA: 89/100, Rank: 1/40. Commencement Valedictorian of Class 2012

Projects

Customer Behavior Classification: Step-Up or Not, 07/2017
  • Designed a machine learning workflow on Spark to predict customers’ decision in switching insurance policies; applied ensemble methods (boosting and stacking) to improve accuracy.
Natural Language Processing: Fraud Detection In Insurance Company, 06/2017
  • Worked with claim department to cleanse, tokenize and vectorize the text description in auto insurance claims. Used SVM clustering to detect suspicious claims for further inspection.
Flood Risk in South Carolina: A Spatial-Temporal Data Analysis, 09/2016
  • Developed a Bayesian hierarchical model based on Gaussian process to evaluate local flood probability; worked with visualization APIs in R to display geographic distribution of rainfall.
Deep Learning Project: Convolutional Neural Network, 01/2016
  • Applied the Tensorflow framework to train and fine-tune a convolutional neural network to detect facial expression; compared with other methods e.g., KNN and random forest.

Working Experience

Lecturer, Department of Statistics, USC, Columbia, SC — 08/2014—Present
  • Taught 4 different undergraduate level courses (STAT 201, STAT 205, STAT 509 and STAT 512) and over 400 students in total since fall 2014.
Intern Data Scientist, the Hartford Financial Services Group, Hartford, CT 05/2017—08/2017
  • Worked on large data sets with H2O and Hadoop; designed a two stage frequency/severity model to predict the loss ratio; included random effects to improve model performance.
Associate Data Analyst, Galaxy Futures, Ltd, Beijing, China 05/2013—08/2013
  • Worked with SQL to fetch transaction data, based on which completed an arbitrage model for futures and derivatives; performed sensitivity test based on MCMC.

Programming Skills

  • R(5+ Years), Python(5+ years), Spark (1+ year), SAS(3+ years), SQL (3+ years), Linux, EXCEL