My passion lies in harnessing the power of data to create better lives for all.

I currently live in San Fransisco, but mostly breathe in Palo Alto on weekdays. My expertise and interests include statistical learning, big data engineering, and neural network applications.

When I'm not playing with data, I enjoy running, painting, and spending quality time with loved ones. Actually a lot more than these. Looking forward to connecting!

Featured Work


American Journal of Lifestyle Medicine, 2018
Used statistical models to evaluate feasibility of Community-based, Supervised Exercise Programs (CSEP) for improving health of those suffering chronic medical conditions.

Machine Learning

Building Spotify’s “Discover Weekly” with Spark
Collaborative filtering algorithm in an audio recommendation system with MLlib & PySpark.
Developing a Matching Algorithm
Classification models, feature importances, & pairwise comparison in an entity resolution.
Direct Marketing Optimization using Mobile Data
Classification ensembles for improving growth forecasts in user subscriptions.
Leveraging Philanthropy Impacts with Data Mining
An overview of Beyond Profit Project sponsored by Bloomberg Philanthropies.
Meta-Learning for Credit Card Fraud Detection
Research study & presentation on fraud detection with bayesian net, knn, & decision trees.
Predicting NYC Renting Prices using Lasso Regression
Linear models for predicting next month's rent across New York neighborhoods.
Recommendation Systems for Purchase Data
An implementation of popularity & collaborative filtering models with Python & Turicreate.

Deep Learning

Improved Wasserstein Generative Adversarial Networks (GANs)
Building an improved version of Facebook's Deep Convolutional GANs implementation.
Starting out with Keras
Attempts in multilayer perceptrons and convolutional neural networks.

Natural Language Processing

Can a Chat a Day Keep the Doctor Away?
Building an end-to-end healthcare messenger bot using NLP and matching algorithm.
Exploring Trending Topic Bias in News vs. Social Media
An NLP-based analysis of topics on New York Times data versus Twitter streams.
Making Boston Safer using Natural Language Processing
A set of classification methods to predict text data & model semantic categories.
Topic Modeling for The New York Times News Dataset
Nonnegative Matrix Factorization (NMF) approach for classifying news topics.

Data Visualizations

Legacy of a Century: South Africa Today
Exploration of the nation's journey after the life of Nelson Mandela (best to use Chrome).
Comparing Marvel and DC Superheroes
An attempt to settle the age-old fight with data and D3.
Exploratory Data Analysis & Visualization Resources
Site repository of visualization resources with Javascript, HTML, CSS, and SVG.
How does Trump's budget cut affect you?
Effects of Trump's billion-dollar cuts in city transportation with R, Carto, and Processing.
Ranking the Top 100 Sci-fi Books
Using D3 and Javascript to observe patterns.
Visualizations with R
Compilation of STAT GR5702 course assignments in descriptive statistics.
Visualizing the World's Poverty Rates
A quick attempt to visualize poverty rates using D3 and UN open source data.
What Makes Us Happy?
Definition of happiness in countries around the world.


  • It's the possibility of having a dream come true that makes life interesting.

    Paulo Coelho
  • If I had asked people what they wanted, they would've said faster horses.

    Henry Ford
  • We make a living by what we get. We make a life by what we give.

    Winston Churchill


Columbia University

M.S. in Data Science Dec '17

Coursework: Machine Learning, Applied Machine Learning, Deep Learning & Neural Networks, Algorithms, Exploratory Data Analysis & Visualizations, Computer Systems, Bayesian Modeling, Storytelling with Data, Tech Entrepreneurship.

Georgia Institute of Technology

B.S. in Industrial Engineering & Statistics May '14
Graduated Summa Cum Laude

Relevant Coursework: Probability Theory, Statistical Inference and Modeling, Database Systems Design and Manipulation, Regression and Forecasting, Quality Control, Optimization, Reliability Engineering (graduate level), Stochastic and Queueing Theory.



Data Scientist Mar '18 - present

Building end-to-end products involving big data engineering, machine learning (regression, classification, NLP), time series, algorithms development, and exploratory analyses.

NASA Goddard Institute for Space Studies

Machine Learning Intern Oct '17 - Jan '18

Constructed unsupervised clustering algorithms to assess ocean carbon cycle models and their atmospheric properties for ModelE climate simulations.

Columbia University

Graduate Teaching Assistant May '17 - Aug '17

Supervised the Applied Analytics capstone course (~140 graduate students) covering scenario modeling, data democratization, and information network mining in healthcare.

NBC Universal

Data Scientist Intern May '17 - Aug '17

Performed statistical inference, multivariate analyses, sampling, and clustering from high dimensional consumer data. Built and automated R&D tools using Spark & Python.

Target Marketeam, Inc.

Data Analyst Jul '14 - Jun '16

Collaborated with SVP Analytics and worked cross-functionally to manage databases, develop linear models, and build A/B testing tools for nonprofit orgs' direct mail products.

United Nations World Food Programme

Research Assistant Aug '13 - May '14

Constructed centralized hub models for Specialized Nutritious Foods with Dr. Nazzal and Spatial Risk Calendar team. Valuation resulted in 30% food shortage decrease in Zambia.

Georgia Institute of Technology

Computer Science & Statistics Teaching Assistant Dec '12 - Dec '13

Led weekly recitations, grade exams, and tutored students for Data Manipulation & Database Systems and Applied Statistics course (~650 students).

Technical Skills

  • Python, Spark, Hive/SQL, D3, HTML
  • R, Tensorflow, Scala
  • SAS, JMP

Selected Honors

Columbia Annual Data Science
Hackathon, 1st Place Winner
Columbia Data Science Institute '17
Columbia Impact Hackathon,
1st Place Winner
Columbia Business School '16
Helen Grenga Nominee for Outstanding Woman Engineer
Georgia Institute of Technology '14
Rockwell Automation
Society of Women Engineers '13
Shannon & Wilson Technology Scholar
Shannon & Wilson, Inc. '11
International Leadership Award
International House NYC '17
Toyota Scholarship
International House NYC '16
President’s Undergraduate Research Award
Georgia Tech Research Institute '14
Faculty Honors
Georgia Institute of Technology '12
Dean's List
Georgia Institute of Technology '11-'14
Student Spotlight
Seattle Colleges Foundation '11