Skip to the content.

Farzana Patel

Data Scientist| Psychologist| Lead Data Consultant

Experience

Education

Languages and Technological skills

Python | R | Git | PowerBI | Tableau | SPSS | SQL ETL | Data science pipeline | Statistics | Time series | Experimental design | Hypothesis testing | ML | NLP | Deep Learning

Project 1: World Happiness Report 2021

This project integrates various visulizations pertaining happiness scores across the globe along with other important parameters.

Project 2: Understanding the space of facebook political adverts using topic modelling.

The aim of this project was to understand the space of Facebook political adverts (i.e. by topic), as well as differences by timing, party, type of source (e.g. local political actors, campaigners, national parties) using automated content analysis, will provide meaningful insights of the inner workings of the political ad domain to the community.

In the project, I implemented topic modelling on political advert texts to detect latent topics and temporal trends with the dataset. By conducting a series of experiments, I finally landed on the optimal topic model I used to answer the research questions.

Top and bottom 10 happiest countries

Top and bottom 10 happiest countries

Top and bottom 10 happiest countries

Project 3: Credit Card Fraud Detection

This project predicts fraudulent credit card transactions using machine learning models.

AUC scores of models Accuracy scores of models Recall scores of models

Project 4: Advanced Statistics- Investigation of knowledge and skill development in a lifetime

In this project, I investigated how people develop skills and knowledge throughout their lifetime. In particular, I investigated how language exposure impacts later linguistic skills, cognitive abilities, and academic achievement. The goal was to make several models, which quantify and test postulated theoretical assumptions. Previous studies in language acquisition showed that language skills depend on the richness of the environment as well as number of other factors. In the case of this exam, I focused on the question of how people learn language and whether this influences other outcomes, such as university enrollment.

SEM Model output:
Structural Equation Model

Generalized Model prediction:
GLM prediction

Full GLM prediction:
Full GLM prediction

Project 5: Car Price Prediction

An automobile consulting company wants to understand the factors on which the pricing of cars depends.The model prediction will be used by management to understand the pricing dynamics of a new market.

Scatterplot of all numeric datapoints:
Scatterplot

Residual plot:
residuals

Linear Model prediction:
model prediction


Let's connect and chat! Open to anything under the sun.