Turning Data into Actionable Insights

Data Analyst and Project Manager specializing in Python, SQL, machine learning, and statistical modeling to drive business decisions.

About Me

alt="Fouzia Ashfaq" class="w-full h-full object-cover" onerror="this.onerror=null;this.src='images/fallback-image.jpg'">

Fouzia Ashfaq

Data Analyst & Project Manager

Professional Summary

Highly analytical Data Analyst and Project Manager with expertise in Python, SQL, machine learning (Scikit-learn, TensorFlow), and statistical modeling. Adept at data visualization (Power BI, Tableau, Seaborn) and building predictive models for HR analytics, healthcare diagnostics, fraud detection, and NLP tasks.

Skilled in project planning, risk management, and stakeholder communication. Passionate about leveraging data science and project management to drive actionable business decisions.

Location

Islamabad, Pakistan

Contact

fouziaashfaq0298@gmail.com
+92-3060651952

Projects

Employee Attrition Prediction: From Data to Decision-Making

HR Analytics

Developed a predictive analytics solution to forecast employee attrition and help HR teams retain talent effectively.

Skills Demonstrated:

EDA Business Insight Generation SHAP/LIME Dashboard Creation Machine Learning (Logistic Regression, XGBoost)

Outcome:

Built a high-accuracy churn model and provided actionable insights to improve retention strategies; results were visualized via interactive dashboards.

GitHub Repository

From Articles to Insights: Building a Text Summarization System

NLP

Built a text summarization system to extract key information from lengthy articles using NLP techniques and the TextRank algorithm.

Skills Demonstrated:

Text Preprocessing Summarization Algorithms Transformer Models Performance Evaluation (ROUGE scores),

Outcome:

Designed and implemented a working summarization tool capable of condensing long-form content while preserving core meaning; integrated into a web application for accessibility.

GitHub Repository

IMDB Sentiment Explorer: From Reviews to Insights

Sentiment Analysis

Created a sentiment analysis tool to classify movie reviews as positive or negative using NLP and logistic regression.

Skills Demonstrated:

Text preprocessing TF-IDF vectorization Logistic Regression Model Tuning Performance Metrics.

Outcome:

Trained a classifier with 88.87% accuracy; provided insights on improving model scalability and performance for future use.

GitHub Repository

Diabetes Risk Prediction Using Machine Learning

Healthcare

Constructed a predictive model to assess the likelihood of diabetes onset in patients using clinical health indicators.

Skills Demonstrated:

EDA Feature Engineering Healthcare Analytics API Development classification modeling (Decision Trees, Random Forest)

Outcome:

A medical prediction model with AUC-ROC: 0.89, providing actionable insights for early disease detection.

GitHub Repository

EDA & Visualization, Titanic Dataset

EDA

Performed exploratory data analysis on the Titanic dataset to understand survival patterns and relationships between passenger characteristics.

Skills Demonstrated:

Exploratory Data Analysis (EDA) Data cleaning storytelling with data. Data Visualization (Matplotlib/Seaborn)

Outcome:

Discovered meaningful trends such as gender, class, and family size impacting survival rates; communicated findings visually and clearly.

GitHub Repository

Time Series Traffic Forecasting

Time Series

Developed a time series forecasting model using ARIMA to predict website traffic trends over time, enabling better planning and resource allocation.

Skills Demonstrated:

Data preprocessing ARIMA algorithm Exploratory Data Analysis Time Series Modelling Data Visualization Python programming

Outcome:

Successfully built and tuned an ARIMA model that provided accurate short-term traffic forecasts, visualized using Matplotlib for clear insights.

GitHub Repository

Understanding Housing Markets:A Data-Driven Analysis of Boston House Prices

Real Estate

Analyzed the Boston housing dataset to uncover key factors affecting property prices and developed a regression model for price prediction.

Skills Demonstrated:

Statistical analysis Linear Regression Pandas NumPy Data visualization (Matplotlib/Seaborn)

Outcome:

Identified significant variables influencing house prices and created a predictive model to support real estate decision-making and investment strategies.

GitHub Repository

Loan Default Prediction

Finance

Created a machine learning system to predict loan defaults based on historical borrower data, helping financial institutions mitigate risk and reduce losses.

Skills Demonstrated:

Data Cleaning Feature Engineering Risk Assessment SMOTE Scikit-learn Classification Algorithms (Logistic Regression, Random Forest)

Outcome:

Trained and evaluated multiple models to identify high-risk borrowers; deployed a functional prototype with a user interface for real-time predictions.

GitHub Repository

Fraud Detection System

Security

Developed a fraud detection pipeline for credit card transactions using the Credit Card Fraud dataset, handling imbalanced data with SMOTE and training a Random Forest model.

Skills Demonstrated:

Imbalanced Data Handling Fraud Detection Model Evaluation Random Forest

Outcome:

A fraud detection pipeline with 90% recall, ensuring accurate identification of fraudulent transactions.

GitHub Repository

Work Experience

Project Manager intern

Excelerate

May 2025 - Present

  • Collaborating with global teams to create and execute a project plan for a large-scale experiential event.
  • Identifying potential risks and developing mitigation strategies to ensure smooth project execution.
  • Formulating comprehensive budget plans for global events, balancing cost-efficiency and quality outcomes.
  • Delivering presentations to senior management and stakeholders, showcasing data-driven recommendations.
  • Managing multiple projects independently while adhering to strict deadlines and maintaining high-quality standards.

Skills: Python (Pandas, Scikit-learn), NLP (spaCy, BERT), SQL, GitHub, Matplotlib, Data Visualization

Data VisuaIization lntern

Excelerate

May 2025 - Present

  • Cleaning and preparing raw data for effective analysis and visualization .
  • Conducting statistical analyses to extract meaningful insights and trends .
  • Designing audience-appropriate data visualizations aligned with best practices .
  • Creating clear and concise reports, charts, and dashboards to present findings to stakeholders .
  • Collaborating with cross-functional teams to ensure accurate and impactful data storytelling.

Bank of America SaIes and Trading AnaIyst

Forage

May 2025

  • Completed a job simulation focused on analyzing market trends and delivering client-centric solutions within the sales and trading division.
  • Conducted in-depth data analysis using tools like Excel and Bloomberg to identify key financial trends, assess market dynamics, and align insights with client objectives.
  • Researched and proposed strategic recommendations for optimizing trade execution processes and enhancing workflow efficiency using automation and process analysis.
  • Developed a client proposal outlining tailored investment strategies, leveraging data-driven insights to address client goals such as portfolio diversification, sustainability, and moderate growth.

Tata Data Visualisation: Empowering Business with Effective Insights

Forage

Apr 2025

  • Completed a simulation involving creating data visualizations for Tata Consultancy Services .
  • Prepared questions for a meeting with client senior leadership
  • Created visuals for data analysis to help executives with effective decision making

Deloitte Australia Data Analytics

Forage

Apr 2025

  • Completed a Deloitte job simulation involving data analysis and forensic technology.
  • Created a data dashboard using Tableau
  • Used Excel to classify data and draw business conclusions

Data Analyst

DevelopersHub Corporation

Mar 2025 – Apr 2025

  • Developed 5+ machine learning models across domains (HR, finance, healthcare, NLP), achieving 85-92% accuracy in predictions including employee attrition (Random Forest) and loan defaults (LightGBM).
  • Built text summarization system using BERT/GPT (HuggingFace), reducing article length by 70% while preserving key information through abstractive techniques.
  • Created fraud detection pipeline for credit card transactions (Python, SMOTE) with 90% recall, and diagnostic models for diabetes/heart disease (AUC-ROC: 0.89).
  • Automated data cleaning/EDA for 4+ datasets (Titanic, Airbnb, medical), generating dashboards (Seaborn) that cut analysis time by 40%.
  • Implemented custom ML algorithms (Linear Regression/XGBoost from scratch) for Boston housing prices, achieving R² > 0.88.
  • Skills: Python (Pandas, Scikit-learn), NLP (spaCy, BERT), SQL, GitHub

Technical Skills

Data Analysis

  • Statistical Analysis
  • Data Cleaning & Preparation
  • Exploratory Data Analysis
  • Data Storytelling
  • Feature Distribution Analysis

Programming & ML

  • Python (Pandas, Scikit-learn)
  • SQL
  • NLP (spaCy, BERT)
  • TensorFlow & Machine Learning
  • Feature Engineering

Visualization

  • Power BI
  • Tableau
  • Matplotlib & Seaborn
  • Ploty
  • Custom Data Visualization

Databases & Tools

  • MySQL
  • PostgreSQL
  • GitHub (version control, collaboration)
  • Jupyter Notebook
  • Google Colab

Business & Analytical Skills

Data Storytelling & Reporting

  • Creating clear, actionable dashboards
  • Presenting insights to stakeholders
  • Audience-appropriate visualization design

Financial & Market Analysis

  • Bloomberg Terminal
  • Excel for financial modeling
  • Trend analysis
  • Client proposal writing

Problem Solving & Critical Thinking

  • Root cause analysis
  • Risk identification and mitigation
  • Strategic decision-making using data

Project Management Skills

Planning & Execution

  • Project planning
  • Budgeting
  • Timeline management
  • Risk management

Team Collaboration & Leadership

  • Stakeholder communication
  • Cross-functional teamwork
  • Delivering presentations to senior leadership
  • Managing multiple projects simultaneously

Education

MS Applied Mathematics

Comsats Islamabad

Jan 2022 - May 2025

Advanced studies in applied mathematics with focus on statistical modeling and data analysis techniques.

BS Mathematics

University of Sargodha

Sept 2016 - Sept 2020

Bachelor's degree in Mathematics with GPA: 3.25. Coursework included statistical analysis, calculus, and mathematical modeling.

Certification

AI-Driven Data Analytics

WsCube Tech

May 2025

  • Skills Acquired : Data Analytics, Data Analysis, Data Cleaning, Artificial Intelligence (AI), Dashboards, Data Visualization
  • Data Cleaning

    Kaggle

    May 2025

  • Skills Acquired : Data Cleaning, Data Analysis
  • Introduction to Career Skills in Data Analytics

    LinkedIn

    April 2025

  • Skills Acquired : Tech Career Skills, Data Analytics, Business Intelligence (BI), Microsoft Excel
  • AI-Powered Marketing Data Analytics for Beginners

    WsCube Tech

    April 2025

  • Skills Acquired : Product Marketing, Digital Marketing, Marketing, Data Analysis, Market Research
  • Dashboard Design Secrets: Power BI Design Smart, AI-Driven Visuals

    WsCube Tech

    April 2025

  • Skills Acquired : Microsoft Power BI, Data Analysis, Microsoft Excel
  • Description : The session was a game-changer, packed with powerful takeaways on blending data storytelling and AI-driven visuals
  • What Is Generative AI?

    LinkedIn

    April 2025

  • Skills Acquired : Artificial Intelligence (AI), Generative AI Tools, Generative AI
  • Google Analytics Certification

    Google Digital Academy (Skillshop)

    March 2025

  • Skills Acquired :Analytical Skills, Google Analytics
  • IELTS Academic Test Report Form

    British Council

    January 2021

  • Overall Band Score : 7.0 (C1 Level)
  • Individual Scores:
  • Listening: 8.0
  • Reading: 7.5
  • Writing: 6.5
  • Speaking: 6.5
  • Get In Touch

    Location

    Islamabad, Pakistan

    Links