Skip to content
View Fshahnaj's full-sized avatar

Block or report Fshahnaj

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
fshahnaj/README.md

πŸ‘‹ Hi, I'm Fujaila Shahnaj

Data/Analytics Engineer β€’ Data Analyst β€’ BI Developer β€’ ML Engineer (Healthcare & Product Analytics)

πŸŽ“ MS Computer Science @ Clemson University (GPA 3.87)
πŸ“ Raleigh–Durham–Cary–RTP (NC)
πŸ’‘ Specializing in Healthcare Analytics, ML Pipelines, BI Systems & Data Engineering


🌟 About Me

I’m a data/analytics engineering professional with experience building end-to-end clinical analytics platforms, enterprise-grade Power BI dashboards, and NLP-driven product insights systems. I thrive in turning messy real-world datasets into clean, validated, explainable insights that guide business decisions.

  • Built ML + NLP pipelines analyzing 368K+ records
  • Developed a clinical risk prediction model (ROC-AUC: 0.79)
  • Designed BI dashboards for senior leadership (KPI, RLS, automated refresh)
  • Implemented dbt + DuckDB star-schema warehouses for healthcare datasets
  • Fine-tuned BERT & RoBERTa models (F1: 0.84)

  • Built ML pipelines analyzing 368K+ records
  • Designed clinical risk prediction model (ROC-AUC: 0.79)
  • Delivered Power BI dashboards for senior leadership decision-making
  • dbt + DuckDB star-schema modeling for healthcare data
  • NLP modeling with BERT & RoBERTa (F1: 0.84)

πŸ“Š Featured Projects

πŸ”Ή CardioInsight-AI β€” Healthcare Analytics Platform

End-to-end cardiovascular risk analytics system
β€” HIPAA-style de-ID β†’ dbt warehouse β†’ ML β†’ Power BI clinical dashboard
ROC-AUC: 0.79

πŸ“ˆ Live Power BI Dashboard β€’ πŸ“‚ GithubLink


πŸ”Ή Product Hunt Community Insights (368K+ records)

NLP pipeline using BERT/RoBERTa to classify user complaints, praise, and feature requests
F1 Score: 0.84 πŸ“˜ Coming Soon


πŸ› οΈ Technical Skills

πŸ”§ Languages

Python, SQL

πŸ“Š Data Analytics

EDA, KPI Development, A/B Testing, Statistical Analysis, Visualization

🧠 Machine Learning

Logistic Regression, Tree Models, BERT/RoBERTa, Feature Engineering, Model Evaluation

πŸ—οΈ Data Engineering

dbt (models, tests, documentation), ETL/ELT, Dimensional Modeling
DuckDB, MySQL, Oracle, Spark

πŸ“ˆ BI & Visualization

Power BI (DAX, M, Star Schema, RLS), Tableau, Matplotlib, Seaborn

☁️ Cloud

AWS (S3, Glue, Redshift), EC2, IAM


πŸ’Ό Experience

Research Assistant β€” HAIE Lab | Clemson University
β€’ Analyzed 368K+ Product Hunt comments using ML/NLP
β€’ Built automated data pipelines (reduced processing time 60%)
β€’ Developed BERT multi-label classifier (F1: 0.84)


Graduate Assistant β€” Data Analytics | Clemson Graduate School
β€’ Designed enterprise Power BI dashboards
β€’ Implemented RLS and automated refresh schedules
β€’ Supported VPs/Deans with KPI tracking


Data Science Intern β€” Data Visualization Lab, Clemson Library
β€’ Built forecasting models (85% accuracy)
β€’ Created operational dashboards (Tableau/Power BI)
β€’ ETL across 50K+ records


Senior Lecturer β€” PCIU (Study Leave)
β€’ Taught DBMS, DS, Algorithms
β€’ Supervised ML/AI research projects


πŸ“¬ Contact

πŸ“§ Email: shahnajfujaila@gmail.com
πŸ”— LinkedIn: linkedin.com/Fujaila-Shahnaj
🌐 Portfolio: https://fshahnaj.github.io


⭐ Thanks for visiting! ⭐

Pinned Loading

  1. CardioInsight-AI CardioInsight-AI Public

    Production-grade ETL pipeline for cardiovascular risk analytics with HIPAA-compliant data processing, automated quality validation, and enterprise BI dashboards.

    Jupyter Notebook 1

  2. data-engineer-handbook data-engineer-handbook Public

    Forked from DataExpert-io-Community/data-engineer-handbook

    This is a repo with links to everything you'd ever want to learn about data engineering

    Jupyter Notebook

  3. Deep-Learning-Projects Deep-Learning-Projects Public

    Jupyter Notebook 1

  4. fork-commit-merge fork-commit-merge Public

    Forked from fork-commit-merge/fork-commit-merge

    Fork, Commit, Merge. A project designed to help you familiarize yourself with the open source contribution workflow on GitHub!

    JavaScript

  5. SoftwareFoundations SoftwareFoundations Public

    Forked from abastola0/SoftwareFoundations

    JavaScript 1

  6. FShahnaj.github.io FShahnaj.github.io Public

    HTML 1