Skip to content
/ EDA Public

In-depth exploratory data analysis, including distribution characteristics, multidimensional correlation, missing value pattern, PCA, etc.

Notifications You must be signed in to change notification settings

FinDii/EDA

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

29 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

πŸ“Š In-depth exploratory data analysis

A clean and reproducible workflow for exploring correlations in datasets using Python.
Ideal for data analysis, visualization, and statistical exploration.


πŸ“ Methods Used

The files inside are aimed at data analysis:

  • Distribution characteristics: fat tails, skewness, kurtosis
  • Multidimensional correlation: Pearson/Spearman/mutual information
  • Missing value pattern
  • PCA dimensionality reduction and feature clustering
  • Time series-aware EDA methods
  • Weighted statistical analysis
  • (con't)

πŸ“ Project Structure

─ Data used:

  1. data_cp.csv (too big for uploading)
  2. data.csv

─ Notebooks:

  1. DataVisualization.ipynb
  2. FatTail&DriftAwareness.ipynb
  3. MissingDataAnalysis.ipynb
  4. PCA.ipynb
  5. Time-sliced EDA.ipynb
  6. CorrelationAnalysis_basic.ipynb

About

In-depth exploratory data analysis, including distribution characteristics, multidimensional correlation, missing value pattern, PCA, etc.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published