🔧 Feature Engineering Toolkit (Python)

A modular and reusable toolkit for performing feature engineering on structured datasets.
This repository provides essential utilities for preprocessing, transforming, and optimizing features for quant finance and machine learning workflows.

📦 Overview

Feature engineering is one of the most critical steps in quant projects.
Here provides a clean pipeline and practical examples for:

🧹 Data preprocessing：missing value handling, outlier detection
📊 Exploratory Feature Analysis：time series analysis, classical technical indices, correlation, visual comparison
🔣 Feature Transformation：temporal dimension, Cross-sectional features, interaction, contextual dimension, demensional reduction
📐 Advanced Engineering：improved engineering methods based on the previous results and comparisons
✂️ Feature Selection：selection based on correlation changes, SHAP from Catboost models

This repository can serve both as a reference and a reusable feature engineering module.

📁 Project Structure

Data used: data_cp.csv (too big for uploading)
Notebooks:
1. Technical Indices.ipynb
2. TimeSeriesAnalysis.ipynb
3. FeatureEngeering_basic.ipynb

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
data		data
notebooks		notebooks
src		src
.gitignore		.gitignore
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

🔧 Feature Engineering Toolkit (Python)

📦 Overview

📁 Project Structure

About

Uh oh!

Releases

Packages

Languages

paramedick/FeatureEngineering

Folders and files

Latest commit

History

Repository files navigation

🔧 Feature Engineering Toolkit (Python)

📦 Overview

📁 Project Structure

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages