🧠 Hate Speech Detection Application

A Streamlit-based web application that detects whether a given text contains hate speech, offensive language, or clean content (Not hate) using a Decision Tree Classifier trained on labeled tweet data.

🚀 Overview

This project demonstrates a simple yet effective Natural Language Processing (NLP) pipeline for classifying text as Hate Speech, Offensive Language, or Not Hate Speech. The app allows users to input any text, which is then cleaned, processed, and classified using a trained Decision Tree model.

It includes a visually appealing Streamlit interface with color-coded results for easy interpretation.

🔗 Want to try it out? Visit the live app here:
👉 https://hatespeechdetection-vikash.streamlit.app

🧩 Features

🧼 Text Preprocessing: URL removal, punctuation cleaning, stopword filtering, and stemming.
💬 Real-Time Prediction: Classifies input text instantly when submitted.
🎨 Custom UI Styling: Styled using HTML and CSS for a clean and modern look.
📊 Model Training & Evaluation: Decision Tree Classifier trained using CountVectorizer features.
💾 Pickle Integration: Uses preprocessed data and trained model stored as .pkl files.

⚙️ Installation & Setup

1. Clone the repository

git clone https://github.com/yourusername/hate-speech-detection.git
cd hate-speech-detection

2. Create a virtual environment (recommended)

python -m venv venv
source venv/bin/activate       # On Linux/Mac
venv\Scripts\activate        # On Windows

3. Install required dependencies

Create a requirements.txt file with the following:

streamlit
nltk
scikit-learn
pandas
numpy
matplotlib
seaborn

Then install them:

pip install -r requirements.txt

4. Download NLTK stopwords

python -m nltk.downloader stopwords

5. Run the app

streamlit run app.py

🧠 Model Details

Vectorizer: CountVectorizer()
Algorithm: DecisionTreeClassifier()
Training Data: Preprocessed tweets labeled as
- Hate Speech
- Offensive Language
- Not Hate Speech

The model is trained on cleaned text data (clean_data.pkl) and label data (dataset.pkl).

🧹 Text Cleaning Steps

The cleaning() function performs several preprocessing steps:

Lowercasing
Removing URLs, HTML tags, mentions, digits, and punctuation
Removing stopwords
Stemming words using SnowballStemmer

🖥️ User Interface

🔹 Normal View:

🔹 Offensive:

🔹 Not hate sentence:

🔹 Hate sentence:

🧑‍💻 Technologies Used

Category	Tools/Libraries
Frontend UI	Streamlit, HTML, CSS
NLP	NLTK
ML	scikit-learn
Data Handling	pandas, numpy
Visualization	matplotlib, seaborn

🏁 Future Improvements

🔍 Integrate advanced models like Logistic Regression, Random Forest, or BERT.
🗣️ Add multilingual support for hate speech detection.
📊 Include data visualization dashboards for model insights.

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
.devcontainer		.devcontainer
Datasets		Datasets
.gitignore		.gitignore
Hate Speech Detecton.ipynb		Hate Speech Detecton.ipynb
README.md		README.md
app.py		app.py
clean_data.pkl		clean_data.pkl
dataset.pkl		dataset.pkl
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

🧠 Hate Speech Detection Application

🚀 Overview

🧩 Features

⚙️ Installation & Setup

1. Clone the repository

2. Create a virtual environment (recommended)

3. Install required dependencies

4. Download NLTK stopwords

5. Run the app

🧠 Model Details

🧹 Text Cleaning Steps

🖥️ User Interface

🔹 Normal View:

🔹 Offensive:

🔹 Not hate sentence:

🔹 Hate sentence:

🧑‍💻 Technologies Used

🏁 Future Improvements

About

Uh oh!

Releases

Packages

Uh oh!

Languages

Vicky9890/Hate_Speech_Detection

Folders and files

Latest commit

History

Repository files navigation

🧠 Hate Speech Detection Application

🚀 Overview

🧩 Features

⚙️ Installation & Setup

1. Clone the repository

2. Create a virtual environment (recommended)

3. Install required dependencies

4. Download NLTK stopwords

5. Run the app

🧠 Model Details

🧹 Text Cleaning Steps

🖥️ User Interface

🔹 Normal View:

🔹 Offensive:

🔹 Not hate sentence:

🔹 Hate sentence:

🧑‍💻 Technologies Used

🏁 Future Improvements

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages