The repository carries a diverse range of themes, difficulty levels, sizes and attributes. They offer hands-on practice to boost their skills in exploratory data analysis, data visualization, data wrangling and machine learning.
GoogleNews-vectors-negative300.bin.gz can be download from internet/kaggle
-
Find out the age of Abalone from physical measurements => Dateset: Abalone)
Regression Models | Environment -
Predict student's knowledge level => Dateset: User Knowledge Modeling)
Classification/Clustering | Education/Web -
Can you predict the price of a house? => Dateset: Real Estate Valuation)
Regression Models | Real Estate -
Can you estimate location from WIFI Signal Strength => Dateset: Wireless Indoor Localization)
Classification Models | Mobile/Location -
Predict acceptability of a car => Dateset: Car Evaluation)
Classification Models | Automobile -
Predict seminal quality of an individual => Dateset: Fertility)
Regression/Classification Models | Healthcare/Life -
Estimate chance of bankruptcy from qualitative parameters by experts => Dateset: Qualitative Bankruptcy)
Classification Models | Finance/Banking -
Understand driving patterns of Birmingham with respect to time and date => Dateset: Birmingham Parking Dataset)
Regression/Classification Models | Transport and Mobility -
Explore the effect of time, date and weather on traffic volume on a US Interstate] Regression Models | Transport and Mobility
-
Explore patterns in drug abuse between cities, age groups and racial groups => Dateset: Accidental Drug Related Deaths in Connecticut, US)
Classification Models | Healthcare/Social Sciences
-
Can you predict the fuel-efficiency of a car? => Dateset: Auto MPG)
Regression Models | Automobiles -
Was that chest pain an indicator of a heart disease => Dateset: Heart Disease)
Classification Models | Health Sciences -
Predict total number of demand of orders => Dateset: Daily Demand Forecasting Orders)
Regression Models | Business -
Find out if a donor will give blood in March 2007 => Dateset: Blood Transfusion Service Center)
Classification Models | Business -
Forecast pollution level of a city => Dateset: Beijing PM2.5)
Regression Models | Environment -
Will the patient survive for at least one year after a heart attack => Dateset: Echocardiogram)
Classification Models | Automobiles -
Estimate compressive strength of concrete => Dateset: Concrete Compressive Strength)
Regression Models | Civil Engineering/Construction -
Discover patterns relating liver disorder and alcohol consumption => Dateset: Liver Disorders)
Classification/Regression/Clustering Models | Healthcare -
Predict which stock will provide greatest rate of return => Dateset: Dow Jones Index)
Clustering/Regression/Classification Models | Business/Finance -
Assess heating and cooling load requirements of building => Dateset: Energy Efficiency)
Regression/Classification Models | Energy -
Determine the type of glass using oxide content => Dateset: Glass Identification)
Classification Models | Physical -
Predict chance of survival => Dateset: Hepatitis)
Classification Models | Healthcare -
Find patterns from spending data at wholesale => Dateset: Wholesale Customers)
Classification/Clustering | Business/Retail -
Group similar travel reviews => Dateset: Travel Reviews)
Clustering/Classification Models | Domain: Web -
Relate returns of Istanbul Stock Exchange with other international indices => Dateset: Istanbul Stock Exchange)
Regression/Classification Models | Business/Finance -
Predict bike rental count (hourly/daily) based on the environmental & seasonal settings => Dateset: Bike Sharing)
Regression Models | Social -
Detect Room Occupancy through Light, Temperature, Humidity and CO2 sensors => Dateset: Occupancy Detection)
Classification Models | Energy/Buildings -
Estimate whether a person’s income exceeds $50K/year => Dateset: Census Income)
Classification Models | Social/Government -
Predict the condition of a patients liver from their bloodwork] Classification Models | Healthcare
-
Predict future poverty trends in EU Countries => Dateset: EU Population Poverty Status Dataset)
Regression Models | Social/Government -
Predict the spread of Tuberculosis across the US => Dateset: US Tuberculosis Dataset)
Regression Models | Healthcare -
Determine if smoking, invasive birth control methods and a history of STDs can lead to Cervical Cancer => Dateset: Risk Factors for Cervical Cancer)
Classification Models | Healthcare
-
Detect Autistic Spectrum Disorder (ASD) cases => Dateset: Autism Screening Adult)
Classification Models | Healthcare/Social Sciences -
Estimate the probability of Default => Dateset: Default of Credit Card Clients)
Classification Models | Business/Finance -
Predict if a note is genuine => Dateset: Banknote Authentication)
Classification Models | Banking/Finance -
Find a short term forecast on electricity consumption of a single home => Dateset: Individual Household Electric Power Consumption)
Regression/Clustering Models | Electricity -
Predict the number of shares on social networks => Dateset: Online News Popularity)
Regression/Classification Models | Business/Web -
Analyze the text or sentiment of products on Amazon, or recommend products => Dateset: Amazon Product Reviews)
Text Analytics/Sentiment Analysis/Recommender Systems -
Explore predictive modelling and numerical forecasting techniques => Dateset: Portugal 2019 Election Dataset)
Regression Models | Social Sciences/Government -
Explore changes in brain activity in humans in the presence and absence of a visual stimulus => Dateset: EEG Eye State Dataset)
Classification Models | Neuroscience/Healthcare -
Explore patterns in brain activity based on multiple visual and non-visual stimuli => Dateset: EEG Steady State Evoked Potential Dataset)
Classification Models | Neuroscience/Healthcare