Skip to content

dasarpai/DAI-Datasets

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

22 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Datasets to Uplift your Datascience Skills

The repository carries a diverse range of themes, difficulty levels, sizes and attributes. They offer hands-on practice to boost their skills in exploratory data analysis, data visualization, data wrangling and machine learning.

Misc

GoogleNews-vectors-negative300.bin.gz can be download from internet/kaggle

Beginner:

  1. Find out the age of Abalone from physical measurements => Dateset: Abalone)
    Regression Models | Environment

  2. Predict student's knowledge level => Dateset: User Knowledge Modeling)
    Classification/Clustering | Education/Web

  3. Can you predict the price of a house? => Dateset: Real Estate Valuation)
    Regression Models | Real Estate

  4. Can you estimate location from WIFI Signal Strength => Dateset: Wireless Indoor Localization)
    Classification Models | Mobile/Location

  5. Predict acceptability of a car => Dateset: Car Evaluation)
    Classification Models | Automobile

  6. Predict seminal quality of an individual => Dateset: Fertility)
    Regression/Classification Models | Healthcare/Life

  7. Estimate chance of bankruptcy from qualitative parameters by experts => Dateset: Qualitative Bankruptcy)
    Classification Models | Finance/Banking

  8. Understand driving patterns of Birmingham with respect to time and date => Dateset: Birmingham Parking Dataset)
    Regression/Classification Models | Transport and Mobility

  9. Explore the effect of time, date and weather on traffic volume on a US Interstate] Regression Models | Transport and Mobility

  10. Explore patterns in drug abuse between cities, age groups and racial groups => Dateset: Accidental Drug Related Deaths in Connecticut, US)
    Classification Models | Healthcare/Social Sciences


Intermediate:

  1. Can you predict the fuel-efficiency of a car? => Dateset: Auto MPG)
    Regression Models | Automobiles

  2. Was that chest pain an indicator of a heart disease => Dateset: Heart Disease)
    Classification Models | Health Sciences

  3. Predict total number of demand of orders => Dateset: Daily Demand Forecasting Orders)
    Regression Models | Business

  4. Find out if a donor will give blood in March 2007 => Dateset: Blood Transfusion Service Center)
    Classification Models | Business

  5. Forecast pollution level of a city => Dateset: Beijing PM2.5)
    Regression Models | Environment

  6. Will the patient survive for at least one year after a heart attack => Dateset: Echocardiogram)
    Classification Models | Automobiles

  7. Estimate compressive strength of concrete => Dateset: Concrete Compressive Strength)
    Regression Models | Civil Engineering/Construction

  8. Discover patterns relating liver disorder and alcohol consumption => Dateset: Liver Disorders)
    Classification/Regression/Clustering Models | Healthcare

  9. Predict which stock will provide greatest rate of return => Dateset: Dow Jones Index)
    Clustering/Regression/Classification Models | Business/Finance

  10. Assess heating and cooling load requirements of building => Dateset: Energy Efficiency)
    Regression/Classification Models | Energy

  11. Determine the type of glass using oxide content => Dateset: Glass Identification)
    Classification Models | Physical

  12. Predict chance of survival => Dateset: Hepatitis)
    Classification Models | Healthcare

  13. Find patterns from spending data at wholesale => Dateset: Wholesale Customers)
    Classification/Clustering | Business/Retail

  14. Group similar travel reviews => Dateset: Travel Reviews)
    Clustering/Classification Models | Domain: Web

  15. Relate returns of Istanbul Stock Exchange with other international indices => Dateset: Istanbul Stock Exchange)
    Regression/Classification Models | Business/Finance

  16. Predict bike rental count (hourly/daily) based on the environmental & seasonal settings => Dateset: Bike Sharing)
    Regression Models | Social

  17. Detect Room Occupancy through Light, Temperature, Humidity and CO2 sensors => Dateset: Occupancy Detection)
    Classification Models | Energy/Buildings

  18. Estimate whether a person’s income exceeds $50K/year => Dateset: Census Income)
    Classification Models | Social/Government

  19. Predict the condition of a patients liver from their bloodwork] Classification Models | Healthcare

  20. Predict future poverty trends in EU Countries => Dateset: EU Population Poverty Status Dataset)
    Regression Models | Social/Government

  21. Predict the spread of Tuberculosis across the US => Dateset: US Tuberculosis Dataset)
    Regression Models | Healthcare

  22. Determine if smoking, invasive birth control methods and a history of STDs can lead to Cervical Cancer => Dateset: Risk Factors for Cervical Cancer)
    Classification Models | Healthcare


Advanced:

  1. Detect Autistic Spectrum Disorder (ASD) cases => Dateset: Autism Screening Adult)
    Classification Models | Healthcare/Social Sciences

  2. Estimate the probability of Default => Dateset: Default of Credit Card Clients)
    Classification Models | Business/Finance

  3. Predict if a note is genuine => Dateset: Banknote Authentication)
    Classification Models | Banking/Finance

  4. Find a short term forecast on electricity consumption of a single home => Dateset: Individual Household Electric Power Consumption)
    Regression/Clustering Models | Electricity

  5. Predict the number of shares on social networks => Dateset: Online News Popularity)
    Regression/Classification Models | Business/Web

  6. Analyze the text or sentiment of products on Amazon, or recommend products => Dateset: Amazon Product Reviews)
    Text Analytics/Sentiment Analysis/Recommender Systems

  7. Explore predictive modelling and numerical forecasting techniques => Dateset: Portugal 2019 Election Dataset)
    Regression Models | Social Sciences/Government

  8. Explore changes in brain activity in humans in the presence and absence of a visual stimulus => Dateset: EEG Eye State Dataset)
    Classification Models | Neuroscience/Healthcare

  9. Explore patterns in brain activity based on multiple visual and non-visual stimuli => Dateset: EEG Steady State Evoked Potential Dataset)
    Classification Models | Neuroscience/Healthcare

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published