Feb 8, 2023

10 Free Datasets to start building your Portfolio

‍📌1. Supermarket Sales

The growth of supermarkets in most populated cities are increasing and market competitions are also high. The dataset is one of the historical sales of a supermarket company which has recorded in 3 different branches for 3 months data.

Link : 🌐 https://lnkd.in/e86UpCMv


📌2. Credit Card Fraud Detection

It is important that credit card companies are able to recognize fraudulent credit card transactions so that customers are not charged for items that they did not purchase. The dataset contains transactions made by credit cards in September 2013 by European cardholders.

Link : 🌐 https://lnkd.in/eFTsZDCW


📌3. FIFA 22 complete player dataset

The datasets provided include the players data for the Career Mode from FIFA 15 to FIFA 22. The data allows multiple comparisons for the same players across the last 8 versions of the videogame.

Link : 🌐 https://lnkd.in/eDScdUUM


📌4. Walmart Store Sales Forecasting

You are provided with historical sales data for 45 Walmart stores located in different regions. Each store contains a number of departments, and you are tasked with predicting the department-wide sales for each store.

Link :🌐 https://lnkd.in/eVT6h-CT


📌5. Netflix Movies and TV Shows

Listings of movies and tv shows on Netflix - Regularly Updated. is one of the most popular media and video streaming platforms. They have over 8000 movies or tv shows available on their platform, as of mid-2021, they have over 200M Subscribers globally. 

Link :🌐 https://lnkd.in/eZ3cduwK


📌6. LinkedIn Data Analyst jobs listings

This project was born out of curiosity. Being new to data analysis, I was perplexed about received opinions propagated by senior data analysts. I thought, why not check all this out?

The steps are simple: collect, clean and analyze data.

Link :🌐 https://lnkd.in/ezqxcmrE


📌7. Top 50 Fast-Food Chains in USA

The key features of this Dataset are - Fast-Food Chains, U.S. Systemwide Sales (Millions - U.S Dollars), Average Sales per Unit (Thousands - U.S Dollars), Franchised Stores, Company Stores, 2021 Total Units, Total Change in Units from 2020.

Link :🌐 https://lnkd.in/esBjf5u4


📌8. Amazon and Best Buy Electronics

This is a list of over 7,000 online reviews for 50 electronic products from websites like Amazon and Best Buy provided by Datafiniti's Product Database. The dataset includes the review date, source, rating, title, reviewer metadata, and more.

Link :🌐 https://lnkd.in/e4fBZvJ3


📌9. Forecasting Book Sales

The task of the DATA MINING CUP Competition 2009 is to forecast purchase quantities for 8 titles for 2,418 different locations. In order to create the model, simulated purchase data from an additional 2,394 locations will be supplied. All data refers to a fixed period of time. The object is to forecast the purchase quantities of these 8 different titles for the 2,418 locations as exactly as possible.

Link : 🌐 https://lnkd.in/eXHN2XsQ


📌10. Real / Fake Job Posting Prediction

This dataset contains 18K job descriptions out of which about 800 are fake. The data consists of both textual information and meta-information about the jobs. The dataset can be used to create classification models which can learn the job descriptions which are fraudulent.

Link : 🌐 https://lnkd.in/e5SDDW9G

We at Alphaa AI are on a mission to tell #1billion #datastories with their unique perspective. We are the community that is creating Citizen Data Scientists, who bring in data first approach to their work, core specialisation, and the organisation.With Saurabh Moody and Preksha Kaparwan you can start your journey as a citizen data scientist.

Need Data Career Counseling. Request Here

Ready to dive into data Science? We can guide you...

Join our Counseling Sessions

Find us on Social for
data nuggets❤️