Part 1: EDA

In this stage of the project, you should decide with your team what aspect of the dataset you’d like to explore and display.

You should choose one of the datasets and conduct EDA on it. Some of the datasets will require different methods of preprocessing. Keep in mind that you may need to:

Part 2: Visualisation

After exploring the data, you need to decide on different ways to visualize the data. Each of your visualizations should be unique.

Part 3: Reporting Results - Presentation

While visualizations are very useful for pointing out important features of a dataset, they are not sufficient for a full data analysis. The last stage of your work must be creating a presentation with the results.

You must include:

Reporting results for grading

Your team should submit a link to a Github public repo with all of your working materials to Arina Sitnikova. For assessing your work, there should be:

Grading criteria for data work (20 points max)