Below is a quite comprehensive way of analysing your dataset. However, it can easy to get bogged down in the details here - you want to avoid this. Make a quick, dirty, hacky, end-to-end solution to your problem. Then once you have something very basic in place, it's time to get creative and start iterating on your intial approach. Try to improve each component of your solution and measure the impact to see where to spend your time. Many times acquiring more data or improving data cleaning and preprocessing steps have a higher ROI than optimizing the machine learning models themselves.
Now, a more comprehensive list of preprocessing steps to go through once you have a quick and dirty solution in place
- Create some hypotheses at the beginning of your analysis. This helps you engage with the problem and think of hwo different variables affect the outcome variable.
- What other features would you like?
- What features might you like to engineer?
 
- Then begin the process of EDA to test htese h