Tag: Data Science Journey

Another thirty days of machine learning with Kaggle (Part 3)

It took me a while to finally complete all 17 modules, but knowing that there was a clear outcome helped motivate me to keep taking the next step, and the step after that.

So, if you’re interested in getting a taste of what the world of machine learning has to offer, beginning with the (free!) online courses on Kaggle Learn is a great starting point. I know that it definitely was for me.

Continue reading

Another thirty days of machine learning with Kaggle (Part 1)

Last August, I completed Kaggle’s “30 Days of Machine Learning” challenge. In the spirit of continuous learning, I decided to take the plunge and finish all the remaining 14 modules in Kaggle Learn. It took me another 30 days, over a span of several months, to finally read through all the tutorials and complete all the coding exercises. Here is Part 1 of my learning journey.

Continue reading

Using a shotgun approach for the Titanic competition

After completing Alexis Cook’s very useful Titanic Tutorial, I couldn’t help myself and spent a couple of days hacking around to try and improve my score without going through the usual data science workflow of EDA, feature engineering, model selection, hyperparameter tuning, train/test iterations.

I know it’s not the proper way of doing data science, but like I said, I just couldn’t help myself.

Continue reading

Installing Anaconda Individual Edition

One of the first tasks is to set up a programming environment on my personal laptop so that I can start coding as quickly as possible. For data science purposes, you can’t go wrong choosing the Anaconda Individual Edition.

It’s open source, free for individuals, has more than 20 million users worldwide and comes with all the key components and packages needed to get started in less than 30 minutes.

Continue reading