# Road to Machine Learning and Data Science

The list of free and non-free resources and courses for quick start in Machine Learning and Data Science. I built this list by searching in Google and taking references from articles about ML and data science. It is updated from time to time and you also can suggest one in comments.

The discipline can be divided to the following areas:

• Mathematics, applied statistics and the probability theory
• Programming (R or Python in particular) and working with tools for data analysis
• Machine learning methods including deep learning

Courses and resource for mathematics and statistics are skipped because this subject is covered with courses for R and Python.

# Programming

According to the poll results of KDnuggets, R, Python Duel As Top Analytics, Data Science software two languages are the most popular in the discipline: R and Python. This article describes the difference between two.

Below is the list of resources and courses for both of them.

# Machine Learning

### Problems

General problems which are the foundation of all business related problems:

• Classification
• Regression
• Clustering
• Rule Extraction

More about Machine Learning problems here.

The problems which Machine Learning helped to solve at some level:

• Manual data entry
• Detecting Spam
• Product recommendation
• Medical Diagnosis
• Customer segmentation and Lifetime value prediction
• Financial analysis
• Predictive maintenance
• Image recognition

### Algorithms

There are a set of well-known algorithms which are used in most of the tasks. More about it here.

• Naïve Bayes Classifier Algorithm
• K Means Clustering Algorithm
• Support Vector Machine Algorithm
• Apriori Algorithm
• Linear Regression
• Logistic Regression
• Artificial Neural Networks
• Random Forests
• Decision Trees
• Nearest Neighbours

All algorithms are classified as follows:

• Supervised
• Unsupervised
• Reinforcement

1. Large Scale Machine Learning, building of algorithms when the model is studying with data which cannot be loaded in RAM of a single PC
2. IoT (Internet of Things)

Web sites related to the discipline:

• KDnuggets: Leading site on Business Analytics, Big Data, Data Mining, Data Science, and Machine Learning
• Kaggle: a platform for data-related competitions
• Hackerrank: Statistics and Machine Learning Challenges
• Tianchi: data sets & challenges

Other courses

E-Learning Platforms:

This site uses Akismet to reduce spam. Learn how your comment data is processed.

## Bike Sharing Demand QuestionsBike Sharing Demand Questions

Recently a group of enthusiasts in Data Science which I lead had a discussion about Kaggle competition Bike Sharing Demand. I presented a possible solution and during discussion were raised

## Siddhi: Update partially in-memory tableSiddhi: Update partially in-memory table

Siddhi is a language which belongs to the group of Streaming SQL languages, offering SQL-like approach to process data streams. This short code snippet shows one of the basics scenarios

## Data Science GlossaryData Science Glossary

In this article I put everything which I think important to be aware when you start learning Data Science. It includes math, statistic, DS concepts. I put examples where I