Tuesday, July 14, 2020

How to select Machine Learning algorithm

Confused with where to start with Machine Learning coding? Confused with what algorithm to select to solve your Machine Learning problem? Instead of using the trial and error method, we can choose the best algorithm using Sciket-learn algorithm cheat-sheet. It consists of a flow chart to guide how to approach Machine Learning problems.

Follow the link below:
https://scikit-learn.org/stable/tutorial/machine_learning_map/index.html

Friday, May 1, 2020

Jargon of Statistics /Data Mining and Data Science

Sometimes while reading mathematics books for my learning of data science, I used to get confused to relate the jargon of mathematics and data science. What I found Statisticians and data scientists often use different language for the same thing. Here is a dictionary of jargon for Statistics and Data Science.


Overview of the Data Science Process


The big data ecosystem and data science

The most common names are considered here. However, many other distributed file systems exist: Google File System, Red Hat Cluster File System, Ceph File System, and Tachyon File System.

Deep learning libraries like Tensor flow, Pytorch are not considered. 

Silicon Valley: The Epicenter of Technological Innovation

 Introduction: Silicon Valley, located in Northern California, is synonymous with innovation, entrepreneurship, and groundbreaking technolog...