My background is largely in machine learning as applied to physical science
The reason I’m here is to help introduce you to the skills and resources you’ll need to interact in the modern world of data mining
The first time I cut code for money was in 2014 and was for embedded data collection/analysis.
My area of interest is in the intersection of natural science and computer/data science and how to use principles from one domain to empower the other.
Algorithm Design
Statistics
Engineering
Optimization
Computer Science
Domain specific (expert) knowledge
Data mining is the act of extracting actionable information from a mass of data.
It indeed includes aspects of algorithm design, statistics, engineering, optimization, and computer science.
In addition, it required expertise in the domain you’re working in (much like the best data science)
Data mining can be summarized into the following steps
Person | Height | Short or Tall |
---|---|---|
1 | 5’6” | Short |
2 | 6’0” | Tall |
3 | 5’2” | Short |
4 | 6’6” | Tall |
What’s the parameter we care about?
Python
Jupyterlab
scikit-learn
Go read up to Page 13 of the text (In chapter 1)