Four steps to master machine learning with python (including free books amp;amp; resources)

本文主要是介绍Four steps to master machine learning with python (including free books amp;amp; resources),希望对大家解决编程问题提供一定的参考价值,需要的开发者们随着小编来一起学习吧!

To understand and apply machine learning techniques you have to learn Python or R. Both are programming languages similar to C, Java or PHP. However, since Python and R are much younger and “farer away” from the CPU, they are easier. The advantage of Python is that it can be adopted to many other problems than R, which is only used for handling data, analysing it with e.g. machine learning and statistic algorythms and ploting it in nice graphs. Because Python has a broader distribution (hosting websites with Jango, natural language proecssing, accessing APIs of websites such as Twitter, Linkedin etc.) and resembles more classical programming languages like C Python is more popular.

The four steps of learning machine learning in python

  1. First you have to learn the basics of Python using books, courses and videos.
  2. Then you have to master the different moduls such as Pandas, Numpy, Matplotlib and Natural Language Processing (NLP) in order to handle, clean, plot and understand data.
  3. Afterwards you have to able to scrap data from the web which is either done by using APIs of websites or the web-scraping moduls Beautiful Soup. Web scraping allows you to collect data which you feed into you machine learning algorithms.
  4. In the last step you have to learn machine learning (ML) tools like Scikit-Learn or implement ML-algorithm from scratch.

1. Getting started with Python:

And easy and fast way to learn Python is to register at codecademy.com and imediately start to code and learn the basics of python. A classic is the website learnpythonthehardway which is referenced by a lot of python programmers. A good PDF is a byte of python. A list of python resources for beginners is also provided by the python community. A book from O’Reilley is Think Python, which can be downloaded for free from here. A last resource is Introduction to Python for Econometrics, Statistics and Data Analysis which also covers the basics of Python.

2. Important Modules for machine learning

The most important modules for machine learning are NumPy, Pandas, Matplotlib and IPython. A book covering a couple of these modules is Data Analysis with Open Source Tools.  The free book Introduction to Python for Econometrics, Statistics and Data Analysis from 1. also covers Numpy, Pandas, matplotlib and IPython. Another resource is Python for Data Analysis: Data Wrangling with Pandas, NumPy, and IPython, which also covers the most important modules. Her are other free Numpy (Numerical Python, Numpy Userguide, Guide to NumPy), Pandas (Pandas, Powerful Python Data Analysis Toolkit, Practical Business Python, Intros to Pandas Data Structure) and Matplotlib books.

Other resources:

  • 10 minutes to Pandas
  • Pandas for machine learning
  • 100 NumPy exercises

3. Mining and scraping the data from websites and through APIs

Once you have understood the basics of python and the most important modules you have to learn how to collect data from different sources. This technique is also called web scrapping. Classic sources are text from websites, textual data through APIs to access websites such as twitter or linkedin. Good books on web scraping are Mining the Social Web (free book!), Web Scraping with Python and Web Scraping with Python: Collecting Data from the Modern Web. 

Lastly this textual data has to be transformed into numerical data, which is done with natural language processing techniques covered by Natural language processing with Python and Natural Language Annotation for Machine Learning. Other data are images and videos, which can be analysed using computer vision techniques: Programming Computer Vision with Python, Programming Computer Vision with Python: Tools and algorithms for analyzing images  and Practical Python and OpenCV are typical resources to analyse images.

Educational and interesting examples of what you can already do using basic python commands and web scraping techniques can be found in these examples:

  • Mini-Tutorial: Saving Tweets to a Database with Python
  • Web Scraping Indeed for Key Data Science Job Skills
  • Case Study: Sentiment Analysis On Movie Reviews
  • First Web Scraper
  • Sentiment Analysis of Emails
  • Simple Text Classification
  • Basic Sentiment Analysis with Python
  • Twitter sentiment analysis using Python and NLTK
  • Second Try: Sentiment Analysis in Python
  • Natural Language Processing in a Kaggle Competition for Movie Reviews

4. Machine learning with Python

Machine learning can be divided into four groups. Classification, clustering, regression and dimensionalty reduction.

drop_shadows_background2

 

Classification can also be called supervised learning and helps one to classify an image in order to identify a symbol or face in the image, or to classify a user from its profile and to grant him different credit scores. Clustering happens under unsupervised learning and allows the user to identify groups/clusters within its data. Regression permits to estimate a value from a paramter set and can be used to predict the best price for a house, apartment or car.

All important modules, packages and techniques to learn Machine Learning in Python, C, Scala, Java, Julia, MATLAB, Go, R and Ruby. Books about machine learning in python:

I especially recommend the book Machine learning in action. Although a bit short it is probably a classic in machine learning due to its age Programming Collective Intelligence. These two books let you build machine learning algorithms from scratch.

Most recent publications about machine learning are base on the Python module scikit-learn. It makes machine learning very easy since all the algorithm are already implemented. The only thing you do is to tell python which ML-technique should be used to analyse the data.

A free scikit-learn tutorial can be found on the official scikit-learn website. Other posts are be found here:

  • Introduction to Machine Learning with Python and Scikit-Learn
  • Data Science in Python
  • Machine Learning for Predicting Bad Loans
  • A Generic Architecture for Text Classification with Machine Learning
  • Using Python and AI to predict types of wine
  • Advice for applying Machine Learning
  • Predicting customer churn with scikit-learn
  • Mapping Your Music Collection
  • Data Science in Python
  • Case Study: Sentiment Analysis on Movie Reviews
  • Document Clustering with Python
  • Five most popular similarity measures implementation in python
  • Case Study: Sentiment Analysis on Movie Reviews
  • Will it Python?
  • Text Processing in Machine Learning
  • Hacking an epic NHL goal celebration with a hue light show and real-time machine learning
  • Vancouver Room Prices
  • Exploring and Predicting University Faculty Salaries
  • Predicting Airline Delays

Books about machine learning and the module scikit-learn in Python are:

  • Collection of books on reddit
  • Building Machine Learning Systems with Python
  • Building Machine Learning Systems with Python, 2nd Edition
  • Learning scikit-learn: Machine Learning in Python
  • Machine Learning Algorithmic Perspective
  • Data Science from Scratch – First Principles with Python
  • Machine Learning in Python

Books which are published in the coming months are:

  • Introduction to Machine Learning with Python
  • Thoughtful Machine Learning with Python: A Test-Driven Approach

Courses and blogs about Machine learning

You want to earn a degree, take an online course or attand a real workshop, camp or university course? Here are some links: Collection of links to online education in analytics, Big Data, Data Mining, and Data Science. Coursera course in machine learning and Data Analyst Nanodegree from Udacity are other recommended online courses. List of frequently updated blogs about machine learning.

A great youtube video is this class from Jake Vanderplas, Olivier Grisel about Exploring Machine Learning with Scikit-learn!

Theory of Machine Learning

Want to learn the theory of machine learning? The Elements of statistical Learning and Introduction to Statistical Learning are often cited classics. Other books are Introduction to machine learning and A Course in Machine Learning. The links contain free PDF, so you don’t have to pay them! Don’t want to read this? Watch 15 hours theory of machine learning!

原文地址:http://lernpython.de/four-steps-to-master-machine-learning-with-python-including-free-books-resources

翻译底子:   http://python.jobbole.com/84326/

这篇关于Four steps to master machine learning with python (including free books amp;amp; resources)的文章就介绍到这儿,希望我们推荐的文章对编程师们有所帮助!



http://www.chinasem.cn/article/581840

相关文章

Python中常用的四种取整方式分享

《Python中常用的四种取整方式分享》在数据处理和数值计算中,取整操作是非常常见的需求,Python提供了多种取整方式,本文为大家整理了四种常用的方法,希望对大家有所帮助... 目录引言向零取整(Truncate)向下取整(Floor)向上取整(Ceil)四舍五入(Round)四种取整方式的对比综合示例应

python 3.8 的anaconda下载方法

《python3.8的anaconda下载方法》本文详细介绍了如何下载和安装带有Python3.8的Anaconda发行版,包括Anaconda简介、下载步骤、安装指南以及验证安装结果,此外,还介... 目录python3.8 版本的 Anaconda 下载与安装指南一、Anaconda 简介二、下载 An

Python自动化处理手机验证码

《Python自动化处理手机验证码》手机验证码是一种常见的身份验证手段,广泛应用于用户注册、登录、交易确认等场景,下面我们来看看如何使用Python自动化处理手机验证码吧... 目录一、获取手机验证码1.1 通过短信接收验证码1.2 使用第三方短信接收服务1.3 使用ADB读取手机短信1.4 通过API获取

python安装whl包并解决依赖关系的实现

《python安装whl包并解决依赖关系的实现》本文主要介绍了python安装whl包并解决依赖关系的实现,文中通过图文示例介绍的非常详细,对大家的学习或者工作具有一定的参考学习价值,需要的朋友们下面... 目录一、什么是whl文件?二、我们为什么需要使用whl文件来安装python库?三、我们应该去哪儿下

Python脚本实现图片文件批量命名

《Python脚本实现图片文件批量命名》这篇文章主要为大家详细介绍了一个用python第三方库pillow写的批量处理图片命名的脚本,文中的示例代码讲解详细,感兴趣的小伙伴可以了解下... 目录前言源码批量处理图片尺寸脚本源码GUI界面源码打包成.exe可执行文件前言本文介绍一个用python第三方库pi

Python中多线程和多进程的基本用法详解

《Python中多线程和多进程的基本用法详解》这篇文章介绍了Python中多线程和多进程的相关知识,包括并发编程的优势,多线程和多进程的概念、适用场景、示例代码,线程池和进程池的使用,以及如何选择合适... 目录引言一、并发编程的主要优势二、python的多线程(Threading)1. 什么是多线程?2.

Python爬虫selenium验证之中文识别点选+图片验证码案例(最新推荐)

《Python爬虫selenium验证之中文识别点选+图片验证码案例(最新推荐)》本文介绍了如何使用Python和Selenium结合ddddocr库实现图片验证码的识别和点击功能,感兴趣的朋友一起看... 目录1.获取图片2.目标识别3.背景坐标识别3.1 ddddocr3.2 打码平台4.坐标点击5.图

Python自动化Office文档处理全攻略

《Python自动化Office文档处理全攻略》在日常办公中,处理Word、Excel和PDF等Office文档是再常见不过的任务,手动操作这些文档不仅耗时耗力,还容易出错,幸运的是,Python提供... 目录一、自动化处理Word文档1. 安装python-docx库2. 读取Word文档内容3. 修改

Python重命名文件并移动到对应文件夹

《Python重命名文件并移动到对应文件夹》在日常的文件管理和处理过程中,我们可能会遇到需要将文件整理到不同文件夹中的需求,下面我们就来看看如何使用Python实现重命名文件并移动到对应文件夹吧... 目录检查并删除空文件夹1. 基本需求2. 实现代码解析3. 代码解释4. 代码执行结果5. 总结方法补充在

Python自动化办公之合并多个Excel

《Python自动化办公之合并多个Excel》在日常的办公自动化工作中,尤其是处理大量数据时,合并多个Excel表格是一个常见且繁琐的任务,下面小编就来为大家介绍一下如何使用Python轻松实现合... 目录为什么选择 python 自动化目标使用 Python 合并多个 Excel 文件安装所需库示例代码