Resources for learning Python during COVID-19

written by Eric J. Ma on 2020-03-25 | tags: covid-19 python learning data science

With COVID-19 on hand, you might have some time to go deeper into Python programming.

I'd like to recommend some resources for you, in case you wish to brush up, learn, or go deeper.

DataCamp

If you're a complete beginner, I recommend going with DataCamp. I say this with a full up-front disclosure that I'm a DataCamp instructor, and if you make it to the Network Analysis courses, then I benefit from you. (If you don't, then I don't.)

The reasons why I recommend DataCamp is:

The reason why you wouldn't want to go with DataCamp is because you have to pay for it, and maybe because you don't trust the words of someone who has a conflict of interest. (I don't blame you, I'd take the same position.)

You might need to discuss with your management if your team is going to sponsor your learning.

Coursera

Coursera has a wide range of offerings for learning Python, and they are completely free with certification if your organization sponsors it (my current employer Novartis includes this as a perk). (Otherwise, it's just free - also hard to beat.) Just on the basis of price I would not hesitate to recommend it.

That said, there's a lot of courses to choose from.

One of the bigger issues here is that you might need to learn how to set up your own Python programming environment. For this, consult your friendly neighborhood parsertongue speaker (i.e. a colleague who knows Python) to help you.

Think Python

Think Python is an online book by Allen Downey, a professor of computer science and more at the Olin College of Engineering in Needham, MA. Allen is also a fellow Python community educator, and has generously let me test-drive my deep learning tutorials at his classes.

In this book, Allen leverages the friendliness of the Python programming language to teach you basic computing concepts.

Allen is incredibly generous, and has made the book freely available online, though you can buy it via Amazon or O'Reilly Media.

He's got other titles for those who already know Python:

Automate the Boring Stuff with Python

This is a book by a Python friend of mine, Al Swiegart, whom I met at the annual PyCon USA.

In this book, Al teaches you how to use Python code to automate all the boring, repetitive stuff that you encounter in your day-to-day work on a computer. It's this kind of project, which directly impacts your day-to-day, that can keep your motivation levels high while learning.

Al is also incredibly generous, and has made the book freely available online, and sometimes gives away the physical book for free. But as it's one of his income sources, I'd encourage you to buy the book (just as I did, even though I don't really need to read it anymore). 

e2eML by Brandon Rohrer

This is an online course created by Brandon Rohrer, who is a data scientist at iRobot (the maker of those fancy robot vacuums!).

Brandon is pretty active on social media, and is a fellow education enthusiast. (I sometimes wish my current role could include formal classroom teaching as part of my professional goals.) With this online course, he brings you through one opinionated path to getting good with machine learning and data science. As a pre-requisite, you should know how to set up your own Python programming environment, as there's no hosted computing environment for you.

Your Own Project

If you're proficient with some basic Python, and in particular, have learned how to use pandas, then you might want to take the time to re-do an analysis that you did once, except now done in pandas and Python.

Doing so will get you lots of practice in what actual day-to-day data science programming looks like, where you:

If possible, write a blog post as well, to document your learning journey, especially explaining how you solved the problem. I have found this to be an incredibly effective way of making that thing I just learned stick in memory.

SciPy/PyCon conferences

The SciPy and PyCon conferences are two annual Python conferences that I have attended since grad school, and they have a wealth of resources available for learners.

There are YouTube playlists available for each of them (SciPy and PyCon). PyCon has a new YouTube channel each year, while SciPy uses Enthought Media's own channel.

If you dig deep enough, my tutorials are available online as well, freely available for anybody to watch. (I link them from my personal website if you want a shortcut there... alrighty, enough of my shameless self-promotion.)

Some videos that have been helpful in my own learning journey:

The YouTube videos are quite good for those who have some basic knowhow on managing their own Python environments. There are some beginner-friendly ones that show you how to get started with Python too. In particular, this playlist should cover everything for you.

pandas Resources

If you're already proficient with Python, then learning pandas can only help you. pandas is the idiomatic package for working with data tables in Python. Knowing how to use it can help you be productive when working with tables that come from collaborators, or data from databases.

Here's some resources for picking up pandas:

Other notes

Setting up Python locally

Thus far, I've alluded to "setting up your own Python environment" many times. To demystify what I mean by that, it really boils down to installing the Anaconda distribution of Python more than anything else. (Don't download the Python 2.7 version, it's outdated!) The Anaconda Python distribution, distributed by the distribution namesake Anaconda, solved a lot of Python packaging problems that weren't actively being solved in the early 2010s, and "robustified" the distribution of Python packages. I myself was once skeptical about using it, until I screwed up my own system Python installation and broke iPhoto. That's when I finally bit the bullet and installed it - and never looked back since.

Jupyter

You might encounter the name "Jupyter", and think that "Jupyter" provides packages. This is incorrect - Jupyter is the name of an ecosystem of tools that data scientists use, and it covers

IPython is the precursor monolith project to Jupyter, and it was primarily focused on the computation engine and notebook.

Hope that disambiguates the terms for you.

The Most Important Trick For Learning...

...is nothing more than getting practice every single day.

Even if it's only for a single DataCamp exercise, getting that practice in daily is important for mastery. Otherwise, your time spent learning now will simply go to waste, filed away in the "I guess I learned it" cabinet never to be retrieved and tested again. If you want the knowledge to stick, you need to increase the odds that you'll get practice every day.

If you have a project you need to solve, you increase the odds that you'll get practice every day.

If you have sunken costs (time or money) into a course, you increase the odds that you'll practice every day.

If you have a community of learners to learn with, you increase the odds that you'll practice every day.

If you have a resource person with whom you click that you can ask questions of, you increase the odds that you'll practice every day.

Other resources

I clearly have a biased view of the world; if you have other resources for learning Python, don't hesitate to DM me or share them with your friends!

Finally...

Stay safe, y'all. Stay indoors, stay away from other people, and keep washing your hands!