Self-Study Python Course

Data analysis with Python is a practical introduction to data analysis using a large number of programming exercises. The course covers Python libraries such as NumPy, Pandas, Matplotlib and SciPy. These are used for data cleaning, grouping, creation of summary statistics, and for machine learning tasks such as linear regression, Naive Bayes, PCA, and clustering.
Course can be taken remotely.


Course Credits and License

Course Credits and License

Course material

The course material was written by Jarkko Toivonen. It was updated by Saska Dögnes.

The course material is licensed under the Creative Commons BY-NC-SA 4.0 -license, so you are free to use and distribute it as long as the names of the original authors are not removed. If you make changes to the material and want to distribute the modified version, you must license it under the same license. The use of materials for commercial purposes is prohibited without special permission.

Technology used on the course

Course page made by Henrik Nygren and Antti Leinonen. Course tasks were converted from TMC (Test My Code) format to Jupyter Notebook / nbgrader format by Petri Leinonen.

Begin your journey in the Data Analysis with Python course now