This is a draft version of a forthcoming book by Marek Gagolewski. Any remarks and bug fixes are appreciated — please submit them via this repository's Issues tracker. Thank you.
This repository hosts the HTML and PDF versions of the book.
You can browse them at:
- https://datawranglingpy.gagolewski.com/ (a browser-friendly version)
- https://datawranglingpy.gagolewski.com/datawranglingpy.pdf (PDF)
I, Marek Gagolewski (pronounced like Mark Gaggle-Eve-Ski), am currently a Senior Lecturer in Applied AI at Deakin University in Melbourne, VIC, Australia and an Associate Professor in Data Science (on long-term leave) at Faculty of Mathematics and Information Science, Warsaw University of Technology, Poland.
I'm actively involved in developing usable free (libre) and open source software, with particular focus on data science and machine learning. I'm the main author and maintainer of stringi – one of the most often downloaded R packages that aims at natural language and string processing as well as the Python and R package genieclust implementing the fast and robust hierarchical clustering algorithm Genie with noise point detection.
I'm an author of over 80 publications on machine learning and optimisation algorithms, data aggregation and clustering, statistical modelling, and scientific computing. Moreover, I taught various courses related to R and Python programming, algorithms, data science, and machine learning in Australia, Poland, and Germany (e.g., at Data Science Retreat).
Copyright (C) 2022, Marek Gagolewski.
This material is licensed under the Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License (CC BY-NC-ND 4.0).