Skip to content

gagolews/datawranglingpy

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

15 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Minimalist Data Wrangling with Python

This is a draft version of a forthcoming book by Marek Gagolewski. Any remarks and bug fixes are appreciated — please submit them via this repository's Issues tracker. Thank you.

About this Repository

This repository hosts the HTML and PDF versions of the book.

You can browse them at:

About the Author

I, Marek Gagolewski (pronounced like Mark Gaggle-Eve-Ski), am currently a Senior Lecturer in Applied AI at Deakin University in Melbourne, VIC, Australia and an Associate Professor in Data Science (on long-term leave) at Faculty of Mathematics and Information Science, Warsaw University of Technology, Poland.

I'm actively involved in developing usable free (libre) and open source software, with particular focus on data science and machine learning. I'm the main author and maintainer of stringi – one of the most often downloaded R packages that aims at natural language and string processing as well as the Python and R package genieclust implementing the fast and robust hierarchical clustering algorithm Genie with noise point detection.

I'm an author of over 80 publications on machine learning and optimisation algorithms, data aggregation and clustering, statistical modelling, and scientific computing. Moreover, I taught various courses related to R and Python programming, algorithms, data science, and machine learning in Australia, Poland, and Germany (e.g., at Data Science Retreat).


Copyright (C) 2022, Marek Gagolewski.

This material is licensed under the Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License (CC BY-NC-ND 4.0).