
<script type="text/javascript">
<!--
document.write('<div id="oa_widget"></div>');
document.write('<script type="text/javascript" src="https://www.openaire.eu/index.php?option=com_openaire&view=widget&format=raw&projectId=undefined&type=result"></script>');
-->
</script>
Minimalist Data Wrangling with Python is envisaged as a student's first introduction to data science, providing a high-level overview as well as discussing key concepts in detail. We explore methods for cleaning data gathered from different sources, transforming, selecting, and extracting features, performing exploratory data analysis and dimensionality reduction, identifying naturally occurring data clusters, modelling patterns in data, comparing data between groups, and reporting the results. This textbook is a non-profit project. Its online and PDF versions are freely available at https://datawranglingpy.gagolewski.com/. To order a paper copy, see https://datawranglingpy.gagolewski.com/order-paper-copy.html. Marek Gagolewski is an Associate Professor in Data Science at Warsaw University of Technology. His research interests are related to data science, in particular: modelling complex phenomena, developing usable, general purpose algorithms, studying their analytical properties, and finding out how people use, misuse, understand, and misunderstand methods of data analysis in research, commercial, and decision making settings. In his spare time, he writes books for his students and develops free (libre) data analysis software, such as stringi – one of the most often downloaded R packages, and genieclust – a fast and robust clustering algorithm in both Python and R. See also: Deep R Programming at https://deepr.gagolewski.com/.
Please cite this book as: Gagolewski M. (2025), Minimalist Data Wrangling with Python, Zenodo, Melbourne, DOI: 10.5281/zenodo.6451068, ISBN: 978-0-6455719-1-2, URL: https://datawranglingpy.gagolewski.com/
FOS: Computer and information sciences, Artificial Intelligence and Image Processing, scipy, 80308 Programming Languages, data frames, Computer Software, vectors, matrices, numpy, FOS: Mathematics, 80204 Mathematical Software, 80306 Open Software, 80304 Concurrent Programming, pandas, matplotlib, Data Wrangling, Data Science, Statistics, outliers, 80110 Simulation and Modelling, data cleansing, missing values, classification, Applied Computer Science, regression, scikit-learn, data science, text processing, time series, data wrangling, Python, clustering
FOS: Computer and information sciences, Artificial Intelligence and Image Processing, scipy, 80308 Programming Languages, data frames, Computer Software, vectors, matrices, numpy, FOS: Mathematics, 80204 Mathematical Software, 80306 Open Software, 80304 Concurrent Programming, pandas, matplotlib, Data Wrangling, Data Science, Statistics, outliers, 80110 Simulation and Modelling, data cleansing, missing values, classification, Applied Computer Science, regression, scikit-learn, data science, text processing, time series, data wrangling, Python, clustering
<script type="text/javascript">
<!--
document.write('<div id="oa_widget"></div>');
document.write('<script type="text/javascript" src="https://www.openaire.eu/index.php?option=com_openaire&view=widget&format=raw&projectId=doi_________::d492efb2e44c3ddca961c8f26f55097b&type=result"></script>');
-->
</script>
<script type="text/javascript">
<!--
document.write('<div id="oa_widget"></div>');
document.write('<script type="text/javascript" src="https://www.openaire.eu/index.php?option=com_openaire&view=widget&format=raw&projectId=doi_________::d492efb2e44c3ddca961c8f26f55097b&type=result"></script>');
-->
</script>
<script type="text/javascript">
<!--
document.write('<div id="oa_widget"></div>');
document.write('<script type="text/javascript" src="https://www.openaire.eu/index.php?option=com_openaire&view=widget&format=raw&projectId=doi_dedup___::3b34d8eb20fc2bc3cfeb06b10259eea5&type=result"></script>');
-->
</script>
<script type="text/javascript">
<!--
document.write('<div id="oa_widget"></div>');
document.write('<script type="text/javascript" src="https://www.openaire.eu/index.php?option=com_openaire&view=widget&format=raw&projectId=doi_dedup___::3b34d8eb20fc2bc3cfeb06b10259eea5&type=result"></script>');
-->
</script>
<script type="text/javascript">
<!--
document.write('<div id="oa_widget"></div>');
document.write('<script type="text/javascript" src="https://www.openaire.eu/index.php?option=com_openaire&view=widget&format=raw&projectId=doi_dedup___::80983c61cf9b65b8992534ce7d1e54a1&type=result"></script>');
-->
</script>
<script type="text/javascript">
<!--
document.write('<div id="oa_widget"></div>');
document.write('<script type="text/javascript" src="https://www.openaire.eu/index.php?option=com_openaire&view=widget&format=raw&projectId=doi_dedup___::80983c61cf9b65b8992534ce7d1e54a1&type=result"></script>');
-->
</script>
<script type="text/javascript">
<!--
document.write('<div id="oa_widget"></div>');
document.write('<script type="text/javascript" src="https://www.openaire.eu/index.php?option=com_openaire&view=widget&format=raw&projectId=doi_________::e0590265b553604d826453a872723428&type=result"></script>');
-->
</script>
<script type="text/javascript">
<!--
document.write('<div id="oa_widget"></div>');
document.write('<script type="text/javascript" src="https://www.openaire.eu/index.php?option=com_openaire&view=widget&format=raw&projectId=doi_________::e0590265b553604d826453a872723428&type=result"></script>');
-->
</script>
<script type="text/javascript">
<!--
document.write('<div id="oa_widget"></div>');
document.write('<script type="text/javascript" src="https://www.openaire.eu/index.php?option=com_openaire&view=widget&format=raw&projectId=dedup_wf_002::4d538f100a4d4169b7b88e385578c61a&type=result"></script>');
-->
</script>
<script type="text/javascript">
<!--
document.write('<div id="oa_widget"></div>');
document.write('<script type="text/javascript" src="https://www.openaire.eu/index.php?option=com_openaire&view=widget&format=raw&projectId=dedup_wf_002::4d538f100a4d4169b7b88e385578c61a&type=result"></script>');
-->
</script>
<script type="text/javascript">
<!--
document.write('<div id="oa_widget"></div>');
document.write('<script type="text/javascript" src="https://www.openaire.eu/index.php?option=com_openaire&view=widget&format=raw&projectId=dedup_wf_002::546b8b0dc6591daecaa261abd0eb76ce&type=result"></script>');
-->
</script>
<script type="text/javascript">
<!--
document.write('<div id="oa_widget"></div>');
document.write('<script type="text/javascript" src="https://www.openaire.eu/index.php?option=com_openaire&view=widget&format=raw&projectId=dedup_wf_002::546b8b0dc6591daecaa261abd0eb76ce&type=result"></script>');
-->
</script>
<script type="text/javascript">
<!--
document.write('<div id="oa_widget"></div>');
document.write('<script type="text/javascript" src="https://www.openaire.eu/index.php?option=com_openaire&view=widget&format=raw&projectId=dedup_wf_002::661e98994689a1acd676d60e7828c9a1&type=result"></script>');
-->
</script>
<script type="text/javascript">
<!--
document.write('<div id="oa_widget"></div>');
document.write('<script type="text/javascript" src="https://www.openaire.eu/index.php?option=com_openaire&view=widget&format=raw&projectId=dedup_wf_002::661e98994689a1acd676d60e7828c9a1&type=result"></script>');
-->
</script>
citations This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 0 | |
popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Average | |
influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Average | |
impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Average |
views | 142 | |
downloads | 118 |