.. SPDX-FileCopyrightText: 2019 Veit Schiele .. .. SPDX-License-Identifier: BSD-3-Clause ======================= Python for Data Science ======================= This is a tutorial on Data Science with Python. This immediately raises the question: What is Data Science? The term has become ubiquitous, but there is no single definition. Some even consider the term superfluous, because what science does not have to do with data? Nevertheless, it seems to me that data science is more than just hype: scientific data has become increasingly voluminous and often can no longer be adequately tapped with conventional mathematical and statistical methods alone – additional hacking skills are needed. However, it is not a new field of knowledge that you need to learn, but a set of skills that you can apply in your field. Whether you are analysing astronomical objects, analysing machines, forecasting stock prices or working with data in other fields, the goal of this tutorial is to enable you to solve tasks programmatically in your field. This tutorial is not intended to be an introduction to Python or programming in general; for that there is the :doc:`python-basics:index` tutorial. Instead, it is intended to show the Python data science stack – libraries such as :doc:`/workspace/ipython/index`, :doc:`/workspace/numpy/index`, :doc:`/workspace/pandas/index`, and related tools – so that you can subsequently effectively analyse your data. We also offer the `Jupyter Tutorial `_ and the `PyViz Tutorial `_ as well as the instructions for `data visualisation `_ from the `cusy Design System `_. All tutorials serve as seminar documents for our harmonised training courses: +---------------+--------------------------------------------------------------+ | Duration | Topic | +===============+==============================================================+ | 3 days | `Introduction to Python`_ | +---------------+--------------------------------------------------------------+ | 2 days | `Advanced Python`_ | +---------------+--------------------------------------------------------------+ | 2 days | `Design patterns in Python`_ | +---------------+--------------------------------------------------------------+ | 2 days | `Efficient testing with Python`_ | +---------------+--------------------------------------------------------------+ | 1 day | `Software documentation with Sphinx`_ | +---------------+--------------------------------------------------------------+ | 2 days | `Technical writing`_ | +---------------+--------------------------------------------------------------+ | 3 days | `Jupyter notebooks for efficient data science workflows`_ | +---------------+--------------------------------------------------------------+ | 2 days | `Numerical calculations with NumPy`_ | +---------------+--------------------------------------------------------------+ | 2 days | `Analysing data with pandas`_ | +---------------+--------------------------------------------------------------+ | 3 days | `Read, write and provide data with Python`_ | +---------------+--------------------------------------------------------------+ | 2 days | `Cleanse and validate data with Python`_ | +---------------+--------------------------------------------------------------+ | 5 days | `Visualising data with Python`_ | +---------------+--------------------------------------------------------------+ | 1 day | `Designing data visualisations`_ | +---------------+--------------------------------------------------------------+ | 2 days | `Create dashboards`_ | +---------------+--------------------------------------------------------------+ | 3 days | `Versioned and reproducible storage of code and data`_ | +---------------+--------------------------------------------------------------+ | Subscription | `News from Python for data science`_ | | of 2 hours | | | per quarter | | +---------------+--------------------------------------------------------------+ .. _`Introduction to Python`: https://cusy.io/en/our-training-courses/introduction-to-python .. _`Advanced Python`: https://cusy.io/en/our-training-courses/advanced-python .. _`Design patterns in Python`: https://cusy.io/en/our-training-courses/design-patterns-in-python .. _`Efficient testing with Python`: https://cusy.io/en/our-training-courses/efficient-testing-with-python .. _`Software documentation with Sphinx`: https://cusy.io/en/our-training-courses/software-documentation-with-sphinx .. _`Technical writing`: https://cusy.io/en/our-training-courses/technical-writing .. _`Jupyter notebooks for efficient data science workflows`: https://cusy.io/en/our-training-courses/jupyter-notebooks-for-efficient-data-science-workflows .. _`Numerical calculations with NumPy`: https://cusy.io/en/our-training-courses/numerical-calculations-with-numpy .. _`Analysing data with pandas`: https://cusy.io/en/our-training-courses/analysing-data-with-pandas .. _`Read, write and provide data with Python`: https://cusy.io/en/our-training-courses/read-write-and-provide-data-with-python .. _`Cleanse and validate data with Python`: https://cusy.io/en/our-training-courses/cleanse-and-validate-data-with-python .. _`Visualising data with Python`: https://cusy.io/en/our-training-courses/visualising-data-with-python .. _`Designing data visualisations`: https://cusy.io/en/our-training-courses/designing-data-visualisations .. _`Create dashboards`: https://cusy.io/en/our-training-courses/create-dashboards .. _`Versioned and reproducible storage of code and data`: https://cusy.io/en/our-training-courses/versioned-and-reproducible-storage-of-code-and-data .. _`News from Python for data science`: https://cusy.io/en/our-training-courses/news-from-python-for-data-science .. toctree:: :hidden: :titlesonly: :maxdepth: 0 intro workspace/index data-processing/index clean-prep/index viz/index performance/index productive/index web/index genindex .. Indices and tables ================== * :ref:`genindex` * :ref:`modindex` * :ref:`search`