Öppna kurser
Python Data Wrangling
In this Python Data Wrangling course, you will learn how to use Python to extract/transform data from various sources, including large database vaults and Excel financial tables. You will also explore insights into why you should avoid traditional methods of data cleaning, as done in other languages, and take advantage of the specialised functions from NumPy and Pandas.
Utbildningsmål
- Extract and parse data from various sources
- Transform and clean data using Numpy and Pandas
- Summarise and visualise data with Matplotlib
- Read HTML, XML, and JSON data from internet resources
- Search and filter data sets
- Apply Python tools and techniques to process data sets efficiently
- Continue learning and face new challenges with after-course one-on-one instructor coaching
Målgrupp
This course is for data analysts and data scientists looking to utilise Python to extract from various sources and prepare it for machine learning modelling.
Förkunskaper
To succeed in this course, you should have a working knowledge of Python basics, including data structures, importing and using modules, creating functions, and using the Jupyter Notebook platform.
Innehåll
Module 1: Introduction to Data Structure using Python
- Python for Data Wrangling
- Lists, Sets, Strings, Tuples, and Dictionaries
Module 2: Advanced Operations on Built-In Data Structure
- Advanced Data Structures
- Basic File Operations in Python
Module 3: Introduction to NumPy, Pandas, and Matplotlib
- NumPy Arrays
- Pandas DataFrames
- Statistics and Visualisation with NumPy and Pandas
- Using NumPy and Pandas to Calculate Basic Descriptive Statistics on the DataFrame
Module 4: Deep Dive into Data Wrangling with Python
- Subsetting, Filtering, and Grouping
- Detecting Outliers and Handling Missing Values
- Concatenating, Merging, and Joining
- Useful Methods of Pandas
Module 5: Getting Comfortable with Different Data Sources
- Reading Data from Different Text-Based (and Non-Text-Based) Sources
- Introduction to BeautifulSoup4 and Web Page Parsing
Module 6: Learning the Hidden Secrets of Data Wrangling
- Advanced List Comprehension and the zip Function
- Data Formatting
Module 7: Advanced Web Scraping and Data Gathering
- Basics of Web Scraping and BeautifulSoup libraries
- Reading Data from XML
Module 8: RDBMS and SQL
- Refresher of RDBMS and SQL
- Using an RDBMS (MySQL/PostgreSQL/SQLite)
Module 9: Application in Real Life and Conclusion of Course
- Applying Your Knowledge to a Real-life Data Wrangling Task
- An Extension to Data Wrangling
Kursen levereras genom utbildningspartner: Learning Tree