常用工具
Jupyter Notebook
Manipulating Text with Regular Expression
Interesting Questions to Ask
google colab
Useful projects
increase speed
quick tips
Time Series
Scikit Learn – Notes
Good projects
Cheat Sheets
Text to Image
Utility functions
Image related resources: tutorials, Image Labelling, Image dataset
Kaggle and Linux commands
Training, Testing Data
Data Analysis Pipeline
data analysis common steps
Read Data, Save Data, Data related
handling missing values
Common Process for Exploratory Data Analysis
Charting (all types)
WHEN to use WHAT charts
Outliers
Feature Engineering
train, validation, and test set
Modelling, Hyperparameter tuning, Evaluate results on validation set
Pipeline Examples
Feature Importance
Others saved in other courses
Introduction to Data Science in Python
Fundamentals of Data Manipulation with Python
Data Processing with Pandas
Basic Statistical Testing
Applied Plotting, Charting & Data Representation in Python
Principles of Information Visualization
Basic Charting: Matplotlib
Applied Visualizations
Applied Machine Learning in Python
Fundamentals of Machine Learning – Intro to SciKit Learn
Supervised Machine Learning: Part 1
Model Evaluation & Selection
Supervised Machine Learning – Part 2
Unsupervised Machine Learning
Applied Text Mining in Python
Working with Text in Python
Basic Natural Language Processing
Topic Modelling
Applied Social Network Analysis in Python
Applications
Topic Modelling
ad hoc analysis
Dashboard
interative data analysis: PyGWalker
Sports Analytics
Python package nba_api
NBA games data and analysis
Data Science Project from scratch – NBA win loss prediction
Resources
Others
Kaggle Related Resources
All Kaggle Resources Links
Handling Time Series data