Data Visualization from Simple Plots to UMAP
Course Description
Visualizing data is often the key to understanding it. This course gives an overview of common data visualization techniques and considerations. Attention will given to exploratory visualization particular to chemometrics. We’ll explore the elements of different visual representations of data, the tools used to create plots, and ideas on how to enhance your visualizations. Both conventional and image data will be considered. Along the way we’ll see how Matlab and Eigenvector plotting tools work and how to use them. New methods for visualizing data will be introduced. The course includes hands-on computer time for participants to work example problems using PLS_Toolbox and MIA_Toolbox, or Solo+MIA.
Prerequisites
Chemometrics I — PCA, and Chemometrics II – Regression and PLS or equivalent experience.
Course Outline
- Overview and Motivation
- Data Structures and Features
- Exploratory vs. Explanatory
- Practical Considerations
- Basic Plotting Elements
- Matlab and PLS_Toolbox Plotting Organization
- Plot Types
- Popular types
- Images
- Custom Plotting
- Defaults (Matlab, Plotgui)
- Custom Class Symbols
- Reverse Engineering
- Fonts
- Static Plots
- Movies
- Exporting
- SVG vs Bitmap
- Publications
- Higher Order Data
- Colorby
- Layering
- Tiled plots
- Condensing Data with Factor Based Models
- Slicing and Motion
- Beyond PCA: New methods for Data Visualization
- t-SNE, t-distributed Stochastic Neighbor Embedding
- UMAP, Uniform Manifold Approximation and Projection
- Conclusions
- Resources
- Future Considerations