XAI-lecture 3

Created by

Merel DJ

Cards (62)

Univ.-Prof. Dr. Marc Streit is a lecturer on Explainable AI.
View source
Explainable AI aims to explain the decisions of complex algorithms and machine learning models.
View source
Explaining algorithms involves storytelling principles such as author-driven vs. reader-driven, Martini glass structure, interactive slideshow, and drill-down story.
View source
Explaining algorithms can involve topics like sorting algorithms, clustering algorithms, Bayes’ theorem, and decisions trees.
View source
Dimensionality reduction is a technique used to transform high-dimensional data into a space with fewer dimensions, such as 2D or 3D.
View source
Dimensionality reduction techniques include PCA, UMAP, and t-SNE.
View source
Steinparz et al. created an embedding of all books on Wikipedia.
View source
Embedding-based trajectories are a type of dimensionality reduction technique.
View source
Rauber et al. explored the embedding of all books on Wikipedia.
View source
Visualization of embeddings is a crucial aspect of dimensionality reduction techniques.
View source
ChemInformatics Model Explorer (CIME): Exploratory Analysis of Chemical Model Explanations
View source
Hinterreiter et al. created an embedding of all books on Wikipedia.
View source
The goal of dimensionality reduction is to reveal patterns and clusters of similar or dissimilar data.
View source
Dimensionality reduction is used in various domains, including document categorization, protein disorder prediction, drug discovery, and machine learning model debugging.
View source
The disadvantages of dimensionality reduction include hard to preserve semantics of single dimensions, hard to understand and interpret, and error not visible, which can inspire false confidence.
View source
Embedding of all books on Wikipedia is an example of projecting data from nD to (1D)/2D/3D.
View source
The goal of projecting data from nD to (1D)/2D/3D is to reveal patterns and clusters of similar or dissimilar data.
View source
Various dimensionality reduction techniques and algorithms exist, each with their own strengths and weaknesses.
View source
A dataset can consist of images, words, and other high-dimensional vectors.
View source
Projecting to 1D Space means arranging artworks according to their average pixel brightness.
View source
Projecting to 2D Space is based on image brightness, giving the pieces more room to spread out.
View source
Depth cues enable us to perceive 2D images as 3D objects.
View source
Vis Excursion: Be Careful with 3D!
View source
t-Distributed Stochastic Neighbor Embedding (t-SNE) produces highly clustered, visually striking embeddings, captures local structure well, and is non-linear, but it may lose the global structure in favor of preserving local distances, is more computationally expensive, requires setting hyperparameters that influence the quality of the embedding, and is a non-deterministic algorithm.
View source
Shape perception is a benefit of representing data points or items in projections.
View source
Dimensionality reduction techniques include linear approaches such as Principal Component Analysis (PCA) and Multidimensional Scaling (MDS), and non-linear approaches like t-distributed stochastic neighbor embedding (t-SNE), Uniform manifold approximation and projection (UMAP), and Self-Organizing Maps (SOM).
View source
The cons of PCA include that it is a linear reduction that limits the information that can be captured, and it may not be as discriminably clustered as other algorithms.
View source
Embeddings can be useful, but it's important to be careful when interpreting patterns as those hyperparameters really matter, cluster sizes in a t-SNE plot mean nothing, distances between clusters might not mean anything, and random noise doesn't always look random.
View source
Categorical attributes or features can be represented as color (hue) and shape.
View source
Principal Component Analysis (PCA) has the pros of being relatively computationally cheap, saving the embedding model to project new data points into the reduced space, and can be used to cluster data.
View source
Some other item representation can be an image or a glyph.
View source
Ordered attributes or features can be represented as size and color (brightness/saturation).
View source
Rauber et al. used UMAP projections to visualize the inter-epoch evolution of a neural network.
View source
Sainburg et al. (2020) proposed a method for embedding neural network embeddings using UMAP.
View source
The disadvantages of UMAP include the requirement to set hyperparameters that influence the quality of the embedding and its non-deterministic algorithm.
View source
The Time Curve Signature of a surveillance video of a street shows outliers as passing pedestrians.
View source
Hinterreiter et al. (2020) also proposed a method for projecting neural network training using UMAP projections.
View source
Hinterreiter et al. (2020) proposed a Projection Space Explorer Tool, which uses UMAP projections to visualize data.
View source
Moritz Schöfl et al. used UMAP projections in a BSc thesis to solve sorting algorithms.
View source
Bach et al. (2015) also proposed the Time Curve Signatures of Different Wikipedia Articles, which show loops as reverts and oscillations as edit wars.
View source

See similar decks

XAI-lecture 3

Cards (62)

4.2 Entertainment and Leisure

1.3 Personal Interests and Leisure Activities

6.3 Task 3: Picture Task and Conversation

2.6 Texture

3. Entertainment and Leisure

3.2.7 Robust and secure programming

Unit 2: Music Fundamentals II: Minor Scales and Key Signatures, Melody, Timbre, and Texture

2.2 Analysis of "Lazarillo de Tormes" (Anonymous)

2.4 Examination of "Segunda carta de relación" by Hernán Cortés

AP Chemistry

Thematic Context 18: Leisure and Lifestyle

Marketing

Drama:

Poetry:

2.2.1 Water

GCSE Biology

AP Biology

2.5 Pacifism

1.4 Proteins

2.1.2 Biodiversity

1.5 Enzymes