Principal Component Analysis (PCA) of Proteomics Data

In this video, we perform a principal component analysis (PCA) to identify outlier genes & between and within sample variabilities in a typical proteomics dataset.
We prepare the data matrix of the expressions, normalize the expressions by extracting their sample means from them and dividing the result by each gene's standard deviation. After that, we perform the actual PCA.

We make a plot that shows the variance of each principal component (PC). The first principal component contains the largest variance of the dataset, the second principal component contains the second largest variance, etc.

Usually, the first two PC should explain most of the variation in the data. Here we see if that is the case

We usually plot the first principal component, containing the largest change, versus the second principal component, containing the second largest change. An ideal dataset has a PCA plot in which the samples under the same experimental conditions are very close to each other while samples under different experimental conditions are far away.
——
Document: https://compu-flair.com/pca
Code: https://colab.research.google.com/dri...

Видео Principal Component Analysis (PCA) of Proteomics Data канала CompuFlair

Комментарии отсутствуют

Информация о видео

19 ноября 2022 г. 3:50:29

00:03:49

CompuFlair

Правообладателям

Жалоба на материал Недопустимый материал Нарушение авторских прав

Комментарии

Другие видео канала

Principal Component Analysis (PCA) of Proteomics Data

The "Energy Levels" of Machine Learning [Logistic Regression]

Machine Learning Explained from One Equation | Ep1. Predict House Prices with Linear Regression

Write Scientific Articles, PubMed for Example, with Generative AI, Python, GPT models, and LangChain

Where would I look for data science jobs? #datascience #physics

A Physics Concept that Reveals When Machine Learning is Not Possible: With a Real-World Example

Monte Carlo in Machine Learning Visually Explained

How do we find the (N, V, E) set in machine learning? [Feature Engineering and Selection]

The 'Time Reversal' of Machine Learning [Decision tree]

The Hidden Calculus Trick That Made Modern AI Possible

Are you making these common mistakes when searching for data science jobs? #datascience #physics

This Physics System Cracks Classification in Machine Learning: Logistic Regression

The "Quantum Statistics" of Machine Learning [Multinomial Logistic Regression]

Extracting and Analyzing Images from PDFs using RAG Multimodal Pipelines | GPT-4o | Chroma vector db

Physic's Inspired Residual Sum of Squares [RSS]

How Entropy Becomes the Loss Function of Linear Regression [Loss Function]

How to Use Python & LangChain to Clean Your Data Spreadsheets Using ChatGPT in Just a Few Minutes

How physics helps an AI agent pass a frozen lake [Monte Carlo Reinforcement Learning]

The Heisenberg "Principle" of Machine Learning [Bias Variance TradeOff]

From Noether to Neural Nets: How Symmetry Gets Built into ML Models (and Why It Matters)

Why Turning Up "Temperature" Can Make Neural Nets Smarter [Learning vs Exploration]