Dimensionality Reduction is a type of learning where we want to take higher-dimensional data, like images, and represent them in a lower-dimensional space.
Having a high number of variables is both a boon and a curse. Note: Both Backward Feature Elimination and Forward Feature Selection are time consuming and computationally expensive. They are practically only used on datasets that have a small number of input variables. It divides the data into a set of components which try to explain as much variance as possible. Independent Component Analysis: We can use ICA to transform the data into independent components which describe the data using less number of components. It also assumes that for any pair of points on manifold, the geodesic distance (shortest distance between two points on a curved surface) between the two points is equal to the Euclidean distance (shortest distance between two points on a straight line). The amount of data we are generating each day is unprecedented and we need to find different ways to figure out how to use it. If we were to project our points onto this axis, they would be maximally spread! It divides the variables based on their correlation into different groups, and represents each group with a factor. Principal Component Analysis: This is one of the most widely used techniques for dealing with linear data. In particular, it assumes that the data for our classes are normally distributed (Gaussian distribution). In other words, we want the axis of maximal variance! Dimensionality reduction is a very useful way to do this.

