There's a shut link concerning machine learning and compression. A technique that predicts the posterior probabilities of a sequence given its full historical past can be utilized for optimal info compression (by making use of arithmetic coding within the output distribution). Illustration of linear regression on an information set Regression https://www.pinterest.com/pin/ventura-it--731131320735923558/