Using matrices to transform the plane: Composing matrices
2X2 matrices can define transformations for the entire plane. In this worked example, we see how to find a single matrix that defines the same transformation as the composition of two other matrices. Created by Sal Khan.
Want to join the conversation?
- At1:54Sal stops the video to think about how Matrices work as compositions. Generally for some rudimentary functions f(x) = x+5 and g(x) = x-3, (f o g)(x) would implement g(x) into f(x) like (f o g)(x) = (x-3)+5. However, at3:15when Sal composes B o A, he multiplies the columns of the B matrix by individual A entries, which appears backwards as B is being implemented into A, indicative of the composition A o B. I was wondering if somebody could explain the reasoning behind that choice and why it apparently flips or why my logic is flawed?(3 votes)
- Function composition is written right-to-left, so the composition of B, then A is written as the matrix AB. In your example, f◦g(x) first applies g to x, then applies f to the result. We define matrix multiplication so as to match this convention.(4 votes)
- When I multiply these two matrices, it is giving me different values from when I compose a matrix from the two matrices. This makes me conclude matrix multiplication is different from matrix composition is that right?
Edit: Matrix multiplication is the same thing as matrix composition. The key to understanding this is paying closer attention to the commutative property that does not exist in matrix multiplication. Matrix multiplication is read from the right to the left and the reason that I was getting results was because I was multiplying from left to right.
Hope this helps someone who faced the same issue as me.(3 votes)
- matrix multiplication and matrix composition are two different operations on matrices that can produce different results.
Matrix multiplication involves multiplying corresponding elements of two matrices and adding the results. It is denoted by *. The number of columns in the first matrix must equal the number of rows in the second matrix for multiplication to be defined.
Matrix composition is the operation of applying one matrix transformation followed by another. It is denoted by ∘. The composition of matrices A and B, written A ∘ B, has the effect of first applying B and then applying A. So if v is a column vector, (A ∘ B)v = A(Bv).
For example, let:
A = [[1, 2],
B = [[5, 6],
A * B = [[19, 22],
A ∘ B = [[43, 46],
As you can see, while matrix multiplication produces a matrix of the product sums, matrix composition produces a matrix by applying the transformations in succession. The results are quite different!
So in summary, the main differences between the two operations are:
Denoted by *
Defined only if the number of columns of the first matrix equals the number of rows of the second matrix.
Calculates the product sums of corresponding elements.
Denoted by ∘
Applies the matrix transformations in succession.
Defined for all matrices of appropriate sizes.
Gives a different result than matrix multiplication.(1 vote)
- So, when Sal says, something something prime, such as at0:53, what does he mean?(2 votes)
- "Prime" often means "variation of something".
Here, "[a,b] prime" means a "variation of [a,b]" and specifically here, he refers to [a,b] after a transformation with the matrix A.(1 vote)
- Am I the only one who is having a hard time seeing how this is a composition? It seems like the first and the second step are totally different? We don't even do anything with the first matrix. 🧐🤔(1 vote)
It seems like just matrix multiplication(2 votes)
- just treat this 2x2 matrix as 2 of the 2x1 matrix (which means two separate vectors).
[0;5] [2;-1] (';' means newline) is basically:
[0;5]= 0*[1;0] + 1*[0;1]
[2;-1] = 2*[1;0] + -1*[0;1]
Applying transformation [-3,0 ; 1,4] only means :
[0;5]'= 0*[-3;1] + 1*[0;4]
[2;-1]'= 2*[-3;1] + -1*[0;4](1 vote)
- That's correct! The transformation matrix [−3 0; 1 4] can be applied to each of the vectors [0 5] and [2 −1] separately by multiplying each vector by the transformation matrix.
So, we have:
[0;5]' = [−3 0; 1 4] [0;5] = 0*[-3;1] + 1*[0;4] = [0;4]
[2;-1]' = [−3 0; 1 4] [2;-1] = 2*[-3;1] + (-1)*[0;4] = [-6;-2]
Therefore, the transformation matrix maps the vector [0 5] to [0 4] and the vector [2 −1] to [−6 −2].(1 vote)
- How matrix multiplication is different from the transformation composition in this context? Can we just say that if you want to apply transformation of matrix A first and matrix B second, you can just multiply BA matrix by your vector? And if so, why not using multiplication notation here? It sounds much more intuitive to me(1 vote)
- Hi, The Dreams Wind
To me, this video is kind of an introduction to matrix multiplication, which is taught (at least in the precalculus course) a few lessons later. I think the purpose of this lesson is for us to understand matrix multiplication before we even get to it, so that when we do learn it we're like, "Oh, I already knew this; I just didn't know I did. Matrix multiplication doesn't seem so daunting after all!"
I don't know... that's my idea, heh.
Good luck!(1 vote)
- [Instructor] So what I have here is two different transformation matrices, and what we're going to think about in this video is can we construct a new matrix that's based on the composition of these transformations? Or a simpler way of saying that, a new transformation that's based on applying one of these transformations first, and then the other one right after that. So first, let's just review what's going on. If we have some random vector here, a, b, we know that we could view this as a times the one, zero vector, the unit vector in the x direction, plus b times the zero, one vector, the unit vector that goes in the vertical direction. Now, if you were to apply this transformational capital A here, it tells you instead of using one, zero and zero, one, use these two columns instead. So if you were to apply the transformation here, I guess we could call it a, b prime, that is going to be, if you apply capital A transformation matrix, it's going to be a times, not one, zero, you use zero, five instead. And then plus b times not zero, one, you use two, negative one instead. So that's just a little bit of review. But what we're gonna think about in this video is, what would be the transformation matrix for the composition? And I could write that as B of A right over here. And you might recognize this from function notation, where essentially it's saying you would apply the function A first, and then whatever the output of that is, you would then input that into B and you'd get the output of that. And that makes sense, because you can view transformation matrices really as functions, functions that map points on the coordinate plane. So, in this situation, what would be the transformation matrix that is a composition of these two? Pause this video and think about that. All right, well, what would happen is, we would first transform any point using these two vectors, the zero, five, the two, negative one, because that's the first transformation we do. And then we would apply this transformation to whatever the resulting vector is. Now, that seems pretty involved and we don't wanna write it in terms of A or B's, we just wanna write it in terms of a transformation matrix. So one way to think about it is, we can transform each of these vectors that you have in matrix A. Because remember, that says, what do you turn the vectors one, zero and zero, one into? So if we transform zero, five using the matrix B, and if we transform two, negative one using the matrix B and we put them in their respective columns, we should have the composition of this. So let me write it this way, create a little bit of space. So let's say that the composition B of A is equal to, I'll write a big two-by-two matrix right over here. The first thing we can do is apply transformation matrix B to the purple column right over here. And what is that going to tell us? Well, that's gonna be zero times negative three, one. So let me write it that way. It's going to be zero times negative three, one plus five times zero, four, zero, four. And this is going to give us a two by one vector right over here. So that you can view as filling up the first column of this transformation, this composition, I guess you could say. And then let's think about this second vector right over here, two, negative one. If you transform that using B, what are you going to get? You're going to get two times negative three, one. So right here, two times negative three, one, plus negative one, or maybe I just write it this way, minus one times zero, four. And this doesn't look like a matrix just yet, but if you work through it, it will become a matrix. For example, if I multiply, well, zero times all of this is going to be zero, and then five times zero is going to be, let me just write it this way. This would turn into five times zero is zero, five times four is 20. And then this matrix right over here is going to be two times negative three, one, is going to be negative six, two, and then we have minus zero, four. And now if we wanted to write this clearly as a two-by-two matrix, this would be equal to, and we get a little bit of a drum roll here, the first column is zero, 20. And then the second column is going to be, let's see, negative six minus zero is still negative six and two minus four is negative two. And we're done. We have just created a new transformation matrix. It's based on the composition B of A. So if you apply transformation A first to any vector and then apply transformation B to whatever you get there, that is equivalent to just applying this one two-by-two transformation matrix, B of A.