The encoder consists of three convolutional layers. The number of features changes from 1, the input data, to 16 for the first convolutional layer, then from 16 to 32 for the second layer, and finally, from 32 to 64 for the final convolutional layer.
While transacting from one convolutional layer to another, the shape undergoes an image compression:
Data flow of encoding phase