This article was automatically translated from the original Turkish version.
Xception is a convolutional neural network (CNN) architecture widely used in deep learning for image classification. Proposed by Google in 2017, its name stands for “Extreme Inception,” reflecting its inspiration from the Inception architecture. The Xception design is particularly notable for its reliance on the principle of depthwise separable convolutions, which enables higher performance with fewer parameters.
Xception improves the information processing pipeline by replacing standard convolutional layers with depthwise separable convolutions, making the structure more modular and efficient. This design is based on the principle of filtering each channel independently before combining them.

Xception Architecture (Source =
Depthwise separable convolutions consist of a two-step process:
These two steps simulate the operations of a standard convolution more efficiently.
Xception consists of 36 convolutional layers, organized into three main blocks:

(a) Standard CNN (b) Depthwise Separable CNN (Source =
The Xception architecture achieves more efficient feature extraction compared to standard convolutions by leveraging depthwise separable convolutions.
The Xception architecture has been successfully applied to various image classification and object detection tasks, most notably on ImageNet. Similar separable convolution principles have also been adopted in models such as MobileNetV2 and EfficientNet. Its primary application areas include:
Xception Architecture
Depthwise Separable Convolutions
Structural Features of Xception
Applications